Abstract
Keywords
1. Introduction
To calculate variance estimates in survey samples, commonly used methods include Taylor Series Linearization (TSL), jackknife, and Balanced Repeated Replication (BRR; interested readers can refer to Cohen (1979), Burt and Cohen (1984), Rao and Wu (1988), Rao (1988) and Wolter (2007), etc.). However, in many large-scale sample surveys such as the Current Population Survey (CPS) and the Canadian Labour Force Survey (CLFS), calculating thousands of variance estimates using these standard methods requires a significant amount of labor.
The Generalized Variance Functions (GVFs) have been used by CPS since 1947 (U.S. Census Bureau 2006). GVF develops regression estimates using a set of variance estimates obtained through BRR or TSL methods, along with their associated survey statistics. It allows for the direct calculation of standard errors for a large volume of potential survey statistics through a fitted regression model. Valliant (1987) demonstrated that the GVF model produces consistent estimates of the variance for a certain class of superpopulation models. However, while many current surveys follow the same households at regular time intervals, GVF does not take advantage of the longitudinal feature of the data.
Zhang et al. (2019) proposed a longitudinal generalized variance function (LGVF) by incorporating the time effect into the modeling of longitudinal survey data. The proposed LGVF maintains the property of GVF, where the estimated variances are often more stable than the direct estimates as they smooth out some of the variability from variable to variable when the design effects (determined by comparing the variance of a variable under a particular survey design with the variance under a simple random sampling [SRS] design with the same size) for the variables are similar to each other. However, a challenge faced by both GVF and LGVF is selecting a group of variables with similar design effects (deffs) to build a regression line, which is often practically difficult. Johnson and King (1986) studied GVF estimators using a national survey of reading ability among young adults and found that one way to markedly improve upon the GVF model is to use prior information about the design effect (deff) of individual estimators.
Building upon the groundwork laid by Zhang et al. (2019) and Johnson and King (1986), this research introduces the Longitudinal Adjusted Design Effect model (LADE). The LADE integrates both design and time effects using a model-based approach. The structure of the paper is as follows: Section 2 provides a comprehensive review of Generalized Variance Function (GVF) and Longitudinal Generalized Variance Function (LGVF) models; Section 3 establishes the theoretical framework, introduces the LADEs, and outlines the asymptotic properties of these models; Section 4 presents results from simulation studies conducted with CPS data. The research concludes in Section 5.
2. Generalized Variance Functions (GVFs) and Longitudinal Generalized Variance Functions (LGVFs)
In this section, we briefly review GVF and LGVF models. More detailed description of GVF can be found from textbooks by Wolter (2007), and Lohr (2021). Detailed description of LGVF can be found from Zhang et al. (2019).
2.1. Generalized Variance Functions (GVF)
Let
where
assuming that
2.2. LGVF
Zhang et al. (2019) introduced the Longitudinal Generalized Variance Function (LGVF), incorporating longitudinal information and addressing for time effects. Let
where
Let
The predicted relative variance by LGVF value is
Under certain conditions, the ratios of relative variances and predicted relative variances from LGVF converge in probability to 1. The study by Zhang et al. (2019) demonstrated that LGVF was more efficient in reducing mean squared prediction errors (MSPE) compared to GVF.
3. The Longitudinal Adjusted Design Effects Models
In this section, we begin by introducing notation that closely adheres to conventions set by Valliant (1987) and Zhang et al. (2019). Subsequently, we present the Longitudinal Adjusted Design Effect models (LADEs), seamlessly integrating both the design effect and time effect. Finally, we explore the properties exhibited by specific types of estimators within the framework of LADE.
3.1. Notation
In a stratified two-stage cluster sampling design, various indices are employed to represent distinct levels within the sampling process. Specifically,
At the PSU level, we denote
Moving to the sample level, we denote
At time
Here,
For example, the Horvitz-Thompson estimator, which is commonly used when PSUs are selected with probabilities proportional to
where
For prediction purposes, the following model assumptions apply:
where within a PSU
Similar formulations can be found in the works of Scott and Smith (1969), Royall (1976, 1986), and Burdick and Sielken (1979). A general direct variance estimator for
where,
Let
For example, for a binary distribution,
where
3.2. The LADE Models
Recall that

Direct estimates of relative variance (plot (a)) and adjusted direct estimates of relative variance (plot (b)) versus reciprocal of estimated total.
Motivated by this observation, we propose the Longitudinal Adjusted Design Effect (LADE) model, which adjusts the relative variance (relvar) by the design effect (deff) and incorporates time effect. To get an overview of the deffs of different variables, Figure 2 displays the design effects of the eighteen variables obtained from the mean of the five hundred simulation runs using the 2008 to 2010 ASEC data. It is observed that the design effects for the variables remain relatively stable over the three-year period. However, it is worth noting that the magnitudes of the design effects vary across the different variables.

Design effects of the eighteen variables for 08 to 10 ASEC data from five hundred simulation runs.
To incorporate the design effects into the analysis, we adjust the relative variance by taking the mean of the design effects for each year across the eighteen variables. Let
where
Recall that
Let
The weighted least square estimators (WLS) of
and
where
3.3. Properties of Proposed Estimators
In this section, we examine a specific class of estimators defined as
Given the assumptions in Equation (7) and the structure of estimators
Theorem 2 enables the construction of confidence intervals using the normal distribution.
4. Simulation Studies
This section presents a simulation study that compares the performance of two types of estimators: LGVFs and LADEs. The study utilizes data from the CPS Annual Social and Economic Supplement (ASEC), specifically focusing on the data restricted to New Mexico for the years 2008 to 2011.
The analysis considers eighteen binary variables derived from the “Source of Income” section of the ASEC data. These variables capture characteristics such as self-employment, unemployment compensation, and more. In this context, a binary variable takes a value of 1 if a person possesses a specific characteristic and 0 if they do not possess that characteristic. The available ASEC data includes 2,059 observations for 2008, 2,188 observations for 2009, 2,108 observations for 2010, and 1,975 observations for 2011. Each household in the data is associated with an ultimate sampling unit (USU) defined by the CPS. However, the specific USU information is not publicly released.
To replicate the sampling design, the households within each year were sorted in ascending order based on their sequence numbers. Then, four consecutive households were combined to form a PSU while maintaining the original order. As a result, there were 205, 208, 193, and 184 PSUs created for the years 2008, 2009, 2010, and 2011, respectively. On average, each of these PSUs accommodates about ten individuals across the four households. Consequently, for example, the total number of individuals in 2008 can be calculated as the number of PSUs (205) multiplied by the average number of individuals residing in the four households, resulting in approximately 2,059 observations. The simulation study is conducted using data from March 2008 to March 2010 for bias square and empirical mean squared error (EMSE) comparisons, and data from March 2011 for mean squared prediction error (MSPE) comparisons. The simulation steps are as follows:
(a) Within each year (2008, 2009, and 2010), a sample of
(b) Estimates
(c) The time adjustment, denoted as
Here,
(d) Deff adjustment
(e) Two regression models are fitted. Model (16) is used to fit LGVFs, and Model (17) is used to fit LADEs.
and
Fitting methods include ordinary linear regression (OLS) LGVF1 and LADE1 and WOLS with weights
(f) The direct estimates of relvar calculated by Equations (6) and (8) is recorded, as well as the fitted relvar values by LGVF1, LGVF2, LADE1, and LADE2 multiplied by the deff adjustment
(g) The simulation process is repeated
and
where
The simulation results are summarized in Table 1 which compares the bias square, EMSE, and MSPE values for the estimators LGVF1, LGVF2, LADE1, and LADE2. The results indicate that LADEs consistently outperform LGVFs across all metrics, showcasing lower bias square, EMSE, and MSPE values. Notably, there is no discernible advantage in using the WLS estimators LGVF2 and LADE2 over the OLS estimators LGVF1 and LADE1. This absence of advantage can be attributed to the overall equal variance assumption of the regression model not being violated.
Simulation Results of a Comparison Between LGVFs and LADEs with
Figure 3 provides a graphical representation of the estimated relvars plotted against the logs of population totals. The solid line represents the direct estimates of relvar values calculated by Equations (6) and (8). The dashed line and dotted line represent the estimates from LGVFs 1 to 2, while the dot-dashed line and the thickest line represent the estimates from LADEs 1 to 2. It is observed that the WLS estimators outperform the OLS estimators in estimating the relvars, especially when the population totals of the variables are large. Figure 3 suggests that heteroscedasticity is more pronounced in the range of large values. WLS is then able to better handle the heteroscedasticity, leading to more efficient and less biased estimates of the relative variance compared to OLS.

Logs of estimates of relvar plotted versus logs of population totals.
Although LADE2 exhibits a slight improvement over LGVF2, the distinction between LGVFs and LADEs is not evident as that in Table 1. This discrepancy can be attributed to the differing perspectives and scaling from which Table 1 and Figure 3 approach the data. Table 1 offers a comprehensive evaluation of bias in relative variance, whereas Figure 3 explores the intricate connection between the logarithm of relative variance and log totals.
5. Conclusions
In this study, we extended the Longitudinal Generalized Variance Functions (LGVFs) to a novel model known as Longitudinal Adjusted Design Effect models (LADEs), incorporating both design effects and time effects into the modeling process. We demonstrated that, under certain conditions, the ratios of relative variances and predicted relative variances obtained from LADEs converge to 1. A comparative analysis between LGVFs and LADEs revealed that the latter outperforms in reducing bias, empirical mean squared error, and mean squared prediction errors. While our simulation study supports the superiority of LADE over LGVF, we acknowledge its limitations and encourage users to carefully select the model based on their specific context.
Our research hasn’t specifically addressed survey nonresponse, but techniques like multiple imputation and weighted adjustments for regression methods can be seamlessly integrated into our framework. Notably, our analysis is currently confined to estimators derived from the LADE model without auxiliary variables in the regression model.
For future investigations, exploring mixed models and nonparametric smoothing methods for fitting regression models could be valuable. Additionally, a detailed examination of estimators in cases where variances within one PSU differ from another PSU could offer insightful perspectives.
