Abstract
Keywords
Introduction
Estimates of the prevalence of various health-related behaviours and statuses in a population, typically derived from health surveys, are important for policy and service provision. Whilst designed to be representative of the target population, national health surveys have experienced declining participation levels over recent decades, resulting in smaller and potentially less reliable samples [1]. It is becoming increasingly possible to obtain additional information on health survey participants through record linkage, provided consent is given, in a number of countries where the population register can be used for statistical purposes [2]. Although non-participation does not necessarily result in bias [3], comparisons often find participants alone are not representative of the target population [4,5]. Comparisons of participants and non-participants, in settings where the latter are able to be identified, have revealed differences in baseline socio-demographic characteristics [6] and health outcomes [7–9].
We have previously developed an advanced methodology which aims to mitigate the effect of non-participation bias in health surveys through record linkage to administrative data sources [10,11]. This methodology has been applied in Scotland [12]. It aims to infer observations on non-participants (see terminology in Box 1) using data on the invited survey participants and the population to simulate partial observations on non-participants, which can then be completed through multiple imputation, assuming both Missing At Random and Missing Not At Random scenarios, with the latter offering benefits over conventional weighting approaches. The reliability of this approach depends on the success of the simulation of the inferred non-participants observations. However, the validity of the assumption that the inferred non-participants are representative of the true non-participants is uncertain. Here, we aim to validate this assumption using survey-derived alcohol consumption as the exemplar. The measurement and monitoring of alcohol consumption within a population is of increasing importance, given that it is implicated in a high burden of morbidity and mortality worldwide [13]. The validation exercise requires a health survey setting whereby some information on the true non-participants is known in order to compare to the synthetic observations on non-participants inferred by our methodology. Finland is an ideal setting for such an exercise, as it maintains a nationally representative population register which forms the sampling frame for health surveys and has the ability to record link socio-demographic information, morbidity and mortality databases and survey responses at the individual level using personal identification codes [14]. In particular, socio-demographic characteristics and the occurrence of alcohol-related hospitalisations and all deaths among both the participants and true non-participants are known, and the invited survey sample as a whole can act as a gold-standard comparator. Such population registers are not widely available beyond the Nordic countries, Belgium and the Netherlands [2], and so our validated methodology will provide the opportunity for valuable corrections to survey-derived estimates in many settings.
Terminology used in this paper.
Methods
Data
To apply the methodology, socio-economic and health data relevant to the survey outcome of interest on the participants and contemporaneous population are required (individually linked for the participant data). For the validation process, we additionally require data on the true non-participants of the invited survey sample, that is, those who were invited to participate but refused. Figure 1 describes the data used and subsequently generated through this validation process.

Visual representation of the data used and subsequently generated through this validation process.
Invited survey sample data
The Finnish Health 2000 survey (thl.fi/health2000) used two-stage cluster sampling to identify 8028 persons aged at least 30 years in 2000 from the individual-level population register in order to create a representative sample of the Finnish population [15]. For this validation exercise, the invited survey sample was restricted to those aged 30–79 years at baseline (
The measures of interest for this analysis were collected via questionnaire and included drinking status (current/non-drinker) and average weekly alcohol consumption (grams per week, 1 unit=12 g (Finland)) [17]. We restricted the definition of participants to those who had returned this questionnaire (
Population sample data
Analyses of population data in Finland are restricted to an 11% sample of the population aged ⩾15 years permanently residing in Finland at the end of any years in 1987 to 2007. Aggregate counts of those aged 30–79 years and alive on 20 October 2000 (median baseline date for Health 2000;
Educational attainment
Highest level of education attained was available from Statistics Finland’s Register of Completed Education and Degrees [18] and individually linked to the survey and population samples, categorised as primary, secondary and tertiary levels (see Appendix).
Linked health outcomes
Records of alcohol-related inpatient hospitalisations and deaths from any cause were individually linked to the invited survey sample and the population sample (1996–2012, obtained from the Finnish Institute for Health and Welfare (THL, hospitalisations [19]) and Statistics Finland (deaths))). Alcohol-related hospitalisations and deaths (harm) were identified using the main diagnosis and additional symptoms (coded using ICD-10; see Appendix). Informed consent for the record linkage was obtained from the Health 2000 participants. Consent was not required for the non-participants and the population sample, as their data were sourced only from register data and used for statistical and scientific purposes [14]. Therefore, hospitalisation and mortality records for the participants, true non-participants and the population sample were available for analysis. Occurrences of alcohol-related admissions prior to the survey participation date (or date of invitation for the non-participants) or 20 October 2000 for the population were excluded.
Statistical methodology
Creating synthetic observations on non-participants: obtaining the inferred survey sample estimates
The methodology used to correct for non-participation bias is described in greater detail elsewhere [11] and in the Appendix. First, we assumed that the expected distributions of age, sex, educational attainment and occurrences of harm and deaths in the invited survey sample were equal to that of the population sample. This is based on the assumption that both samples were representative of the population [11]. Therefore, any deviations in the simultaneous socio-demographic–harm comparisons between the weighted participants of the survey and the population sample elucidated the characteristics of those missing (the ‘non-participants’). Seventy sets of non-participant synthetic observations were inferred based on the deviations in the socio-demographic–harm comparisons, allowing for sampling variation, and combined with the participant observations (total inferred weighted sample sizes ranged between 8006.3 and 8042.3). The resultant inferred survey sample data sets, combining the inferred non-participants and the observed participants, were individual level, containing variables on participation status, age, sex, educational attainment and indicators of whether they had experienced alcohol-related harm, death from any cause, both or neither during follow-up. From the survey, self-reported drinking status and alcohol consumption were available for the participants, and regression-based multiple imputation was used to impute these for the inferred non-participants, as per our methodology [11]. The imputation model contained age and indicators of alcohol-related harm and all-cause mortality, assumed a Missing At Random (MAR) [20] scenario and was stratified by groups of sex and educational attainment, as consumption can vary by socio-demographic status [12]. The results of the imputed data sets were combined using Rubin’s Rules [21] to obtain the inferred survey sample estimates.
Imputing alcohol consumption of the true non-participants: obtaining the invited survey sample estimates
The invited survey sample consisted of participants and true non-participants of the Health 2000 survey aged 30–79 (
Validation assessment
In order to assess our methodology’s validity, we followed the approach set out previously [22]. We (a) explored how well the creation and inclusion of the inferred non-participant observations aligned the inferred sample with the population sample; and (b) compared the estimated weekly alcohol consumption of the inferred survey sample to that of the invited survey sample. Each comparison was explored in terms of sex and educational attainment breakdowns. The comparison made in (a) was necessary to check that a sufficient number of inferred samples had been generated and the random rounding used to induce sampling variation was successful, whilst the comparison detailed in (b) enabled an assessment of how well the application of the developed methodology yielded results in line with the invited survey sample if all invited participants had agreed to participate. Differences in each comparison were assessed using the percentage relative difference in mean weekly alcohol consumption estimates, calculated as the difference in alcohol consumption estimates divided by the invited survey sample’s estimate. Relative differences were calculated overall, by sex and by sex and educational attainment. An acceptability limit of ±5% was used to assess the overall validity of the method, with 95% confidence intervals (CI) generated through bootstrap sampling, allowing determination of the statistical significance of any differences [22].
Sensitivity analysis
The MAR assumption was relaxed to allow for the possibility that the non-participants are Missing Not At Random (MNAR)—that is, that their non-participation was dependent on their alcohol consumption habits, based on the assumption that non-participation would be associated with higher consumption. Therefore, we allowed for higher consumption in the subgroup of non-participants who experienced harm relative to participants who had also experienced harm. A pattern-mixture approach was used [11] with a hypothesised mean upper limit of 1200 g of alcohol per week for non-participants. This limit was derived through the extrapolation of evidence used in an earlier application of this methodology [23]. This resulted in a series of simulations which explored the impact of the alcohol consumption of non-participants who had experienced harm being re-scaled up to 17 times the sex-specific mean weekly alcohol consumption of participants who had experienced harm. Further details are available in the Appendix. This approach resulted in an adjusted mean consumption of 1189 g and 1152 g among those who had experienced harm in the inferred and invited survey samples, respectively, compared to the MAR estimates of 252 g and 258 g. All analyses were performed using Stata v14.0 (StataCorp, College Station, TX) and the user-contributed ice package [24].
Results
Comparison of overall proportions, alcohol-related harm and mortality in each sample
Comparisons of the sex-by-educational attainment compositions of the population, invited survey and inferred survey samples are given in Table I. Whilst the proportions between the weighted participants and the population sample are not vastly different overall, there are indications of the weighting alone not sufficiently adjusting within levels of educational attainment: there is a lower proportion of male participants with primary education (17.7% vs. 18.6% in the population) and a corresponding increased representation of men with tertiary education (13.3%) compared to the population (12.5%). Women with tertiary education are similarly overrepresented in the participants (15.5% vs. 14.7%).
Breakdown of overall proportions, alcohol-related harm and mortality (95% CI) by sex and educational attainment for the sample of the Finnish general population, participants, inferred non-participants, the inferred total, true non-participants and the invited survey sample of the Health 2000 survey.
All aged 30–79 years.
Incorporates sampling weights calculated in the Health 2000 survey.
Participants and true non-participants combined; no survey weights are incorporated.
Participants and inferred non-participants combined; participants are weighted, and all inferred non-participants have the null weight value of 1.0.
CI: confidence interval.
Comparison of the composition of the inferred survey sample and the population sample reveal that the appropriate balance has been achieved through the creation of the synthetic observations on non-participants. All health outcome comparisons of the inferred survey sample and the population sample reveal a ⩽0.3% absolute difference.
Relative to the invited survey sample, there was a higher absolute proportion of men in the inferred survey sample (+1.0%), with the greatest difference found in those with primary education (+0.6%); there was a lower proportion of women in the inferred survey sample (–1.0%), with the greatest absolute difference concentrated in those educated to a tertiary level (–0.9%). In terms of health outcomes, a greater absolute proportion of men and women with secondary levels of education experienced alcohol-related harm in the invited survey sample compared to the inferred (men: +0.6%; women: +0.5%), whilst lower absolute proportions of the invited survey sample died during follow-up (men: –0.5%; women: –0.6%).
Comparison of weekly alcohol consumption estimates in the inferred and invited survey samples
Overall, weekly alcohol consumption is estimated in both samples to be approximately 80 g per week, with a relative difference of just –0.3% (95% CI –1.0% to 0.5%) between the inferred and invited survey samples (Table II), indicating that the creation of the non-participants and their subsequent imputation were successful overall. By sex, the inferred survey sample underestimated the male weekly consumption by 1.6% (95% CI –2.2% to –0.04%), whilst the relative difference for female estimates was –1.1% (95% CI –2.4% to –0.8%). All relative differences remained within the 5% acceptability limit.
MAR imputed estimates of alcohol consumption (g/week) in the inferred and true Health 2000 survey samples for those aged 30–79 years by sex and educational attainment.
Participants are weighted; inferred non-participants have a null weight of 1.0.
No survey weights are incorporated.
MAR: Missing At Random.
Sensitivity analyses
The results for three MNAR sensitivity analyses are reported in Table III (see Appendix for all MNAR adjustments). Overall weekly alcohol consumption increased from 80 g per week under MAR to approximately 115 g per week under the most extreme MNAR scenario. The relative differences remained within the 5% acceptability limits until it was assumed that non-participants who experienced harm consume up to and including four times the sex-specific mean consumption of the participants.
MNAR imputed estimates of alcohol consumption (g/week) in the inferred and true Health 2000 survey samples for those aged 30–79 years by sex and educational attainment.
Participants are weighted; inferred non-participants have a null weight of 1.0.
No survey weights are incorporated.
MNAR: Missing Not At Random.
Discussion
This analysis aimed to validate the assumption of equivalence in simulation of non-participants using a developed methodology which harnesses record linkage and reference population data to create partial observations on non-participants of a health survey and imputes to correct for non-participation bias. We explored differences between the invited and inferred survey samples under both MAR and MNAR assumptions.
The evidence yielded mixed results. The relative differences of the estimates of alcohol consumption for the inferred and invited survey samples were all within our 5% acceptability limits, assuming MAR. However, statistically significant relative differences were estimated for men and women overall and men with secondary levels of education, suggestive of an underestimation from the methodology. Under MNAR, the possibility of the non-participants being systematically heavier drinkers was explored by increasing the amount of alcohol imputed for the non-participants who had experienced harm in both the inferred and invited survey samples. Mean weekly alcohol consumption increased from 80 g overall per week to 115 g in the most extreme scenario. The relative differences between the inferred and invited survey samples increased as the scenarios extremity increased (largest RD=16.4%, women with primary levels of education). Caution is therefore advised in future applications when MNAR assumptions reach their extremities. Results from comparisons of both the inferred sample and the invited sample differed from the results of the weighted participants alone mainly for the MNAR (data not shown).
Broadly speaking, the creation of the observations on non-participants was successful, with the proportion breakdown in the inferred survey sample generally closely reflecting that of the population sample. Men with secondary levels of education were found to consume the largest amounts of alcohol across the sexes and attainment levels in the inferred and invited survey samples and in the participants alone. Table I revealed that higher proportions of the male invited survey sample experienced alcohol-related harm at all levels of educational attainment than would be expected based on the population sample. The greatest difference in rates of incident and any alcohol-related harm between Health 2000 participants and non-participants have previously been found to be within men with secondary levels of education [25].
The strengths of this validation exercise lie within the linked health survey and population sampled data available. Each had 12 years of complete individually linked follow-up data available for participants, non-participants and the population sample. There are several limitations to consider. As alcohol consumption is a survey-derived variable, we do not have observed alcohol consumption for the total sample available for use as a true gold standard. Differences in the proportion breakdown between the invited survey sample and the population sample, especially within those who experienced alcohol-related harm, may indicate violation of the assumption of the survey sample being representative of the population. Third, the relatively high participation rate of the Health 2000 survey (85.5% of those aged 30–79 years) is rare in recent health surveys, with many experiencing response rates <50% [26]. This methodology has been applied to surveys with lower response rates [12], where the inferred sample reflected the population breakdown well but may still require further validation. Finally, this methodology was developed with application to data from Scotland using an area-based measure of deprivation as the measure of socio-economic status. In Finland, no such measure officially exists that is contemporaneous with the survey baseline (the year 2000). Therefore, an individual measure – educational attainment – was used. The elevated risk of alcohol-related harm in lower socio-economic groups has been found to be stronger when measured at the individual level rather than area level [27]. Therefore, this methodology may require further validation for settings with area-level measures.
Alternative approaches to correcting for non-participation include applying calibration weights for MAR or Bayesian modelling [28] and selection modelling [29] for MNAR scenarios. Our methodology offers an advantage over the developed Bayesian approach in that it infers the necessary information on non-participants and can therefore be applied in a wider range of settings, such as those where individual-level register data are not available. The calibration weights and selection modelling approaches will be explored in future work, and they may offer advantages over this methodology, as the technical requirements are more modest. Simulation to compare estimates’ relative finite sample properties will be further explored. Whilst this methodology had previously been used to improve estimates of population-level alcohol consumption [11,12], and this validation exercise continues to make use of alcohol consumption, it could equally be applied to other health-related behaviours of interest, where suitable health records are available and there is a clear link between the health behaviour and harm resulting in hospitalisation or death, such as tobacco smoking.
In conclusion, the validation process presented here indicates that the methodology may be a valid approach to correcting for non-participation bias in health surveys, though consideration of alternatives such as selection modelling is warranted. Especially where lower levels of participation are experienced, the absence of such methodological correction is likely to yield biased results.
Supplemental Material
sj-docx-1-sjp-10.1177_14034948211031383 – Supplemental material for Adjustment for survey non-participation using record linkage and multiple imputation: A validity assessment exercise using the Health 2000 survey
Supplemental material, sj-docx-1-sjp-10.1177_14034948211031383 for Adjustment for survey non-participation using record linkage and multiple imputation: A validity assessment exercise using the Health 2000 survey by Megan A. Mcminn, Pekka Martikainen, Tommi Härkänen, Hanna Tolonen, Joonas Pitkänen, Alastair H. Leyland and Linsay Gray in Scandinavian Journal of Public Health
Footnotes
Declaration of conflicting interests
Funding
Supplemental material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
