Abstract
INTRODUCTION
The vast majority of age-related dementias involve insidious cognitive decline that can be difficult to distinguish from normal aging [1]; thus, identification of older adults at risk for dementia presents a challenge to prevention efforts. It is increasingly recognized that these insidious declines may occur even in adults with normal range cognition [2–5]. Single exam diagnosis of mild cognitive impairment (MCI) cannot capture these subtle cognitive changes, and is an inherently unstable diagnosis that is of uncertain prognostic significance [1].
Consistent with recent changes to diagnostic guidelines for research, efforts are currently focused on treating individuals before symptoms emerge, underscoring the need for tools to assess subtle cognitive changes in asymptomatic individuals [6]. Detecting cognitive change within normal range performance requires serial cognitive testing, but there are several factors complicating interpretation of serial cognitive exams. Many tests show substantial practice effects that lead to improved scores on follow-up examinations [7], and regression to the mean causes baseline scores more distal to the mean to trend toward the mean at follow-up [8]. Although normative-referenced scores are commonly employed to aid interpretation of raw neuropsychological scores at baseline, interpretation of serial exam scores is rarely anchored to normative data on expected serial exam performance in healthy individuals. Use of linear regression methods to determine expected change on serial exams in a robustly normal population may aid in efforts to characterize neuropsychological decline (NP decline) relative to expectations [8, 9]. Practice effects and regression to the mean would be reflected in these norm-referenced NP decline scores, making a lack of a practice effect a potential indicator of subtle decline, as suggested by prior research [10, 11].
Numerous studies support the value of decline on neuropsychological testing in the prediction of dementia [3, 12], and prior work has related preclinical cognitive decline to biomarker abnormalities [13–15]. However, the vast majority of studies have focused on how biomarker-based pathological assessments influence preclinical decline [16, 17], or have used methods that cannot operationalize NP decline in asymptomatic individuals who are performing within normal range [3, 12]. To our knowledge no studies have used a norm-referencing approach to examine whether longitudinal NP decline contributes to dementia prognosis beyond single exam cognitive diagnosis and Alzheimer’s disease (AD) biomarkers. We sought to determine whether longitudinal NP decline adds unique prognostic information and to provide a simple tool to identify older adults with and without NP decline on a standard 12-month follow-up examination. We used linear regression in a robustly normal sample to generate norm-referenced NP decline scores representing worse than expected performance at 12-month follow-up [8, 9]. These norm-referenced scores aid in the interpretation of neuropsychological performance over a one-year interval. In order to determine the value of our specific method, we also compared our results to the simpler approach of calculating norm-referenced change scores to quantify decline over 12 months. Finally, we also investigated differences in brain volumes between adults with and without NP decline in order to better understand the neuroanatomical basis for NP decline.
MATERIALS AND METHODS
Participants
Data were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). Participants were recruited from more than 50 sites across the United States and Canada. Inclusion criteria include: age 55–91 years, permitted medications stable for 4 weeks, study partner accompanying participant to visits, Geriatric Depression Scale <6, Hachinski Ischemic Score ≤4, adequate visual/auditory acuity, good general health, 6th grade education or work history equivalent, and fluent English or Spanish. Exclusion criteria were any significant neurological disease or head trauma history. This study was conducted according to Good Clinical Practice guidelines, the Declaration of Helsinki, US 21CFR Part 50- Protection of Human Subjects, and Part 56- Institutional Review Boards, and pursuant to state and federal HIPAA regulations. All participants and/or authorized representatives provided written informed consent.
We included 1,074 older adults enrolled in ADNI-1, ADNI-Grand Opportunity, and ADNI-2 who were dementia-free at baseline, remained dementia-free at 12-month follow-up, and underwent variable follow-up with serial neuropsychological and clinical exams (18–120 months from baseline). A subset (
AD biomarkers and genetic data
Roche Elecsys immunoassays determined CSF amyloid-
Apolipoprotein E (APOE)
Neuropsychologically-defined MCI diagnosis
Cluster analysis of ADNI neuropsychological measures was used as previously described to obtain empirically-derived groups of baseline cognitively normal and MCI participants for our analysis [20, 21]. Briefly, participants who were ADNI-diagnosed cognitively normal at baseline and 12-month follow-up, and remained normal for all available follow-up (12–120 months), were used as a robust norm-reference group (
Participants from the robust normal group were combined with the cluster-derived normal group to form the cognitively normal (CN) group (
Quantification of neuropsychological (NP) decline
Performance at baseline and 12-month follow-up was used to identify NP decline averaged across tests of memory, attention/executive function, and language function. Test scores by domain were as follows: memory, AVLT trials 1-5 total score, AVLT trials 6 and 7 average; attention/executive function, Trails A and Trails B; language function, Animal Fluency and Vegetable Fluency.
Linear regression models were used to calculate NP decline based on the set of robust norm-reference controls described above. Prediction equations were developed using linear regression models with baseline neuropsychological performance as a predictor of future performance at 12-month follow-up. Demographic factors were not used in regression models, but instead were included as covariates in Cox regression models predicting progression to dementia. All normative regression models were visually inspected for linearity. Where outliers were observed, model fit was examined with and without outliers. All data were retained in statistical models, except for one subject in the robustly normal sample who exhibited an influential outlier on Trails A.
We used the linear regression model (Equation 1) assuming that baseline performance ( Predicted AVLT Trials 1–5 total = 13.214 + (0.720×Baseline AVLT Trials 1–5 total) Predicted AVLT Trials 6-7 average = 3.270 + (0.645×Baseline AVLT Trials 6-7 average) Predicted Trails A (log) = 0.589 + (0.598×Baseline Trails A (log)) Predicted Trails B (log) = 0.656 + (0.643×Baseline Trails B (log)) Predicted Animal Fluency = 8.410 + (0.623×Baseline Animal Fluency) Predicted Vegetable Fluency = 4.464 + (0.687×Baseline Vegetable Fluency)
For our measure of NP decline, we calculated the discrepancy between actual performance at 12-month follow up (
Diagnosis of AD dementia
Our primary study outcome was progression to AD dementia. In order to avoid circularity, dementia diagnostic criteria did not include any of our predictive NP decline markers described above. We used the ADNI study criteria based on modes of assessment that are independent of our NP decline measure. Specifically, participants diagnosed with dementia had to 1) have memory complaints, 2) score between 20–26 (inclusive) on the Mini-Mental State Examination (MMSE), 2) have a Clinical Dementia Rating (CDR) 0.5 or 1.0, 3) score <8 for 16 or more years of education, <4 for 8–15 years of education, or <2 for 0–7 years of education on the Logical Memory II subscale (Delayed Paragraph Recall) from the Wechsler Memory Scale-Revised (maximum score of 25), and 5) meet the National Institute of Neurological and Communicative Disorders and Stroke-Alzheimer’s Disease and Related Disorders Association (NINCDS/ADRDA) criteria for probable AD [22].
Statistical analyses
Hierarchical Cox regression analyses evaluated the predictive utility of NP decline in identifying those at risk for more rapid progression to dementia over all follow-up, and all findings were confirmed over fixed 48-month follow-up. We also examined progression to MCI or dementia over all follow-up among cognitively normal older adults. The first model included all participants and corrected for age, sex, education, APOE
NP decline was examined as a continuous measure in all Cox regression analyses. To visualize improved prediction of dementia with addition of NP decline,
In order to determine whether our NP decline indicator was superior to simple change score calculations, we compared our results to those derived from a simple normalized change score approach. We also conducted additional analyses to ensure study findings were not biased by the inclusion of the robustly normal sample in all models. First, we divided our robustly normal sample into two roughly equal samples using a binary random number generator. Second, we re-calculated NP decline using half of the robustly normal sample. Finally, we replicated our primary analysis of NP decline as an independent predictor of progression to dementia after excluding the half of the robustly normal sample we used to generate the NP decline scores.
All statistical analyses were conducted in SPSS version 24. Significance tests were two-tailed and used a cutoff of
Neuroimaging acquisition, processing, and analyses
A subset of 1,061 participants from the ADNI study underwent 3D T1-weighted brain MRI at baseline. Scans were downloaded from the online ADNI database (http://adni.loni.usc.edu/). Images were acquired across 64 sites on Siemens, GE, and Phillips scanners at 1.5T or 3T magnet strength. T1-weighted sequences were either magnetization prepared rapid acquisition gradient echo (MP-RAGE) or inversion recovery spoiled gradient recalled (IR-SPGR). Specific scanning parameters for each scanner can be viewed online (http://adni.loni.usc.edu/methods/documents/mri-protocols/tocols/). Prior to analyses, images were individually checked for quality and aligned along the anterior and posterior commissures.
For group comparisons of grey matter volumes, voxel-based morphometry (VBM) analyses were conducted in Matlab using SPM12 and the DARTEL toolbox, as previously described [23, 24]. Briefly, T1-weighted images were first segmented into grey and white matter tissue classes using SPM12’s unified segmentation procedure, followed by the creation of a study-specific DARTEL template. Images were then iteratively aligned to the template, spatially normalized, and smoothed with an 8 mm full-width at half-maximum isotropic Gaussian kernel. Voxel-wise
RESULTS
Participants
Table 1 displays the ADNI sample demographic and clinical characteristics for all non-demented participants at baseline and 12-month follow-up exam.
Demographics and clinical characteristics
NP decline predicting dementia risk beyond baseline cognitive diagnosis
There were 221 patients who progressed to dementia (see Supplementary Table 1 for all events by month of follow-up and cognitive diagnosis). NP decline significantly improved the model and was linked to approximately 2.8-fold increased risk for future dementia after controlling for age, sex, education, APOE
Cox regression model for neuropsychological (NP) decline predicting future dementia beyond baseline diagnosis
ROC analysis indicated an optimal cutoff for predicting dementia of

Neuropsychological decline predicts dementia beyond cognitive diagnosis. Cox regression models predicting progression to dementia over all follow-up among those who were cognitively normal (CN) without neuropsychological decline (CN NP-, dashed blue line) versus CN with neuropsychological decline (CN NP+, solid blue line), and MCI patients without neuropsychological decline (MCI NP-, dashed red line) versus MCI with neuropsychological decline (MCI NP+, solid red line). See Supplementary Table 2 for subject numbers, MMSE, CDR, and Logical Memory Story A delayed free recall scores by participant subgroup.
Additional analyses focused on the ADNI cognitively normal subgroup demonstrated that NP decline predicted progression to either MCI (by ADNI criteria) or dementia over all follow-up in those who were cognitively normal by ADNI criteria at both baseline and 12-month follow up (Supplementary Table 5).
Comparison of study results using regression-based norms (Table 2) to those of normalized simple difference scores revealed an apparent superiority of regression-based norming (OR 1.959,
NP decline predicting dementia risk beyond biomarker profiles
There were 134 patients who progressed to dementia (see Supplementary Table 7 for all events and censored cases for each month of follow-up). NP decline significantly improved the model and was linked to greater than 2.2-fold increased risk for future dementia after controlling for age, sex, education, APOE
Cox regression model for neuropsychological (NP) decline predicting future dementia beyond AD biomarkers

Neuropsychological decline predicts dementia beyond AD biomarkers. Cox regression models predicting progression to dementia over all follow-up by cognitive diagnosis without neuropsychological decline (CN dashed blue; MCI dashed red) versus those cognitive diagnosis with neuropsychological decline (CN solid blue; MCI solid red) for each biomarker profile, including normal AD biomarkers, A
Baseline brain volumetric analysis
Figure 3 displays FWE corrected results of VBM analysis comparing brain volumes at baseline CN participants without NP decline to those with NP decline (Fig. 3A), as well as MCI patients without NP decline versus with NP decline (Fig. 3B) (see Supplementary Tables 9 and 10 for coordinates and other cluster details). Findings indicated smaller brain volume specifically within the hippocampus of CN participants with NP decline relative to those without NP decline, and smaller volumes within the hippocampus and medial temporal lobes of MCI participants with NP decline relative to those without NP decline.

Neuropsychological decline linked to baseline hippocampal and medial temporal lobe volume. Results of voxel-based morphometry analysis after FWE correction are displayed as
DISCUSSION
Asymptomatic older adults performing below expectations at follow-up exhibit more rapid progression to dementia, regardless of demographic factors, APOE
Prior studies have found that cognitively normal older adults who are biomarker positive for both CSF A
The current conceptualization of Stage 3 neurocognitive decline includes both abnormal baseline performance and evidence of longitudinal decline [6], consistent with a tradition that makes no distinction between progressive and non-progressive forms of MCI [33]. Although an individual’s absolute level of cognitive ability at baseline is likely to be a more powerful predictor of imminent dementia risk, as suggested by the greater odds ratios for baseline diagnosis versus decline status, this baseline ability presents an incomplete picture of an individual’s overall risk in that it contains no information on cognitive trajectory. Premorbid cognitive ability, cultural/ethnic background, and linguistic variability limit the ability of baseline exams to optimally predict risk for dementia [34]. These factors may be less salient when evaluating within-subject variance from baseline to follow-up, suggesting NP decline may have value in evaluating more diverse populations. However, this remains an open question for future studies to evaluate using more ethnically diverse samples.
There are no operationalized criteria for cognitive decline as described in the research recommendations for diagnosis and staging of neurocognitive decline in AD [6]. Subjective reports of cognitive decline are prone to errors [35] and reliable informants are not always available or able to notice subtle cognitive changes. For these and other reasons, serial neuropsychological data may be the favored method for establishing cognitive trajectories. However, the known influence of practice effects and regression to the mean makes clinical judgments based on raw score changes extremely difficult [7]. The present study demonstrates that the use of regression-based norming practices can aid in the interpretation of serial neuropsychological test data. We also provide normative regression equations and optimal cutoff values for ascertaining the presence or absence of NP decline, which may be of clinical and investigational value in patients and participants similar to those studied as part of ADNI. Although more research is needed across different populations and test batteries, the present approach provides a relatively simple framework that may be readily applied in clinical and research settings. This practice is analogous to the widely adopted method of norm-referenced demographic corrections, but adds a norm-referencing method for neuropsychological change over 12 months. Future studies may help determine whether regression, mixed modeling, simple change scores, or other approaches are the most appropriate for obtaining norm-referenced decline metrics, but the present study findings suggested regression-based methods may be superior to simple change scores.
Despite the advantages of using neuropsychological markers as risk indicators for dementia, such an approach has also been understandably criticized for circularity since neuropsychological markers are often also used to arrive at a diagnosis of dementia. For this reason, the present study used independent modes of assessment to ascertain dementia diagnosis as the criterion measure. Thus, it is unlikely that shared methods variance alone can account for the study findings. The face validity of neuropsychological markers as indicators of cognitive impairment should not preclude their use as prognostic instruments, particularly since these markers predict progression over lengthy follow-up intervals in patients who are initially asymptomatic. Ultimately NP decline demonstrates considerable utility in predicting who will progress to dementia, indicating critical value for clinicians and scientists working with this population.
Prior work has demonstrated correlations between hippocampal volume and both AD biomarkers [36, 37] and cognitive decline [37] during the early stages of the disease. Consistent with these findings, older adults with NP decline exhibited smaller regional baseline brain volumes than those without NP decline in a pattern that was remarkably specific to the hippocampus and surrounding medial temporal regions. Although medial temporal regions are traditionally associated with memory functions and most attempts to identify preclinical decline have focused predominantly on memory measures [38], we chose to use tests that also assessed executive function and semantic fluency. Recent studies have suggested more widespread impact of preclinical AD across neuropsychological domains [2]. Consistent with these findings, research recommendations acknowledge that AD-related cognitive impairment may involve widespread versus specific cognitive domain impairment [6, 25]. This may be particularly true for NP decline versus single exam findings since subthreshold declines may occur across domains. Increased reliability is another advantage of global composites that average scores across domains, which may improve the likelihood of detected subtle cognitive changes. Future studies should use longitudinal and multimodal imaging to identify additional structural and functional characteristics of early NP decline, improving our understanding of the neuropathology underlying progressive forms of cognitive impairment.
Single exam diagnosis or biomarker-only evaluations are not likely to yield accurate prediction of when an individual will progress to dementia. Inclusion of asymptomatic individuals with no NP decline in clinical trials may require much larger samples and greatly extended follow-up due to high variability in progression rates. This leads to increased costs and delayed progress toward predicting and preventing dementia, as well as exposure of low risk individuals to potentially detrimental medication side effects. If additional studies can establish widely accepted standards for establishing cognitive trajectories, trial design may be improved by adding cognitive trajectory to single exam diagnoses to further classify participant risk.
Study limitations include the ADNI sample being comprised of relatively highly educated and demographically homogenous individuals recruited from over 50 sites across the US and Canada with variable sampling bias and methodology, and inclusion/exclusion criteria that limited comorbid neuropathological factors. Extension of the present study findings in larger community-based samples may further support the predictive utility of NP decline as a prognostic supplement to biomarkers and baseline cognitive diagnosis. Another potential limitation was the inclusion of robustly normal participants in both the generation of NP decline prediction equations and the testing of those equations as predictors of future dementia. However, we were able to replicate the main study findings in sub-analysis that excluded the robustly normal participants used to generate the equations, alleviating concern that circularity may have influenced our results. An additional concern may be raised regarding the application of normative regression equations to both cognitively normal and MCI cases since these two groups would have different anticipated cognitive trajectories. We chose to use normative models for both groups in an effort to not only predict future dementia in patients who are initially normal, but also to improve the prognostic value of MCI diagnosis, an inherently unstable category. Future studies will also evaluate whether use of 3 time-points further improves evaluation of NP decline, determine the degree to which individual tests contribute to overall decline prediction, and further characterize heterogeneity in patterns of decline across MCI subtypes.
