Abstract
INTRODUCTION
The biological presence of the Alzheimer’s disease (AD) is defined by amyloid-β (Aβ) and tau pathology. The early impact of Aβ and tau in cognition has been reported even in the cognitively healthy population [1, 2]. The effects of Aβ and tau have been mainly associated with a decline in memory function, but recent evidence suggested that beyond memory, other cognitive processes such as executive functioning are affected at early stages of preclinical AD [3]. A comprehensive assessment of cognitive function across cognitive domains, and not just of memory function, might help to delineate better the subtle cognitive decline associated with subjacent Aβ deposition, and aggregated tau at the preclinical stage of AD.
In 2018, the research criteria of the National Institute on Aging-Alzheimer’s Association (NIA-AA) defined subtle cognitive decline as the stage of transitional decline between healthy cognition and mild cognitive impairment (MCI) [4]. The characterization of subtle cognitive decline in preclinical AD aims to advance clinical diagnosis at earlier asymptomatic stages, and to improve sensitivity in clinical trials targeting cognitive function. The NIA-AA provided a classification system based on the status of the three AT(N) biomarker types that describe the biological profile along the AD continuum, Aβ (A), p-tau (T), and non-specific neurodegeneration ([N]) [4]. Thus, the AT(N) system allowed the classification of individuals with normal AD biomarkers, individuals within the AD continuum, or individuals with a non-AD pathologic change, providing a research framework for delineating the earliest relationship between biological changes and the decline of cognitive function at preclinical stages.
Normative data for clinical neuropsychological testing are conventionally obtained from cognitively unimpaired (CU) individuals. Thus, cognitive impairment could be identified by comparing individual performances with normal range while adjusting by sociodemographic factors, such as age, education, and sex. Nevertheless, this canonical procedure might yield some deflation in normative cognitive scores according to the presence of subtle cognitive decline in CU individuals due to incipient abnormal levels of AT(N) biomarkers indicating underlying AD pathology.
Since conventional neuropsychological norms might lack enough sensitivity for detecting subtle cognitive decline in preclinical stages, robust normative procedures typically account for the reliable performance of clinically stable populations (including only individuals who maintain a status of healthy cognitive performance on longitudinal follow-up and excluding those that latter on develop MCI/dementia). Robust normative approaches based on longitudinal assessment demonstrated an increase in the probability of labeling cognitive performance as below average in comparison to conventional norms [5–7], by establishing higher thresholds for what is considered normal performance or decline according to these robust procedures [8, 9].
An alternative approach is to rely on biomarker data for constructing robust references sensitive enough to identify subtle decline in cognitive performance. The presence of Aβ positivity in CU individuals ranging from 50 to 70 years has an estimated prevalence of between 10.4% and 23.1% [10], and the presence of either Aβ, tau pathology, or neurodegeneration in CU individuals older than 65 years has an estimated prevalence of up to 44% [11]. In fact, the effect of aging in the cognitive performance of older adults is attenuated when AD biomarker status is accounted for in the analysis [12]. Accordingly, excluding individuals with signs of altered Aβ, p-tau, neurodegeneration (neurofilament light) biomarker levels, cerebrovascular pathology, or uncontrolled systemic medical illness, results in more rigorous standards for detecting cognitive decline than those of conventional norms [13, 14]. Therefore, accounting for cerebrospinal fluid (CSF) AD biomarkers for the adjustment of normal performance in cognitive testing may enhance the sensitivity of normative neuropsychological data beyond standard published norms for detecting subtle cognitive decline in individuals not meeting yet MCI criteria [15, 16].
Previous research identified that the use of norms excluding individuals with positive Aβ biomarker levels, as criteria for a more robust normative procedure, allowed the identification of individuals at risk of dementia and increased the predictive accuracy of dementia progression using memory measures [16]. Thus, Bos et al. proposed a “normal or impaired” classification model based on three groups combining the classification of conventional norms and Aβ negative norms (i.e., Group 1: normal with both norms; Group 2: impaired only with robust Aβ negative norms, Group 3: impaired with both norms [16]). Such classification model provides a complementary interpretational framework for characterizing subtle cognitive decline. Although in Bos et al. [16] it was only necessary to adjust for AD biomarkers in memory measures, other evidence pointed out that removing preclinical AD participants from normative samples also results in higher means and less variability in visuospatial ability and executive function [17]. For this reason, we considered relevant to explore the usefulness of AD biomarker-based normative data in the assessment of other several cognitive domains apart from memory processes.
We have recently published norms for the verbal memory tests Free and Cued Selective Reminding Test and the WMS-IV Logical Memory Test in a sample of CU individuals with normal AD CSF biomarker levels. The use of such biomarker-based norms in combination to standard references may be useful for the identification of subtle impairments in memory function [15]. These biomarker-based normative data have demonstrated to capture cognitive decline in individuals with evidence of positive Aβ and tau status in a 3-year follow-up within the ALFA+study, while conventional norms failed to do so (in preparation).
In the present work, as a robust normative procedure we only used data of CU individuals with normal (non-pathological) levels of AD biomarkers. We excluded from our analyses individuals with positive status of underlying AD pathology, defined by CSF Aβ (A), p-tau (T), t-tau ([N]) biomarkers, with the aim to provide a more sensitive normative reference for cognitive evaluation. The AD biomarker-based normative procedure is a complementary interpretational framework for the neuropsychological assessment of subtle cognitive decline in preclinical AD.
METHODS
Participants
We included 248 Spanish participants that completed the first visit (2016–2019) of the ongoing longitudinal ALFA+(ALzheimer and FAmilies) study. ALFA+is a research cohort of middle-aged CU individuals, most of them with family history of AD (153 participants [61.7%] had at least one parent diagnosed with AD before the age of 75). ALFA+participants have been thoroughly characterized with clinical interviews, lifestyle and risk factors questionnaires, cognitive testing, CSF biomarkers, and neuroimaging procedures, including magnetic resonance imaging (MRI), and positron emission tomography (PET). All tests and procedures are repeated every 3 years with the aim to identify the earliest pathophysiological changes in the preclinical AD continuum [18].
ALFA+ inclusion criteria were: 1) participants who had previously participated in the 45–65/FPM2012 study (ALFA parent cohort [18]); 2) range of 45–65 years-old at the moment of the inclusion in the 45–65/FPM2012 study; 3) long-term commitment to the study: inclusion and follow-up visits and agreement to undergo all tests and procedures (e.g., MRI and lumbar puncture). ALFA+ exclusion criteria included: 1) cognitive impairment (Clinical Dementia Rating [CDR] > 0, Mini-Mental State Examination [MMSE] < 27, semantic fluency < 12); 2) significant systemic illness or unstable medical condition which could lead to difficulty complying with the protocol; 3) contraindication to any test or procedure; 4) suspected family history of monogenic AD.
AD biomarker status definition
CSF analyses defined Aβ, p-tau, and t-tau status. CSF collection and processing have been described previously [19]. CSF p-tau and t-tau were measured using the electrochemiluminescence Elecsys® Phospho-Tau (181P) CSF and Total-Tau CSF immunoassays, respectively, on a fully automated cobas e601 module (Roche Diagnostics International Ltd.). CSF Aβ42 and Aβ40 were measured with the exploratory Roche NeuroToolKit immunoassays (Roche Diagnostics International Ltd, Rotkreuz, Switzerland) on a cobas e601 module. Measurements were performed at the Clinical Neurochemistry Laboratory, Sahlgrenska University Hospital, Mölndal, Sweden. Aβ status was defined using the cutoff of 0.071 for the ratio Aβ42/40, tau status was defined using the cutoff of 24 pg/ml for p-tau and neurodegeneration status was defined using the cutoff of 300 pg/ml for t-tau as previously described [19].
Cognitive measures
We included standard neuropsychological tests for the clinical assessment of attentional (WAIS-IV: Digit Span; WMS-IV: Symbol Span; TMT-A), executive (TMT-B; WAIS-IV: Coding; WAIS-IV: Matrix reasoning), linguistic (Semantic Fluency Test), and visual processing (WAIS-IV: Visual Puzzles; RBANS: Judgement of line orientation).
WAIS-IV: Digit Span
The Digit Span subtest used is included in the WAIS-IV [20]. This test measures attentional processing, specifically executive attention understood as the relationship between working memory and attentional control. It consists of three parts: forward, backward, and sequencing. The forward part requires the participant to repeat, in the same order, the numbers recited by the examiner. The backward part requires the participant to repeat, in reverse order, the numbers recited by the examiner. The sequencing part requires the participant to repeat, in ascending order, the numbers recited by the examiner. The test consists of 8 items for each part. Each item has two attempts that are scored with 1 or 0. The administration of each part is suspended if the participant obtains a score of 0 in both attempts of the same item. The maximum score is 16 points for each part, with a maximum score of 48 for the total test. The main variables of the test are forward span (0–9), backward span (0–8), and the total (0–48). We present normative data and equations to calculate z-scores for the forward span, backward span, as well as the total score.
WMS-IV: Symbol Span
The Symbol Span subtest used is included in the Spanish version of the Wechsler Memory Scale-IV [21]. This test is used to measure visual working memory. The participant briefly observes a series of abstract symbols. The examiner then removes these from his view and asks the participant to sequentially identify them, following the order in which he has seen them, from a larger series of symbols. The first two items are scored with 0 or 1 point, while the remaining items (from 3 to 26) are scored with 0, 1, or 2. The main variable of interest is the total sum of the scores obtained in the items from 1 to 26 (0–50). We present normative data and equations to calculate z-scores for the total score.
Trail Making Test (TMT)
The Trail Making Test [22] consists of two parts, A and B. Part A is used to assess selective attention and it consists of the connection, by means of drawing a line, of 25 numbers randomly distributed on a sheet. The participant must consecutively join the numbers following their natural order from 1 to 25. Part B is used to assess cognitive flexibility and it consists of the alternating joining of numbers and letters following the natural order of the numbers and the order of the alphabet. The main variables of interest are measured in seconds for Part A (completion time) and Part B (completion time). We present normative data and equations to calculate z-scores for the completion times of Part A and Part B.
WAIS-IV: Coding
The Coding subtest used is included in WAIS-IV [20]. This test assesses processing speed. The participant must reproduce the symbol that corresponds to each number according to the model presented on the same sheet, where there are numbers from 1 to 9 and the symbol that corresponds to each of the numbers. The participant tries to complete as many items as possible for two minutes. The main variable of interest is the total score (0–135). We present normative data and equations to calculate z-scores for the total score.
WAIS-IV: Matrix reasoning
The Matrix reasoning subtest used is included in the WAIS-IV [20]. It captures fluid intelligence, general visual intelligence, spatial aptitude, and classification. The participant must look at an incomplete matrix or series and select, from among five options, the one that best completes the matrix or series. The test has 26 items, and a maximum time of 30 seconds is allowed for each response, which will be scored with 0 or 1. After three consecutive scores of 0, the administration of the test is suspended. The main variable of interest is the total score (0–26). We present normative data and equations to calculate z-scores for the total score.
Semantic Fluency Test
The Semantic Fluency Test [23] assesses semantic verbal fluency. The test requires the participant to say the maximum number of words within the semantic category “animals” in one minute. The main variable of interest is the total score, composed by the number of correct productions. We present normative data and equations to calculate z-scores for the total score.
WAIS-IV: Visual Puzzles
The Visual Puzzles subtest used is included in the WAIS-IV [20]. It captures non-verbal reasoning. Within a time limit (20 or 30 s, depending on the complexity), the participant must select the three pieces that allow the reconstruction of the presented puzzle. Each item is scored with 0 or 1 and the administration is suspended after 3 consecutive scores of 0. The test consists of 26 items. The main variable of interest is the total score (0–26). We present normative data and equations to calculate z-scores for the total score.
Repeatable Battery for the Assessment of Neuropsychological Status (RBANS): Judgment of line orientation.
The Judgment of line orientation subtest is included in the RBANS [24]. It captures visuospatial processing. The participant must estimate spatial relationships between line segments by matching them to a series of sample lines. Ten items are offered with two segments to match for each of them. The main variable of interest is the total score (0–20). We present normative data and equations to calculate z-scores for the total score.
Development of normative data
Normative data were developed following multiple linear regression-based methods. The procedure is as follows: 1) Centering the age of the participants by subtracting the mean group age from each individual’s chronological age. 2) Constructing a set of multiple regression models (one for each cognitive score of interest), with the cognitive score as dependent variable and age-centered, education (with 4 category levels [elementary = 0, secondary = 1, graduate = 2, postgraduate = 3]), and sex (male = 0; female = 1) as predictors. In this study we defined education categories as the highest completed education level. The number of years of formal education may overlap among levels depending on the education system and the number of additional years attended from an uncompleted educational program. A backward stepwise method was used, with a criterion of
4) Calculating the residuals between each possible value of the cognitive score and each possible expected score (using the relevant predictors for each variable) by subtracting them. Then, the residuals were converted to a z-score by dividing them by the standard deviation of the unstandardized residuals of the regression model. Clinicians may use the equations with the coefficients provided in the results to calculate the z-scores associated with the specific raw scores of a patient. 5) To simplify the use of the normative data, we provide tables for percentiles 1, 2, 5, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 95, and 98. In each table, the theoretical raw scores associated with each percentile value are shown. Since age accounted for a relevant effect in all cognitive variables, age groups were collapsed considering the distribution of the percentiles along the age range to reduce the number of tables presented. 6) To facilitate the use of biomarker-based cognitive norms, we provide online calculators for expected scores and actual z-scores and percentiles associated.
Comparison of normative frameworks
To explore the sensitivity of the current biomarker-based cognitive references we compared the number of lower scores in our sample by using both the biomarker-based references provided in this work and the available Spanish data from the NEURONORMA project and the WAIS-IV and WMS-IV norming studies. We assessed the proportion of lower scores in: Digit Span (forward and backward), Symbol Span, TMT (A and B), Coding, Matrix Reasoning, Semantic Fluency, and Visual Puzzles. Sociodemographic-adjusted scaled scores equal or below to 5 (equivalent to percentile 5) were considered as impaired. Individuals were classified as having or not a lower score (impaired or normal) using this definition. The McNemar test for related proportions (adjusted for continuity) was used to analyze the distribution of individuals labeled as impaired between both references. The Fleiss’ Kappa interrater correlation coefficient for categorical variables was used to test the agreement for the labeling of the same individuals between biomarker-based and standard references.
RESULTS
Demographic, presence of the
Participants characteristics (
Education was coded as follows: Elementary equals to finished elementary school (range of formal effective education 8–11 years); Secondary equals to finished secondary studies (range of formal effective education 9–14 years); Graduate equals to a university or superior degree (range of formal effective education 14–18 years); Postgraduate equals to Master or PhD (range of formal effective education 15–20 years). MMSE, Mini-Mental State Examination. The cutoffs of CSF biomarkers to classify individuals in the AT(N) triple-negative group were > 0.071 for the Aβ42/40 ratio, <24 pg/ml for p-tau181 and < 300 pg/ml for total-tau [19]. Data only included individuals classified in the AT(N) triple-negative group.
Descriptive data of the neuropsychological assessment
WAIS-IV, Wechsler Adult Intelligence Scale –IV; WMS-IV, Wechsler Memory Scale –IV; TMT, Trail Making Test; RBANS, Repeatable Battery for the Assessment of Neuropsychological Status.
Results of the multiple linear regression analyses
WAIS-IV, Wechsler Adult Intelligence Scale – IV; WMS-IV, Wechsler Memory Scale – IV; TMT, Trail Making Test; RBANS, Repeatable Battery for the Assessment of Neuropsychological Status. Education was entered as: Elementary = 0; Secondary = 1; Graduate = 2; Postgraduate = 3. Sex was entered as: Male = 0; Female = 1. Age was centered to 60.5. Significance codes: ‘***’ <0.001 ‘**’ <0.01 ‘*’ <0.05 ‘·’ <0.1.
Z-scores calculation formula
Education should be entered as: Elementary = 0; Secondary = 1; Graduate = 2; Postgraduate = 3. Sex should be entered as: Male = 0; Female = 1. Age should be centered to 60.5. Elementary education equals to finished elementary school (range of formal effective education 8–11 years); Secondary equals to finished secondary studies (range of formal effective education 9–14 years); Graduate equals to a university or superior degree (range of formal effective education 14–18 years); Postgraduate equals to Master or PhD (range of formal effective education 15–20 years).
We assessed the proportion of impaired scores when using the biomarker-based in contrast to the standard references. Using biomarker-based references, the percentage of at least one or more lower scores among the 9 measures considered was 44.76%. Using the standard references, the percentage was 21.77%. The McNemar test for related proportions was statistically significant when comparing the distribution of impaired individuals between both references (McNemar X2 (df = 1,
To facilitate the use of the reference data presented here and the previously published norms for the Free and Cued Selective Reminding Test and the WMS-IV Logical Memory Task developed with the same sample and approach [15], we provide online interactive tables with automatic calculators in the Supplementary Material.
DISCUSSION
In this study we provided regression-based normative data obtained from a sample of CU individuals (aged between 50 and 70 years) without underlying AD pathology, assessed by CSF determination of core AD biomarkers (AT[N]). Normative data is presented for the following tests: the Digit Span of the WAIS-IV, the Symbol Span of the WMS-IV, the TMT, the Coding of the WAIS-IV, the Matrix reasoning of the WAIS-IV, the Semantic Fluency Test, the Visual Puzzles of the WAIS-IV, and the Judgement of line orientation of the RBANS. The current biomarker-based cognitive references aim to provide an interpretational framework for the characterization of subtle cognitive decline in preclinical AD.
Given the potential limitations of conventional neuropsychological norms in identifying subtle cognitive decline, robust normative procedures are commonly employed to ensure the inclusion of clinically stable populations. These procedures involve selecting individuals who consistently demonstrate healthy cognitive performance over longitudinal follow-ups, while excluding those who later develop MCI or dementia. Such robust normative procedures aimed to increase the sensitivity of norms for the detection of MCI [5, 8]. However, robust normative procedures are not limited to clinical stability over longitudinal follow-up, nor to the detection of MCI. Recent biomarker-based normative approaches have expanded beyond these parameters by excluding data obtained from individuals with underlying AD pathology. Such norms render higher cut-offs and more sensitive standards than those provided by conventional norms aiming to detect the subtle cognitive decline that precedes MCI [12, 13]. By incorporating such innovative methodologies, researchers can enhance the accuracy and sensitivity of cognitive assessment tools, thereby facilitating the early identification of individuals at risk of dementia and potentially enabling interventions at preclinical stages.
The present work may help in the characterization of subtle cognitive decline. As defined by the NIA-AA in 2018, subtle cognitive decline is the stage of transitional decline before MCI [4]. This criterion proposed that this stage can be documented through subjective reports of cognitive decline or objectively by a longitudinal follow-up. Since most individuals with subjective reports of decline do not progress to dementia [25], objective cognitive measures may be more predictive of future dementia [16]. A direct approach is to use cross-sectional performance to predict clinical progression. The use of the common demographically adjusted cut-offs of< -1.5 SD below the mean, or percentile < 5, using a group of CU individuals without biological evidence of AD pathology as a reference has already been reported to improve the accuracy prediction of dementia progression [16]. By combining conventional and biomarker-based normative data, it is possible to identify those CU individuals who perform in the normal range according to standard normative data (without using negative-biomarker CU as reference) but fall in the impaired range according to normative data based on AD biomarkers (Group 2 classification according to Bos et al. [16]). We propose that cognitive scores showing these discrepancies could serve as indicators of subtle cognitive decline.
We assessed the sensitivity of the biomarker-based norms provided in this work in comparison to the conventional published references. We demonstrated that the proportion of lower scores (individuals labeled as impaired) significantly differed between the datasets, being higher for the biomarker-based norms. In addition, there was a low-intermediate agreement for impaired labeling between the norms. Considering the classification discrepancies between references and the fit of biomarker-based approaches for the characterization of preclinical AD, results suggest that the presented norms work as intended and indeed were more sensitive to subtle cognitive changes than the standard references.
As it has been previously reported, removing individuals with positive status of amyloid-β and/or tau pathology in normative data, yields higher reference scores than those of standard procedures [15–17]. As a result, the sensitivity was enhanced but the specificity of the norms decreased. To explore the characterization of subtle cognitive decline, the norms provided here do not aim to replace previously published ones. We suggest using these norms as a complementary tool (e.g., not in isolation but rather in combination with published conventional norms) for an interpretative framework in characterizing subtle cognitive decline. For reference, we provide hypothetical examples of the use of the normative data provided in this work in comparison with standard norms (see the Supplementary Material).
We found a relevant effect of age on the performance of all tasks, except for the Digit Span forward and backward parts. Education affected the performance for all tasks except for two variables within the attentional domain, the Digit Span forward, and the TMT-A. Education was positively associated with cognitive performance, except for the TMT-B, in which the association was negative, as expected (higher education associated with lower completion times). Sex affected the performance in all tasks except for two tasks within the executive domain, the TMT-B, and the WAIS-IV: Coding, and for the task in the linguistic domain, the Semantic Fluency Test. Males outperformed women on the TMT-A, WAIS-IV: Digit Span, WMS-IV: Symbol Span, WAIS-IV: Matrix reasoning, WAIS-IV: Visual Puzzles, RBANS: Judgment of line orientation, indicating that males tend to have higher scores than females. These associations between sex and cognitive performance showing better scores in men are consistent with previous studies in the studied cognitive domains [26–29], but are in contrast with our previously published normative data from the same sample in verbal episodic memory, in which women consistently outperformed men [15].
The influence of sociodemographic variables on cognitive function is well known [30]. However, available normative data typically presents adjustments for age, but adjustments for education are not always included, and including them for sex is even less frequent. It is of high relevance to adjust cognitive norms for sex, since not accounting for sex differences could lead to misdiagnoses [31]. Following this consideration, and in contrast with available normative data, the present study included adjustments for education for the WMS-IV: Symbol Span; the WAIS-IV: Coding; and the WAIS-IV: Matrix reasoning, and additional adjustments for sex for the WAIS-IV: Digit Span; the WMS-IV: Symbol Span; the TMT-A; the WAIS-IV: Matrix reasoning; and the WAIS-IV: Visual Puzzles.
The main limitation of our work resides in the partial applicability of the normative data provided. Individuals tested were aged from 50 to 70 years-old, thus constraining the age range of application. Nevertheless, the proposal of normative data based on AD biomarkers is especially useful in this age range, since the estimated prevalence of subjective cognitive decline is 25% for individuals older than 60 years [32]. The range of application is also constrained by the fact that our sample is mainly composed of individuals with at least elementary studies. Therefore, there is an important gap in the lower levels of education and the presented norms are not directly applicable to individuals with lower educational levels than those of the present investigation. Considering sample characteristics, the percentage of
In conclusion, we developed socio-demographically fully adjusted reference data obtained from individuals with evidence of normal (non-pathological) (AT[N]) biomarker levels in CSF. In combination with conventional norms, the biomarker-based cognitive references presented here may improve the objective characterization of subtle cognitive decline in preclinical AD.
