Abstract
Many caregivers of autistic children report developmental concerns as early as the first or second years of life (Chawarska et al., 2007; Crane et al., 2016; De Giacomo & Fombonne, 1998; Herlihy et al., 2015). However, families face many barriers in accessing autism-specific evaluations and services at these young ages, including hesitancy by primary care providers to initiate referrals, recommendations to “wait and see,” long waitlists at centers with appropriate expertise, and limited numbers of community-based providers. This results in years between parents’ first concerns and their child’s ultimate diagnosis, and consequently, lost time to access supportive services during this important developmental period (Dababnah et al., 2018; Edwards et al., 2021; Zuckerman et al., 2015). One key challenge is the lack of tools available to guide clinical decision-making regarding an individual infant’s likelihood of autism or other non-typical outcomes during the first year. Development of such tools could help improve responsiveness to families with early developmental concerns, prioritize waitlists for specialty evaluations, and “fast track” infants most in need of early supportive services. In this article, we conducted a secondary data analysis of 256 infants with an older autistic sibling to examine whether combining multiple behavioral assessment tools could help identify individual infants with elevated likelihood of non-typical outcomes, including autism, in the first year of life.
There are several unique challenges in identifying infants with elevated likelihood for autism and other developmental differences in the first year of life. Most of the data on very early development in autism comes from studies of younger siblings of autistic children followed prospectively through toddlerhood when diagnostic outcomes can be ascertained. These studies have demonstrated that symptoms emerge over time and at different rates across children; even with multiple serial developmental evaluations, only approximately 40% of infants ultimately diagnosed with autism meet criteria by 18 months of age, with many others not meeting formal diagnostic criteria until 24 or 36 months (Chawarska et al., 2014; Ozonoff et al., 2015; Szatmari et al., 2016). Given this heterogeneity in the unfolding of autism symptoms, it is not surprising that existing behavioral assessments appropriate for use in the first year of life have relatively modest sensitivity (i.e. ability to identify autism among those ultimately diagnosed) and specificity (i.e. ability to accurately identify cases without autism in the sample), as behavioral signs may not yet be measurable in many children. For example, at 12 months, the Autism Observation Scale for Infants (AOSI) demonstrated 0.52 sensitivity and 0.74 specificity in predicting 36-month autism outcomes within an infant sibling sample (Zwaigenbaum, Bryson, et al., 2021).
A second key finding to emerge from this literature is that infant siblings have an elevated likelihood of atypical developmental outcomes more broadly, with approximately 20% meeting criteria for autism and another 20%–30% exhibiting other developmental differences such as elevated attention-deficit hyperactivity disorder (ADHD) and anxiety symptoms, speech-language problems, and lower scores on adaptive functioning measures (Messinger et al., 2013; Shephard et al., 2017). Thus, assessments used in the first year of life that identify multiple types of early delays, including but not limited to autism, could help more children benefit from early intervention.
Improving detection in the first year of life
Screening tools used in the second year of life in general settings such as well-baby visits prioritize sensitivity and specificity in order to identify as many affected children as possible while limiting referrals for children who do not need them (McCarty & Frye, 2020). This approach may be less helpful in the first year of life when signs of autism are rarely clear (Zwaigenbaum, Bryson, et al., 2021; Zwaigenbaum, Brian, et al., 2021), limiting sensitivity. Instead, the challenge is to determine whether any behavioral differences present are clinically meaningful and predictive of later challenges. Prioritizing positive predictive value (PPV) over sensitivity and specificity may better address the specific needs of this developmental period. PPV is defined as the proportion of individuals scoring above a given threshold who have the outcome of interest. Developing tools with high PPV will be useful for guiding feedback to families with DC about their infant’s likelihood of autism or other developmental differences.
PPV is heavily influenced by the base rate of the outcome within a given population. This means that even screeners with high sensitivity and specificity tend to have poor PPV in general population samples, since the base rate of autism is relatively low (approximately 3%; Maenner et al., 2023). Therefore, it may be useful to examine PPV in a population with a higher base rate of autism and other developmental delays, such as infant siblings of autistic children or children for whom concerns have already been raised by parents or practitioners. Such an approach is consistent with increasing calls for a stepped and iterative approach to early screening wherein infants are initially identified based on universal level 1 screening or history (e.g. autistic sibling, prematurity, genetic syndrome, caregiver concerns), and subsequently evaluated using a level 2 screening or assessment to differentiate between autism or other developmental challenges (Dow et al., 2020; Green et al., 2022; Hyman et al., 2020; Khowaja et al., 2018; Robins, 2020; Sturner et al., 2023; Talbott & Miller, 2020; Wieckowski et al., 2022). This two-stage approach has been shown to improve diagnostic accuracy beginning in the second year of life (Brewer et al., 2020; Khowaja et al., 2018; McCarty & Frye, 2020; Towle & Patrick, 2016). We propose that applying this multi-stage framework could help to identify some infants in the first year of life with high likelihood of non-typical outcomes.
This study
This study investigated the potential for existing behavioral assessments, when used in the first year of life, to identify infants with elevated likelihood of autism and other developmental differences in toddlerhood. That is, are there some infants for whom early behavioral differences are predictive of subsequent non-typical developmental outcomes? To answer this question, we examined the PPV of three behavioral measures collected at 6, 9, and 12 months of age, individually and in combination, to predict non-typical developmental outcomes at age 3. For each measure, we defined cutoffs representing clear developmental differences in the first year of life (markers). We hypothesized that combining multiple measures would decrease the number of false positives and therefore increase the PPV, and thus the clinical utility, of such a screening approach. Given the relatively low overall prevalence of autism and other non-typical outcomes among general population samples, this article focused on infants with a family history of autism (FH). If combining multiple measures yields greater predictive utility within this sample, future studies could test the tools in general community samples.
Methods
Participants
This is a secondary analysis of data collected from three cohorts enrolled in longitudinal studies of infants with (
Procedures
Data were collected at multiple time points across the first 3 years of life for each cohort (at 6, 9, 12, 18, and 24 months of age), and a diagnostic outcome visit at 36 months of age. Participants in this analyses included all infants with an FH who had at least one assessment visit at 6, 9, or 12 months of age and a final visit at 36 months. Participants were evaluated at all ages by examiners unaware of family history and the results of prior assessments. Examiners were all MA or PhD level, with multiple years of experience in autism evaluations, and supervised by a licensed clinical psychologist who verified all diagnostic outcomes. Of the 256 FH participants, 158 had data at all three infant time points (6, 9, and 12), 47 had visits at 6 and 12 months, 30 had visits at 9 and 12 months, and 21 had visits at 12 months only. Demographic information and clinical outcomes for all participants are shown in Table 1. All data were collected at UC Davis Health, with approval from the Institutional Review Board. Written informed consent was obtained from all parents or legal guardians.
Participant demographics.
FH: family history of autism; GED: General Educational Development.
Includes courses toward college and/or associates or 2-year degree.
Includes Masters and Doctorate.
Measures
Autism observation scale for infants
This semi-structured behavioral observation measure was designed to measure behavioral constructs related to autism in infants 6- to 18-months of age (Bryson et al., 2008). A series of play interactions and systematic presses assess target behaviors including visual tracking and attention disengagement, coordination of eye gaze and action, imitation, affective responses, early social-communicative behaviors, behavioral reactivity, and sensory–motor development. It has excellent inter-rater reliability and fair-to-good test–retest reliability on standard laboratory-based administration (Bryson et al., 2008). AOSI total scores range from 0 to 38. At 12 months, a cut-off score of 7 results in a relative risk ratio of 1.58, with infants having scores at or above 7 more likely to have an autism outcome (Sacrey et al., 2018). AOSI total score was used as one of the three marker variables, described further in the behavioral markers definitions section below.
Mullen scales of early learning
This standardized developmental measure for children from birth to 68 months contains four cognitive subscales: Expressive Language, Receptive Language, Visual Reception, and Fine Motor (Mullen et al., 1995). Raw scores can be converted to T-scores and age equivalents. The original Mullen norming sample demonstrated good internal, test–retest, and interrater reliability (Mullen, 1995). The Mullen scales of early learning (MSEL) also shows good convergent validity with other standardized tests of cognitive functioning (Bishop et al., 2011). In this study, an Overall developmental quotient (DQ) was calculated by dividing the mean age-equivalent score for the four subtests by the child’s chronological age and multiplying by 100.
Parent concerns
At the end of each visit, examiners asked parents to describe any “ . . . current concerns about [their child’s] development or behavior at this time” and recorded responses verbatim (Ozonoff et al., 2009). All responses were coded into one of eight categories of concerns or no concerns, following the procedures detailed by Ozonoff and colleagues (2009). In previous use of this instrument, parents of infants who are later diagnosed with autism report significantly more concerns than parents of infants with other developmental delays or typically developing outcomes beginning at 12 months of age (Ozonoff et al., 2009). The current study used a dichotomous variable indicating the presence/absence of autism-related concerns (defined as parent concerns about speech/language/communication, social development, repetitive behavior, and/or unspecified general worries about autism).
Behavioral marker definitions in infancy
We generated cutoffs for each of the three measures collected at 6, 9, and 12 months. MSEL scores were dichotomized based on standard scores, with low MSEL scores defined as a DQ less than or equal to 85 (e.g. 1 standard deviation below the mean). Parent concerns were dichotomized as absence or presence of autism-related concerns (none versus one or more). High AOSI scores at 12 months were defined as 7 or above, as recommended by the measure’s developers (Sacrey et al., 2018). There are no recommended cutoffs for 6 or 9 months of age because the receiver operating characteristics (ROC) analyses used to develop the existing guidelines did not statistically differentiate between infants with and without autism outcomes until 12 months of age (Zwaigenbaum, Bryson, et al., 2021). We defined high AOSI scores as 1.5 standard deviations above the mean of our no family history (NFH) group collected as part of the larger infant sibling study. High AOSI in our sample was defined as ⩾ 13 at 6 months and ⩾ 10 at 9 months. Of note, at 12 months, the published and recommended cutoff of 7 was identical to the cutoff generated using the 1.5 SD above the NFH group definition.
Using these cutoffs, we determined whether a child demonstrated one, two, or three of the behavioral markers and examined the psychometric properties of each marker, individually and in combination, at each age. Mean scores and standard deviations for the three measures collected at 6, 9, and 12 months for the FH and NFH groups are presented in Table 2. At the group level, the FH group had a significantly greater number of parent concerns across all ages. At 6 months, the FH group also had significantly lower MSEL DQ scores. By 12 months, the FH group also had significantly higher AOSI total scores than the NFH group.
Sample characteristics, by recruitment group.
FH: family history of autism; NFH: non-FH; AOSI: Autism Observation Scale for Infants; MSEL: Mullen Scales of Early Learning; DQ: developmental quotient.
36-Month diagnostic outcome measures and procedures
Autism diagnostic observation schedule (ADOS), second edition
This is a semi-structured, play-based interaction and observation tool assessing social communication and repetitive behavior (Lord et al., 2012). Comparison scores, ranging from 1 to 10, allow for comparison of ADOS scores across different administration modules, with scores of 4 and above indicative of autism (Gotham et al., 2009). Here, the ADOS was used to confirm older sibling diagnosis and to determine 36-month infant outcomes.
Outcome classification procedure
At 36 months, infants were classified into three outcome groups: autism, typical development (TD), or non-typical development (Non-TD), following the procedures of Ozonoff and colleagues (2014). Criteria for the autism outcome group included (1) comparison score at or above the autism spectrum disorder (ASD) cutoff of 4 on the ADOS-2 and (2) meeting Diagnostic and Statistical Manual of Mental Disorders (5th ed.;
Community involvement statement: community members were not involved in this study.
Results
Mean scores and standard deviations for the three measures collected at 6, 9, and 12 months for the TD and DC groups are presented in Table 3. At the group level, the DC group had significantly higher AOSI scores than the TD outcome group by 9 months, and a significantly greater number of parent concerns and lower MSEL DQ scores at 12 months.
Sample characteristics, by outcome group, FH group only.
FH: family history of autism; TD: typical development; DC: developmental concerns; AOSI: Autism Observation Scale for Infants; MSEL: Mullen Scales of Early Learning; DQ: developmental quotient.
We started by examining whether there were fewer infants who met the multiple marker versus single marker definitions. Across all ages and outcome groups, few infants demonstrated all three behavioral markers (only 6 at 6 months, 5 at 9 months, and 13 at 12 months). The number of infants with two behavioral markers was higher, though still a small number of infants overall. Table 4 lists the distribution of infants at or above the cutoffs on single and combined marker definitions at each of the three ages.
Number of infants scoring above each marker definition, by age, FH group only.
FH: family history of autism; TD: typical development; DC: developmental concerns; AOSI: Autism Observation Scale for Infants; MSEL: Mullen Scales of Early Learning; PC: autism-related parent concerns.
The Any 1, Any 2, or All 3 definitions includes infants whose scores met behavior marker definitions on at least one, at least two, or all three of the measures.
Differences in the relative proportions of infants meeting the single versus multiple marker definitions were explored using McNemar’s exact tests, comparing the proportion of infants who met any single-marker definition (“Any 1”) versus the Any 2 marker definition in infants for whom at least two measures were available at a given age. Across all three ages, the proportions of infants who met the Any 2 definition were significantly smaller than the Any 1 definition (all
Next, we examined how well each definition performed in identifying infants belonging to the typically developing and DC groups at 36 months. We calculated the PPV, negative predictive value (NPV), sensitivity, and specificity of each definition in predicting classification into the DC group at 36 months (see Table 5). At each age, the behavioral marker definition with the highest PPV involved the combination of at least two measures. At 6 months, this was the All 3 definition, with a PPV of 0.67 (95% confidence interval (CI): 0.22–0.96). At 9 and 12 months, PPV was highest for the AOSI + MSEL definition, 0.83 (95% CI: 0.36–1.0) and 0.95 (95% CI: 0.75–1.0), respectively.
PPV, NPV, sensitivity, and specificity of marker definitions, by age, for DC outcomes versus TD outcomes, FH group only.
DC: developmental concerns; TD: typical development; FH: family history of autism; CI: confidence interval; AOSI: Autism Observation Scale for Infants; MSEL: Mullen Scales of Early Learning; PC: autism-related parent concerns.
The Any 1, Any 2, or All 3 definitions includes infants whose scores met behavior marker definitions on at least one, at least two, or all three of the measures.
The high PPV for these combined marker definitions contrasts with their low sensitivity values, which were expected and consistent with previous research showing most infants ultimately classified with DC do not demonstrate clear differences in the first year of life (Cleary et al., 2023; Zwaigenbaum et al., 2015). However, the high PPVs suggest that when multiple developmental differences are present, there is a strong likelihood those differences will persist into toddlerhood. At both 9 and 12 months, all but one infant who met either the AOSI + MSEL or All 3 definitions were classified into the DC group at outcome. Furthermore, of the 13 infants who met the All 3 definition at 12 months, 11 were diagnosed with ASD, 1 classified as Non-TD, and only 1 was typically developing. Supplemental Table 1 shows the number of infants who scored above each marker definition separately for autism and Non-TD outcomes. Supplemental Table 2 presents the associated PPV, NPV, sensitivity, and specificity values for differentiating autism from Non-TD outcomes. In general, the pattern of findings was similar in the autism-only outcome group, with the highest PPV at each age involving the combination of at least two measures. As expected, given the smaller sample size in this sub-group, the 95% CIs were wider and overlapped across all definitions. Sensitivity was also quite low across most definitions.
Because the measures available at each age differed across cohorts and because the criteria for each marker definition were not independent, it was not possible to directly compare each definition pair. Instead, we examined differences in the PPV values for the single-marker (e.g. AOSI, MSEL, PC, and Any 1) versus multiple marker (e.g. AOSI + PC, AOSI + MSEL, MSEL + PC, Any 2, All 3) definitions by calculating the 95% CI for each definition at each age. Non-overlapping CIs suggest a true difference in the proportion of infants scoring above the cutoff who belonged to the DC outcome group. These estimates are included in Table 5, and can be visualized in Figure 1 to facilitate comparison across the different definitions.

PPV with confidence intervals for DC outcomes, by age and marker definition, FH group only.
As can be seen in Figure 1, although the PPVs for the multiple marker definitions were generally higher than the single-marker definitions across all ages, at both 6 and 9 months, the 95% CIs for these proportion estimates overlapped across all definitions. This suggests that the multiple-marker definitions may not identify a significantly higher proportion of infants with DC outcomes than the single-marker definitions at these earliest ages. However, at 12 months, the proportion of infants with DC correctly identified by the MSEL alone, AOSI + MSEL, MSEL + PC, Any 2, and All 3 definitions were all significantly higher than Any 1 or Parent Concerns alone, as these CIs do not overlap. The AOSI + MSEL definition also identified a significantly higher proportion of DC outcomes infants than the AOSI alone, as the 95% CIs do not overlap. Importantly, 19 of 20 infants meeting the AOSI + MSEL definition belonged to the DC outcomes group. This suggests that when infants are showing a combination of high AOSI and low MSEL scores, there is a high likelihood these differences will persist into toddlerhood and result in a classification of autism or other Non-TD.
We used the same CI approach to compare differences in the proportion of infants with TD outcomes who met the single-marker (e.g. AOSI alone) versus multiple marker (e.g. AOSI + MSEL) definitions. This represents the proportion of infants incorrectly flagged by the each of the definitions (e.g. the false-positive rate or FPR). As can be seen in Figure 2, across all ages, the multiple-marker definitions resulted in proportionally fewer infants incorrectly identified. At all ages, the CIs around the estimates for the Any 1 and Parent Concerns Alone were higher than, and did not overlap with, any of the multiple-marker definitions. This indicates these two measures incorrectly identified a significantly higher proportion of infants with TD outcomes than the other definitions. At 6 and 9 months, the intervals around the AOSI and MSEL alone and each of the multiple-marker definitions all overlapped, suggesting that although a smaller proportion of infants were incorrectly identified by the multiple-marker definitions than the AOSI or MSEL alone, the differences were not significant. At 12 months, the MSEL alone, AOSI + MSEL, MSEL + PC, and All 3 definitions identified a significantly smaller proportion of infants with typical outcome classifications than the Any 1, AOSI alone, and Parent Concerns definitions. Thus, a similar pattern of improved identification rates was observed in the FPR as the PPV, with the multiple-marker definitions associated with fewer infants falsely identified than the AOSI alone, particularly at 12 months.

PPV with confidence intervals for TD outcomes, by age and marker definition, FH group only.
Overall, we found that combining multiple behavioral markers increased the proportion of infants scoring above the cutoff who are ultimately classified into the DC group (e.g. PPV), particularly at 12 months. At 6 months, the highest PPV was observed for the high AOSI and parent concern definition (PPV 0.64); adding low Mullen scores increases PPV slightly (0.67). While the combined marker definitions had higher PPV than the single-measure marker definitions, the 95% CIs for each proportion estimate overlaps for all definitions. This suggests that at 6 months, combining multiple markers does not significantly improve PPV. At 9 months, the highest PPV is obtained for the definition involving high AOSI and low MSEL scores (PPV 0.83). However, the 95% CIs around the proportion estimates overlap for all definitions, again suggesting that at 9 months, combining multiple markers does not significantly improve PPV. At 12 months, the highest PPV is again observed for the high AOSI and low MSEL definition (PPV 0.95). This definition significantly improves PPV over AOSI alone, parent concerns, or Any 1 definitions. Nineteen of 20 infants with both high AOSI and low MSEL scores at 12 months were classified into the DC outcome group.
Discussion
In this study, we examined the potential of combining three early behavioral measures to identify infants 6–12 months with elevated likelihood of DC, including autism or other non-typical outcomes, in toddlerhood. We generated cutoffs for each of three measures collected at 6, 9, and 12 months: the AOSI, Mullen Scales of Early Learning (MSEL), and the presence of autism-related Parent Concerns (PC). We found that combining two or more measures yielded a PPV between 0.67 and 0.95 in predicting any non-typical outcome (including autism as well as other DC, such as speech-language delays). While many infants with typical development outcomes met single-marker definitions, very few met multiple marker definitions. For example, at 6 months, 15 infants with TD outcomes met the high AOSI score definition, but only 7 met both the high AOSI and low MSEL definition. At 12 months, 22 infants with TD outcomes met the high AOSI definition, yet only one participant with a typical outcome at 36 months met both high AOSI and low MSEL markers. This suggests that using this multiple-marker approach could reduce the number of infants unnecessarily referred for additional assessments or services.
As expected, sensitivity was low across all ages and groups. This is consistent with the existing literature demonstrating the heterogeneous developmental patterns among infant siblings of autistic children and the protracted emergence of developmental differences in toddlerhood. It is clear that repeated serial screening will be needed to progressively identify infants and toddlers over time (Lipkin, 2023). Our findings build on this literature by identifying one potential approach to identifying DC at the very earliest ages, requiring multiple behavioral differences across multiple domains. While differentiating between autism and other developmental differences is, by definition, not possible during the pre-diagnostic period, we suggest that infants demonstrating differences on multiple measures are those most likely to benefit from referrals and services. Existing autism screeners for older toddlers also perform much better when predicting developmental differences that are more broadly defined, rather than autism specifically. For example, the initial validation of the M-CHAT-R/F reported a PPV of 0.48 (95% CI: 0.41–0.54) in predicting autism specifically; it rose to 0.95 (95% CI: 0.92–0.98) when predicting any developmental condition (e.g. Global Developmental Delay, Language Disorder). For this reason, we used a broader outcome classification that included not only autism but also other developmental delays and/or clinical concerns, as defined by elevated scores on the ADOS-2 and/or low scores on the Mullen. These infants are likely to benefit from evaluation and supports, regardless of specific diagnostic outcome. It is worth noting that we observed a similar pattern of results, with similar PPV and sensitivity indices, when predicting to autism-specific outcomes as well, further demonstrating the utility of this approach.
Clinical Implications
Although our sample is small, we suggest our approach offers a new lens for thinking about early detection processes. Existing recommendations and detection practices are inconsistent. For example, the US Preventative Services Task Force Recommendation Statement found insufficient evidence to assess the balance of benefits and harms for autism screening (Siu, 2016). However, the American Academy of Pediatrics (AAP) recommends integration of autism-specific screening at 18 and 24 months embedded within ongoing developmental surveillance through routine primary care visits (Hyman et al., 2020; Lipkin & Macias, 2020). There have now been several trials demonstrating the effectiveness of this approach in improving screening and autism identification rates in community settings (Barbaro et al., 2022; Karlov et al., 2024; McNally Keehn et al., 2020). However, a key challenge is the lack of tools validated for screening infants in the first year of life. The most recent AAP guidance lists several screening tools under development for infants less than 18 months of age (Hyman et al., 2020). However, only one of these includes infants younger than 12 months: The Infant/Toddler Checklist (ITC) from the Communication and Symbolic Behavior Scales (Wetherby et al., 2008). Prior work with the ITC in a sample of infant siblings of autistic children reported PPV ranging from 31% at 6 months to 62% at 24 months in predicting outcomes of any developmental delay (Autism + Non-TD; Parikh et al., 2021). In light of these challenges, there have been increased calls for repeated early screening and successive identification of infants most likely to benefit from referrals for evaluations and services at the time of screening (Constantino et al., 2021; Dai et al., 2020; Eisenhower et al., 2021; Robins, 2020).
Our findings are consistent with this serial screening and evaluation approach. Rather than trying to identify all infants who will ultimately be classified in the DC group (i.e. prioritizing sensitivity), we focused on identifying infants with the highest likelihood for autism and other developmental differences at each timepoint (i.e. prioritizing PPV). We suggest that for infants showing developmental differences across multiple domains, there is no need to “wait and see” to initiate services. When parents or others have concerns about infants’ development, they should be referred for developmental evaluation, and if clear behavioral differences are observed on multiple tools, they should be connected with early intervention services. Currently, these services might include existing low-intensity early intervention programs such as those administered through federally funded programs in the United States (Individuals with Disabilities Education Improvement Act, 2004). Our findings also support the development of additional transdiagnostic supports for infants and their families during this pre-diagnostic period (Charman et al., 2023; Constantino et al., 2021; Finlay-Jones et al., 2019). These supports might also include heightened developmental surveillance, including more frequent or repeated screening and evaluation of social communication development.
Limitations and Future Directions
There are some limitations to this approach. First is the small number of infants who met multiple screening criteria and dependencies in the multiple marker definitions, which limits statistical power to compare across definitions and ages. This limitation is inherent to comparisons of PPV values, since PPV is calculated as a proportion of the number of infants who met the screening criteria for each definition. This means the denominator varies across PPV definitions, even within a single age, prohibiting the direct comparison of PPV values within a single set of participants. This is not an issue when comparing sensitivity values, in which the denominator (i.e. the total number of children with the outcome of interest) remains constant and only the numerator (i.e. the number of children who meet the screening criteria for each definition) varies. A second related limitation of this study is that infants were from families already familiar with autism, since they had an older diagnosed child. Such infants, and their families, may not be representative of community-ascertained samples. The use of standardized behavioral measures administered by well-trained examiners is also much more costly to implement than brief screening questionnaires so the methods described here may not generalize to clinical settings. However, the goal was more narrowly focused on understanding the clinical significance of early behavioral differences, rather than testing any specific screening tool. An important strength of the infant sibling design used here is the ability to follow the entire cohort of children—including those who screen negative—which is often not feasible in large community-based screening studies. While costly to implement, the use of standardized behavioral assessments supports the validity of our findings. There is also some initial evidence that the AOSI performs similarly in community-ascertained infants as in infant sibling samples (Hudry et al., 2021; Talbott et al., 2022). Future work focused on longitudinal examination of infants ascertained on the basis of DC from parents, providers, and/or universal screening tools, which will help to clarify the utility of this approach for identifying infants with the highest likelihood of non-typical outcomes. In particular, it will be helpful to examine the impact of combining multiple tools within the context of a developmental surveillance approach.
Conclusion
This study demonstrates an approach that can identify infants with an elevated likelihood of autism and other non-typical outcomes in the first year of life. Further refinement of this approach may help to improve early detection practices by identifying which infants to prioritize for referrals. Our findings underscore the need for continued development of strategies for repeated screening and developmental monitoring. Overall, our findings indicate that although only a small number of infants will be identified at these very young ages, when significant developmental differences are present, particularly differences across multiple measures, they are likely to be clinically meaningful.
Supplemental Material
sj-docx-1-aut-10.1177_13623613241275455 – Supplemental material for Can combining existing behavioral tools improve identification of infants at elevated likelihood of autism in the first year of life?
Supplemental material, sj-docx-1-aut-10.1177_13623613241275455 for Can combining existing behavioral tools improve identification of infants at elevated likelihood of autism in the first year of life? by Meagan R Talbott, Gregory S Young and Sally Ozonoff in Autism
Footnotes
Data availability statement
Declaration of conflicting interests
Funding
Supplemental material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
