Abstract
Keywords
Children with autism spectrum disorder (ASD) present with varied cognitive, language, adaptive, and social skills. The presence of early functional speech has been identified as an important predictor of better long-term outcomes (Gillberg & Steffenburg, 1987; Howlin, Goode, Hutton, & Rutter, 2004; Mayo, Chlebowski, Fein, & Eigsti, 2013). Although early intervention programs for ASD differ, communication is often a key treatment goal (Rogers, 2006; Schreibman et al., 2015).
Characterizing language skills
Despite its importance as a predictor and treatment outcome, functional speech has been a challenging construct to define and measure. Many outcome studies use direct, standardized measures to characterize skills. Although there are important advantages to this assessment approach (established reliability and validity, well-defined norms), there are also disadvantages (Condouris, Meyer, & Tager-Flusberg, 2003). Children with ASD may be less comfortable in novel situations, such as testing environments. In addition, they may have difficulty with test-taking skills such as attending to instructions and focusing on task materials. Most standardized measures of expressive language have few items targeting very early developmental levels (e.g. <24 months), such that a small change in raw scores can have disproportionate effects on standard scores and age equivalencies (Tager-Flusberg et al., 2009). These variables can influence the utility of standardized measures when planning for intervention or assessing change over time.
Parent-report measures and natural language samples are important alternative sources of information to characterize functional language. Parent-report measures, such as the MacArthur-Bates Communicative Development Inventory (Fenson, Bates, Dale, Marchman, Reznick, & Thal, 2007), allow for the collection of information about everyday skills in young children and can be strongly correlated with other measurement approaches (Luyster, Kadlec, Carter, & Tager-Flusberg, 2008). Natural language samples can be an especially valuable approach to characterizing early language development (Costanza-Smith, 2010). They provide opportunities to measure a wide range of expressive language abilities, including pragmatic skills, and are often collected in contexts that are familiar to the child. Importantly, natural language samples assess functional use of language: the degree to which the child uses varied language spontaneously, intelligibly, and across contexts, which may be especially valuable when planning for treatment and examining intervention impact.
The language benchmarks framework
In 2006–2007, the National Institute on Deafness and Other Communication Disorders convened a panel of researchers with expertise in the study of language development in young children with ASD to address this issue (Tager-Flusberg et al., 2009). Their goal was to explore strategies to encourage greater uniformity in the definition and measurement of spoken language in children with ASD. They recommended the use of a developmental framework with five phases of expressive language development: Phase 1: Pre-verbal; Phase 2: First Words; Phase 3: Word Combinations; Phase 4: Sentences; and Phase 5: Complex Language. They also recommended that all domains of expressive language be considered when characterizing language level: phonology, vocabulary, grammar, and pragmatics. The group provided detailed guidelines about the specific skills that a child must demonstrate to meet each phase in these four domains across a range of assessment sources.
In 2015, Ellawadi and Weismer used the language benchmarks framework to characterize communication profiles in 105 toddlers with ASD aged 23–39 months. They assessed phonology using a speech analysis, vocabulary and grammar using the Preschool Language Scale—Fourth Edition (Zimmerman, Steiner, & Pond, 2002), and pragmatics using the Early Social Communication Scales (Mundy et al., 2003). An acknowledged limitation of their assessment approach was that the highest level of pragmatics attainable was Phase 3: Word Combinations. Results indicated that communication profiles of young children with ASD were often uneven, with only 10% of the sample scoring at the same level across all domains. The two most common profiles were: (1) strength in phonology, even scores in vocabulary/grammar, and weakness in pragmatics (present in 36% of the sample) and (2) weakness in pragmatics and even scores in other areas (present in 33% of the sample). Other profiles were rare (present in ≤10% of the sample).
In a related study, Kover, Davidson, Sindberg, and Weismer (2014) compared the impact of different language sampling contexts on developmental language phase in 63 three- to four-year-olds with ASD. Language phase was determined using 15-min natural language samples. Across sampling contexts, few children had even skills across domains: 16% had even profiles when the Autism Diagnostic Observation Schedule (ADOS; Lord et al., 2008) was used as a sampling context, 6% had even profiles when examiner–child play was used, and 14% had even profiles when parent–child play was used. Of note, this study only included children who produced at least 30 vocal utterances during the 15-min play sessions.
The language benchmarks framework also offers a valuable approach to characterizing treatment impact. For example, Paul, Campbell, Gilbert, and Tsiouri (2013) found that approximately half of a sample of 22 minimally verbal children with ASD aged 2–6 years assigned to receive either a discrete trial training intervention or a milieu communication training intervention attained benchmarks consistent with the First Words phase of language development. They used the expressive language domain of the Vineland Adaptive Behavior Scales—II (VABS-II; Sparrow, Balla, & Cicchetti, 2005), the Communication and Symbolic Behavior Scales—Behavioral Observation (Wetherby & Prizant, 2003), and the MacArthur-Bates Communicative Development Inventory (Fenson et al., 2007) to establish language phase. Importantly, Paul et al.s’ criteria for attaining the First Words phase for pragmatics were less stringent than those recommended by Tager-Flusberg et al. (2009). Tager-Flusberg et al. recommended that children use words for a minimum of two communicative functions, including commenting (p. 650), whereas Paul et al. accepted any two pragmatic functions, without requiring commenting (e.g. requesting and social interaction).
Development of the assessment of phase of preschool language
In previous studies applying the language benchmarks framework, the same measures were used across all children (e.g. Ellawadi and Weismer (2015) used the Preschool Language Scale—Fourth Edition and the Early Social Communication Scales). We propose that the language benchmarks framework can also be meaningfully applied in contexts where a wide range of measures may be used (e.g. clinicians in different regions and multi-site research projects). Condouris et al. (2003) found that key language scores from natural language samples in children with ASD (mean length of utterance and number of different word roots) were significantly correlated with scores tapping similar constructs on standardized. As the criteria proposed by Tager-Flusberg et al. (2009) are stringent and robust, clinicians and researchers could, in principle, determine a child’s phase using varied measurement sources.
We further propose that there are advantages to including skills using augmentative and alternative communication (AAC) when characterizing language level in children with ASD. Tager-Flusberg et al. (2009) acknowledged that some children with ASD successfully acquire expressive language skills using AAC systems, but limited their recommendations about measures and benchmarks to spoken language. Recent estimates suggest that approximately 30% of children with ASD have minimal spoken language ability as they transition from preschool to school-age (see Tager-Flusberg & Kasari, 2013). Various AAC systems are taught to support non- or minimally verbal children (Mirenda, 2003), including sign language and the Picture Exchange Communication System (PECS; Frost & Bondy, 1994), and electronic tablet-based speech-generating devices are also increasingly used (van der Meer & Rispoli, 2010). Inclusion of skills with AAC when characterizing language level allows clinicians and researchers to capture important, functional communication skills in preschool children who nonetheless remain non-verbal; skills that are often the focus of intervention and can substantially improve quality of life.
We (authors HEF and IMS) developed the Assessment of Phase of Preschool Language (APPL) to support assessors in evaluating children using the language benchmarks framework. The APPL outlines possible sources of information that can be used to meet each language phase within each language domain (natural language samples, direct assessment measures, and/or parent rating forms; see
The current study
In this study, we used the APPL to extend previous research exploring language skills and profiles in children with ASD using the language benchmarks framework. We also explored changes in language phase during intervention in a subset of our larger sample. Benchmark criteria were carefully applied, and no children were excluded due to comorbid diagnoses or low skill levels. We addressed the following specific research questions:
Prior to intervention:
How is language phase characterized using the APPL in four- to five-year-old children with ASD assessed by community-based S-LPs? How does inclusion of AAC skills alter characterization of language phase? Is language phase similar or different across domains? Which domains are most often areas of strength or weakness? In a sub-set of children re-assessed after intervention:
How do APPL scores change during participation in a community-based Naturalistic Developmental Behavioral Intervention (NDBI) program providing Pivotal Response Treatment (PRT) and the PECS? How does inclusion of AAC skills alter characterization of change during treatment?
Method
Participants
Age, adaptive functioning, autism symptom severity, and overall language phase at the beginning of intervention.
VABS II: Vineland Adaptive Behavior Scales, second edition (Sparrow, Balla, & Cicchetti, 2005); SRS-2: Social Responsiveness Scale, second edition (Constantino & Gruber, 2012); APPL: Assessment of Phase of Preschool Language.
All participants were beginning the Nova Scotia Early Intensive Behavioral Intervention program (NS EIBI; Bryson et al., 2007) in one of six regions of a small province. Approval was granted by the IWK Health Centre Research Ethics Board and by REBs in each of the provincial health authorities that existed in Nova Scotia at that time. The NS EIBI program is a publicly provided program open to all preschool children with ASD, regardless of symptom severity or the presence of comorbid conditions. As shown in Table 1, adaptive functioning ranged from the Delayed to Average range, and total scores on the Social Responsiveness Scale, second edition (SRS-2) ranged from the Severe to the Normal range. To be eligible for the EIBI program, children must have an ASD diagnosis by a specialist clinician, based on clinical judgment informed by the ADOS and a detailed developmental history (typically the Autism Diagnostic Interview—Revised; Rutter, LeCouteur, & Lord, 2003). Analyses in Smith et al. (2019) suggest that demographic characteristics for families participating in research on the NS EIBI program are similar to census figures.
Measures
VABS-II (Sparrow, Balla, & Cicchetti, 2005)
The VABS-II is a parent-report measure assessing adaptive functioning in the following four areas: Communication, Socialization, Daily Living Skills, and Motor Skills. In this study, the VABS-II was completed as a caregiver rating form at the beginning of intervention, as part of the larger PATI study (Smith et al., 2019). The Adaptive Behavior Composite (overall score) was used as a demographic variable, and the age-equivalent score from the Expressive language sub-domain was used to assess APPL validity.
SRS-2 (Constantino & Gruber, 2012)
The SRS-2 is a parent rating scale that was designed to be a continuous measure of ASD symptoms. Recent research (Hus, Bishop, Gotham, Huerta, & Lord, 2013) suggests that scores are better interpreted as indicating general levels of impairment, as they are influenced by variables such as cognitive level and challenging behavior. The SRS-2 was completed at the beginning of intervention as part of the larger PATI study (Smith et al., 2019). The total
APPL
Description
Assessment sources appropriate for demonstrating attainment of specific phases for each language domain on the APPL.
NLS: natural language sample.
A page of the APPL is dedicated to each expressive language domain: phonology, vocabulary, grammar, and pragmatics (see Figure 1 for the Vocabulary domain and supplementary material for the full APPL). For each domain, the APPL outlines the range of demonstrated skills that could meet criteria for each phase: Phase 2: First Words; Phase 3: Word Combinations, Phase 4: Sentences, or Phase 5: Complex Language, based on minimum criteria suggested by Tager-Flusberg et al. (2009). A child who does not meet the First Words phase criteria is considered to be at Phase 1: Pre-verbal. Assessors can use a range of possible assessment approaches (natural language samples, parent rating forms, and/or direct assessment measures; see Table 2). If multiple approaches are used to assess the same language domain (e.g. a natural language sample and a parent rating form to assess Vocabulary), the child need only meet minimum criteria based on one of these sources.
The vocabulary domain of the Assessment of Phase of Preschool Language (APPL).
Consistent with recommendations by Tager-Flusberg et al. (2009, p. 647), to meet a specific level of overall language functioning (e.g. overall level of First Words), the child must meet at least one benchmark in every language domain that defines that phase. On the APPL, minimum criteria for language benchmarks can be met either using spoken language or an AAC system. In this study, children could move beyond Phase 1 while remaining non- or minimally verbal, if they met criteria using an AAC system.
Current study
In this study, the APPL was completed by 13 different S-LPs working in six regions of the NS EIBI program. It was administered at the beginning of the NS EIBI program for all children in this study and after approximately 10 months of intervention for a representative sub-sample. Prior to this study, S-LPs in the NS EIBI program were already assessing children’s language skills using natural language samples and standardized assessment measures as part of routine clinical care. The APPL was introduced in the context of the Preschool Autism Treatment Impact Study (see Smith et al., 2019). Clinicians from across the province attended an hour-long teleconference session in which the APPL was introduced and described. The APPL form was then distributed. The only guidelines regarding measure administration and selection were those built into the APPL measure itself, as described below (see also Table 2 and supplementary material).
Although not an explicit instruction, clinicians completing the APPL in this study consistently administered a natural language sample. Natural language samples were part of routine clinical practice in the NS EIBI program and can be used to complete the APPL for all language domains at all phases. APPL guidelines indicate that the natural language sample should involve at least 50 spontaneous utterances, if feasible, for a given child, be at least 20-min long, and involve a range of communicative contexts as appropriate to the child, including opportunities to request, comment, answer questions, have a conversation, and tell a narrative. In this study, mean duration of natural language samples was 31.39 (
Measures administered by community-based speech-language pathologists completing the APPL.
Note. A range of measurement sources can be used to complete the APPL. Evidence from at least one appropriate source is needed to meet criteria within each domain.
Natural language samples were administered for all children and are used to complete all APPL domains.
Parent-report measures of expressive language were used to complete the Vocabulary domain of the APPL.
Direct assessment measures of expressive language were used to complete the Vocabulary and the Grammar domains of the APPL.
APPL: Assessment of Phase of Preschool Language.
Inter-rater reliability
The inter-rater reliability of the APPL was assessed for 22 children using weighted kappa (
Validity
To carry out a preliminary exploration of the validity of the APPL, we explored whether children at higher overall APPL language phases had greater mean age-equivalent scores on the expressive language subdomain of the VABS-II (available for 83% of the sample). Similar to the APPL, the VABS-II measures a child’s functional communication skills in their everyday environment. A one-way ANOVA confirmed that VABS-II age-equivalent scores are higher at higher APPL phases, F (3, 75) = 54.68,
Intervention
The NS EIBI program is publicly funded and involves up to one year of behavioral intervention in natural settings (see Bryson et al., 2007; Smith, Flanagan, Garon, & Bryson, 2015). Children in this study participated for an average of 10 months (
The main treatment targets of the NS EIBI program are expressive communication, play, and other functional skills (e.g. toileting, safety, and reducing challenging behavior). The primary treatment approach is Pivotal Response Treatment (see Koegel, Openden, Freeden, & Koegel, 2006), which is a NDBI (Schreibman et al., 2015). To build expressive communication with PRT, clinicians and parents follow a child’s lead to establish preferred objects and activities as part of daily routines in the natural environment. They then pause activities using a strategy called shared control and present clear language opportunities. For example, if a child wants a ball, the adult might hold the ball (i.e. shared control) and provide a verbal model such as “ball”, “throw ball”, or “throw me the red ball”, tailored to the child’s language level. Adults support children to provide reasonable attempts at responding to language opportunities before allowing play to continue (i.e. providing reinforcement). Reasonable attempts vary across children, and can include intentional vocalizations for Pre-verbal children.
In the NS EIBI program, AAC systems are introduced for Pre-verbal children if they do not progress using PRT. The usual AAC approach is PECS (Frost & Bondy, 1994), with a focus on ensuring that a child progresses through Phases 1 (How to Communicate) and 2 (Distance and Persistence) before beginning Phase 3 (Picture Discrimination). Some children are transitioned to an electronic tablet-based AAC system after attained Phase 3B of PECS (Picture Discrimination with Preferred Items; for this subgroup, a decision about whether to introduce an iPad®-based system is made collaboratively with parents). When supporting a child to use an iPad®-based AAC system, clinicians ensure that the child maintains skills learned through the PECS system, such as travelling to the communication partner and attaining his/her attention prior to communicating.
Results
Language phase and profiles in four- and five- year olds prior to intervention
Language phase on the APPL at the beginning of intervention in four- to five-year olds with ASD;
For vocabulary, three children who attained the First Words phase did so using an Augmentative and Alternative Communication (AAC) system. Considering only spoken language/excluding AAC: 25 (26%) were Pre-verbal and 6 (6%) were First Words.
For grammar, one child who attained the Word Combinations phase did so using AAC. Considering only spoken language/excluding AAC: 32 (34%) were at Phase 1 and 23 (24%) were at Phase 2.
For pragmatics, one child who attained First Words and one child who attained Word Combinations did so using AAC. Considering only spoken language/excluding AAC: 25 (26%) were at Phase 1, 5 (5%) were at Phase 2, and 44 (46%) were at Phase 3.
For overall language, one child who attained the First Words phase did so using AAC. Considering only spoken language/excluding AAC: 26 (27%) were at Phase 1 and (8%) were at Phase 2.
For all study results, language phase was determined based on skills using either spoken language or AAC, whichever was highest (see
For all 11 children using an AAC system at the beginning of intervention, spoken language skills for vocabulary, grammar, and pragmatics were at the Pre-verbal phase. For 7 of the 11 children, inclusion of skills using AAC did not affect characterization of language level, as AAC skills were also at the Pre-verbal phase. For four children (4% of overall sample), inclusion of AAC skills did affect language phase for at least one domain, with overall language phase affected for one child (1% of overall sample; see Table 4). Notes in Table 4 outline proportions of the sample at each language phase considering only spoken language (i.e. excluding AAC skills).
Communication profiles across language domains at the beginning of intervention;
Areas of relative strength and weakness in specific language domains at the beginning of intervention;
Note: A domain was considered an area of relative strength if language phase was at least one phase higher in that domain than in two or three of the other three domains. A domain was considered an area of relative weakness if language phase was at least one phase lower in that domain than in two or three of the other three domains.
Changes in APPL scores after intervention
The APPL was completed at the end of intervention for 46 of the 95 children, at a mean age of five years, six months (
Change in specific language domains during community-based natural developmental behavioral intervention,
Note: For each domain, analyses include children who began intervention at Phase 4 or lower.
We also explored changes in overall language phase for subgroups that began intervention at the Pre-verbal, First Words, Word Combinations, or Sentences levels (see Figure 2). Improvements in overall language phase were most common for children who began intervention at the Word Combinations phase (56% improved), followed by the First Words phase (33%), the Pre-verbal Phase (25%), and then the Sentences Phase (20%).
Changes in Overall language during intervention for children starting at different language phases (
An AAC system was taught for 11 of the 46 children included in change-over-time analyses. Eight used PECS only, one used PECS and then Proloquo2go, one used PECS and then TouchChat (Silver Kite, 2016), and one used Speak for Yourself. Including AAC skills in APPL ratings affected characterization of change during intervention. Despite remaining at the Pre-verbal phase for vocabulary, grammar, and pragmatics based on spoken language after participating in the NS EIBI program, 45% attained Phase 2 for vocabulary using AAC (vs. 18% at baseline), 27% attained Phase 3 for grammar using AAC (vs. 9% at baseline), 18% attained Phase 3 for pragmatics using AAC (vs. 9% at baseline; none met Phase 2), and 18% attained Phase 2 for overall language using AAC (vs. 9% at Baseline). Notes in Figure 2 show the proportion of the sample that changed overall language phase considering only spoken language (i.e. excluding AAC skills). While 8% of Pre-verbal four- and five-year olds moved to the First Words phase during intervention based only on spoken language, 25% moved to the First Words based on either spoken language or AAC.
Discussion
There is a pressing need to establish consistent frameworks to characterize skill levels in children with ASD (Kasari & Smith, 2019). We developed a new rating form, the APPL, to facilitate consistent implementation of the Tager-Flusberg et al. (2009) language benchmarks framework by clinicians or researchers assessing individual children with ASD. The language benchmarks framework allows for the categorization of language phase based on a range of possible assessment sources, including natural language samples and parent-report measures, as well as standardized clinician-administered measures. Natural language samples and parent-report measures capture important information about communication, especially in early communicators and in children who may be less verbal in unfamiliar contexts such as testing. Importantly, the language benchmarks framework is designed to incorporate skills across all domains of language, including pragmatics and phonology, areas that are not consistently tapped by commonly used direct assessment measures.
On the APPL, raters can indicate whether a child met language benchmarks using either spoken language or an AAC system. This acknowledges that some children with ASD develop meaningful communication using AAC (large vocabularies, use of pictures across multiple functions, and including commenting). This is an important consideration when conceptualizing intervention outcomes and predicting later development, given that a functional communication system is generally a key goal of intervention programs. The same APPL standards are applied across communication modalities. This is a high standard for AAC users and could be adapted to reflect emerging research specific to AAC users. In this study, language phase was determined based on skills using either spoken language or AAC, whichever was highest.
In this study, the APPL was administered by 13 S-LPs in a province-wide community-based NDBI program primarily providing PRT. The treatment program is publicly provided, and no children are excluded based on skill level or symptom severity. Ninety-five children were assessed at the beginning of intervention and of them, 46 were re-assessed after approximately 10 months of treatment. Inter-rater reliability of APPL characterization was substantial, and validity was supported by relationships between APPL scores and age-equivalent scores on the expressive language domain of the VABS-II.
First, we explored APPL profiles in four- and five-year olds with ASD at the beginning of intervention. APPL phase varied, with the largest number of children scoring at the Word Combinations phase. Including skills using AAC affected overall language phase for one child (1% of sample) and affected scores on at least one specific language domain for four children (4% of sample). Similar to findings with younger samples, a relatively small proportion of children had skills that were evenly developed across language domains: 24% in this study; 10% in Ellawadi & Weismer (2015); and 6–14% in Kover et al. (2014). Relative strengths in phonology and relative weaknesses in pragmatics were common (41% and 43% of our sample, respectively) but not universal, replicating findings by Ellawadi and Weismer (2015). These findings highlight the importance of evaluating distinct domains of language when assessing children with ASD. Assessments that focus only on one or two areas may fail to capture important strengths and challenges that affect communication. The observed variability in language profiles also lends support for the approach to characterizing overall language phase suggested by Tager-Flusberg et al. (2009). Defining overall language phase as the highest phase met across all domains may reduce the possibility that clinicians and researchers overestimate language level by failing to consider communication challenges influenced by specific language domains.
Using the APPL, we explored changes in children’s expressive language phase during participation in a community-based NDBI program. Pivotal Response Treatment was the main form of intervention for most children. PECS and other AAC systems were also taught. Thirty-seven percent of children moved to a new phase of overall language during intervention. The NS EIBI program was associated with more gains in phonology, vocabulary, and grammar (approximately half of the sample gained a phase) than pragmatics (approximately one-third of the sample gained a phase). Including AAC skills in ratings had an impact on characterization of change. Eleven of the 46 children (24%) used an AAC system during intervention, and many made gains in specific language domains despite remaining non-vocal. Twenty-five percent of Pre-verbal children moved to the First Words phase using either spoken language or AAC, versus 8% based on spoken language alone. Inclusion of skills using AAC allows for measurement of important treatment gains in key communication skills that can improve children’s and families’ quality of life. These findings add to results from our larger study demonstrating that children participating in the NS EIBI program experienced significant gains in adaptive functioning and significant reductions in challenging behavior during participation in the intervention (see Smith et al., 2019).
To our knowledge, this is the first study exploring changes during intervention using the language benchmarks framework in a sample of children with ASD that spans initial language levels. This study suggests that the NS EIBI model is especially likely to improve the overall language level of children in this four- to five-year age range who begin intervention at the Word Combinations phase (56% improved), although gains were observed for children starting at all levels.
Paul et al. (2013) compared the impact of a discrete trial training intervention versus a milieu communication training intervention on spoken language in preverbal preschoolers. A greater proportion of children in that study moved to the First Words phase (50%) than in this one (25%, with some meeting criteria using an AAC system). This may have been influenced by sampling differences (e.g. Paul et al. included pre-verbal children with a nonverbal mental age of at least 12 months and children in whom a generalized motor imitation repertoire could be established). It may also have been influenced by differences in application of the benchmarks framework. In Paul et al. (2013), commenting skills were not required to meet the First Words phase (any two pragmatic functions were accepted). In our study, whereas approximately half of the children at the Preverbal phase for Phonology and Vocabulary moved to the First Words phase in these areas (60% for phonology; 50% for vocabulary (some met vocabulary criteria using AAC)), only 25% met criteria for pragmatics, with 75% failing to meet the commenting requirement.
It is important to note that this study included only four- and five- year olds. Language profiles and changes during intervention are likely to differ for younger preschoolers. A further limitation of this study is that it did not include a control or comparison group. A number of children who were already four or five years old when treatment began made gains during 10 months of intervention, moving to a new phase of overall language. However, without a control group, it is not possible to know how many would have made gains without treatment. It is also important to acknowledge that the same clinicians often completed pre- and post- treatment assessments, and that knowledge of pre-treatment scores could have influenced follow-up ratings. Future studies could explore the impact of interventions on language phase using control or comparison groups, as well as blinded outcome ratings.
It is important to acknowledge that differences in measurement approach across APPL administrators may affect characterization of language phase. In previous studies applying the language benchmarks framework, research teams have characterized language phase using a consistent set of measures across all children. In contrast, in this study, S-LPs working in multiple regions within a community-based intervention program determined language phase using a flexible tool that allowed evidence from a range of possible sources to be sufficient in demonstrating that a child had attained a specific language phase for a certain domain (all assessments involved a natural language sample; and 59% incorporated information from at least one standardized direct assessment measure or parent rating form). This increased flexibility will reduce reliability compared to a more consistent testing approach (e.g. an approach in which all children complete a natural language sample during a specific task; and an approach in which grammar and vocabulary is always assessed using a specific standardized tool). For example, Kover et al. (2014) found that the context of the natural language sample affected language benchmark characterization. In the current study, children had varying levels of familiarity with the communication partner during the natural language sample, which could also influence outcomes. Similarly, although Condouris et al. (2003) found that scores from language samples and direct assessment measures were significantly correlated, and noted that measures from natural language samples may be a good proxy for standardized assessment of related constructs, different assessment sources (e.g. direct assessment measures versus natural language samples) may categorize children within different language domains. In the subset of children in this study for whom both a natural language sample and standardized measures were administered, characterization based on all available sources of information was similar to characterization based only on natural language samples, with excellent agreement across domains. There were instances in which administration of a direct assessment measure led to an increase by one phase in the specific language domain assessed. As the APPL defines a child’s phase as the highest level attained using at least one assessment source, the addition of direct assessment measures could not decrease phase below that attained with the natural language sample. Consistency of categorization may be lower if a study using the APPL included some children assessed using only natural language samples and other children assessed using only standardized measures. The APPL includes criteria aimed at minimizing variability in characterization across clinicians, such as specific criteria regarding language sample administration and criteria outlining appropriate rating forms and standardized measures to assess specific language domains. Importantly, characterization of overall language phase requires that a specific level of language be met across all four language domains, which minimizes the impact of differences in categorization in any specific domain.
We believe that important advantages are associated with the APPL’s flexible assessment approach. First, assessors can use clinical judgment to select measures that are the best fit for individual children. As clinicians in this study could determine an appropriate natural language sample context for an individual child, and decide whether to supplement this with information from standardized measures, they had the opportunity to maximize the quality of language phase characterization, increasing the validity of individual APPL scores. Specific assessment approaches may be better suited to certain children, especially when samples contain children at diverse ages and language levels. For example, a direct assessment measure may not sufficiently capture language skills in a child who has attentional or behavioral challenges that limit engagement in standardized testing. Conversely, a natural language sample may not sufficiently capture language skills in a child who is anxious or reserved during unstructured play. A second advantage to the APPL’s flexible measurement approach is that it may maximize the representativeness of study samples. For example, a child who could not be assessed using a specific assessment approach (e.g. could not obtain a basal level on a standardized measure), but could be assessed using another (e.g. a language sample) would not be excluded from analyses. Third, the APPL allows for research and program evaluation studies such as this that amalgamate information from diverse clinical or research sites that used varied measures to explore similar constructs. It provides a framework to ensure that scores from these measures are used to assess relevant language domains in a consistent way (e.g. the use of language to comment and for at least one other function to meet Phase 2 for pragmatics; the attainment of a consistent age equivalent score on standardized measures), and that all domains of language are considered at all phases when defining overall language level.
This study presents the initial use of the APPL in a community-based context. Thirteen different S-LPs in a province-wide intervention program completed the APPL following a brief orientation to the measure. A limitation of this study is that the APPL guidelines were often, but not consistently, applied. Seventeen percent of natural language samples were shorter than 20 min, and 3% of those included in a quality scan did not involve the recommended range of appropriate communicative contexts. A more detailed orientation to the measure would likely have improved administration quality. In this study, all administrators were S-LPs who had received clinical training in the administration and scoring of natural language samples. Additional training in the use of the APPL may be beneficial in future studies, especially if coders have less experience administering and scoring natural language samples. In this study, natural language samples were consistently administered, and any of a wide range of standardized measures was administered. Future APPL users could increase reliability of categorization beyond the acceptable level reported in this study by adding context-specific assessment requirements beyond those specified by the APPL. For example, users could require that the natural language sample is administered with a specific communication partner, or that specified standardized measures are always administered to measure vocabulary and grammar skills.
To conclude, study authors developed a rating form that promotes consistent application of the language benchmark framework. Consistent characterization of language level is central to understanding language profiles and the impact of specific interventions. We found that four- and five-year-old children with ASD typically have language skills that vary across phonology, vocabulary, grammar, and phonology domains. We also found that participation in a community-based NDBI program was associated with gains in language phase, with gains varying based on initial language level, and varying across language domains.
Supplemental Material
Supplemental material for The Assessment of Phase of Preschool Language: Applying the language benchmarks framework to characterize language profiles and change in four- to five-year-olds with autism spectrum disorder
Supplemental Material for The Assessment of Phase of Preschool Language: Applying the language benchmarks framework to characterize language profiles and change in four- to five-year-olds with autism spectrum disorder by Helen E Flanagan, Isabel M Smith Autism Research Centre, IWK Health Centre, Halifax, NS, Canada; Departments of Pediatrics and Psychology & Neuroscience, Dalhousie University, Halifax, NS, Canada Fiona Davidson in Autism & Developmental Language Impairments
Footnotes
Acknowledgements
Declaration of Conflicting Interests
Funding
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
