Abstract
Introduction
The American Academy of Pediatrics (AAP) recommends that all children receive autism-specific screening at 18 and 24 months of age, in addition to broad developmental screening at 9, 18, and 24 months.1,2 Since this recommendation, there has been an increase in the routine use of developmental screening in primary care.3,4 Challenges to the implementation of developmental screening in primary care have been reported.3,4 Systemic challenges include time constraints and inadequate reimbursement.5,6 Also, appropriate training regarding the importance of developmental screening influences the willingness of providers to implement routine screening. 7 Although standard developmental screening tools increase detection of concerns, many primary care providers continue to rely on surveillance alone, trusting their clinical acumen and parental concerns. 7 Therefore, the implementation of developmental screening in primary care practice requires a systematic and comprehensive approach that addresses barriers to implementation. 8 Appropriate training regarding the importance of developmental screening influences the willingness of providers to implement routine screening. 7
The use of developmental screeners in non-English primary language cultures has further limitations due to linguistic and cultural differences.9,10 Adoption of developmental screening in practice requires awareness of culture and setting, in addition to the appropriate translation of screening measures.11,12
The Modified Checklist for Autism in Toddlers (M-CHAT) is one of the most widely used tools for early detection of autism. 5 While the M-CHAT has been translated into multiple languages including Arabic, it is not commonly used in Arab countries. One study involving 9 Arab countries concluded that M-CHAT’s sensitivity, specificity, and positive predictive value were similar to those reported in the initial M-CHAT validation study. 13 A study conducted in Egypt using the Arabic M-CHAT noted concern among pediatricians that the high number of false positives would have a negative impact on resource-limited health care systems. 14 Furthermore, the feasibility of its implementation in primary care settings in Arabic-speaking countries has yet to be described.
As part of a quality improvement (QI) initiative, the Pediatrics Department at John Hopkins Aramco Healthcare (JHAH), Saudi Arabia, evaluated the feasibility and effectiveness of implementing the M-CHAT as a routine screening tool for autism in a low-risk pediatric population for the year of 2014. Feasibility was reflected by the proportion of visits with M-CHAT completed, and reports of workflow challenges and provider satisfaction. The effectiveness of screening as applied in the primary care setting by staff in that setting was evaluated based on the number of referrals for autism evaluation and rates of autism identification. 15 We describe in this article the QI implementation process and report on the feasibility and effectiveness of implementing M-CHAT screening in a large primary care setting in Saudi Arabia.
Methods
Setting
The Primary Care Pediatric Clinic at JHAH serves the employees of Saudi ARAMCO (Saudi Arabian American Oil Company) and their dependents, who are from multiple provinces of Saudi Arabia. In 2014, JHAH served approximately 30 000 children between 0 and 14 years.
Participants
The M-CHAT was distributed to the caregivers of all toddlers between 16 and 32 months attending JHAH’s pediatric clinic between January and December 2014 regardless of visit type (ie, well-visit, sick-visit, or nurse-only immunization visit). If a child had multiple visits during this period, only the results from the first M-CHAT screening were included in analyses, even if the screen was re-administered on subsequent visits.
Project Implementation
We identified the following key steps for project implementation: (1) assessing office protocol and workflow; (2) identifying a physician champion; (3) addressing key drivers to project success; (4) conducting staff orientation and training; and (5) sharing process and outcome data at regular intervals with the staff. 4
Through assessment of the office workflow, we were able to formulate the most suitable process to efficiently distribute M-CHAT screens without significant impact on the workflow. A physician champion led the project and was responsible for training, data collection, and analyses. Key drivers to project implementation were defined, and physician and nursing staff were engaged in training and regular sharing of process and outcome data. Monitoring of the impact of changes in M-CHAT completion rate, the accuracy of scoring, and referral procedures were conducted monthly through charts review.
Addressing Key Drivers
We developed a key driver diagram (Figure 1), highlighting the primary drivers that contribute to the achievement of the project aim: (1) physician engagement, (2) nursing engagement, (3) workflow challenges, and (4) parent/family engagement. We theorized on potential secondary drivers that were used to test change ideas.

Key driver diagram.
Project Timeline
Approval was obtained from the Chiefs of the Pediatric and Nursing Departments prior to project implementation. Nurses implemented a simultaneous QI project involving proper documentation of M-CHAT distribution. Institutional review board approval was not required for purposes of project implementation and reporting; authorization to review medical records of patients was obtained.
Educational training was conducted in December 2013 with physicians and nursing staff on the latest AAP developmental screening recommendations and the diagnostic criteria for autism spectrum disorder. Also, physicians attended educational workshops on the scoring of M-CHAT and referral procedures, as well as how to discuss the results with families.15,16
Following a review of the first quarter data, we noted the frequency of incomplete/missing M-CHATs and erroneous scoring. A change cycle was implemented that included providing more 1:1 guidance on the scoring of M-CHAT with physician providers. During the second quarter, changes to the formatting of the M-CHAT were made following feedback from nurses and families to improve readability. Following the second-quarter review, it was noted that a change in formatting, such that the screening questions were on both front and back of the page, led to an increased number of incomplete M-CHAT screeners. Another PDSA cycle was implemented adding an instruction to “turn the page.” We also created a dedicated referral link on the electronic health system for positive M-CHAT scores. In July 2014, the Arabic M-CHAT-R (Modified Checklist for Autism in Toddlers–Revised) replaced the M-CHAT, scoring was adjusted accordingly, and providers were again given 1:1 guidance on scoring. For purposes of reporting, we continue to use M-CHAT as the collective term to describe both the original M-CHAT and the revised version.
M-CHAT Scoring and Referral Processes
For purposes of our project, M-CHAT scores were classified as low risk (ie, negative score), medium risk, or high risk, as described in the M-CHAT standard instructions. Table 1 notes the action recommended for each score/category on the M-CHAT and M-CHAT-R.
M-CHAT and M-CHAT/R Score, Risk Categories, and Associated Actions.
Abbreviations: M-CHAT, Modified Checklist for Autism in Toddlers; M-CHAT/R, M-CHAT–Revised
Children with a medium risk score were referred to the Neurodevelopmental (ND) Clinic, run by a consultant pediatrician who reviewed and verified M-CHAT responses using the follow-up interview, a method shown to increase the positive predictive value of M-CHAT screening. 17 This was particularly relevant because some of the M-CHAT questions were hard to interpret in Arab culture; some questions created confusion either because the question was not a familiar activity (eg, Q1: Does your child enjoy being swung, bounced on your knee, etc?) or the meaning was not understood (eg, Q18: Does your child make unusual finger movements near his/her face?) Similar difficulties understanding certain M-CHAT questions were also noted in other cultures. 16
The pediatrician in the ND clinic conducted a full history and developmental observation. Children who did not meet criteria for autism were reassured. When needed, referral to an autism-specific treatment program was made, and information/resources were provided to the parents. Few were further referred to the Autism Multidisciplinary Team Diagnostic Clinic (AMTDC) for a definite diagnosis. Thus, the ND clinic served as an intermediate level of evaluation.
Children whose M-CHAT scores were in the high-risk range were referred directly to AMTDC. The AMTDC consisted of a multidisciplinary team including a consultant pediatrician, child psychiatrist, clinical psychologist, speech therapist, occupational therapist, and a social worker. Comprehensive diagnostic evaluations were in keeping with recommendations for autism diagnosis, including (1) detailed health, developmental and behavioral history, eliciting core diagnostic symptoms of autism (
Measures
All charts were reviewed monthly and reported quarterly. The following items were assessed on the chart review:
Completion. M-CHAT fully completed and, in the chart.
Scoring accuracy. Including correction of inaccurate scoring.
Referrals. Documentation of action taken based on score (eg, reassurance, referral to ND clinic, AMTDC clinic, or other referrals).
Diagnoses. Review of charts for patients seen in ND Clinic and/or AMTDC clinic to determine any final diagnoses.
Patient demographic information.
In addition, following project completion, we continued to review the referral and diagnostic rates to AMTDC using data from the clinic itself.
Primary Outcome Measures
Feasibility measurement was based on the percentage of visits with M-CHAT completed. The impact of systematic screening on the number of diagnostic referrals and autism identification rates were estimated by chart review.
Secondary Outcome Measures
To explore the engagement and satisfaction of the providers, a survey was e-mailed to providers in April 2016. The survey consists of 10 questions that targeted providers’ engagement, and their perceptions of barriers, benefits, and potential improvements.
Results
The Effectiveness of M-CHAT at Identifying Children At Risk for Autism
Of the 1207 M-CHAT screens distributed, 129 (11%) were missing or incomplete, and 1078 were completed (Figure 2). Of those completed, the majority of toddlers (88%) scored negative (low risk); these parents were reassured and given regular follow-up appointments. In our study, of those who were screened, a total of 12% were positive (two thirds medium risk and one third high risk).

Flow chart of M-CHAT screening results.
Toddlers with positive M-CHAT in the medium- and high-risk range were evaluated in the ND clinic and/or AMTDC (Figure 3). Of the initial 127 positive M-CHAT scores, 40 (31%) were lost to follow-up. Of those evaluated (n = 87), 41 toddlers (47%) were judged to be developing typically. Twelve were diagnosed with autism (14% of all positive M-CHAT, and 1% of all children screened) at a mean age of 24 months. In addition to autism, positive M-CHAT detected other developmental disorders in 34 toddlers (39% of those evaluated): speech delay, social anxiety, and subthreshold autism symptoms not meeting diagnostic criteria. Of the toddlers who had a speech delay (without autism), 85% were males. On the other hand, of toddlers with social anxiety, 80% were females.

Referral to AMTDC and number of cases diagnosed with autism.
The Effectiveness of M-CHAT Screening in Increasing Autism Diagnostic Referrals
There was a sizable increase in the number of referrals to AMTDC in the year of project implementation and the year after (2014 and 2015). In 2013, there were 21 referred patients seen at AMTDC, while in 2015 there were 43 patients, with the largest increase noted in the year immediately following project implementation. While there was a slight decline in the referrals seen in 2016, the referral numbers remained higher than in the period before M-CHAT implementation.
Feasibility of M-CHAT in Practice
Study findings indicated that 89% of eligible toddlers received M-CHAT screening within the study year, suggesting that overall M-CHAT implementation in primary care practice in Saudi Arabia is feasible. Also, there was a sizeable decrease in the number of missing/incomplete screeners from the first half of the year to the second half.
Figure 4 represents the number of missing/incomplete screeners and inaccurate scoring (errors) over time. It is noteworthy that the implementation of individual guidance on scoring after the first quarter was associated with a reduced number of errors. There was a spike in the number of incomplete and erroneously scored screeners following formatting changes during the second quarter, and again after the M-CHAT-R was initiated in the second half of the implementation. Improvements made to the formatting and further guidance on scoring resulted in the decline in the number of incomplete/missing screeners and inaccurate scoring over time.

Effect of interventions on the rate of missing/incomplete charts, and errors in scorning.
Feasibility of M-CHAT: Provider Engagement and Satisfaction
All pediatric physicians participated in the M-CHAT implementation, but only 4 out of 13 physicians (30%) and 6 out of 8 nurses (75%) completed the survey. All providers reported satisfaction with the process of M-CHAT implementation; they felt involved in the implementation, decision-making, and modifications processes. The majority felt that their concerns were met. Forty percent found it challenging to their workflow, suggesting time associated barriers, even though they were overall satisfied with the project implementation. Eighty percent agreed that M-CHAT improved patient’s care and increased their knowledge about autism. Qualitative responses indicated the following future suggestions:
Send M-CHAT to parents before the clinic visit to ease the clinic flow and allow for more accurate information.
Use the electronic version of M-CHAT which is directly incorporated into electronic medical record.
Change 18-month-old nurse-only immunization visit into a physician visit to complete M-CHAT screening, as specifically recommended by the AAP.
Increase parental awareness and acceptance of screening by providing information regarding the importance of screening and early detection and use email blasts that describe screening process.
Discussion
To our knowledge, this is the first study to evaluate the feasibility and effectiveness of implementing routine screening of toddlers using the Arabic M-CHAT in a primary care setting in Saudi Arabia. In our study, of those children screened, a total of 12% were noted to have concerns on M-CHAT screening (two thirds medium risk and one third high risk). As has been reported based on the use of the M-CHAT in the United States, positive M-CHAT in our study also detected other delays (eg, speech delay) and psychological difficulties (eg, social anxiety). 19 This highlights that children with false-positive screening results are an at-risk group for which diagnostic evaluation draws attention to their need for services. 19
Referral rates for diagnostic assessments increased during the implementation period suggesting the effectiveness of routine M-CHAT screening on identifying those children at risk for autism and other developmental concerns. AMTDC was established as part of JHAH in 2012. Following its establishment, awareness of neurodevelopmental disorders among providers at JHAH increased resulting in an initial spike in 2013 referral rates. The AMTDC clinic received referrals from the primary care clinic, as well as several other subspecialty clinics within JHAH. The increase in referrals during and following M-CHAT screening implementation was above what was expected as a direct result of screening, suggesting that this trend may have partly been contributed by generally increasing awareness and knowledge of neurodevelopmental disorders among pediatric health care providers.
The initial increase in referrals to AMTDC was maintained (with only a slight decline) in the years following initial project implementation. The decline in referral rate might be explained by changes in when M-CHAT was distributed. During project implementation, M-CHAT was distributed to all toddlers (16-32 months) regardless of visit type; however, in subsequent years, the M-CHAT was given only at 18 and 24 months, in keeping with the AAP recommendation for autism-specific screening. This suggests that regular developmental surveillance is needed in addition to the use of formal screening instrument.
Within our sample, 32% of children who screened positive on M-CHAT were later felt to be typically developing following evaluation. Two thirds of those screening positive on M-CHAT (medium risk) were evaluated in ND clinic, which served as an intermediate step by reviewing the questions with parents based on the M-CHAT follow-up interview and by addressing the cultural interpretation of questions. Using a screening tool needs it to be adapted to the cultural norms and familiarities of the studied population.12,20 The majority of the children with a positive M-CHAT were therein determined not to have specific developmental concerns. Other studies have indicated the importance of the M-CHAT follow-up interview in increasing the positive predictive value of concerning results. 21 The M-CHAT follow-up interview can be done on the same visit, or a close follow-up visit. Using this 2-step screen can reduce the number of false positive and the referral rate to the specialized clinic.22,23 Pediatricians might be trained on using 2-step screening procedures to assure appropriate referrals for comprehensive evaluations. 24
Study results also demonstrated that implementation of M-CHAT in primary care practices in an Arabic-speaking country is feasible, with a very high rate of properly completed M-CHAT screeners and a high level of provider satisfaction. We found that incorporation of QI strategies helped ensure the feasibility and fidelity of M-CHAT implementation. The number of missing, incomplete, and erroneously scored screeners diminished over time. Providers’ knowledge of correct scoring rules, the format of the survey, and parental awareness of the purpose of screening contributed to successful implementation.
In the first half of the QI cycle, regular individual scoring guidance with providers improved rates of correct scoring as well as decreasing the number of missing/incomplete charts. This supports studies that suggest that providers’ lack of comfort with screening instruments is a barrier to successful screening in primary care practices. 7 Through the QI process, nursing staff noted that lack of parental awareness of the purpose of screening influenced survey completion. Therefore, an essential change strategy included increasing family/parental engagement by providing information regarding the importance of screening.
In the second half of the QI cycle, with the use of M-CHAT-R, we noticed less error in scoring, more completed forms, and less parental confusion answering the screen. Also, the scoring is easier in the M-CHAT-R, with fewer questions, which were more clearly stated and included added examples. 17 Even minor details such as survey formatting affected the completion rate of the questionnaire.
It has been noted before that after the success of a QI project, it is important to implement strategies to sustain the change. 25 Four areas have been identified that lead to sustainability: awareness of the practice, agreement with it, adoption of it, and adherence. 26 Our QI process concluded by the routine policy of screening using the M-CHAT at the 18- and 24-months visits. To sustain the use of the M-CHAT for autism screening, systematic changes that directly incorporate the procedure into the workflow and staff engagement continue to be vital to its ongoing success. 27
Limitations
We were not able to follow-up on the toddlers who scored in the low risk or those apparently “typically developing.” As such, calculation of positive predictive value, sensitivity, and specificity are not within the scope of this study. Some of the toddlers who scored positive on the M-CHAT were lost to follow-up. These toddlers may be at particular risk, and cultural barriers may impede recognition of developmental concern or willingness to pursue further evaluation.
Implications and Conclusions
Implementation of autism screening using the Arabic M-CHAT is effective and feasible among a low-risk toddler population in Saudi Arabia. While it was not within the scope of the study to evaluate the psychometric properties of M-CHAT in Arabic population, study findings highlight the need to support the cultural interpretation of screening instruments and to have in place an adequate referral and evaluation system. In addition, sustaining implementation within a primary care setting requires engagement of the entire clinic staff, initial and ongoing training, as well as the incorporation of ongoing change cycles with monitoring of effect.
