Abstract
Understanding and Assessing Youth Strengths in a Criminal Justice Context
Assessment is a core component of effective practice with youth involved in the criminal justice system. Sound assessment provides information regarding risk to reoffend and guides case management, including the intensity and nature of intervention intended to prevent reoffending. In many jurisdictions in North America and internationally (Cullen & Gilbert, 2012), correctional policies and practices—including assessment—are guided by the Risk–Need–Responsivity (RNR) framework, a theoretically informed, evidence-based rehabilitation model (Bonta & Andrews, 2017). Until recently, assessment in criminal justice contexts has focused predominantly on risk factors that are strongly and directly associated with recidivism (Gagnier, 2013; Singh, Desmarais, Hurducas, et al., 2014). For youth, these include antisocial or pro-criminal attitudes, antisocial personality features, associating with antisocial peers, substance abuse, family dysfunction, challenges in education/employment, and inappropriate use of leisure time (Bonta & Andrews, 2017; Gendreau et al., 1996). However, there is growing interest in the assessment of strengths in addition to risk. The inclusion of strengths is posited to improve the accuracy of assessments, enrich intervention planning, and enhance youth engagement in intervention (de Ruiter & Nicholls, 2011; de Vries Robbé & Willis, 2017; Singh, Desmarais, Sellers, et al., 2014). Furthermore, the RNR framework has explicitly highlighted the importance of strengths or protective factors for more than two decades. For example, both versions of the YLS/CMI manual (risk/needs tool based on RNR framework, Hoge & Andrews, 2002, 2011) articulate the importance of strengths as responsivity factors, with the more recent version including direct scoring of strengths within each area of criminogenic need. Furthermore, in more recent discussions of incorporating strengths within the RNR framework, it is noted that strengths should be assessed not only to enhance treatment effects but also to enhance prediction (Bonta & Andrews, 2017).
While strengths can be understood as personal and/or environmental characteristics that are associated with positive outcomes, there is conceptual diversity and debate regarding what constitutes a strength. Some scholars conceptualize strengths as the opposite pole of a risk factor (e.g., not having antisocial peers; McAra & McVie, 2016; Stouthamer-Loeber et al., 2002). Others contend that strengths must provide unique information not captured in the assessment of risk factors (Borum et al., 2006; de Vogel et al., 2009). Empirically, risk and strength measures have been found to be moderately to highly negatively correlated (de Vries Robbé et al., 2011; Kleeven et al., 2022; Viljoen et al., 2020), which calls into question their conceptual distinctiveness from risk. In addition, the evidence regarding the unique contribution of strengths in assessments (e.g., of likelihood of reoffending or future violence) is mixed (Wanamaker et al., 2018).
Scholars have suggested that the brief nature of the strength measures that are part of widely used risk assessment tools may account for their lack of incremental prediction when included alongside risk measures (e.g., Viljoen et al., 2020), which signals that
The SAPROF-YV: A Dedicated Measure of Youth Strengths in the Context of Violence and Offending
The Structured Assessment of Protective Factors for Violence Risk—Youth Version (SAPROF-YV; de Vries Robbé et al., 2015), an adaptation of the adult version of this measure, is intended for the sole purpose of assessing strengths and has been proposed as a more robust and comprehensive tool compared with risk-focused instruments that include only a handful of strength items. The SAPROF-YV has 16 items grouped into four strength domains: resilience (social competence, coping, self-control, and perseverance); motivational (future orientation, motivation for treatment, attitudes toward agreements and conditions, and medication); relational (school/work, leisure activities, parents/guardians, peers, and other supportive relationships); and external (pedagogical climate, professional care, and court order). The tool is intended to be used in conjunction with a risk-focused measure, such as the SAVRY or YLS/CMI, to help determine risk of violence or offending and to inform intervention plans.
In terms of psychometric properties, strong inter-rater reliability has been reported in several studies (Chu et al., 2020; de Vries Robbé et al., 2015; Kleeven et al., 2022; Li et al., 2019; Lovatt et al., 2022). Correlations between the SAPROF-YV and other strength measures provide evidence for convergent validity. Moderate to strong positive correlations have been reported in a number of studies, including with the SAVRY protective scale (e.g.,
The literature on strength measures in general—and the SAPROF-YV in particular—has grown over the past several years, with a number of recent studies focusing specifically on predictive validity and examining different aspects of how strengths function, including whether strengths have an independent, direct effect on outcome (i.e., serve as “promotive” factors; Farrington et al., 2016), whether strengths add incremental predictive validity over risk, and whether strengths moderate, or buffer, the effect of risk on outcome (i.e., serve as “protective” factors; Farrington et al., 2016). Outcomes in these studies include binary measures of any (overall) recidivism, violent recidivism, and nonviolent recidivism, with some studies focusing on the presence of reoffense and others focusing on the absence of reoffense.
Evidence that strengths function as promotive factors is available from a number of recent studies in which the predictive validity of the SAPROF-YV was assessed using receiver operating characteristic (ROC) analysis. Area under the curves (AUCs) reported in these studies vary in size (Rice & Harris, 2005), including large effects (AUC = .80 for any 2-year recidivism in a sample of community-sentenced Singaporean youth; Chu et al., 2020) and medium effects (e.g., AUCs = .65-.70 for any, violent, and nonviolent recidivism at follow-up periods of 6, 12, and 24 months after discharge from a Dutch juvenile justice institution; Kleeven et al., 2022, and AUCs = .65-.68 for any, violent, and nonviolent 3-year recidivism in a Canadian youth sample serving community or custodial sentences; Lovatt et al., 2022). Similarly, de Ruigh et al. (2021) found moderate predictive validity for absence of violent recidivism (AUC = .64) and strong predictive validity for absence of any recidivism (AUC = .72) at 1-year follow-up in a Dutch youth sample. In contrast, Koh et al. (2022) found that the SAPROF-YV was a small predictor of any, violent, and nonviolent recidivism at both 1- and 3-year follow-ups (AUCs = .58-.62) in a sample of Singaporean youth serving custodial or community sentences. Similarly, in a sample of Canadian youth, Goodwin et al. (2022) reported that the SAPROF-YV evidenced a small association with any (AUC = .58) and violent recidivism (AUC = .56) using a 3-year follow-up.
Evidence for incremental predictive validity is mixed. Kleeven et al. (2022) reported that that the SAPROF-YV total score added predictive power over the SAVRY risk factors for any, violent, and nonviolent recidivism at 6-month follow-up in their sample of Dutch youth with a history of violent behavior. In contrast, de Ruigh et al. (2021) found that the SAPROF-YV total score added predictive validity to SAVRY risk factors for any reoffending but not for violent reoffending at 1-year follow-up. Koh et al. (2022) found that the SAPROF-YV did not add to the ability of the SAVRY or the VRS-YV to predict any, violent, or nonviolent reoffending in a sample of Singaporean youth sentenced to probation or juvenile detention. Similarly, in a Canadian sample of youth, Lovatt et al. (2022) found that the SAPROF-YV did not contribute incremental predictive ability over the SAVRY or the VRS-YV to subsequent convictions for any, violent, and nonviolent offenses. Turning to another widely used risk assessment tool for youth recidivism, Chu et al. (2020) reported that the SAPROF-YV significantly improved predictive accuracy for any recidivism over and above the YLS/CMI risk score at 2-year follow-up. In contrast, Goodwin et al. (2022) reported that the SAPROF-YV did not significantly improve predictive accuracy for any or violent recidivism over and above the YLS/CMI at a 3-year follow-up (in males or females).
In addressing whether strengths moderate the relationship between risk and recidivism, Chu et al. (2020) found the relationship between the SAPROF-YV total and any recidivism did not vary across YLS/CMI risk scores. Several studies have examined this question with other strength measures; while some have found evidence for a protective effect (de Vries Robbé et al., 2013; Jones et al., 2016; Wanamaker et al., 2018), others have not (Dolan & Rennie, 2008; Goodwin et al., 2022; Schmidt et al., 2011; Viljoen et al., 2020).
The Importance of Examining Validity Across Groups
In the criminal justice context, validity of assessment tools across subgroups, including gender and race/ethnicity, has become an increasingly pressing issue in policy and research (Kitzmiller et al., 2022; Lovatt et al., 2022). The consequences associated with assessment can be extremely serious, including inappropriate labeling, failure to accurately identify and provide service for relevant criminogenic needs (Kitzmiller et al., 2022), and disproportionately negative outcomes for racialized groups that lead to continued overrepresentation in corrections systems (Lovatt et al., 2022). As a result, there has been increased focus on examining the validity of risk assessment tools in females as well as males, and across ethno-racial groups. While the psychometric properties of the SAPROF-YV have been examined in youth samples from several different countries (primarily Canada, the Netherlands, and Singapore), the samples were all or mostly male and in all but two, subgroups were not compared. Lovatt et al. (2022) compared the predictive validity of the SAPROF-YV (in addition to the SAVRY and VRS-YV) for males and females, and for Indigenous and nonIndigenous Canadian justice system-impacted youth referred by the court or their probation officer to a mental health facility for specialized assessment and intervention. AUCs for any 3-year recidivism were .72 for males and .64 for females, .65 for Indigenous youth, and .75 for nonIndigenous youth, suggesting that the SAPROF-YV better predicts nonrecidivism for White and male youth. Similarly, in a sample of Canadian justice-impacted youth Goodwin et al. (2022) reported that the SAPROF-YV predicted any recidivism (AUC = .68) and violent recidivism (AUC = .60) for boys but not for girls (AUCs = .53-.54); a 3-year fixed follow-up period was used.
The Present Study
In the current study, we retrospectively reviewed existing forensic clinical data to provide a validation of the SAPROF-YV in an ethnically diverse Canadian sample of justice-impacted youth on community supervision, comparing males and females, and Black and White youth (the two ethno-racial groups with sufficient sample size for group comparisons). First, we examined the SAPROF-YV’s inter-rater reliability, internal consistency, and convergent validity with the SAVRY and YLS/CMI. Then, we tested its structural validity in the sample as a whole; to our knowledge this is the first study to empirically examine the proposed four-domain structure of the measure. Notwithstanding that the SAPROF-YV is a Structured Professional Judgment (SPJ)—as opposed to an actuarial—tool, exploring its factor structure can inform its theoretical conceptualization and clinical use. Next, we analyzed its predictive validity for the total sample, and by subgroup (males and females, Black and White youth), including whether strengths measured by the SAPROF-YV operate via a promotive effect (direct relationship to recidivism), provide incremental predictive validity when considered alongside risks, or serve as protective factors (i.e., moderate the relationship between risks and recidivism).
Consistent with the studies reported above, we expected to find strong inter-rater reliability and convergent validity. We did not make any predictions about the factor structure of the SAPROF-YV given the lack of prior research. Evidencing a direct—or promotive—effect, we hypothesized that the SAPROF-YV total score would discriminate between recidivists and nonrecidivists and significantly predict recidivism, consistent with prior research (Chu et al., 2020; de Ruigh et al., 2021; Kleeven et al., 2022; Koh et al., 2022) and that it would predict recidivism better for males than for females (Goodwin et al., 2022; Lovatt et al., 2022). Given the lack of clear theoretical or empirical (Chu et al., 2020; de Ruigh et al., 2021; Kleeven et al., 2022; Koh et al., 2022; Lovatt et al., 2022) justification, we did not hypothesize that the SAPROF-YV would provide incremental validity to a measure of risk (the YLS/CMI) in predicting recidivism. Similarly, given the mixed findings from previous studies, no hypothesis was made as to whether the SAPROF-YV would function as a protective factor—that is, would moderate the relationship between risk and recidivism. We also made no hypotheses in terms of whether the SAPROF-YV would perform better for White or Black youth.
Method
Participants
Participants were 305 justice-impacted youth (58% male) between the ages of 12 and 19 seen for assessment in the forensic service of a mental health agency in a Canadian city between October 2001 and 2017 for court-ordered assessments to assist with sentencing. Data from youth who consented for clinical information to be used in research (84% of clients seen over this time period) were included in the study, which received institutional ethics approval. The sample included all consenting female youth and a random sample of male youth for whom data were available over the same assessment period; there were no exclusion criteria. The sample was ethnically diverse (see Table 1). Approximately, two-thirds of youth were charged with a violent offense, which is consistent with their status as a forensic sample; 27% were charged with a nonviolent offense, and a small minority were charged with a sexual offense. As shown in Table 1, females were more likely than males to be charged with a violent offense, whereas males were more likely to be charged with a sexual offense. The reoffense rate was higher for males than females.
Demographic, Criminal History, Risk, Strength, and Recidivism Characteristics for Males, Females and Total Sample
Not including Item 8 (Medication).
Measures
Strength Measures
SAPROF-YV
The SAPROF-YV (de Vries Robbé et al., 2015) comprised 16 strength items scored as either absent (0), partially present (1), or fully present (2), with higher scores representing more strengths. The manual outlines the type of information that would qualify a strength item as absent, partially present, or fully present. For example, the item “Parents/Guardians” would be coded as
Structured Assessment of Violence Risk in Youth
The SAVRY (Borum et al., 2006) comprised 24 risk and six protective items designed to assess violence risk in 12- to 18-year-old adolescents. Only the protective factors scale was used in this study. Protective items, coded as present or absent, include prosocial involvement, strong social support, strong attachments and bonds, positive attitude toward intervention and authority, strong commitment to school, and resilient personality traits. The protective items were summed to create a protective total score, in line with previous studies (Lodewijks et al., 2010; Viljoen et al., 2020). We did not create an SPJ rating retrospectively.
The SAVRY has established reliability and validity, including strong internal consistency of subscales and total score (Borum et al., 2010), excellent inter-rater reliability for scoring of individual items, robust positive correlations between its risk total score other established measures of risk, such as the YLS/CMI (Borum et al., 2006), negative correlations between its strength total score and the YLS/CMI (Borum et al., 2006), and positive correlations of the strength scale with the SAPROF-YV total (de Vries Robbé et al., 2015). Meta-analyses have shown the SAVRY risk total exhibits moderate to strong predictive validity for any and violent recidivism (Olver et al., 2009; Singh et al., 2011). There is also evidence for good predictive validity of the SAVRY protective scale for any recidivism (Kleeven et al., 2022; Koh et al., 2022; Lovatt et al., 2022), with mixed evidence with respect to violent recidivism (Kleeven et al., 2022; Lovatt et al., 2022).
Risk Measure
The Youth Level of Service/Case Management Inventory (YLS/CMI) (Hoge & Andrews, 2002) is a standardized instrument used to assess the criminogenic needs and risk to reoffend of adolescents 12-18 years of age. It consists of 42 yes/no items divided into eight domains: Offense History, Family Circumstances/Parenting, Education/Employment, Peer Relations, Substance Abuse, Leisure/Recreation, Personality/Behavior, and Attitudes/Orientation. A total risk score is derived from summing all items.
The YLS/CMI was completed by the assessing clinician at the time of the assessment. Inter-rater reliability for the total score is high, with correlations among clinicians ranging from .80 to .98 (average
Recidivism
Information on recidivism was obtained from the Royal Canadian Mounted Police (RCMP) Canadian Police Information Center (CPIC) database. Any recidivism (yes/no) was defined as one or more new convictions that occurred within a 3-year fixed period following the youth’s assessment; violent recidivism included convictions in the same timeframe for offenses against the person (e.g., robbery, assault). Criminal records were also reviewed against the offense history summary in youths’ clinical reports to exclude new offenses that were committed prior to their assessment but that resulted in convictions following their assessment due to the time it takes for cases to make their way through the court system. For 41 cases, no recidivism data were provided. As the result of a provision in Canada’s youth criminal justice legislation, a CPIC record for a youth with a previous conviction is expunged following a specified period during which no new offense is committed. In this case, a records request would be returned with no information for that youth. It was also the case that recidivism data were collected as part of a larger program of research; therefore, requests for recidivism data were not made for all youth in this study.
Procedure
During their forensic assessment, youth—and typically a parent—met with a clinician (psychologist or psychiatrist and a social worker) to complete an in-depth clinical forensic interview (7-10 hours in total); youth and parents also typically completed a variety of questionnaires and standardized tests. Clinicians also interviewed additional collateral sources of information (e.g., probation officers, school staff, and previous service providers). Clinicians produced a comprehensive report, which was typically 25 to 35 pages in length focused on psychosocial history, criminogenic needs, cognitive and academic functioning, reoffense risk, mental health, and strengths. In this study, the first author reviewed the comprehensive reports to gather information on demographics, offense history, charges leading to referral for assessment, and YLS/CMI scores, and to retrospectively code for strengths using the SAPROF-YV and the SAVRY protective factors scale. For both instruments, the first author was trained to score the tools by one of the study’s co-authors, a youth forensic psychologist and head of the clinic at which the assessments were conducted—and who received formal training on a variety of adult and youth risk assessment tools, including the SAPROF and the SAVRY. The forensic psychologist undertook a detailed review of the training manuals for the SAPROF, the SAPROF-YV and the SAVRY with the first author. All coding adhered to the manual instructions and agreement was established through coding a number of practice cases. Inter-rater reliability between the first author and a second, similarly trained, student coder is reported in the “Results” section. All coders were blind to the outcome (reoffending) when coding the SAPROF-YV and SAVRY. Total scores (the sum of all items) rather than SPJ ratings were used for both the SAPROF-YV and the SAVRY. There is evidence that both SPJ ratings and total scores on SPJ measures are equally effective at predicting recidivism in justice system-impacted youth (Dolan & Rennie, 2008; Schmidt et al., 2011) and this approach is consistent with previous research (Lovatt et al., 2022; Shepherd et al., 2016; Viljoen et al., 2020).
Analytic Plan
Descriptive statistics on risk and strength variables were provided for the total sample, males, females, Black youth, and White youth. Descriptive statistics were further disaggregated within the female and male subsamples as a function race (i.e., Black females, White females, Black males, White males). In terms of race/ethnicity, the only subgroups large enough for inferential analyses were Black and White youth. Within this subsample, a multivariate analysis of variance (MANOVA) was run to explore potential gender and race differences in scores on the SAPROF-YV, SAVRY protective total, and YLS/CMI scores. For the total sample (as well as separately for males and females, and for Black and White youth), inter-rater reliability and internal consistency were examined using intra-class coefficients (ICCs) and Cronbach’s alpha, respectively.
The internal structure of the SAPROF-YV was examined with both confirmatory factor analysis (CFA) based on polychoric correlations using the weighted least squares means and variance (WLSMV) estimator and exploratory factor analysis (EFA) with principal axis factoring extraction using Promax for the oblique rotation, using Stata 15 (StataCorp, 2017). Factor solutions were considered potentially appropriate if they had eigenvalues greater than 1.0 (Kaiser, 1958). The deletion of items was considered for factors with loadings less than .40 (Pett et al., 2003), cross-loadings that exceeded .32 on two or more factors (Tabachnick & Fidell, 2007), or communalities below .40 and uniqueness values above .60 (Costello & Osborne, 2005); theoretical conformity was also considered when evaluating models (Sellbom & Tellegen, 2019). For the total sample (as well as separately for males and females, and for Black and White youth), correlations between the SAPROF-YV and the SAVRY protective total and the YLS/CMI total scores were run to examine convergent validity.
To examine predictive validity and to test whether strengths operate via a promotive (direct) effect, for the total sample—and for the subsamples of Black vs. White youth and female vs. male youth—we conducted ROC analyses with recidivism as the outcome. Next, we ran hierarchical logistic regressions to examine the incremental predictive validity of the SAPROF-YV over the YLS/CMI; YLS/CMI scores were entered in the first block, and the SAPROF-YV was added to the second block. Analyses were conducted separately for the total sample, Black youth, White youth, female youth and male youth. To test for a protective (moderation) effect of strengths on the relationship between risk and recidivism, a risk by strength interaction term was added in the second block of the regression for the total sample. Subgroup sample sizes were too small to reliably conduct moderation regressions.
Results
Descriptive Statistics
Table 2 provides the descriptive statistics (
Overall, youth who reoffended had lower SAPROF-YV scores than youth who did not (10.23 vs. 7.18,
Mean SAPROF-YV Total, SAVRY Protective Total, and YLS/CMI Total Scores as a Function of Gender and Race
Reliability
To evaluate inter-rater reliability, a trained research assistant, blind to outcome, independently coded the SAPROF-YV and SAVRY for approximately 20% of cases in the sample. Substantial inter-rater agreement was found for both measures, using ICC, two-way mixed effects model, single measure, absolute agreement. The SAVRY total score (
Structural Validity
To examine the internal structure of the SAPROF-YV, we ran a CFA based on the resilience, motivational, relational, and external subscales. Item 8 (medication) was omitted because it was coded “not applicable” for 72% of the sample. The solution did not converge, and no output was provided, indicating that this structure did not hold in our sample. Other factor solutions were explored using EFA with principal axis factoring as an extraction method and Promax oblique rotation. When unrestricted, the EFA produced a three-factor solution, accounting for 72% of the variance in scores, with three eigenvalues greater than 1 (5.7, 1.5, and 1.3) (see Supplemental Table S1, available in the online version of this article). However, one factor included fewer than three items (the suggested minimum to constitute a factor; Costello & Osborne, 2005), the solution contained four cross loaded items, and the three factors did not make sense theoretically when considering the content of the scale item groupings. We repeated the EFA, forcing a two-factor solution that accounted for 61% of the variance (see Supplemental Table S2, available in the online version of this article). While the two factors made some theoretical sense, there were issues around uniqueness values and cross loadings, and the internal consistency of the second factor was quite poor (α = .36). We ran a third EFA with principal axis factoring extraction, forcing a one-factor solution that accounted for 48% of the variance (see Supplemental Table S3, available in the online version of this article). Three items had factor loadings below .4 (motivation for treatment; professional care; and court order). When the EFA was re-run with these items removed, the factor accounted for 68.5% of the variance (see Supplemental Table S4, available in the online version of this article) and showed good internal consistency (alphas of .84 for the total sample, .83 for males, .81 for females, .80 for Black youth and .79 for White youth). These results suggest that the SAPROF-YV total score held together as a construct but that the subscales did not. Thus, we used the full SAPROF-YV scale (minus the medication item) for all subsequent analyses.
Convergent Validity
Support for convergent validity was demonstrated by correlations in the predicted direction between the SAPROF-YV and the SAVRY protective score (
Predictive Validity
Table 3 shows the AUC values for the SAPROF-YV, SAVRY Protective, and YLS/CMI scores for predicting any recidivism for the total sample, as well as separately for males and females, and for Black and White youth. Across the samples, the SAPROF-YV predicted reoffending significantly with a medium effect size (Rice & Harris, 2005) and with equal strength between Black and White youth. However, the SAPROF-YV illustrated a slight advantage in males vs. females. The SAVRY protective scale significantly predicted reoffense with medium strength for males and White youth, and with a small but significant AUC for the total sample. The SAVRY protective scale was not a significant predictor of reoffending for females or Black youth. Finally, the YLS/CMI predicted any reoffending with large AUCs for males and White youth, and medium AUCs for females and the full sample. Of note, the YLS/CMI predicted recidivism at chance levels for Black youth, and there was a significant difference in AUCs between Black and White youth (see Table 3). Thus, the SAPROF-YV demonstrated medium sized promotive (direct) effects across all subsamples whereas the SAVRY had only medium sized promotive effects for males and White youth; it did not have a promotive effect for females or Black youth.
ROC Analyses: SAPROF-YV, SAVRY Protective Scale, and YLS/CMI Total Scores Predicting Any Recidivism Across Groups
When violent recidivism was the outcome, for the total sample, the SAPROF-YV had a small but significant effect for reoffending (AUC = .58,
To test for incremental predictive validity of the SAPROF-YV alongside YLS/CMI total risk, we ran hierarchical logistic regressions (YLS/CMI entered into Block 1, SAPROV-YV entered into Block 2) for the total sample and the subgroups of male and female youth, and Black and White youth (see Table 4). We also included corresponding AUC values for Block 1 and Block 2 (using the predicted probability values). Across all Block 1 analyses, the YLS/CMI total risk score was a significant individual predictor of any recidivism in all samples except Black youth. Interestingly, across all Block 2 analyses, the only time the SAPROV-YV evidenced incremental predictive validity over the YLS/CMI was for the subsample of Black youth, though the confidence interval did contain 1. It is noteworthy that, in Block 2, neither the SAPROF-YV nor the YLS/CMI significantly predicted reoffense for female youth.
Logistic Regressions: Incremental Associations of Strength (SAPROF-YV) and Risk (YLS/CMI) For Any Recidivism Across Groups
Andrew F. Hayes Process Macro Version 4.1 (SPSS) was used to test whether the total SAPROF-YV score would evidence a protective effect against YLS/CMI scores in the prediction of any reoffending. We could only perform this analysis in the total sample due to sample size restrictions. A significant interaction between YLS/CMI and SAPROF-YV scores emerged (

Interaction Between YLS/CMI Total and SAPROF-YV Total for the Total Sample (
Discussion
To our knowledge, the SAPROF-YV is the only tool created solely to assess strengths among justice system-impacted youth. Results from this study lend support to the reliability and validity of this tool for the assessment of strengths among justice-impacted youth, while also highlighting some areas of departure from the structure as proposed by its authors. Subgroup analyses—in which male and female youth and Black and White youth were examined separately—indicated some areas of similarity across groups as well as distinct findings, which suggest that youth cannot be treated as a homogeneous group.
The SAPROF-YV had high internal consistency. Evidence of strong inter-rater reliability is consistent with findings of recent studies (Chu et al., 2020; de Vries Robbé et al., 2015; Kleeven et al., 2022; Li et al., 2019; Lovatt et al., 2022). Support for the tool’s convergent validity comes from a strong positive correlation between the SAPROF-YV and SAVRY strength total score, which is consistent with the findings of several studies (Christiansen et al., 2021; de Vries Robbé et al., 2015; Kleeven et al., 2022; Koh et al., 2022; Lovatt et al., 2022). The strong negative correlation with the YLS/CMI total risk score is consistent with Chu et al.’s (2020) finding and with the robust negative relationship between the SAPROF-YV and other risk measures, including the SAVRY (Christiansen et al., 2021; de Vries Robbé et al., 2015; Kleeven et al., 2022; Koh et al., 2022; Lovatt et al., 2022) and VRS-YV (Koh et al., 2022; Lovatt et al., 2022). Taken together, the results also add to evidence supporting the conception of strengths falling on a single risk-strength dimension rather than as a construct independent of risk.
This is the first study to examine the structural validity of the SAPROF-YV. CFA and EFA analyses indicated that a one factor solution best fit the data, and suggested the removal of three items—motivation for treatment, professional care, and court order—in addition to medication (which was not applicable for the majority of the sample). Li et al. (2019) also reported low variability in the medication, professional care, and court order items and excluded these items from their analyses. Lack of support for the structural validity of the four subscales is interesting in light of previous evidence of concurrent validity (e.g., Lovatt et al., 2022) and predictive validity for general recidivism (e.g., Kleeven et al., 2022; Lovatt et al., 2022), though psychometric support has been weaker for the External subscale. While the factor structure of an SPJ assessment tool does not speak directly to its validity, further examination of the SAPROF-YV’s structure with larger samples will allow for subgroup analyses and can help evaluate the usefulness of each of the SAPROF-YV items, as not all items may be equally useful in assessing youth’s strengths.
Structure aside, as expected, youth who reoffended had lower mean scores on the SAPROF-YV, both in the total sample and in the subgroups. Moreover, ROC analyses lent support to the predictive validity of the SAPROF-YV in relation to any recidivism. Our findings were consistent with the moderate relationships reported by de Ruigh et al. (2021), Kleeven et al. (2022) and Lovatt et al. (2022). They contrasted somewhat with Chu et al. (2020)’s large AUC and with Goodwin et al. (2022) and Koh et al. (2022), who found that the SAPROF-YV was a more modest predictor of any recidivism. The SAPROF-YV was a weak predictor of violent recidivism, which is consistent with Goodwin et al. (2022) and Koh et al. (2022) but contrasts with de Ruigh et al. (2021), Kleeven et al. (2022), and Lovatt et al. (2022), who found the tool to be a moderate predictor of violent recidivism. That said, consistent with our findings, other studies have found the tool to better predict any recidivism than violent recidivism (de Ruigh et al., 2021; Lovatt et al., 2022).
Researchers have proposed that the addition of strengths will improve the accuracy of risk assessment but the evidence for this assertion is mixed, with some studies finding the SAPROF-YV to contribute incrementally over risk measures, including the SAVRY (de Ruigh et al., 2021; Kleeven et al., 2022) and YLS/CMI (Chu et al., 2020), while others have found no incremental contribution of the SAPROF-YV over various risk measures (Goodwin et al., 2022; Koh et al., 2022; Lovatt et al., 2022). In the current study, in the overall sample—as well as the subsamples of males, females, and White youth—the SAPROF-YV was not a unique predictor of general recidivism alongside the YLS/CMI. However, in Black youth (for whom the YLS/CMI was a poor predictor of recidivism), the SAPROF-YV did contribute significantly to the prediction of reoffending. These results are consistent with the robust negative correlations (in the mid- to high: .6 range) between the SAPROF-YV and YLS/CMI discussed earlier, which suggest sizable overlap in what is being measured by the two instruments. It is noteworthy that the correlation was smaller—though still sizable—in the subgroup of Black youth (−.58). This finding suggests that there was somewhat less overlap in what was being measured between the SAPROF-YV and the YLS/CMI in this group, which is also consistent with the finding that the SAPROF-YV was a significant predictor of recidivism alongside the YLS/CMI for Black youth. These results also speak to the question of how strengths can best be conceptualized. The finding that, in most groups, strengths did not provide additional predictive power with respect to reoffending adds evidence to the argument that strengths and risks are poles of a continuum.
To explore whether strengths can be conceptualized as protective factors, we examined whether strengths would moderate the relationship between risk and reoffending. Our results indicated a protective effect only among lower risk youth; strengths were associated with reduced recidivism in this group, but not in moderate or high risk youth. Indeed, in the high risk group, it appeared that strengths increased the likelihood of recidivism, but this result is unreliable due to the tiny number of high risk/high strength youth in the sample. Shepherd et al. (2016) found a similar result among the lower risk youth in their sample of Australian males in detention: the SAVRY protective factors score was associated with nonreoffending, but the effect did not extend to higher risk youth. In contrast to the current findings, a protective effect was identified by Jones et al. (2016), who found that high levels of strength (on the YASI) improved outcomes for high-risk individuals, but not for low-risk youth.
Aside from the theoretical question of whether strengths can be understood as protective factors, the moderation finding is relevant for treatment planning: lower risk youth might benefit from incorporating strength-focused treatment into interventions that target criminogenic needs, while for higher risk youth, it may be that less weight should be afforded to strength factors until more immediate criminogenic needs have been addressed, or at the very least, strengths should not be the focus of service provision without an equally strong focus on criminogenic needs. Further research is needed to examine whether such differential practice would positively impact treatment success and reoffending outcomes. Unfortunately, the few studies examining moderation effects have produced inconsistent results (e.g., Chu et al., 2020; de Vries Robbé et al., 2013; Dolan & Rennie, 2008; Goodwin et al., 2022; Jones et al., 2016; Schmidt et al., 2011; Viljoen et al., 2020; Wanamaker et al., 2018). Thus, more research is needed before we can begin to understand how strength information can or should be incorporated into risk assessment and service planning.
Our results also highlight the importance of examining how assessment tools function in specific sub-groups of justice system-impacted individuals. Attention to the importance of gender has grown substantially over the last 15 years, and—more recently—researchers have begun to examine ethno-racial groups separately (e.g., Lovatt et al., 2022). When potentially high-stake decisions are made on the basis of these assessments, it is incumbent on clinicians to ensure that the instruments they use are reliable and valid in the population from which their clients come, which requires that researchers establish an adequate base of evidence. With respect to the functioning of the SAPROF-YV in male and female youth, and Black and White youth, there was evidence of strong convergent validity across groups. With respect to predictive validity, we have already touched on the fact that the SAPROF-YV (and the YLS/CMI and SAVRY) functioned differently in Black youth than in White youth. While the SAPROF-YV predicted reoffending with equal strength between Black and White youth, the YLS/CMI predicted recidivism at chance levels for Black youth, a finding that may be limited to this particular sample but that is nonetheless concerning. It was only in Black youth that the SAPROF-YV evidenced incremental predictive validity over the YLS/CMI—presumably because the latter was such a poor predictor of recidivism. In terms of gender differences, the AUCs for males and females were similar with respect to general recidivism, which contrasts with the findings of Goodwin et al. (2022) and Lovatt et al. (2022) that the SAPROF-YV had stronger predictive validity for male youth than female youth. When we examined the incremental predictive validity of the SAPROF-YV for female youth in the model that included both the SAPROF-YV and the YLS/CMI, neither measure significantly predicted reoffending. Together with these other recent studies, our results indicate a continued need to examine the SAPROF-YV (as well as other measures of risk and strength) within subgroups of youth. This need is particularly important for racialized and marginalized groups (e.g., Indigenous youth; Lovatt et al., 2022) who are overrepresented in the criminal justice system.
Results should be considered in light of the following limitations. First, the use of retrospective chart-review, while common for the investigation of new measures (Campbell et al., 2009; Gearing et al., 2006), makes it impossible to discern whether the absence of information about specific strength items represents the true absence of those characteristics or simply reflects missing information. Moreover, the SAPROF-YV was coded for a baseline score only. Given the dynamic nature of the items, regularly updated assessments (i.e., every 6 months) may improve the predictive validity of the measure. Reassessment of dynamic risk factors improves prediction (Lloyd et al., 2020); therefore, future research would benefit from evaluating strengths at multiple time points. In addition, because the SAPROF-YV and SAVRY were not completed as part of youth’s assessments, we do not have SPJ measures associated with these tools, instead analyzing total strength scores. While there is evidence of good agreement of SPJ ratings and scores (Dolan & Rennie, 2008; Schmidt et al., 2011), future studies would benefit from examining both types of measures in validating the SAPROF-YV. Similarly, we did not have data on the YLS/CMI strengths scale. In future research, it will be useful to examine this scale—along with the SAVRY strengths—in comparison with the SAPROF-YV, which is a lengthier, standalone measure of strengths.
While the analysis of subgroups—males and females, and Black and White youth—is a strength of the study, small subgroup sizes meant that we could not do separate incremental validity and moderation analyses on these subgroups, as well as to examine race and gender intersectionally; consistent with our above discussion of subgroup analyses, these are important directions for future studies. In addition, the number of youth with a violent reoffense precluded our ability to conduct ROC and regression analyses with violent reoffending as the outcome. In future research, it will be important to continue examining the role of the SAPROF-YV, in combination with a risk measure, as a predictor of violent reoffending as well as general recidivism.
The results from this study provide support for the psychometric properties of the SAPROF-YV, including strong inter-rater reliability and internal consistency, as well as evidence for convergent validity and predictive validity for general reoffending. These findings indicate that this measure can be used to reliably and validly to assess strengths among justice-impacted youth. Further research is needed to build on these results; in particular, it will be important for the moderation analysis to be investigated in additional samples before clear recommendations can be made regarding the use of the SAPROF-YV to enhance current risk assessment procedures. Our understanding of strengths and their relationship to recidivism is exceedingly underdeveloped compared with our understanding of the associations between risk/criminogenic needs and reoffending (Dickens & O’Shea, 2017). It is vital that further research help elucidate how strengths operate in relation to risk and reoffending to guide changes to policy and practice.
In addition to, or perhaps instead of, contributing to risk prediction, the authors of the SAPROF-YV have also suggested that this tool might enhance rehabilitation service planning and provision by highlighting areas of strength that can be used to increase motivation or engagement in services (de Vries Robbé et al., 2015). This aim aligns with the specific responsivity principle of the RNR framework (Bonta & Andrews, 2017). However, while it is a compelling idea, there is a dearth of research examining this position. An empirical investigation into the role of strengths in case management and treatment provision may help augment our understanding of effective rehabilitation and the cessation of offending (de Vries Robbé & Willis, 2017) and is therefore an important avenue for future research. Existing research suggests that probation officers find risk assessment tools that include strength factors (e.g., the SAVRY) helpful for guiding service and supervision recommendations (Guy et al., 2014) and, importantly, that such tools lead to a greater consideration of strength factors in relation to service recommendations (Vincent et al., 2012). There is preliminary evidence that strength factors function as specific responsivity factors (Finseth et al., 2022), that using strength factors to inform rehabilitation is valuable (Singh, Desmarais, Sellers et al., 2014), and that improvements in strength factors over time is linked to reduced recidivism and positive community outcomes (Coupland & Olver, 2020), but additional research is needed to further advance our understanding of the benefits of assessing strengths in the youth justice context.
Supplemental Material
sj-docx-1-cjb-10.1177_00938548231165286 – Supplemental material for Assessment of Strengths in Criminal Justice System-Impacted Youth: A Retrospective Validation Study of the SAPROF-YV
Supplemental material, sj-docx-1-cjb-10.1177_00938548231165286 for Assessment of Strengths in Criminal Justice System-Impacted Youth: A Retrospective Validation Study of the SAPROF-YV by Sonia Finseth, Michele Peterson-badali, Shelley L. Brown and Tracey A. Skilling in Criminal Justice and Behavior
Footnotes
Authors’ Note:
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
