Sage Journals: Discover world-class research

Abstract

Describing and understanding personality structure is fundamental to predict and explain human behavior. Recent research calls for large personality item pools to be analyzed from the bottom-up, as item-level analysis may reveal meaningful differences often obscured by aggregation. This study introduces and applies Taxonomic Graph Analysis (TGA), a comprehensive network psychometrics framework aimed at identifying hierarchical structures from the bottom-up, to an open-source 300-item IPIP-NEO dataset (N = 149,337). This framework addresses key methodological challenges that have hindered accurate recovery of hierarchical structures, including local independence violations, wording effects, dimensionality assessment, and structural robustness. TGA revealed a three-level structure composed of 28 first-level dimensions (facets), 6 second-level dimensions (traits), and 3 third-level dimensions (meta-traits). Although some dimensions aligned with the theoretical IPIP-NEO structure, there were considerable deviations including the emergence of Sociability, Integrity, and Impulsivity traits at the second-level and a novel Disinhibition meta-trait at the third-level. The overarching theme of our findings was a hierarchical structure that integrated empirical and theoretical findings that have been scattered across the personality literature, demonstrating TGA’s value to investigate hierarchical psychological constructs. This study contributes to discussions on personality taxonomy by providing a rigorous, data-driven perspective on the IPIP-NEO’s hierarchical structure.

Plain Language Summary

Describing personality structure is important to understand why people do what they do. Conventional approaches often rely on analyzing pre-established structures (e.g., facets), taking them at face value, to analyze up to and beyond the Big Few. These approaches potentially overlook how alternative groupings might form based on the relations between the items. When analyzing item-level relations, there are many methodological obstacles that must be overcome; otherwise, the accurate recovery of these item groupings can be obscured. This study introduces Taxonomic Graph Analysis (TGA), a network psychometrics framework that can build personality structures from the bottom-up. TGA addresses previous methodological challenges such as local independence violations (redundant or semantically similar items), wording effects (how item polarity affects responses), dimensionality assessment, and structural robustness. This framework was applied to the 300-item IPIP-NEO personality inventory (N = 149,337) where we uncovered a three-tiered personality structure composed of 28 facets, 6 traits, and 3 meta-traits but no general factor. Importantly, although there was alignment with the theoretical IPIP-NEO structure, there were considerable differences, like a novel Disinhibition meta-trait and the emergence of the Sociability, Integrity, and Impulsivity trait domains. These results integrate scattered empirical and theoretical findings across the personality literature, providing a more cohesive understanding of how the IPIP-NEO hierarchy is structured. Taken together, these results demonstrate TGA’s value for investigating complex psychological constructs and contribute to ongoing discussions about how personality can be defined and measured, with implications for both theory and practice.

Keywords

personality taxonomy network analysis hierarchical exploratory graph analysis

Describing and understanding hierarchical structures of psychological constructs, such as personality (Markon et al., 2005), intelligence (McGrew, 2009), and psychopathology (Lahey et al., 2021), is fundamental to advancing their theory and measurement (Kotov et al., 2017). Refining and confirming these structures ensures that subsequent models adequately capture the underlying conceptual complexity. Empirical investigations to establish these hierarchies, however, face many obstacles, such as the interrelations between constructs (Marsh et al., 2010), potential overlaps in content (e.g., jingle-jangle; Condon et al., 2020; Wulff & Mata, 2023), and other nuanced measurement issues (Achenbach, 2021; Bringmann et al., 2022). As a result, developing methodological approaches to effectively recover the hierarchical organizations of psychological constructs remains a challenge with significant implications for theory and practice.

The majority of efforts to develop and evaluate hierarchical structures have used top-down methods that impose a priori theoretical assumptions on the number and nature of dimensions (e.g., Goldberg, 2006; McCrae & Costa, 1985). Although these approaches can offer valuable insights, they often constrain findings to existing conceptual models which, in turn, limits opportunities to uncover new structures (Condon et al., 2020). With hierarchical or otherwise complex constructs, large item pools are often used to provide systematic measurement of psychological constructs (Clark & Watson, 2019). Yet large item pools are rarely examined at the item-level in a fully exploratory manner. Instead, research typically relies on predefined scales or factor structures, hindering our ability to organically identify the fundamental building blocks of psychological constructs (Irwing et al., 2024).

A growing number of researchers have called for personality to be explored from the “bottom-up”—to investigate relationships at the bottom of the hierarchy (i.e., items) and build up (Condon et al., 2020). Several researchers have answered these calls by analyzing large pools of items (Condon, 2022) and adjectives (Saucier & Iurino, 2020) that extend beyond the five-factor model (FFM) and Big Five traits (i.e., openness to experience, conscientiousness, extraversion, agreeableness, and neuroticism). These efforts have broadened the theoretical lens through which researchers view the personality hierarchy. Methodologically, however, they remain limited in their psychometric scope, often relying on conventional procedures that are known to work poorly with large item pools and overlooking measurement issues such as violations of local independence and wording effects (Hands & Everitt, 1987; MacCallum et al., 1999, 2001).

In light of the need for a comprehensive, bottom-up approach to assess the hierarchical structure of personality (Condon et al., 2020), we developed a network psychometrics framework called Taxonomic Graph Analysis (TGA). TGA goes beyond conventional approaches that identify hierarchical structures (e.g., bass-ackwards, factor analysis, hierarchical clustering, principal component analysis; Castro et al., 2021; Forbes, 2024; Forbes, Baillie et al., 2024; Goldberg, 2006; Irwing et al., 2024; Saucier & Iurino, 2020) by including methods to identify and mitigate local independence violations and wording effects, estimate the number of dimensions at each level of the hierarchy, and determine the robustness of the hierarchical structure. These methodological issues are discussed in detail first to highlight how they affect the recovery of hierarchical structures. Next, the TGA framework is introduced with specific attention to how it addresses these challenges using a suite of psychometric network methods that have each been individually validated through prior simulation studies. Afterward, TGA is applied to the 300-item IPIP-NEO as an empirical demonstration. Our results show that while TGA identifies a structure with notable similarities to the theoretical framework of the IPIP-NEO, it also reveals considerable deviations. These findings underscore the need for more comprehensive investigations into the hierarchical structure of personality using large item pools.

Methodological challenges for hierarchical structures

There are a number of important methodological challenges that hinder the development and validation of hierarchical structures (Clark & Watson, 2019), including, but not limited to, violations of local independence, wording effects, dimensionality assessment for each level of the hierarchy, and structural robustness. Common scale development practices start with a set of target constructs (e.g., personality trait domains) and develop each construct independently (e.g., Extraversion; Lambert & Newman, 2023). For example, the IPIP-NEO defines Extraversion as someone who is outgoing and enjoys interacting with the external world (Johnson, 2014). From this definition, Friendliness, Gregariousness, Assertiveness, Activity-level, Excitement-seeking, and Cheerfulness serve as narrow characteristics that target specific facets of Extraversion.¹ Within each facet, many items are developed to address specific nuances. This approach is then applied to every target construct, where items are written within a defined scope, and is a common procedure for scale development in personality (and psychology more broadly; Clark & Watson, 2019; DeVellis, 2007).

Similar to how these constructs are developed, conventional psychometric approaches validate each facet or trait one-by-one, taking their constituent items at face value (Achenbach, 2021). Classical test theory, for example, identifies items most strongly correlated with the overall sum score in their respective facet or trait (i.e., item-total correlations) and removes items with low correlations. In modern test theory, factor analysis and item response theory are typically conducted dimension-by-dimension in large item pools to evaluate how item parameters correspond to their underlying latent construct (DeVellis, 2007; Forbes, Baillie et al., 2024; Irwing et al., 2024; Reise & Waller, 2009). In both approaches, facets and traits are often “siloed” relative to each other, which prevents simultaneous evaluations of relations across the item pool.

Although this process aims to ensure homogeneous constructs (Burisch, 1984), the lack of cross-level evaluation can lead to numerous issues such as local independence violations, substantial cross-loadings, method effects, and the merging or collapsing of facets or traits, when all items are analyzed together (Marsh et al., 2010, 2013). An example from the IPIP-NEO includes the items “Am afraid to draw attention to myself” in the Self-consciousness facet of Neuroticism, “Don’t like to draw attention to myself” (reverse) in the Assertiveness facet of Extraversion, and “Dislike being the center of attention” in the Modesty facet of Agreeableness. Despite belonging to three different facets and traits, these items strongly correlate due to their shared content of seeking attention. Each item fits within its prescribed facet and trait, so when researchers evaluate them independently using conventional procedures, these cross-level relations remain hidden (only to reappear later as additional “semantic” dimensions or correlated residuals in item-level factor models; Marsh et al., 2010).

To date, there are two main approaches to estimate hierarchical structures from large item pools: hierarchical clustering and bass-ackwards (Goldberg, 2006; Forbes, Baillie et al., 2024, Forbes, Watts et al., 2024; but see Condon, 2022). Agglomerative hierarchical clustering is a bottom-up approach that develops a tree-like hierarchy by merging variables one-by-one (Ward, 1963). In contrast, the bass-ackwards approach establishes a hierarchy from the top-down by extracting the first principal component or factor that explains the maximum possible variance across all variables and continuing to extract dimensions until there is no meaningful variance to extract (Goldberg, 2006). Recent improvements on the bass-ackwards approach include removing redundant components, identifying statistical artifacts, and examining relationships among components across all levels (Forbes, 2024). Despite the recent improvements and empirical applications of these approaches in personality (Goldberg, 2006) and psychopathology (Forbes, Baillie et al., 2024; Forbes, Watts et al., 2024), several limitations and challenges remain unaddressed.

One important limitation is that both clustering and factor analytic methods often struggle to recover the correct simulated structure when item pools are large or dimension sizes vary (i.e., number of variables per dimension; Hands & Everitt, 1987; MacCallum et al., 1999, 2001; Milligan, 1981). Both conditions are likely to occur in hierarchical structures. Large item pools arise from the common psychometric practice of creating many variables per dimension to increase internal consistency (DeVellis, 2007). Variable dimension sizes may occur during the item refinement phase of scale development or if cross-dimension item interrelations have not been previously explored (e.g., attention-seeking).

A related limitation is the lack of supporting simulation evidence. Simulation studies allow researchers to evaluate how well different methods recover the underlying hierarchical structure by systematically varying parameters with known values (e.g., factor loadings, items per specific factor, number of specific and general factors; Jiménez et al., 2023). These studies are essential to determine whether a method is likely to work across diverse datasets or only under specific conditions (Siepe et al., 2024). Although hierarchical clustering and bass-ackwards have been applied empirically (Forbes, Baillie et al., 2024; Forbes, Watts et al., 2024), neither approach, to our knowledge, has been rigorously evaluated in their ability to recover simulated hierarchical structures.

A core methodological challenge in the identification of hierarchical structures and large items pools is the assumption of local independence. This assumption is central to latent variable models, stating that items are uncorrelated after accounting for latent variables (Chen & Thissen, 1997; Holland & Rosenbaum, 1986). Violations of local independence can happen for many reasons such as acquiescence and shared semantic content (Leising et al., 2024). These violations can cause problems ranging from minor to severe, including model misspecification (Montoya & Edwards, 2021), biased model parameters (Edwards et al., 2018), and inaccurate estimates of internal structure (Wood et al., 1996).

A fundamental limitation in latent variable approaches is that specifying the correct number of latent variables is required to accurately detect violations of local independence, yet local independence violations adversely impact dimensionality assessment (Flores-Kanter et al., 2021; Montoya & Edwards, 2021). The circular nature of this problem can lead to an impasse in exploratory situations. Indeed, although methods have been developed to detect these violations in the factor analysis (Ferrando et al., 2022), item response theory (Chen & Thissen, 1997; Edwards et al., 2018), and exploratory structural equation modeling (ESEM; Saris et al., 2009) frameworks, they all rely on the correct specification of the number of factors to work as intended (Christensen et al., 2023). If such violations are present and left unchecked, the validity of the estimated structure is questionable.

Another methodological challenge stems from the item content themselves. In personality research (and much of psychology), it’s common for items to be administered using a Likert scale (e.g., strongly disagree to strongly agree) with some items reverse-keyed to control for response biases (e.g., “Don’t like to draw attention to myself” in the Assertiveness facet of Extraversion). However, this practice can introduce wording effects, a form of systematic method variance that arises when people provide inconsistent answers to positively and negatively worded items measuring the same construct (Gu et al., 2015; Kam, 2018). These effects emerge from various response biases, such as carelessness, acquiescence, and item difficulty, particularly for negatively worded statements (Swain et al., 2008; Weijters et al., 2013). When people agree with both positively and negatively phrased items that are logically opposite, it indicates potential biases rather than true trait variance. In the context of latent variable models, wording effects can distort factor solutions by introducing artificial method factors (DiStefano & Motl, 2006), altering factor loadings (Arias et al., 2020), and affecting the estimation of correlations between substantive traits. Failing to account for wording effects may lead to inflated or deflated trait correlations depending on how method variance interacts with the factor structure (Nieto et al., 2021). If unmodeled, wording effects can obscure the true hierarchical organization of traits, emphasizing the need for appropriate modeling strategies to mitigate their impact (Schmalbach et al., 2020).

When violations of local independence and wording effects both occur, accurately recovering hierarchical structures becomes a formidable task. These problems, coupled with the lack of simulation evidence to recover dimensions in large item pools, may render most conclusions about empirically derived hierarchies to be speculative at best. Achenbach (2021, p. 65) summarizes these issues succinctly: “Despite the growing popularity of hierarchical dimensional models, their value may be undercut if researchers fail to deal scientifically with the many details to be mastered in properly constructing, testing, and applying such models.”

Taxonomic graph analysis framework

Our primary aim is to develop a network psychometrics framework that combines several recently developed methods, validated via extensive simulations, to identify hierarchical structures from the bottom-up. The TGA framework consists of seven primary steps: (1) assess and mitigate local independence violations, (2) assess and mitigate wording effects (if necessary), (3) estimate a network, (4) reach a consensus on the number of dimensions and item assignments, (5) determine the robustness of these dimensions and assignments, (6) compute network scores for the next level, and (7) estimate N-level dimensions (i.e., levels of the hierarchy; Figure 1). Steps 3–7 are repeated until unidimensionality, which is assessed for whether it is statistically supported (Revelle & Condon, 2025).

Figure 1.

Taxonomic graph analysis framework.

Step 1: Remove redundancies

The framework begins by addressing local independence violations by identifying and removing redundant items. Although network proponents have criticized latent variable models for their local independence assumption (Cramer et al., 2012), substantial shared variance over-and-above other relations in a network can lead to similar effects such as distorted dimension recovery (e.g., minor dimensions) and biased parameter estimates (e.g., centrality confounded by redundancy rather than true relative position; Fried & Cramer, 2017). Unique Variable Analysis (UVA; Christensen et al., 2023) was developed to assess and mitigate this issue using a network estimation method followed by the graph theoretic measure of weighted topological overlap (wTO; Nowick et al., 2009) to identify redundant nodes in the network. Recent simulation work has demonstrated that UVA is effective across data types and conditions, performing as well as or better than alternative methods like ESEM modification indices (Saris et al., 2009) and correlated residuals (Christensen et al., 2023; Ferrando et al., 2022). In contrast to conventional procedures, UVA can be applied before any dimensions are estimated, avoiding a circular impasse and making it ideal in exploratory situations.

Step 2: Mitigate wording effects

The second step is to evaluate wording effects by fitting a random intercept factor model where regular- and reverse-keyed items (without recoding) load onto a single latent factor with loadings fixed to one, enabling the estimation of an additional variance component that captures response biases and individual differences in scale usage (Maydeu-Olivares & Coffman, 2006). This model relaxes the assumption that all people use the response scale identically and accounts for variance that would otherwise distort dimensionality estimates (García-Pardina et al., 2024). The presence of wording effects is determined by model convergence. If the model converges, it suggests that wording effects are likely present with their magnitude reflected in the size of the loadings on the random intercept factor; if it fails to converge, wording effects are negligible or absent. When wording effects are detected, the residual correlation matrix, which has the variance attributed to the random intercept factor removed from the sample correlation matrix, is used for dimensionality estimation; otherwise, the sample correlation matrix is retained. Recent simulation research has demonstrated that the random intercept exploratory graph analysis (riEGA) provides more accurate dimensionality estimates than the random intercept parallel analysis in the presence of wording effects (García-Pardina et al., 2024). By explicitly modeling and removing the variance associated with wording effects before dimensionality assessment, riEGA increases the accuracy of dimension recovery and mitigates distortions introduced by response biases.

Step 3: Estimate network structure

After mitigating violations of local independence and wording effects, the network structure can be estimated. A common network estimation method is the graphical least absolute shrinkage and selection operator with extended Bayesian information criterion for model selection (commonly referred to as EBICglasso; Chen & Chen, 2008; Epskamp & Fried, 2018; Foygel & Drton, 2010; Friedman et al., 2008). The result of the EBICglasso algorithm is a sparse network where nodes (circles) represent variables and edges (lines) represent regularized partial correlations (Epskamp & Fried, 2018). Although there are many other network estimation methods that could be used (e.g., non-regularized methods; Williams et al., 2019), the EBICglasso has consistently demonstrated comparable or better performance over other methods for dimension recovery (Christensen et al., 2024; Golino & Epskamp, 2017; Golino et al., 2020).

Step 4: Reach dimension consensus

On the network structure, a community detection algorithm can be applied to identify communities (sets of densely connected nodes) that are statistically consistent with factors when data are generated from a latent factor model (Fortunato, 2010; Golino & Epskamp, 2017; Golino et al., 2020). The Louvain algorithm, which iteratively merges nodes into communities (Blondel et al., 2008), has been demonstrated to be one of the more accurate dimension recovery algorithms across a variety of data generating conditions (e.g., count, latent factors, Lancichinetti-Fortunato-Radicchi graphs; Christensen et al., 2024; Gates et al., 2016; Yang et al., 2016). A recent simulation study used the first pass of the Louvain algorithm to identify lower-order dimensions in data generated from hierarchical structures (hereafter referred to as lower order Louvain; Jiménez et al., 2023). When combined with factor scores to estimate higher-order dimensions with the algorithm, it was consistently as accurate or more accurate than parallel analysis for both lower- and higher-order levels and across conditions. One caveat of the Louvain algorithm is that its solution can depend on the initial order in which the nodes are passed to the algorithm. To address potential uncertainty in the algorithm’s solutions, Lancichinetti and Fortunato (2012) developed the consensus clustering approach to mitigate this issue. This approach applies a stochastic algorithm, like the Louvain, repeatedly to the same network N times (e.g., 1000) and attempts to establish a consensus based on the frequency with which two nodes are placed in the same community. A variant of this approach, that we call “most common” consensus clustering, uses the solution that appears most frequently across the N repetitions of the algorithm as the consensus (Golino & Christensen, 2025).

Step 5: Establish dimension robustness

The next step evaluates the consensus solution’s robustness using Bootstrap EGA (bootEGA; Christensen & Golino, 2021), which assesses the generalizability of dimensions through a resampling with replacement bootstrap procedure. This process applies EGA or riEGA (if wording effects are present) with the most common consensus method (using the same N repetitions) and lower order Louvain algorithm to each bootstrap replicate (typically 500 samples). This procedure generates a sampling distribution of the results, allowing each item’s placement into the original most common consensus solution to be computed as a replication proportion (i.e., item stability). Items with stability values below 0.75 are considered unstable and potentially indicate underlying psychometric issues such as weak loadings, local dependence, or strong multidimensionality (Christensen & Golino, 2021). These problematic items can be removed at the first-level, and Steps 3–5 should be repeated until achieving a stable and robust solution with all item stabilities at or greater than 0.75. Beyond the first-level, item stability is important to understand but removing entire dimensions would be less ideal and could limit the discovery of cross-dimension relations (e.g., interstitial traits; Blain et al., 2023; Lawn et al., 2023).

Step 6: Compute network scores

Once a robust consensus solution for communities and item assignments is established, network loadings and subsequent scores can be computed. Simulation studies have demonstrated that network loadings can accurately capture the same patterns as factor loading patterns when data are generated from a latent factor model (Christensen et al., 2025). Network scores, calculated by multiplying data by network loadings (Golino et al., 2022), can estimate factor correlations with comparable accuracy to exploratory factor analysis when data are generated from factor models (Christensen et al., 2025). A unique feature of network loadings (and subsequently scores) is that they do not require or depend on (factor) rotations to make their estimates accurate and interpretable.

Step 7: N-level communities

The network scores from each level can be used to compute the next level’s dimensions by repeating Steps 3–6 until reaching unidimensionality. One limitation of contemporary network psychometric methods is that if all nodes are connected to at least one other node, then community detection algorithms always identify at least one community, which could lead researchers to erroneously conclude that there is a single, overall dimension at the top of the hierarchy (Musek, 2007; Watts et al., 2024). Rather than concluding at a single dimension, a recently developed measure called unidim (Revelle & Condon, 2025) can be used to statistically evaluate whether one or more dimensions likely exist. If unidim indicates multiple dimensions are more likely, then the current level’s dimensions are the top of the hierarchy; otherwise, the final level is a single, general dimension.

Summary

Taken together, TGA provides a systematic, bottom-up framework based on network psychometrics to identify hierarchical structures. Each step of TGA addresses methodological challenges that have historically been obstacles to the accurate identification of hierarchical structures in psychology (i.e., siloed item analyses, violations of local independence, wording effects, dimensionality assessment for each level of the hierarchy, and verifying the robustness of the hierarchical structure). To date, TGA offers one of the most comprehensive approaches to investigate hierarchical constructs from the bottom-up.

Personality taxonomies

Personality reflects variations in thoughts, feelings, and behaviors that occur within and across people (Funder, 2009), reliably predicting a range of important work and life outcomes across cultures, time, and raters (Barrick & Mount, 1991; Kim et al., 2019; Ozer & Benet-Martínez, 2006; Roberts & DelVecchio, 2000; Soto, 2019). Personality is typically organized into hierarchical structures based on lexical patterns of covariation (Markon, 2009) that are consolidated around the “Big Few” such as the five- and six-factor models of personality prevalent today (John et al., 2008). The psycholexical approach, which examined how trait adjectives cluster based on covariation of natural language, led to the discovery of consistently replicable dimensions that are known today as the Big Five (Allport & Odbert, 1936; Cattell, 1943; Goldberg, 1990; Tupes & Christal, 1961). These Big Five have inspired many subsequent frameworks: lexically derived Big Five (Goldberg, 1990; McCrae & John, 1992), questionnaire-based Five Factor Model (NEO-PI-R; Costa & McCrae, 1992), circumplex models (AB5C; Hofstee et al., 1992), and the Big Six model with HEXACO (Ashton & Lee, 2020). A consensus on their content and structure remains elusive between these models and other frameworks (Baumert et al., 2017; Block, 2010; Christensen et al., 2019; Condon et al., 2020; Mõttus et al., 2020; Schwaba et al., 2020).

Above the trait level, there is a general consensus around two meta-traits, such as Alpha and Beta (Digman, 1997) or Stability and Plasticity (DeYoung et al., 2002), that reflect shared variance in lower-level traits, representing tendencies toward restraint and control or exploration and engagement, respectively. Alternative cross-lexical models include the Big Two, such as Dynamism and Social Self-Regulation (Saucier et al., 2014), which are similar to Getting Ahead and Getting Along (Hogan, 1982), and the Big Three supertraits (Dynamism, Affiliation, and Order; De Raad et al., 2014) that reproduce across languages. Some have even proposed a single, general factor of personality (Musek, 2007). Below the Big Few, structures vary by inventory: aspects serve as intermediaries between traits and facets (DeYoung et al., 2007), while facets represent narrower characteristics with no clear consensus on their number or content (ranging from twenty to sixty; Ashton & Lee, 2020; Irwing et al., 2024; Johnson, 2014; McCrae, 2015; Christensen et al., 2019; Schwaba et al., 2020). At the lowest level, there appears to be as many personality nuances (e.g., items) as there are stars in the sky (Condon et al., 2020).

Beyond typical trait constellations, alternate structures and dimensions of personality have been proposed that encompass individual differences related to, but not captured by, the Big Few (Block, 2010). These alternatives include maladaptive personality models, which capture tendencies toward disordered thoughts, feelings, and behaviors such as Detachment, Antagonism, Disinhibition, and Psychoticism (Krueger et al., 2012), competing three-factor models (e.g., PEN model of Psychoticism, Extraversion, and Neuroticism; Eysenck et al., 1985), and the Dark Triad framework which considers non-pathological personality traits of Narcissism, Machiavellianism, and Psychopathy (Paulhus & Williams, 2002). Finally, some facets within the Big Few structures have been considered traits themselves, such as Risk Propensity (Highhouse et al., 2022) and Impulsivity (DeYoung & Rueter, 2016; Whiteside & Lynam, 2001), making their location in the hierarchy uncertain.

IPIP-NEO personality hierarchy

Overall, there are many competing structures of personality with little consensus around the organization of each level in the hierarchy (Block, 2010; Mõttus et al., 2020). Clarifying the personality hierarchy can have substantial consequences for descriptive, predictive, and explanatory efforts (Baumert et al., 2017; Blum et al., 2021). Accordingly, there are increasing calls for refinement of the Big Few frameworks and facets with novel methodological approaches from the bottom-up (e.g., Castro et al., 2021; Condon et al., 2020; Roberts & Yoon, 2022; Thielmann et al., 2022). When attempting to assess a personality hierarchy from the bottom-up, a broad item pool is recommended (Condon et al., 2020), making the 300-item IPIP-NEO inventory an ideal starting point (Goldberg, 1999).

The IPIP-NEO has several advantages including a large data repository that contains over 300,000 observations (Kajonius & Johnson, 2019). In addition, it serves as an open-source proxy for the widely used NEO-PI-R (Costa & McCrae, 1992), which makes its relations closely aligned with the broader NEO-PI-R literature because of its similar hierarchical structure of 30 facets that form the Big Five. Finally, the IPIP-NEO has been recognized for having broad (but not exhaustive) coverage of the universe of personality content (Condon et al., 2020). Altogether, the IPIP-NEO framework offers a useful foundation on which to build a personality taxonomy from the bottom-up. Clarifying the IPIP-NEO hierarchy through novel methodological approaches can lead to more accurate measurement, substantive interpretations, and predictions at each level (Blum et al., 2021; Condon et al., 2020; Irwing et al., 2024; Mõttus et al., 2020).

Present research

Although the IPIP-NEO has a well-defined theoretical structure, its empirical structure has not been rigorously examined, to our knowledge, using exploratory methods on the entire item pool. TGA provides a systematic approach to identify hierarchical dimensions starting from the items, and accounting for methodological challenges that have historically impeded the accurate recovery of hierarchical structures. Such an approach opens the door to discover novel dimensions or alternative organizations that may not have been considered previously within the constraints of its item pool. Therefore, a primary goal of the present research is to exemplify the TGA framework by evaluating the hierarchical structure of the IPIP-NEO inventory from the bottom-up. A second goal is to clarify the extent to which the recovered hierarchy aligns with or deviates from our current understanding of how the item pool organizes into facets, traits, and meta-traits. By identifying areas of convergence and divergence, we can provide valuable insights into the structure of personality as operationalized with the IPIP-NEO and highlight possible refinements to its conceptual framework. These findings have the potential to enhance our understanding of the IPIP-NEO hierarchy, its relations to other personality frameworks, and may improve subsequent prediction and explanation efforts. Overall, our findings can contribute to the ongoing dialogue on personality structure and provide a foundation for further research to investigate the hierarchical nature of individual differences.

Method

Participants

We used archival personality data from Johnson’s IPIP-NEO repository (e.g., Kajonius & Johnson, 2019). The IPIP-NEO 300-item dataset has 307,313 cases and is available on their OSF (https://osf.io/tbmh5). In the original database, the reverse-keyed items were recoded in the direction of their theoretical facet. In order to perform TGA, the reverse-keyed items were recoded back to be in their original direction. The dataset was subset to only include respondents based in the United States of America (U.S.A.) between 19 to 69 years old for a working sample of N = 149,337. The constraint on the region and age range was intended to mitigate common sources of noninvariance such as age and culture (but see Beck et al., 2023, Olaru et al., 2019; Syed, 2024, for limitations that may exist beyond our constraint).

Measures

The IPIP-NEO was created by evaluating a large item pool (over 1,000 items) and constructing 30 facets that mirrored the facets of the NEO-PI-R (Goldberg, 1999). The IPIP-NEO structure contains 6 facets per FFM trait and 10 items per facet: Openness to Experience (Adventurousness, Artistic Interests, Emotionality, Imagination, Intellect, Liberalism), Conscientiousness (Achievement-striving, Cautiousness, Dutifulness, Orderliness, Self-discipline, Self-efficacy), Extraversion (Activity-level, Assertiveness, Cheerfulness, Excitement-seeking, Friendliness, Gregariousness), Agreeableness (Altruism, Cooperation, Modesty, Morality, Sympathy, Trust), and Neuroticism (Anger, Anxiety, Depression, Immoderation, Self-consciousness, Vulnerability). Although the facets that correspond between the IPIP-NEO and NEO-PI-R correlate strongly (on average, 0.73; Goldberg, 1999), many facets have different labels (e.g., Immoderation and Impulsivity, respectively). The ranges for Cronbach’s $α$ across facets in the traits are all acceptable: Openness to Experience (0.77–0.86), Conscientiousness (0.71–0.85), Extraversion (0.71–0.87), Agreeableness (0.73–0.82), and Neuroticism (0.77–0.88; Goldberg, 1999).

Statistical analysis

Missingness

Although the rate of missing data was small (0.4%), there were 183,923 total missing responses due to the large sample. Missing data were imputed by taking the rounded mean of each variable (e.g., 3.56 = 4; 2.21 = 2), as is appropriate when the missing data rate is minor (i.e., < 1–2%; Widaman, 2006).²

Taxonomic graph analysis

Step 1: Remove redundancies

UVA was iteratively applied to remove redundancies by estimating an EBICglasso network, computing wTO, and removing all but one item from redundant item sets that had wTO values greater than 0.20 (Christensen et al., 2023). UVA first estimated a network using the EBICglasso (Epskamp & Fried, 2018) on polychoric correlations and then computed wTO (Nowick et al., 2009) to quantify the extent to which each pairwise combination of nodes (items) overlap—that is, whether they share a connection (partial correlation) and are connected to the same nodes. To mitigate local dependence, all but one item from a redundant set was removed using the following heuristics: (1) for pairs, keep the item with the lowest maximum wTO across all other items; (2) for sets of three or more, retain the item with the highest mean wTO to the others in the redundant set. UVA was performed at the first level only.

Step 2: Mitigate wording effects

To account for potential wording effects, riEGA (Garcia-Pardina et al., 2024) was estimated using the sample polychoric correlation matrix of the items and the maximum likelihood estimator within a random intercept model. If the model fails to converge, it is assumed that wording effects are negligible, and the sample correlation matrix is used for network estimation. However, when the model converges, as observed in this study, the network is estimated using the residual correlation matrix after controlling for the random intercept factor. This approach was applied across all consensus runs to ensure congruent network estimation. riEGA was applied to the first-level only because wording effects are only relevant at the item-level.

Step 3: Estimate network structure

The EBICglasso (Epskamp & Fried, 2018) was applied by searching along 100 values of $λ$ , a hyperparameter used to set the magnitude of the penalty. Larger values are a larger penalty, leading to sparser (fewer edges) networks; smaller values are a smaller penalty, leading to denser (more edges) networks. The range of the $λ$ values was defined following standard practices (Epskamp & Fried, 2018): the maximum of the range was obtained using the maximum absolute zero-order correlation and the minimum of the range was obtained by multiplying the maximum by 0.10 (sometimes referred to by its argument in the code, “lambda.min.ratio”). A sequence of 100 values that range from these minimum to maximum values are obtained following logarithmic increments (many smaller values and fewer larger values). The EBIC (Chen & Chen, 2008), which selects $λ$ , also has a hyperparameter called $γ$ that adjusts the preference for model complexity from more complex (smaller $γ$ values) to more parsimonious (larger $γ$ values) models. $γ$ was set to 0.50 which is commonly used and the default across many software (Epskamp & Fried, 2018; Foygel & Drton, 2010). The EBICglasso is applied to either the residual or zero-order correlation matrix (depending on the presence or absence of wording effects) across the different consensus runs.

Step 4: Reach dimension consensus

The lower order Louvain algorithm (Jiménez et al., 2023) with the most common consensus clustering approach (Golino & Christensen, 2025) was applied to the EBICglasso network. The number of repetitions in the most common consensus approach was incrementally increased in powers of ten starting with 10² (100) up to 10⁶ (1,000,000) to reach a stable solution. At each increment, 100 applications of the most common consensus clustering approach were used to establish the stability of the most common solution with the goal of achieving the same solution across all 100 applications. The first-level achieved this aim at 10⁴ (10,000) repetitions (as well as 10⁵ and 10⁶) and both the second- and third-level achieved this aim for all repetition increments (i.e., 10²⁻⁶). For first-level analyses, some consideration should be given to communities with only two nodes, as traditional psychometric practices suggest that at least three items are required to adequately assess a dimension (Gorsuch, 1997). In these cases, UVA heuristics can be applied: retain the item with the lowest maximum wTO with all other variables.

Step 5: Establish dimension robustness

bootEGA (Christensen & Golino, 2021) using the resampling with replacement procedure was applied after a consensus solution was reached. The resampling with replacement procedure randomly draws an observation from the original sample and makes a copy of their responses in a replicate sample. This observation is placed back into the original sample (replacement) and then another observation is drawn from the original sample (meaning the same observation can be drawn more than once). These draws repeat (resampling) until the replicate sample has the same number of observations as the original sample. On this replicate sample, the same procedures that were performed on the original sample up to this step are applied to the replicate sample (e.g., riEGA, network estimation, most common consensus clustering using the largest number of repetitions in Step 4). This process was repeated 500 times to create a sampling distribution of the communities and item assignments. Using the 500 replicate results, item stability, or the extent to which each item was assigned to the community found in the empirical most common consensus solution (Step 4), is computed as a proportion. Following Christensen and Golino (2021), item stabilities less than 0.75 are considered unstable. For the first-level, if any items were unstable (item stability < 0.75), they were removed from the analysis and Steps 2–5 were repeated.

Step 6: Compute network scores

Network loadings and scores were computed following Christensen et al. (2025). Network loadings have different magnitudes relative to factor loadings but maintain interpretable effect sizes of small (0.20), moderate (0.35), and large (0.50). Network scores were computed by multiplying the standardized data by its respective community assigned loading only (Golino et al., 2022).

Step 7: N-level dimensions

The network scores were used to estimate the Pearson’s correlation matrix for the next level of the hierarchy and Steps 3–6 are repeated until a single dimension is identified. To evaluate whether the final level of the hierarchy was unidimensional, the unidim metric was used (Revelle & Condon, 2025). unidim is the product of two indices: $τ$ (of tau equivalence) and p_c (of congeneric fit). Although Revelle & Condon (2025) do not suggest cut-offs, their simulation studies suggest that unidim values greater than 0.90 typically correspond to strong evidence for unidimensionality and values below 0.50 tend to suggest clear multidimensional structures (Tables 3–5 in Revelle & Condon, 2025). These unidim values are used as our criterion: values around 0.90 or greater suggest unidimensionality and values around 0.50 or lower suggest multidimensionality.

Openness and transparency

All code, output, supplemental tables, and a link to the Johnson IPIP-NEO repository are available on the project OSF (https://osf.io/hwpa9). Analyses were conducted in R (version 4.3.1; R Core Team, 2023) using EGAnet (version 2.2.0; Golino & Christensen, 2025).³

Results

Procedural results

Over three passes with UVA, 46 locally dependent sets of variables were identified and resolved to reduce the dataset from 300 to 249 items.⁴ Next, using the residual correlation matrix from the riEGA, the EBICglasso network was estimated, and the lower order Louvain with most common consensus was applied for one million repetitions, reaching perfect agreement across 100 applications. In this application, 7 two-node communities were identified and one node from each were removed, resulting in 242 items.

Applying these same steps (except for UVA) to the 242 items, the next cycle had zero two-node communities; however, further inspection of the communities revealed one set of items that formed a music dimension (theoretical facet and trait in parentheses): “Dislike loud music” (Excitement-seeking of Extraversion), “Like music” (Artistic Interests of Openness to Experience), and “Do not like concerts” (Artistic Interests of Openness to Experience). Despite the semantic similarity of the item content, their broader association patterns differed: “Dislike loud music” was more related to Recklessness, “Like music” was more related to Artistic Interests, and “Do not like concerts” was more related to Gregariousness. Of the three, “Like music” was retained due to its association with its theoretically intended dimension (Artistic Interests) and the other two items were removed.

This process was repeated on the remaining 240 items. After a consensus was reached (“Like music” was now assigned to the Artistic Interests), the robustness evaluation revealed three items were unstable (stabilities < 0.75). These items were removed leaving 237 items. These steps were repeated through the robustness evaluation step two more times, removing twelve (225 remaining) and three (222 remaining) items, respectively. On the next cycle through these steps, 1 two-node community was identified and only one item was retained (following the UVA heuristics). The remaining 221 items were passed through these steps for the final time, confirming all dimensions and items within those dimensions were robust (all item stabilities > 0.75; M = 0.99, SD = 0.03, Range = 0.78–1.00). This final result spread the 221 items across 28 first-level dimensions (Table 1).

Table 1.

Item content nested in their first-, second-, and third-level dimensions.

Stability (82)		Plasticity (99)		Disinhibition (40)
Neuroticism (48)	Conscientiousness (34)	Sociability (58)	Openness to experience (41)	Integrity (14)	Impulsivity (26)
Anger (9)Get angry easily;Get upset easily (0.63)	Self-Discipline (4)Get to work at once;Waste my time (0.59)	Gregariousness (22)Make friends easily;Make people feel welcome (0.75)	Intellect (10)Like to solve complex problems;Have a rich vocabulary (0.46)	Fairness (5)Would never cheat on my taxes;Try to follow the rules (0.43)	Recklessness (5)Enjoy being reckless;Act wild and crazy (0.55)
Anxiety (18)Worry about things;Panic easily (0.49)	Work Ethic (5)Work hard;Plunge into tasks with all my heart (0.51)	Cheerfulness (9)Radiate joy;Have a lot of fun (0.40)	Introspection (6)Enjoy examining myself and my life;Indulge in my fantasies (0.45)	Manipulativeness (6)Use flattery to get ahead;Use others for my own ends (−0.43)	Cautiousness (10)Choose my words with care;Jump into things without thinking (−0.45)
Emotionality (7)Experience my emotions intensely;Seldom get emotional (0.41)	Determination (4)Go straight for the goal;Turn plans into actions (0.50)	Empathy (14)Anticipate the needs of others;Am passionate about causes (0.40)	Artistic Interests (7)Like music;See beauty in things that others might not notice (0.37)	Honesty (3)Listen to my conscience;Break my promises (0.39)	Excitement- Seeking (5)Love excitement;Like to visit new places (0.39)
Dominance (14)Take charge;Can’t stand confrontations (0.11)	Self-Efficacy (7)Complete tasks successfully;Excel in what I do (0.46)	Trust (3)Trust others;Believe in human goodness (0.29)	Adventurousness (9)Prefer variety to routine;Interested in many things (0.32)		Immoderation (6)Go on binges;Rarely overindulge (0.18)
	Orderliness (11)Like order;Avoid mistakes (0.32)	Attention-Seeking (7)Dislike being the center of attention;Dislike talking about myself (0.20)	Liberalism (9)Tend to vote for liberal political candidates;Believe that there is no absolute right or wrong (0.24)
	Calmness (3)Like to take my time;Let things proceed at their own place (−0.13)	Humility (3)Consider myself an average person;Look down on others (0.10)

Note. Selected items are representative of each dimension based on high loadings. The first numbers in parentheses indicate the number of items in the first-, second-, and third-level dimensions and the second numbers in parentheses represent the first-level dimension’s network loading on the second. All loadings for all levels are provided in the Supplemental Information. Network loadings of 0.20, 0.35, and 0.50, can be considered as small, moderate, and large, respectively (Christensen et al., 2025).

Network scores were computed, and the second level was established using these scores. At the second level, only one pass through the steps was necessary (consensus and robustness were achieved; all stabilities = 1.00), resulting in 6 second-level dimensions (Table 1). Similarly, network scores were computed, and a single pass was needed to establish 3 third-level dimensions (all stabilities = 1.00; Table 1). As expected, the fourth-level resulted in a single dimension according to the lower order Louvain algorithm. At this stage, the unidim metric was applied and there was no statistical support for unidimensionality (u = 0.49). The correlations between the 3 third-level dimensions further supported this conclusion as there was a negligible correlation between Plasticity and Disinhibition (r = 0.056). Stability was positively correlated with Plasticity (r = 0.278) and negatively correlated with Disinhibition (r = −0.365). The final structure resulted in 28 first-level (facets), 6 second-level (traits), and 3 third-level (meta-traits) dimensions (Table 1; Figure 2). Network figures, loadings, and correlations for each level are provided in the Supplemental Information.

Figure 2.

IPIP-NEO taxonomic network.

Mapping the theoretical structure to the empirical structure

To interpret and label the network dimensions at each level, several of the authors met to interpret the dimension content and label the dimensions accordingly. The authors aimed to keep the dimension interpretations in line with the existing theoretical IPIP structure, inventories (e.g., HEXACO; Ashton & Lee, 2007), and research (e.g., Impulsivity, Integrity; Laginess, 2016; Whiteside & Lynam, 2001); opting to keep the theoretical labels for dimensions that retained most of their theoretically aligned content and considering new labels for dimensions with new orientations. Network loadings were used to identify the key defining features of each dimension at every level (first-level loadings on the second-level dimensions are provided in Table 1). At the same time, GPT-4, a foundational language model (OpenAI, 2023), was used to augment the human decision-making process (i.e., Brynjolfsson, 2022) by providing it with detailed prompting and item-based descriptions of the communities. The authors met again to compare the sets of human and GPT-4 labels in order to finalize the dimension interpretation and labeling.

To compare the theoretical and empirical (TGA) facets (first-level) and traits (second-level), the reduced item set (221 items) was grouped according to the theoretical and empirical assignments, respectively. The first analysis depicts the proportion of items from each theoretical facet that were identified in an empirical facet (Figure 3). Dimensions were “mapped” within their trait and organized from most alike to least alike (based on proportions). To provide an example, the retained Adventurousness items (9 out of 10 possible) are considered. Eight of the nine items ( $\frac{8}{9}$ = 0.889) organized into a single dimension, leading to an identical empirical label of Adventurousness. Its last item sorted into the empirical dimension of Excitement-seeking resulting in the proportion $\frac{1}{9}$ or 0.111. In the second analysis, the items from the reduced item set were recoded to be keyed in a positive direction toward their respective theoretical and empirical facets, respectively. Afterward, mean scores for each facet were computed and then correlated using Pearson’s correlation (Figure 4).

Figure 3.

Proportion correspondence map of the first-level dimensions. Note. Theoretical first-level dimensions are on the y-axis with their theoretical second-level labels; empirical first-level dimensions are on the x-axis with their theoretical second-level labels. The values represent the proportion of items in the reduced item set for the theoretical first-level dimension that is represented in each empirical first-level dimension. The color represents whether the empirical first-level dimension is consistent (yellow) or inconsistent (purple) with the theoretical first-level dimension based on the second level dimensions (gray boxes).

Figure 4.

Correlation correspondence map of the first-level dimensions. Note. Theoretical first-level dimensions are on the y-axis with their theoretical second-level labels; empirical first-level dimensions are on the x-axis with their theoretical second-level labels. The values represent Pearson’s correlations, based on the reduced item set, between the mean of the first-level dimensions where items were keyed in the direction of the theoretical and empirical labels, respectively. The color represents whether the correlation was positive (blue) or negative (red). The opacity is related to the strength of the correlation. For readability, white text is used for correlations > |0.50| and black text otherwise.

Overall, Openness to Experience retained the highest proportion of items within its theoretical domain and facets, with the exception of Emotionality, which moved to Neuroticism and still kept a substantial proportion of its items (0.750; Figure 3). Although a substantial proportion of the Neuroticism and Conscientiousness items also remained within their respective domains, more than half did not align with their intended facets. The majority of Neuroticism’s Anxiety (0.900) and Vulnerability (0.875) items merged into a single dimension of Anxiety. Neuroticism gained a novel but weakly loading dimension of Dominance (Table 1), which had its highest proportion of items from Cooperation (0.857; Agreeableness) and Assertiveness (0.600; Extraversion). The Achievement-striving facet of Conscientiousness split almost evenly between Work Ethic (0.556) and Determination (0.333), and its Orderliness facet retained all of its items (1.000) as well as one or two items from the other Conscientiousness facets (except for Self-efficacy).

Sociability emerged mostly from a mix of Extraversion (Cheerfulness, Gregariousness, Friendliness) and Agreeableness (Sympathy, Altruism, Modesty, Trust) facets as well as with some additional items from Neuroticism (Self-consciousness). The Extraversion facets largely retained all of their items (1.000, 1.000, 0.800, respectively) whereas the Agreeableness facets retained the majority of their items (0.556, 0.667, 0.625, 0.750, respectively). Integrity arose from the two facets of Dutifulness (0.875 total; Conscientiousness) and Morality (0.777 total; Agreeableness), with their items spread across Integrity’s facets: Manipulativeness (0.250 and 0.444, respectively), Fairness (0.250 and 0.333, respectively), and Honesty (0.375 and 0.000, respectively). Impulsivity emerged from a smattering of traits with each of its respective facets representing a higher proportion from one theoretical trait: Recklessness (0.556; Extraversion), Excitement-seeking (0.333; Extraversion), Cautiousness (0.875; Conscientiousness), and Immoderation (0.857; Neuroticism).

The correlations between the theoretical and empirical facets revealed several noteworthy patterns (Figure 4). For the 16 empirical facets that retained a theoretical label, their correlations were substantial with their theoretical counterpart (M = 0.94, Range = 0.76–1.00). These facets were distributed across five empirical second-level dimensions: Openness to Experience (Liberalism, Artistic Interests, Adventurousness, and Intellect), Neuroticism (Anger, Anxiety, and Emotionality), Conscientiousness (Orderliness, Self-efficacy, and Self-discipline), Sociability (Cheerfulness, Gregariousness, Trust), and Impulsivity (Excitement-seeking, Cautiousness, and Immoderation).

The first-level dimensions with the weakest loadings on their respective second-level dimension—Dominance (Neuroticism), Calmness (Conscientiousness), and Humility (Sociability)—displayed more complex correlational patterns than the other first-level dimensions within their respective domains. Dominance showed a strong negative correlation with Self-consciousness (r = −0.43) despite Self-consciousness having small-to-moderate positive correlations with the other facets of Neuroticism: Anger (r = 0.23), Anxiety (r = 0.61), and Emotionality (r = 0.20). Additionally, Dominance had moderate-to-large positive correlations with Extraversion’s facets of Assertiveness, Activity-level, and Excitement-seeking (r’s = 0.61, 0.35, 0.37, respectively) whereas these facets had negligible-to-positive correlations with Anger (r’s = 0.01, 0.10, −0.01, respectively) and Emotionality (r’s = 0.03, 0.03, −0.01, respectively) and small-to-moderate negative correlations with Anxiety (r’s = −0.31, −0.10, −0.19, respectively). Calmness showed a strong negative correlation with Extraversion’s Activity-level (r = −0.79) whereas the remaining Conscientiousness facets had large positive correlations (r’s = 0.43–0.47). Finally, Humility was primarily associated with the theoretical Agreeableness facet of Modesty (r = 0.67), which was negatively associated with other Sociability facets of Cheerfulness (r = −0.22), Gregariousness (r = −0.32), and Attention-seeking (r = −0.85).

The correlation patterns across the first-level dimensions for the novel second-level dimensions—Sociability, Integrity, and Impulsivity—were largely consistent with the proportions observed in Figure 3. Sociability’s first-level dimensions showed moderate-to-large positive correlations for most of the facets in Extraversion and Agreeableness. Attention-seeking had the largest deviations with primarily negative correlations on the Agreeableness facets and especially Morality (r = −0.33) and Modesty (r = −0.86). Integrity had relatively consistent correlational patterns across the theoretical facets of Agreeableness and Conscientiousness, with the strongest correlations belonging to Morality and Dutifulness, respectively. The correlation strengths on these facets, for each first-level dimension of Integrity, were comparable suggesting that it equally relates to both of the Big Five traits. Impulsivity’s first-level dimensions had the most revealing correlation patterns of all empirical second-level dimensions, mirroring the proportions observed in Figure 3. All Impulsivity facets had moderate-to-strong correlations with at least a few Conscientiousness facets. Their correlational patterns differed, however, with respect to facets of Extraversion and Neuroticism where Recklessness and Excitement-seeking tended to have stronger correlations with Extraversion, and Cautiousness and Immoderation tended to have stronger correlations with Neuroticism.

Discussion

The present study developed a comprehensive psychometric network framework called Taxonomic Graph Analysis (TGA) to estimate hierarchical structures from the bottom-up. TGA offers a robust framework that addresses key methodological challenges in hierarchical personality assessment, including local independence violations, wording effects, dimensionality assessment, and structural stability, enabling a rigorous bottom-up investigation of psychological constructs without the constraints of traditional top-down assumptions. This framework was applied to a large, U.S.-based dataset (N > 145,000) containing responses from the open-source Johnson IPIP-NEO repository (https://osf.io/tbmh5/; Kajonius & Johnson, 2019). The initial 300-item pool was reduced to 249 items after handling local dependencies and 221 items after establishing a consensus of 28 stable first-level dimensions (facets). Based on these 28 first-level dimensions, we identified 6 second-level dimensions (traits), which were further grouped into 3 third-level dimensions (meta-traits). Our results did not support a single, fourth-level dimension (i.e., a general factor of personality; Musek, 2007). The structure identified by TGA shared some similarities with the theoretical IPIP-NEO structure but also had considerable deviations. The overarching theme of our results was that TGA identified a hierarchical structure that integrated empirical and theoretical findings that have been scattered across the personality literature. We highlight these findings and discuss TGA’s potential to advance future research aimed at developing hierarchical models in psychological research.

Overlooking the challenges addressed by TGA, such as local independence violations, wording effects, and the stability of the dimensions, would have resulted in a substantially different hierarchical structure. For example, applying the standard EGA approach to the full 300 item set would have resulted in 61 first-level communities with some representing split dimensions (Flores-Kanter et al., 2021; Wood et al., 1996) or minor dimensions due to redundancy (Ferrando et al., 2022). If local independence violations were handled, but wording effects were not, then some dimensions would split on the semantic polarity of the items (e.g., Empathy split into positively and negatively worded dimensions). After addressing local independence violations and wording effects, 26 additional items still needed to be removed due to structural instability (e.g., weakly loading items or items that loaded substantially on multiple dimensions). These findings highlight the importance of TGA to ensure that the recovered structure is not distorted by methodological artifacts, resulting in a clearer, more interpretable and stable hierarchy.

First-level structure

Applying TGA to the IPIP-NEO allowed items to freely associate and new dimensions to emerge from the bottom-up, with substantial departures from the theoretical structure. Relative to the existing structure, some items formed larger, broader dimensions involving the merging of theoretical facets (e.g., Gregariousness and Friendliness, Anxiety and Vulnerability); other items formed refined, narrower dimensions involving few items from a single theoretical facet (e.g., Honesty from Dutifulness, Calmness from Activity-level). Some items remained in their theoretical facet (e.g., Artistic Interests, Anger); other items formed a facet distinct from the IPIP-NEO facet structure (e.g., Determination, Recklessness). Of the 30 theoretical dimensions, only 16 emerged empirically (53.3%), with the proportion of theoretical items composing them varying considerably (from 0.333 to 1; M = 0.84). These results suggest that the theoretical facets and traits of the IPIP-NEO should not be assumed to align with their original structure and warrant further scrutiny. This finding is not surprising given that the 300-item IPIP-NEO had never been validated, to our knowledge, by analyzing the complete item pool simultaneously.

These results reiterate the challenges of ensuring that constructs remain homogeneous as item pools grow in the face of “bloated specifics” or “cheating by repeating” semantic variations (Cattell & Tsujioka, 1964; Reise et al., 2018). Additionally, conventional psychometric practices tend to analyze items in silos, assuming content composition is homogenous and overlooking much of the complexity in high-dimensional data (Achenbach, 2021; Condon et al., 2020). The identification of these cross-facet and cross-domain assignments can substantively inform researchers as to why certain facets and domains in the IPIP-NEO tend to be correlated (e.g., Openness to Experience and Agreeableness; Lawn et al., 2023), and why outcomes related to these theoretical dimensions are related to multiple different facets and traits (Mõttus, 2016). These results are particularly relevant in relation to the widespread practice of performing theoretical parceling (i.e., creating facet scores based on theory) to explore the second-order structure of personality (e.g., Costa & McCrae, 1995; Sanz-García et al., 2024). Although parceling can be advantageous under certain scenarios (Little et al., 2013), it can obscure, rather than clarify, the structure of the data if the complete item-level structure (analyzed simultaneously) has not been established empirically (Bandalos, 2002; Little et al., 2013; Marsh et al., 2013).

Second-level structure

At the second level, there were six dimensions identified that ranged from nearly identical to theory (Openness to Experience) to mostly a reorganization of content (Conscientiousness and Neuroticism) to the mixing of Extraversion and Agreeableness (Sociability; Blain et al., 2023) to the emergence of relatively novel dimensions not identified in the Big Five model (Integrity and Impulsivity; Laginess, 2016; Whiteside & Lynam, 2001). Only half of the second-level dimensions had content that represented the majority of one theoretical trait domain (i.e., Openness to Experience, Conscientiousness, and Neuroticism).

In the case of the Openness to Experience dimension, most empirical facets were consistent with their theoretical counterpart. The only notable change was that the Emotionality facet moved to Neuroticism. The Neuroticism dimension also merged two theoretical facets, Anxiety and Vulnerability, into a single dimension (Anxiety), and had all items belonging to the Depression facet removed completely due to local dependence and low stability. A novel dimension of Dominance emerged that loaded weakly, reflecting a confrontational social orientation (a mix of Extraversion and Agreeableness facets). Although not explicitly defined within the IPIP-NEO, Dominance has frequently appeared as a component of personality and group dynamics (Anderson & Kilduff, 2009). The empirical Conscientiousness dimension was largely composed of its theoretical items, however, there were substantial changes in their organization. Achievement-striving split into two dimensions reflecting Determination (goal-pursuit) and Work Ethic (DeYoung, 2015; Jayawickreme et al., 2019; Kanfer et al., 2017). A novel but weakly (negative) loading facet of Calmness was identified indicating a general penchant toward an easy-going and passive pace of life. The Self-efficacy and Self-discipline dimensions that emerged were partial representations of their theoretical facets while Orderliness was composed completely of its original items plus individual items from several other Conscientiousness facets.

The remaining three second-level dimensions—Sociability, Integrity, and Impulsivity—represent major departures from the traditional IPIP-NEO structure and FFM framework. Sociability represented a mix of Extraversion and Agreeableness, capturing the affiliative content (Gregariousness) that broadly characterizes Extraversion and the prosocial elements of Agreeableness (Empathy and Trust). The Extraversion side of Sociability also captured positive affect (Cheerfulness), resonating with the Enthusiasm aspect of the Big Five Aspect Scale; the Agreeableness side of Sociability consisted of Empathy, resonating with the Compassion aspect of the Big Five Aspect Scale (DeYoung et al., 2007). This finding corroborates recent suggestions of an interstitial trait of Affiliation, which blends these two aspects (Blain et al., 2023) as well as the affiliative role of empathy in interpersonal relationships (Ringwald & Wright, 2021). The label Sociability was selected to capture broader social engagement characteristics (e.g., Attention-seeking, Trust) that extend beyond its affiliative content.

Our findings establish Integrity as a distinct personality dimension that integrates elements from the theoretical facets of Agreeableness (Morality) and Conscientiousness (Dutifulness; Laginess, 2016). Integrity aligned primarily with the Honesty content of HEXACO’s Honesty-Humility factor, with an explicit dimension of Honesty and the dimensions of Manipulativeness and Fairness corresponding to HEXACO’s theoretical facets of Sincerity and Fairness, respectively (Ashton & Lee, 2007). Although Integrity overlaps conceptually with Honesty-Humility, the theoretical Modesty content aligned more closely with Sociability, reinforcing the idea that social self-presentation tendencies operate independently from moral character (Hart et al., 2023). Integrity captures moral or socially normative behaviors (Honesty and Fairness) versus more antisocial behaviors (Manipulativeness) associated with the Dark Triad traits of Narcissism, Machiavellianism, and Psychopathy (Howard & Manix, 2022; Howard & Van Zandt, 2020; Paulhus & Williams, 2002) and the Dark Factor of Personality (Moshagen et al., 2018). This association is further underscored by research on workplace psychopathy, which highlights dishonesty and manipulation as key drivers of unethical decision-making and counterproductive behaviors (Hart et al., 2023; Smith & Lilienfeld, 2013). Importantly, by emphasizing manipulation, this conception of Integrity extends beyond merely telling the truth or following rules—it represents the rejection of deceptive, self-serving behaviors (Miller & Schlenker, 2011).

The final second-level dimension identified was Impulsivity, which was defined in earlier frameworks as a component of Psychoticism (Eysenck et al., 1985) and more recently operationalized as a heterogeneous mix of traits often associated with facets of Conscientiousness, Extraversion, and Neuroticism (Whiteside & Lynam, 2001; Zuckerman et al., 1993). The theoretical dimensions of Impulsivity (Whiteside & Lynam, 2001) correspond to specific empirical facets where Premeditation is consistent with the facet of Cautiousness, Sensation-Seeking aligns with Excitement-Seeking and Recklessness, and Urgency is associated with Immoderation, supporting the view that Impulsivity is a multifaceted trait (de Vries et al., 2009; Sharma et al., 2014). This placement in the IPIP-NEO hierarchy raises questions about whether Impulsivity should be conceptualized as a broad trait domain rather than as a subordinate facet within existing frameworks (DeYoung & Rueter, 2016) and whether it may be part of a unique system that works in conjunction with other personality traits (Mullins-Sweatt et al., 2019).

Third-level structure

At the third level, three dimensions emerged that appeared to resemble conceptualizations of the Big Two or Three meta-traits (De Raad et al., 2014; DeYoung, 2006, 2015; Digman, 1997; Saucier et al., 2014). Two of the dimensions retained the labels of Stability and Plasticity (DeYoung, 2006) because they closely resembled the original concepts, despite Agreeableness not emerging as a dimension and its content being redistributed to different traits. In the reorganization of the IPIP-NEO components, Stability (Conscientiousness and Neuroticism) and Plasticity (Sociability and Openness to Experience), appear to align with their theoretical interpretation as an adaptive system of purposeful behavior reflecting motivational control in the pursuit of goals (Conscientiousness) that emerges from low emotional volatility (Neuroticism), and an orientation toward exploring novel internal (Openness to Experience) and external (Sociability) states in the pursuit of goals, respectively (DeYoung, 2006, 2015; Fleeson & Jayawickreme, 2015).

The final third-level dimension, Disinhibition, is a novel addition to the IPIP-NEO and Big Few personality hierarchies, and is composed of Integrity and Impulsivity, representing a broad dispositional tendency characterized by reduced behavioral and emotional regulation, integrating aspects of both externalizing and self-regulatory processes (Mullins-Sweatt et al., 2019). This structure aligns with prior hierarchical models of personality that have consistently identified Disinhibition versus Constraint as a major superordinate dimension spanning normal and maladaptive traits (De Raad et al., 2014; Eysenck et al., 1985; Markon et al., 2005). The integration of Impulsivity and Integrity under this overarching construct reflects a balance between a propensity for rash, sensation-seeking behavior and a susceptibility to ethical and self-control failures, suggesting that Disinhibition encompasses not only impulsive action but also a broader failure to regulate behavior in accordance with internalized social and moral standards (Joyner et al., 2021). This conceptualization is further supported by meta-analytic findings indicating that Disinhibition is closely tied to Conscientiousness but also incorporates elements of low Agreeableness, reinforcing its role as a dimension spanning externalizing and self-regulatory traits (Sharma et al., 2014). Additionally, recent research has emphasized that Disinhibition is an important predictor of a range of maladaptive behaviors, including risk-taking, rule-breaking, and interpersonal insensitivity (Mullins-Sweatt et al., 2019; Ro et al., 2023).

Despite mostly maladaptive connotations, moderate levels of Disinhibition may be adaptive for goal disengagement, acting as a switch that allows someone to stop old, ineffective strategies (Stability) and start new ones (Plasticity) in the pursuit of long-term goals (Clark & Watson, 2008). As people navigate environments in pursuit of their goals, they need to dynamically identify and employ strategies to overcome obstacles that challenge their goal pursuit (DeYoung, 2015; Wrosch et al., 2003). Disinhibition may be an important component of self-regulatory or cybernetic systems of motivated behavior (Carver & White, 1994; Elliot & Thrash, 2002; McNaughton & Gray, 2000; Higgins & Cornwell, 2016; Monni et al., 2020), such that an adaptive state of Disinhibition may be beneficial in situations where normative behaviors are no longer beneficial, allowing people to disengage from current strategies and search for new ones (e.g., Asch’s social conformity experiments, Milgram’s obedience to authority studies, and the Bystander effect; Hirsh et al., 2010; van den Bos et al., 2011). The convergence of Integrity and Impulsivity within this framework suggests that Disinhibition is not merely a reflection of momentary impulse control failures but rather a more pervasive meta-trait that influences behavioral regulation across multiple domains.

Limitations

Although the components of TGA have been thoroughly vetted through simulation studies, some with item pools as large as 180 items (Jiménez et al., 2023), none of the applications so far have included item pools and structures as large as that of the IPIP-NEO. Future simulation studies should evaluate the TGA methods under similar conditions. Our results are also based on a single personality inventory, risking results that have a mono-operation bias (Gallagher et al., 2020). Although the IPIP-NEO is a broad, open-access inventory based on the widely used NEO-PI-R (Costa & McCrae, 2008) and IPIP (Goldberg et al., 2006), future research should continue to investigate other self-report inventories and adjective-based approaches to evaluate item pools beyond those used in this study (including Big Few alternatives; Feher & Vernon, 2021). Similarly, researchers should use TGA in the exploration and evaluation of other hierarchical constructs related to personality, such as intelligence, attitudes, and psychopathology. Aside from the inventory, our data were restricted to a single country (U.S.A.), which may limit the generalizability of the results. Future work should continue contributing to open-source personality data so that researchers and practitioners have access to large, representative datasets that comprehensively sample the content space of personality so that researchers can continue exploring its structure.

Conclusion

The taxonomic structure of personality is the foundation on which subsequent prediction and explanatory models of personality are built (Baumert et al., 2017; Mõttus et al., 2020). Although most contemporary theories of personality start with the Big Few and work down, more recent calls have emphasized the need for more bottom-up, exploratory approaches to validate item-level structures that are often neglected in theoretical-driven approaches (e.g., Condon et al., 2020). To date, a core constraint on such analyses has been the limited availability of validated statistical methods to evaluate complex, hierarchical structures in this manner. This study introduces a promising statistical framework, TGA, that has the capability to capture the full complexity of personality and build from the bottom-up.

Supplemental Material

Supplemental Material - Revisiting the IPIP-NEO personality hierarchy with taxonomic graph analysis

Supplemental Material for Revisiting the IPIP-NEO personality hierarchy with taxonomic graph analysis by Andrew Samo, Luis Eduardo Garrido, Francisco J Abad, Hudson Golino, Samuel T McAbee, and Alexander P Christensen in European Journal of Personality.

Supplemental Material

Supplemental Material - Revisiting the IPIP-NEO personality hierarchy with taxonomic graph analysis

Footnotes

Acknowledgments

The first author would like to thank Christopher M. Gallagher for the introduction to EGA.

Author contributions

Andrew Samo: Conceptualization, data curation, formal analysis, methodology, validation, visualization, writing—original draft, and writing—review and editing. Luis Eduardo Garrido: Conceptualization, formal analysis, methodology, validation, writing—original draft, and writing—review and editing. Francisco J Abad: Formal analysis, methodology, validation, and writing—review and editing. Hudson Golino: Formal analysis, methodology, software, and writing—review and editing. Samuel T McAbee: Conceptualization, supervision, and writing—review and editing. Alexander P Christensen: Conceptualization, data curation, formal analysis, methodology, resources, software, supervision, validation, visualization, writing—original draft, and writing—review and editing.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Open science statement

All project materials are available on the Open Science Framework (OSF): .

ORCID iDs

Andrew Samo

Luis Eduardo Garrido

Francisco J Abad

Samuel T McAbee

Alexander P Christensen

Supplemental Material

Supplemental material for this article is available online.

Notes

References

Achenbach

T. M.

(2021). Hierarchical dimensional models of psychopathology: Yes, but…. World Psychiatry, 20(1), 64–65. https://doi.org/10.1002/wps.20810

Allport

G. W.

Odbert

H. S.

(1936). Trait-names: A psycho-lexical study. Psychological Monographs, 47(1), i–171. https://doi.org/10.1037/h0093360

Anderson

Kilduff

G. J.

(2009). Why do dominant personalities attain influence in face-to-face groups? The competence-signaling effects of trait dominance. Journal of Personality and Social Psychology, 96(2), 491–503. https://doi.org/10.1037/a0014201

Arias

V. B.

Garrido

L. E.

Jenaro

Martínez-Molina

Arias

(2020). A little garbage in, lots of garbage out: Assessing the impact of careless responding in personality survey data. Behavior Research Methods, 52(6), 2489–2505. https://doi.org/10.3758/s13428-020-01401-8

Ashton

M. C.

Lee

(2007). Empirical, theoretical, and practical advantages of the HEXACO model of personality structure. Personality and Social Psychology Review, 11(2), 150–166. https://doi.org/10.1177/1088868306294907

Ashton

M. C.

Lee

(2020). Objections to the HEXACO model of personality structure—and why those objections fail. European Journal of Personality, 34(4), 492–510. https://doi.org/10.1002/per.2242

Bandalos

D. L.

(2002). The effects of item parceling on goodness-of-fit and parameter estimate bias in structural equation modeling. Structural Equation Modeling, 9(1), 78–102. https://doi.org/10.1207/S15328007SEM0901_5

Barrick

M. R.

Mount

M. K.

(1991). The Big Five personality dimensions and job performance: A meta-analysis. Personnel Psychology, 44(1), 1–26. https://doi.org/10.1111/j.1744-6570.1991.tb00688.x

Baumert

Schmitt

Perugini

Johnson

Blum

Borkenau

Costantini

Denissen

J. J.

Fleeson

Grafton

Jayawickreme

Kurzius

MacLeod

Miller

L. C.

Read

S. J.

Roberts

Robinson

M. D.

Wood

Wrzus

(2017). Integrating personality structure, personality process, and personality development. European Journal of Personality, 31(5), 503–528. https://doi.org/10.1002/per.2115

10.

Beck

E. D.

Condon

Jackson

(2023). Interindividual age differences in personality structure. European Journal of Personality, 37(3), 257–275. https://doi.org/10.1177/08902070221084862

11.

Blain

S. D.

Weisberg

Y. J.

Condon

D. M.

DeYoung

C. G.

(2023). Affiliation: A consequential, interstitial trait. PsyArXiv. https://doi.org/10.31234/osf.io/8gf5t

12.

Block

(2010). The Five-Factor framing of personality and beyond: Some ruminations. Psychological Inquiry, 21(1), 2–25. https://doi.org/10.1080/10478401003596626

13.

Blondel

V. D.

Guillaume

J.-L.

Lambiotte

Lefebvre

(2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008

14.

Blum

G. S.

Baumert

Schmitt

(2021). Personality processes—From description to explanation. In Rauthmann

J. F.

(Ed.), The handbook of personality dynamics and processes (pp. 33–55). Elsevier. https://doi.org/10.1016/B978-0-12-813995-0.00002-9

15.

Bringmann

L. F.

Elmer

Eronen

M. I.

(2022). Back to basics: The importance of conceptual clarification in psychological science. Current Directions in Psychological Science, 31(4), 340–346. https://doi.org/10.1177/09637214221096485

16.

Brynjolfsson

(2022). The Turing trap: The promise & perial of human-like artificial intelligence. Daedalus, 151(2), 272–287. https://doi.org/10.1162/daed_a_01915

17.

Burisch

(1984). Approaches to personality inventory construction: A comparison of merits. American Psychologist, 39(3), 214–227. https://doi.org/10.1037/0003-066X.39.3.214

18.

Carver

C. S.

White

T. L.

(1994). Behavioral inhibition, behavioral activation, and affective responses to impending reward and punishment: The BIS/BAS scales. Journal of Personality and Social Psychology, 67(2), 319–333. https://doi.org/10.1037/0022-3514.67.2.319

19.

Castro

Ferreira

T. B.

(2021). Modularity of the personality network. European Journal of Psychological Assessment, 36(6), 998–1008. https://doi.org/10.1027/1015-5759/a000613

20.

Cattell

R. B.

(1943). The description of personality: Basic traits resolved into clusters. Journal of Abnormal and Social Psychology, 38(4), 476–506. https://doi.org/10.1037/h0054116

21.

Cattell

R. B.

Tsujioka

(1964). The importance of factor-trueness and validity, versus homogeneity and orthogonality, in test scales. Educational and Psychological Measurement, 24(1), 3–30. https://doi.org/10.1177/001316446402400101

22.

Chen

(2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. https://doi.org/10.1093/biomet/asn034

23.

Chen

W.-H.

Thissen

(1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22(3), 265–289. https://doi.org/10.3102/10769986022003265

24.

Christensen

A. P.

Cotter

K. N.

Silvia

P. J.

(2019). Reopening openness to experience: A network analysis of four openness to experience inventories. Journal of Personality Assessment, 101(6), 574–588. https://doi.org/10.1080/00223891.2018.1467428

25.

Christensen

A. P.

Garrido

L. E.

Golino

(2023). Unique variable analysis: A network psychometrics method to detect local dependence. Multivariate Behavioral Research, 58(6), 1165–1182. https://doi.org/10.1080/00273171.2023.2194606

26.

Christensen

A. P.

Garrido

L. E.

Guerra-Peña

Golino

(2024). Comparing community detection algorithms in psychometric networks: A Monte Carlo simulation. Behavior Research Methods, 56(3), 1485–1505. https://doi.org/10.3758/s13428-023-02106-4

27.

Christensen

A. P.

Golino

(2021). Estimating the stability of psychological dimensions via bootstrap exploratory graph analysis: A Monte Carlo simulation and tutorial. Psych, 3(3), 479–500. https://doi.org/10.3390/psych3030032

28.

Christensen

A. P.

Golino

Abad

F. J.

Garrido

L. E.

(2025). Revised network loadings. Behavior Research Methods, 57, 114. https://doi.org/10.3758/s13428-025-02640-3

29.

Clark

L. A.

Watson

(2008). Temperament: An organizing paradigm for trait psychology. In John

O. P.

Robins

R. W.

Pervin

L. A.

(Eds.), Handbook of personality: Theory and research (3rd ed., pp. 265–286). Guilford Press.

30.

Clark

L. A.

Watson

(2019). Constructing validity: New developments in creating objective measuring instruments. Psychological Assessment, 31(12), 1412–1427. https://doi.org/10.1037/pas0000626

31.

Condon

D. M.

(2022). The SAPA Personality Inventory: An empirically-derived, hierarchically-organized self-report personality assessment model. PsyArXiv. https://doi.org/10.31234/osf.io/sc4p9

32.

Condon

D. M.

Wood

Mõttus

Booth

Costantini

Greiff

Johnson

Lukaszewski

Murray

Revelle

Wright

A. G. C.

Ziegler

Zimmermann

(2020). Bottom up construction of a personality taxonomy. European Journal of Psychological Assessment, 36(6), 923–934. https://doi.org/10.1027/1015-5759/a000626

33.

Costa

P. T.

McCrae

R. R.

(1992). Normal personality assessment in clinical practice: The NEO personality inventory. Psychological Assessment, 4(1), 5–13. https://doi.org/10.1037/1040-3590.4.1.5

34.

Costa

P. T.

McCrae

R. R.

(1995). Domains and facets: Hierarchical personality assessment using the Revised NEO Personality Inventory. Journal of Personality Assessment, 64(1), 21–50. https://doi.org/10.1207/s15327752jpa6401_2

35.

Costa

P. T.

McCrae

R. R.

(2008). The revised NEO personality inventory (NEO-PI-R). In Boyle

G. J.

Matthews

Saklofske

D. H.

(Eds.), The SAGE handbook of personality theory and assessment (Vol. 2, pp. 179–198). Sage Publications Inc.

36.

Cramer

A. O.

Van der Sluis

Noordhof

Wichers

Geschwind

Aggen

S. H.

Kendler

K. S.

Borsboom

(2012). Dimensions of normal personality as networks in search of equilibrium: You can’t like parties if you don’t like people. European Journal of Personality, 26(4), 414–431. https://doi.org/10.1002/per.1866

37.

De Raad

Barelds

D. P.

Timmerman

M. E.

De Roover

Mlačić

Church

A. T.

(2014). Towards a pan–cultural personality structure: Input from 11 psycholexical studies. European Journal of Personality, 28(5), 497–510. https://doi.org/10.1002/per.1953

38.

DeVellis

R. F.

(2007). Scale development: Theory and applications (3rd ed.). Sage Publications.

39.

de Vries

R. E.

de Vries

Feij

J. A.

(2009). Sensation seeking, risk-taking, and the HEXACO model of personality. Personality and Individual Differences, 47(6), 536–540. https://doi.org/10.1016/j.paid.2009.05.029

40.

DeYoung

C. G.

(2006). Higher-order factors of the Big Five in a multi-informant sample. Journal of Personality and Social Psychology, 91(6), 1138–1151. https://doi.org/10.1037/0022-3514.91.6.1138

41.

DeYoung

C. G.

(2015). Cybernetic Big Five Theory. Journal of Research in Personality, 56(3), 33–58. https://doi.org/10.1016/j.jrp.2014.07.004

42.

DeYoung

C. G.

Peterson

J. B.

Higgins

D. M.

(2002). Higher-order factors of the Big Five predict conformity: Are there neuroses of health? Personality and Individual Differences, 33(4), 533–552. https://doi.org/10.1016/S0191-8869(01)00171-4

43.

DeYoung

C. G.

Quilty

L. C.

Peterson

J. B.

(2007). Between facets and domains: 10 aspects of the Big Five. Journal of Personality and Social Psychology, 93(5), 880–896. https://doi.org/10.1037/0022-3514.93.5.880

44.

DeYoung

C. G.

Rueter

A. R.

(2016). Impulsivity as a personality trait. In Vohs

K. D.

Baumeister

R. F.

(Eds.), Handbook of self-regulation: Research, theory, and applications (3rd ed., pp. 345–363). Guilford Press.

45.

Digman

J. M.

(1997). Higher-order factors of the Big Five. Journal of Personality and Social Psychology, 73(6), 1246–1256. https://doi.org/10.1037/0022-3514.73.6.1246

46.

DiStefano

Motl

R. W.

(2006). Further investigating method effects associated with negatively worded items on self-report surveys. Structural Equation Modeling, 13(3), 440–464. https://doi.org/10.1207/s15328007sem1303_6

47.

Edwards

M. C.

Houts

C. R.

Cai

(2018). A diagnostic procedure to detect departures from local independence in item response theory models. Psychological Methods, 23(1), 138–149. https://doi.org/10.1037/met0000121

48.

Elliot

A. J.

Thrash

T. M.

(2002). Approach-avoidance motivation in personality: Approach and avoidance temperaments and goals. Journal of Personality and Social Psychology, 82(5), 804–818. https://doi.org/10.1037/0022-3514.82.5.804

49.

Epskamp

Fried

E. I.

(2018). A tutorial on regularized partial correlation networks. Psychological Methods, 23(4), 617–634. https://doi.org/10.1037/met0000167

50.

Eysenck

S. B.

Eysenck

H. J.

Barrett

(1985). A revised version of the psychoticism scale. Personality and Individual Differences, 6(1), 21–29. https://doi.org/10.1016/0191-8869(85)90026-1

51.

Feher

Vernon

P. A.

(2021). Looking beyond the Big Five: A selective review of alternatives to the Big Five model of personality. Personality and Individual Differences, 169(1), Article 110002. https://doi.org/10.1016/j.paid.2020.110002

52.

Ferrando

P. J.

Hernandez-Dorado

Lorenzo-Seva

(2022). Detecting correlated residuals in exploratory factor analysis: New proposals and a comparison of procedures. Structural Equation Modeling, 29(4), 1–9. https://doi.org/10.1080/10705511.2021.2004543

53.

Fleeson

Jayawickreme

(2015). Whole trait theory. Journal of Research in Personality, 56(4), 82–92. https://doi.org/10.1016/j.jrp.2014.10.009

54.

Flores‐Kanter

P. E.

Garrido

L. E.

Moretti

L. S.

Medrano

L. A.

(2021). A modern network approach to revisiting the positive and negative affective schedule (PANAS) construct validity. Journal of Clinical Psychology, 77(10), 2370–2404. https://doi.org/10.1002/jclp.23191

55.

Forbes

M. K.

(2024). Improving hierarchical models of individual differences: An extension of Goldberg’s bass-ackward method. Psychological Methods, 29(6), 1062–1073. https://doi.org/10.1037/met0000546

56.

Forbes

M. K.

Baillie

Batterham

P. J.

Calear

Kotov

Krueger

R. F.

Markon

K. E.

Mewton

Pellicano

Roberts

Rodriguez-Seijas

Sunderland

Watson

Watts

A. L.

Wright

A. G. C.

Anna Clark

(2024). Reconstructing psychopathology: A data-driven reorganization of the symptoms in the Diagnostic and Statistical Manual of Mental Disorders. Clinical Psychological Science, 13(3), 21677026241268345. https://doi.org/10.1177/21677026241268345

57.

Forbes

M. K.

Watts

A. L.

Twose

Barrett

Hudson

J. L.

Lyneham

H. J.

McLellan

Newton

N. C.

Sicouri

Chapman

McKinnon

Rapee

R. M.

Slade

Teesson

Markon

Sunderland

(2024). A hierarchical model of the symptom-level structure of psychopathology in youth. Clinical Psychological Science, 13(2), 21677026241257852. https://doi.org/10.1177/21677026241257852

58.

Fortunato

(2010). Community detection in graphs. Physics Reports, 486(3–5), 75–174. https://doi.org/10.1016/j.physrep.2009.11.002

59.

Foygel

Drton

(2010). Extended Bayesian information criteria for Gaussian graphical models. In: Advances in neural information processing systems, 23.

60.

Fried

E. I.

Cramer

A. O.

(2017). Moving forward: Challenges and directions for psychopathological network theory and methodology. Perspectives on Psychological Science, 12(6), 999–1020. https://doi.org/10.1177/1745691617705892

61.

Friedman

Hastie

Tibshirani

(2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3), 432–441. https://doi.org/10.1093/biostatistics/kxm045

62.

Funder

D. C.

(2009). Persons, behaviors and situations: An agenda for personality psychology in the postwar era. Journal of Research in Personality, 43(2), 120–126. https://doi.org/10.1016/j.jrp.2008.12.041

63.

Gallagher

C. M.

Samo

Shea

M. A.

Mcabee

S. T.

(2020). Distinguishing between instruments and constructs in Big Six research. European Journal of Personality, 34(4), 526–527.

64.

Garcia-Pardina

Abad

F. J.

Christensen

A. P.

Golino

Garrido

L. E.

(2024). Dimensionality assessment in the presence of wording effects: A network psychometric and factorial approach. Behavior Research Methods, 56(6), 6179–6197. https://doi.org/10.3758/s13428-024-02348-w

65.

Gates

K. M.

Henry

Steinley

Fair

D. A.

(2016). A Monte Carlo evaluation of weighted community detection algorithms. Frontiers in Neuroinformatics, 10, 45. https://doi.org/10.3389/fninf.2016.00045

66.

Goldberg

L. R.

(1990). An alternative “description of personality”: The Big-Five factor structure. Journal of Personality and Social Psychology, 59(6), 1216–1229. https://doi.org/10.1037/0022-3514.59.6.1216

67.

Goldberg

L. R.

(1999). A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. Personality Psychology in Europe, 7(1), 7–28.

68.

Goldberg

L. R.

(2006). Doing it all bass-ackwards: The development of hierarchical factor structures from the top down. Journal of Research in Personality, 40(4), 347–358. https://doi.org/10.1016/j.jrp.2006.01.001

69.

Goldberg

L. R.

Johnson

J. A.

Eber

H. W.

Hogan

Ashton

M. C.

Cloninger

C. R.

Gough

H. G.

(2006). The international personality item pool and the future of public-domain personality measures. Journal of Research in Personality, 40(1), 84–96. https://doi.org/10.1016/j.jrp.2005.08.007

70.

Golino

Christensen

A. P.

(2025). EGAnet: Exploratory Graph Analysis – A framework for estimating the number of dimensions in multivariate data using network psychometrics. R package version 2.2.0. https://doi.org/10.32614/CRAN.package.EGAnet

71.

Golino

Christensen

A. P.

Moulder

Kim

Boker

S. M.

(2022). Modeling latent topics in social media using dynamic exploratory graph analysis: The case of the right-wing and left-wing trolls in the 2016 US elections. Psychometrika, 87(1), 156–187. https://doi.org/10.1007/s11336-021-09820-y

72.

Golino

Epskamp

(2017). Exploratory graph analysis: A new approach for estimating the number of dimensions in psychological research. PLoS One, 12(6), Article e0174035. https://doi.org/10.1371/journal.pone.0174035

73.

Golino

Shi

Christensen

A. P.

Garrido

L. E.

Nieto

M. D.

Sadana

Thiyagarajan

J. A.

Martinez-Molina

(2020). Investigating the performance of exploratory graph analysis and traditional techniques to identify the number of latent factors: A simulation and tutorial. Psychological Methods, 25(3), 292–320. https://doi.org/10.1037/met0000255

74.

Gorsuch

R. L.

(1997). Exploratory factor analysis: Its role in item analysis. Journal of Personality Assessment, 68(3), 532–560. https://doi.org/10.1207/s15327752jpa6803_5

75.

Wen

Fan

(2015). The impact of wording effect on reliability and validity of the Core Self-Evaluation Scale (CSES): A bi-factor perspective. Personality and Individual Differences, 83, 142–147. https://doi.org/10.1016/j.paid.2015.04.006

76.

Hands

Everitt

(1987). A Monte Carlo study of the recovery of cluster structure in binary data by hierarchical clustering techniques. Multivariate Behavioral Research, 22(2), 235–243. https://doi.org/10.1207/s15327906mbr2202_6

77.

Hart

Kinrade

Lambert

J. T.

Breeden

C. J.

Witt

D. E.

(2023). A closer examination of the integrity scale’s construct validity. Journal of Personality Assessment, 105(6), 743–751. https://doi.org/10.1080/00223891.2022.2152346

78.

Higgins

E. T.

Cornwell

J. F. M.

(2016). Securing foundations and advancing frontiers: Prevention and promotion effects on judgment & decision making. Organizational Behavior and Human Decision Processes, 136, 56–67. https://doi.org/10.1016/j.obhdp.2016.04.005

79.

Highhouse

Wang

Zhang

D. C.

(2022). Is risk propensity unique from the big five factors of personality? A meta-analytic investigation. Journal of Research in Personality, 98, 104206. https://doi.org/10.1016/j.jrp.2022.104206

80.

Hirsh

J. B.

DeYoung

C. G.

Peterson

J. B.

(2010). Compassionate liberals and polite conservatives: Associations of agreeableness with political ideology and moral values. Personality and Social Psychology Bulletin, 36(5), 655–664. https://doi.org/10.1177/0146167210366854

81.

Hofstee

W. K.

de Raad

Goldberg

L. R.

(1992). Integration of the Big Five and circumplex approaches to trait structure. Journal of Personality and Social Psychology, 63(1), 146–163. https://doi.org/10.1037/0022-3514.63.1.146

82.

Hogan

(1982). A socioanalytic theory of personality. In Page

(Ed.), Nebraska Symposium on Motivation (pp. 55–89). Lincoln, NE: University of Nebraska Press.

83.

Holland

P. W.

Rosenbaum

P. R.

(1986). Conditional association and unidimensionality in monotone latent variable models. Annals of Statistics, 14(4), 1523–1543. https://doi.org/10.1214/aos/1176350174

84.

Howard

M. C.

Manix

K. G.

(2022). Assessing the shared facets of honesty-humility and machiavellianism. Journal of Individual Differences, 44(2), 1–6. https://doi.org/10.1027/1614-0001/a000384

85.

Howard

M. C.

Van Zandt

E. C.

(2020). The discriminant validity of honesty-humility: A meta-analysis of the HEXACO, Big Five, and Dark Triad. Journal of Research in Personality, 87(2), Article 103982. https://doi.org/10.1016/j.jrp.2020.103982

86.

Irwing

Hughes

D. J.

Tokarev

Booth

(2024). Towards a taxonomy of personality facets. European Journal of Personality, 38(3), 494–515. https://doi.org/10.1177/08902070231200919

87.

Jayawickreme

Zachry

C. E.

Fleeson

(2019). Whole Trait Theory: An integrative approach to examining personality structure and process. Personality and Individual Differences, 136(4), 2–11. https://doi.org/10.1016/j.paid.2018.06.045

88.

Jiménez

Abad

F. J.

Garcia-Garzon

Golino

Christensen

A. P.

Garrido

L. E.

(2023). Dimensionality assessment in bifactor structures with multiple general factors: A network psychometrics approach. Psychological Methods. https://doi.org/10.1037/met0000590

89.

John

O. P.

Naumann

L. P.

Soto

C. J.

(2008). Paradigm shift to the integrative big five trait taxonomy. In John

O. P.

Robins

R. W.

Pervin

L. A.

(Eds.), Handbook of personality: Theory and research (3rd ed., pp. 114–158). Guilford Press.

90.

Johnson

J. A.

(2014). Measuring thirty facets of the Five Factor Model with a 120-item public domain inventory: Development of the IPIP-NEO-120. Journal of Research in Personality, 51, 78–89. https://doi.org/10.1016/j.jrp.2014.05.003

91.

Joyner

K. J.

Daurio

A. M.

Perkins

E. R.

Patrick

C. J.

Latzman

R. D.

(2021). The difference between trait disinhibition and impulsivity—And why it matters for clinical psychological science. Psychological Assessment, 33(1), 29–44. https://doi.org/10.1037/pas0000964

92.

Kajonius

P. J.

Johnson

J. A.

(2019). Assessing the structure of the Five Factor Model of personality (IPIP-NEO-120) in the public domain. Europe's Journal of Psychology, 15(2), 260–275. https://doi.org/10.5964/ejop.v15i2.1671

93.

Kam

C. C. S.

(2018). Why do we still have an impoverished understanding of the item wording effect? An empirical examination. Sociological Methods & Research, 47(3), 574–597. https://doi.org/10.1177/0049124115626177

94.

Kanfer

Frese

Johnson

R. E.

(2017). Motivation related to work: A century of progress. Journal of Applied Psychology, 102(3), 338–355. https://doi.org/10.1037/apl0000133

95.

Kim

Di Domenico

S. I.

Connelly

B. S.

(2019). Self–other agreement in personality reports: A meta-analytic comparison of self- and informant-report means. Psychological Science, 30(1), 129–138. https://doi.org/10.1177/0956797618810000

96.

Kotov

Krueger

R. F.

Watson

Achenbach

T. M.

Althoff

R. R.

Bagby

R. M.

Brown

T. A.

Carpenter

W. T.

Caspi

Clark

L. A.

Eaton

N. R.

Forbes

M. K.

Forbush

K. T.

Goldberg

Hasin

Hyman

S. E.

Ivanova

M. Y.

Lynam

D. R.

Markon

Zimmerman

(2017). The Hierarchical Taxonomy of Psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology, 126(4), 454–477. https://doi.org/10.1037/abn0000258

97.

Krueger

R. F.

Derringer

Markon

K. E.

Watson

Skodol

A. E.

(2012). Initial construction of a maladaptive personality trait model and inventory for DSM-5. Psychological Medicine, 42(9), 1879–1890. https://doi.org/10.1017/S0033291711002674

98.

Laginess

A. J.

(2016). Mapping integrity in the domain of trait personality (Publication No. 3365) [Master’s thesis, Florida International University]. FIU Electronic Theses and Dissertations. https://digitalcommons.fiu.edu/etd/3365

99.

Lahey

B. B.

Moore

T. M.

Kaczkurkin

A. N.

Zald

D. H.

(2021). Hierarchical models of psychopathology: Empirical support, implications, and remaining issues. World Psychiatry, 20(1), 57–63. https://doi.org/10.1002/wps.20824

100.

Lambert

L. S.

Newman

D. A.

(2023). Construct development and validation in three practical steps: Recommendations for reviewers, editors, and authors*. Organizational Research Methods, 26(4), 574–607. https://doi.org/10.1177/10944281221115374

101.

Lancichinetti

Fortunato

(2012). Consensus clustering in complex networks. Scientific Reports, 2(1), 1–7. https://doi.org/10.1038/srep00336

102.

Lawn

E. C.

Laham

S. M.

Zhao

Christensen

A. P.

Smillie

L. D.

(2023). Where the head meets the heart: ‘Enlightened’ compassion lies between Big Five Openness/Intellect and Agreeableness. Collabra: Psychology, 9(1), Article 74468. https://doi.org/10.1525/collabra.74468

103.

Leising

Burger

Zimmermann

Bäckström

Oltmanns

J. R.

Connelly

B. S.

(2024). Why do judgments on different person-descriptive attributes correlate with one another? A conceptual analysis with relevance for most psychometric research. PsyArXiv. https://doi.org/10.31234/osf.io/7c895

104.

Little

T. D.

Rhemtulla

Gibson

Schoemann

A. M.

(2013). Why the items versus parcels controversy needn’t be one. Psychological Methods, 18(3), 285–300. https://doi.org/10.1037/a0033266

105.

MacCallum

R. C.

Widaman

K. F.

Preacher

K. J.

Hong

(2001). Sample size in factor analysis: The role of model error. Multivariate Behavioral Research, 36(4), 611–637. https://doi.org/10.1207/S15327906MBR3604_06

106.

MacCallum

R. C.

Widaman

K. F.

Zhang

Hong

(1999). Sample size in factor analysis. Psychological Methods, 4(1), 84–99. https://doi.org/10.1037/1082-989X.4.1.84

107.

Markon

K. E.

(2009). Hierarchies in the structure of personality traits. Social and Personality Psychology Compass, 3(5), 812–826. https://doi.org/10.1111/j.1751-9004.2009.00213.x

108.

Markon

K. E.

Krueger

R. F.

Watson

(2005). Delineating the structure of normal and abnormal personality: An integrative hierarchical approach. Journal of Personality and Social Psychology, 88(1), 139–157. https://doi.org/10.1037/0022-3514.88.1.139

109.

Marsh

H. W.

Lüdtke

Muthén

Asparouhov

Morin

A. J.

Trautwein

Nagengast

(2010). A new look at the big five factor structure through exploratory structural equation modeling. Psychological Assessment, 22(3), 471–491. https://doi.org/10.1037/a0019227

110.

Marsh

H. W.

Lüdtke

Nagengast

Morin

A. J.

Von Davier

(2013). Why item parcels are (almost) never appropriate: Two wrongs do not make a right—Camouflaging misspecification with item parcels in CFA models. Psychological Methods, 18(3), 257–284. https://doi.org/10.1037/a0032773

111.

Maydeu-Olivares

Coffman

D. L.

(2006). Random intercept item factor analysis. Psychological Methods, 11(4), 344–362. https://doi.org/10.1037/1082-989X.11.4.344

112.

McCrae

R. R.

(2015). A more nuanced view of reliability: Specificity in the trait hierarchy. Personality and Social Psychology Review, 19(2), 97–112. https://doi.org/10.1177/1088868314541857

113.

McCrae

R. R.

Costa

P. T.

(1985). Updating Norman’s “adequacy taxonomy”: Intelligence and personality dimensions in natural language and in questionnaires. Journal of Personality and Social Psychology, 49(3), 710–721. https://doi.org/10.1037/0022-3514.49.3.710

114.

McCrae

R. R.

John

O. P.

(1992). An introduction to the five-factor model and its applications. Journal of Personality, 60(2), 175–215. https://doi.org/10.1111/j.1467-6494.1992.tb00970.x

115.

McGrew

K. S.

(2009). CHC theory and the human cognitive abilities project: Standing on the shoulders of the giants of psychometric intelligence research. Intelligence, 37(1), 1–10. https://doi.org/10.1016/j.intell.2008.08.004

116.

McNaughton

Gray

J. A.

(2000). Anxiolytic action on the behavioural inhibition system implies multiple types of arousal contribute to anxiety. Journal of Affective Disorders, 61(3), 161–176. https://doi.org/10.1016/S0165-0327(00)00344-X

117.

Miller

M. L.

Schlenker

B. R.

(2011). Integrity and identity: Moral identity differences and preferred interpersonal reactions. European Journal of Personality, 25(1), 2–15. https://doi.org/10.1002/per.765

118.

Milligan

G. W.

(1981). A Monte Carlo study of thirty internal criterion measures for cluster analysis. Psychometrika, 46(2), 187–199. https://doi.org/10.1007/BF02293899

119.

Monni

Olivier

Morin

A. J. S.

Olivetti Belardinelli

Mulvihill

Scalas

L. F.

(2020). Approach and avoidance in Gray’s, Higgins’, and Elliot’s perspectives: A theoretical comparison and integration of approach-avoidance in motivated behavior. Personality and Individual Differences, 166, 110163. https://doi.org/10.1016/j.paid.2020.110163

120.

Montoya

A. K.

Edwards

M. C.

(2021). The poor fit of model fit for selecting number of factors in exploratory factor analysis for scale evaluation. Educational and Psychological Measurement, 81(3), 413–440. https://doi.org/10.1177/0013164420942899

121.

Moshagen

Hilbig

B. E.

Zettler

(2018). The dark core of personality. Psychological Review, 125(5), 656–688. https://doi.org/10.1037/rev0000111

122.

Mõttus

(2016). Towards more rigorous personality trait–outcome research. European Journal of Personality, 30(4), 292–303. https://doi.org/10.1002/per.2041

123.

Mõttus

Wood

Condon

D. M.

Back

M. D.

Baumert

Costantini

Epskamp

Greiff

Johnson

Lukaszewski

Murray

Revelle

Wright

A. G. C.

Yarkoni

Ziegler

Zimmermann

(2020). Descriptive, predictive and explanatory personality research: Different goals, different approaches, but a shared need to move beyond the big few traits. European Journal of Personality, 34(6), 1175–1201. https://doi.org/10.1002/per.2311

124.

Mullins-Sweatt

S. N.

DeShong

H. L.

Lengel

G. J.

Helle

A. C.

Krueger

R. F.

(2019). Disinhibition as a unifying construct in understanding how personality dispositions undergird psychopathology. Journal of Research in Personality, 80, 55–61. https://doi.org/10.1016/j.jrp.2019.04.006

125.

Musek

(2007). A general factor of personality: Evidence for the Big One in the five-factor model. Journal of Research in Personality, 41(6), 1213–1233. https://doi.org/10.1016/j.jrp.2007.02.003

126.

Nieto

M. D.

Garrido

L. E.

Martínez-Molina

Abad

F. J.

(2021). Modeling wording effects does not help in recovering uncontaminated person scores: A systematic evaluation with random intercept item factor analysis. Frontiers in Psychology, 12, 685326. https://doi.org/10.3389/fpsyg.2021.685326

127.

Nowick

Gernat

Almaas

Stubbs

(2009). Differences in human and chimpanzee gene expression patterns define an evolving network of transcription factors in brain. Proceedings of the National Academy of Sciences of the United States of America, 106(52), 22358–22363. https://doi.org/10.1073/pnas.0911376106

128.

Olaru

Schroeders

Wilhelm

Ostendorf

(2019). ‘Grandpa, do you like roller coasters?’: Identifying age–appropriate personality indicators. European Journal of Personality, 33(3), 264–278. https://doi.org/10.1002/per.2185

129.

OpenAI . (2023). ChatGPT (Mar 14 version) [Large language model]. https://chat.openai.com/chat

130.

Ozer

D. J.

Benet-Martínez

(2006). Personality and the prediction of consequential outcomes. Annual Review of Psychology, 57(1), 401–421. https://doi.org/10.1146/annurev.psych.57.102904.190127

131.

Paulhus

D. L.

Williams

K. M.

(2002). The Dark Triad of personality: Narcissism, Machiavellianism, and psychopathy. Journal of Research in Personality, 36(6), 556–563. https://doi.org/10.1016/S0092-6566(02)00505-6

132.

R Core Team . (2023). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://R-project.org

133.

Reise

S. P.

Bonifay

Haviland

M. G.

(2018). Bifactor modelling and the evaluation of scale scores. In Irwing

Booth

Hughes

D. J.

(Eds.), The Wiley handbook of psychometric testing (pp. 675–707). Wiley. https://doi.org/10.1002/9781118489772.ch22

134.

Reise

S. P.

Waller

N. G.

(2009). Item response theory and clinical measurement. Annual Review of Clinical Psychology, 5(1), 27–48. https://doi.org/10.1146/annurev.clinpsy.032408.153553

135.

Revelle

Condon

(2025). Unidim: An index of scale homogeneity and unidimensionality. Psychological Methods. https://doi.org/10.1037/met0000729

136.

Ringwald

W. R.

Wright

A. G. C.

(2021). The affiliative role of empathy in everyday interpersonal interactions. European Journal of Personality, 35(2), 197–211. https://doi.org/10.1002/per.2286

137.

Vittengl

J. R.

Jarrett

R. B.

Clark

L. A.

(2023). Disinhibition domain and facets uniquely predict changes in depressive symptoms and psychosocial functioning. Personality and Mental Health, 17(4), 363–376. https://doi.org/10.1002/pmh.1585

138.

Roberts

B. W.

DelVecchio

W. F.

(2000). The rank-order consistency of personality traits from childhood to old age: A quantitative review of longitudinal studies. Psychological Bulletin, 126(1), 3–25. https://doi.org/10.1037/0033-2909.126.1.3

139.

Roberts

B. W.

Yoon

H. J.

(2022). Personality psychology. Annual Review of Psychology, 73(1), 489–516. https://doi.org/10.1146/annurev-psych-020821-114927

140.

Sanz-García

García-Vera

M. P.

Sanz

(2024). Is it time to replace the Big Five personality model? Factorial structure of the NEO PI-R in a community sample of Spanish adults. The Journal of General Psychology, 151(3), 335–356. https://doi.org/10.1080/00221309.2023.2261136

141.

Saris

W. E.

Satorra

Van der Veld

W. M.

(2009). Testing structural equation models or detection of misspecifications? Structural Equation Modeling: A Multidisciplinary Journal, 16(4), 561–582. https://doi.org/10.1080/10705510903203433

142.

Saucier

Iurino

(2020). High-dimensionality personality structure in the natural language: Further analyses of classic sets of English-language trait-adjectives. Journal of Personality and Social Psychology, 119(5), 1188–1219. https://doi.org/10.1037/pspp0000273

143.

Saucier

Thalmayer

A. G.

Payne

D. L.

Carlson

Sanogo

Ole‐Kotikash

Church

A. T.

Katigbak

M. S.

Somer

Szarota

Szirmák

Zhou

(2014). A basic bivariate structure of personality attributes evident across nine languages. Journal of Personality, 82(1), 1–14. https://doi.org/10.1111/jopy.12028

144.

Schmalbach

Zenger

Michaelides

M. P.

Schermelleh-Engel

Hinz

Körner

Beutel

M. E.

Decker

Kliem

Brähler

(2020). From bi-dimensionality to Uni-dimensionality in self-report questionnaires. European Journal of Psychological Assessment, 37(2), 135–148. https://doi.org/10.1027/1015-5759/a000583

145.

Schwaba

Rhemtulla

Hopwood

C. J.

Bleidorn

(2020). A facet atlas: Visualizing networks that describe the blends, cores, and peripheries of personality structure. PLoS One, 15(7), Article e0236893. https://doi.org/10.1371/journal.pone.0236893

146.

Sharma

Markon

K. E.

Clark

L. A.

(2014). Toward a theory of distinct types of “impulsive” behaviors: A meta-analysis of self-report and behavioral measures. Psychological Bulletin, 140(2), 374–408. https://doi.org/10.1037/a0034418

147.

Siepe

B. S.

Bartoš

Morris

T. P.

Boulesteix

A.-L.

Heck

D. W.

Pawel

(2024). Simulation studies for methodological research in psychology: A standardized template for planning, preregistration, and reporting. Psychological Methods. https://doi.org/10.1037/met0000695

148.

Smith

S. F.

Lilienfeld

S. O.

(2013). Psychopathy in the workplace: The knowns and unknowns. Aggression and Violent Behavior, 18(2), 204–218. https://doi.org/10.1016/j.avb.2012.11.007

149.

Soto

C. J.

(2019). How replicable are links between personality traits and consequential life outcomes? The Life Outcomes of Personality Replication Project. Psychological Science, 30(5), 711–727. https://doi.org/10.1177/0956797619831612

150.

Swain

S. D.

Weathers

Niedrich

R. W.

(2008). Assessing three sources of misresponse to reversed Likert items. Journal of Marketing Research, 45(1), 116–131. https://doi.org/10.1509/jmkr.45.1.116

151.

Syed

(2024). Where are race, ethnicity, and culture in personality research? Personality Science, 5, 27000710241257348. https://doi.org/10.1177/27000710241257348

152.

Thielmann

Moshagen

Hilbig

Zettler

(2022). On the comparability of basic personality models: Meta-analytic correspondence, scope, and orthogonality of the Big Five and HEXACO dimensions. European Journal of Personality, 36(6), 870–900. https://doi.org/10.1177/08902070211026793

153.

Tupes

E. C.

Christal

R. E.

(1961). Recurrent personality factors based on trait ratings. USAF ASD technical report No. 61–97. U.S. Air Force.

154.

van den Bos

Müller

P. A.

Damen

(2011). A behavioral disinhibition hypothesis of interventions in moral dilemmas. Emotion Review, 3(3), 281–283. https://doi.org/10.1177/1754073911402369

155.

Ward

J. H.

(1963). Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association, 58(301), 236–244. https://doi.org/10.1080/01621459.1963.10500845

156.

Watts

A. L.

Greene

A. L.

Bonifay

Fried

E. I.

(2024). A critical evaluation of the p-factor literature. Nature Reviews Psychology, 3(2), 108–122. https://doi.org/10.1038/s44159-023-00260-2

157.

Weijters

Baumgartner

Schillewaert

(2013). Reversed item bias: An integrative model. Psychological Methods, 18(3), 320–334. https://doi.org/10.1037/a0032121

158.

Whiteside

S. P.

Lynam

D. R.

(2001). The five factor model and impulsivity: Using a structural model of personality to understand impulsivity. Personality and Individual Differences, 30(4), 669–689. https://doi.org/10.1016/S0191-8869(00)00064-7

159.

Widaman

K. F.

(2006). Best practices in quantitative methods for developmentalists: III. Missing data: What to do with or without them. Monographs of the Society for Research in Child Development, 71(3), 42–64. https://doi.org/10.1111/j.1540-5834.2006.00404.x

160.

Williams

D. R.

Rhemtulla

Wysocki

A. C.

Rast

(2019). On nonregularized estimation of psychological networks. Multivariate Behavioral Research, 54(5), 719–750. https://doi.org/10.1080/00273171.2019.1575716

161.

Wood

J. M.

Tataryn

D. J.

Gorsuch

R. L.

(1996). Effects of under-and overextraction on principal axis factor analysis with varimax rotation. Psychological Methods, 1(4), 354–365. https://doi.org/10.1037/1082-989X.1.4.354

162.

Wrosch

Scheier

M. F.

Miller

G. E.

Schulz

Carver

C. S.

(2003). Adaptive self-regulation of unattainable goals: Goal disengagement, goal reengagement, and subjective well-being. Personality and Social Psychology Bulletin, 29(12), 1494–1508. https://doi.org/10.1177/0146167203256921

163.

Wulff

D. U.

Mata

(2023). Automated jingle–jangle detection: Using embeddings to tackle taxonomic incommensurability. PsyArXiv. https://doi.org/10.31234/osf.io/9h7aw

164.

Yang

Algesheimer

Tessone

C. J.

(2016). A comparative analysis of community detection algorithms on artificial networks. Scientific Reports, 6(1), 1–18. https://doi.org/10.1038/srep30750

165.

Zuckerman

Kuhlman

D. M.

Joireman

Teta

Kraft

(1993). A comparison of three structural models for personality: The big three, the big five, and the alternative five. Journal of Personality and Social Psychology, 65(4), 757–768. https://doi.org/10.1037/0022-3514.65.4.757

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.67 MB

0.00 MB

1.76 MB