Abstract
Top management teams (TMTs) at most companies worldwide are male-dominated (Dezső, Ross, & Uribe, 2016), with female underrepresentation relatively intractable despite increased recent focus (Jeong & Harrison, 2017). Given the small, stagnating share of women at the top, research has explored barriers to mobility (Fernandez-Mateo & Fernandez, 2016; Helfat, Harris, & Wolfson, 2006) but devoted less attention to the nature of top-level positions women take. In particular, while social psychologists have probed the possibility that companies in crisis will more likely appoint women to executive positions (Ryan & Haslam, 2007), this glass cliff hypothesis has received less attention in management research. If the glass cliff exists, women would be relatively more likely than men to occupy risky executive positions set up for failure (Cook & Glass, 2014).
Anecdotal evidence appears to support the glass cliff hypothesis. Prominent cases include Anne Mulcahy, promoted to chief executive officer (CEO) with Xerox on the brink of bankruptcy, and Marissa Mayer, who became Yahoo's first female chief executive when the internet giant was badly struggling (for more examples, see Ryan et al., 2016). However, evidence beyond single cases is much less consistent (Dwivedi, Joshi, & Misangyi, 2018). Field studies examining the occurrence of the glass cliff in executive staffing have reported some supportive (e.g., Cook & Glass, 2014; Mulcahy & Linehan, 2014) and some contradictory evidence (e.g., Adams, Gupta, & Leeth, 2009; Bechtoldt, Bannier, & Rock, 2019). Similarly, a recent meta-analysis revealed no robust evidence for glass cliffs in the management domain, with substantial heterogeneity in effect sizes across studies (Morgenroth, Kirby, Ryan, & Sudkämper, 2020). These inconsistent findings fuel heated debate: Some experts argue the glass cliff is more of a myth than a real phenomenon (Bechtoldt et al., 2019), but proponents contend it “is a real and reasonably pervasive phenomenon” (Ryan et al., 2016: 449).
We argue that an extended theoretical perspective and a rigorous empirical strategy are required to explain inconsistent effects and advance the glass cliff debate. Theoretically, many field studies on the glass cliff have been more descriptive (
We take a different route by attributing glass cliffs not to the ascribed behavioral advantage of women in leading crisis firms but rather to the symbolic value of female appointments under crisis. Based on signaling theory (Connelly, Certo, Ireland, & Reutzel, 2011; Spence, 2002), we suggest that, regardless of whether female leaders are perceived to have a behavioral advantage in leading crisis firms (i.e., think crisis
Based on signaling theory's core rationale (Spence, 2002), that actors consider the symbolic value of their decisions, we suggest that under clearly defined conditions, crisis firms may appoint females to signal change to investors. Female executive appointments generally signal forward-thinking and a willingness to break with long-held practices (Miller & Triana, 2009)—characteristics that might otherwise be unclear or invisible to the market.
Signaling theory suggests that signaling decisions are shaped by the nature of the signaler, signal, and receiver (Connelly et al., 2011; Gomulya & Mishina, 2017). We theorize that all three facets determine if a female executive appointment is an attractive signal for a firm. The presence/absence of other female executives before the focal appointment should be a central characteristic of the signaler. We assume that appointment type (i.e., insider vs. outsider) is an aspect of the signal itself; and we categorize investors’ level of attention toward the firm as a receiver-related factor. Based on signaling theory's core tenets, we theorize that the signaling value of female crisis appointments is most pronounced (i.e., glass cliffs most likely) for firms with no existing female top managers, for insider appointments, and for firms receiving high investor attention.
Empirical limitations may further explain inconsistent results as past glass cliff studies left a potential endogeneity bias mostly unaddressed. In particular, omitted variables and reverse causality might have led to erroneous conclusions (Hill, Johnson, Greco, O’Boyle, & Walter, 2021). Most notably, a firm's crisis is likely endogenous to the appointed executive's gender: A firm's financial state possibly correlates with many unobserved confounding variables (e.g., firm culture, industry norms, predecessor network) also affecting the likelihood of female appointments. Consequently, these omitted variables likely bias the correlation between firm crisis and likelihood of female appointments. Moreover, the estimated correlation might be affected by reverse causality if gender diversity in the TMT affects the firm's financial health (Dezsö & Ross, 2012). Ignoring such issues leads to estimates without a causal interpretation that are higher/lower than or in the opposite direction to the true estimate; the bias may vary widely across studies. For a clean causal interpretation, crisis status must be randomly assigned to companies—impossible in a field setting. To address this randomization problem, we rely on a quasi-experimental regression discontinuity design that exploits random variance in crisis status assignment (Lee & Lemieux, 2010). We employ this identification strategy in a large self-constructed dataset of 26,156 executive turnovers from 2000 to 2016 in the United States. Moreover, we content analyze press releases on executive appointments for change-related language, aiming to capture the theorized signaling rationale of crisis appointments.
The study contributes to research in three notable ways. First, by integrating economic signaling theory (Spence, 2002) into the social psychology domain of glass cliff research, we gain more holistic understanding of the glass cliff. While Ryan and Haslam's (2007) seminal theorizing on glass cliffs identified the potential relevance of signaling efforts, they neglected core assumptions of signaling theory and their perspective has received scant empirical attention (for a recent scenario study, see Kulich, Lorenzi-Cioldi, Iacoviello, Faniko, & Ryan, 2015). By directly considering signaling theory, our work spotlights neglected moderators of the glass cliff and answers calls for context-sensitivity (Morgenroth et al., 2020; Ryan & Haslam, 2009).
Second, the study contributes to research on gender and upper echelons by delineating how and when the executive's gender might signal the firm's quality under crisis. Following Connelly et al.'s (2011) call to improve understanding of management phenomena by considering signaling, we move beyond the dominant behavioral perspective in upper echelons research (i.e., upper echelons theory; Finkelstein, Hambrick, & Cannella, 2009; Hambrick & Mason, 1984). Traditional upper echelons research emphasizes the effect of executive gender on strategic decisions and firm performance (Dezsö & Ross, 2012; Jeong & Harrison, 2017). We theorize on the symbolic value of leader gender, independent of actual leader behaviors, as an alternative to the behavioral perspective and elaborate on potential boundary conditions of the signaling effect. By considering the signaling value of executive gender and the three key components of signaling decisions (signaler, signal, receiver) as contextual factors, we also enrich the executive signaling literature, which has so far focused on the signaling function of experience and qualification (e.g., Gomulya, Wong, Ormiston, & Boeker, 2017; Zhang & Wiersema, 2009) and rarely tested all key components of signaling in one model.
Last, our study answers recent calls for more rigorous causal evidence in management research (Hill et al., 2021; Semadeni, Withers, & Trevis Certo, 2014) and leadership research (Adams, 2016; Antonakis, Bendahan, Jacquart, & Lalive, 2010). Leading researchers have encouraged more use of natural experiments, such as regression discontinuity designs, to establish causal effects in field settings. Regression discontinuity designs have a long tradition in economics and related disciplines and come closest to the gold standard of randomized experiments (Lee & Lemieux, 2010). Our study aims to inspire natural experiments to answer theoretically relevant causal questions in leadership and diversity research.
Theory and Hypotheses
The basic tenet of the glass cliff hypothesis is that firms will more likely appoint women to executive positions when in crisis (Ryan & Haslam, 2007: 553), with crisis defined “as any form of dramatic reductions in financial and/or reputational well-being that has an adverse bearing on the perceived state of the organization.” While some recent studies have moved away from this rigid definition and considered any drop in company performance as the stimulus for a glass cliff (e.g., Adams et al., 2009; Cook & Glass, 2014), we return to the original crisis conceptualization based on the theoretical reasoning that only a strong crisis pushes companies to incur the costs of signaling and fundamentally alter the long-held practice of male executive appointments. We thus define crisis status as a company's dramatically and unambiguously weak financial reputation.
While early research pointed to a myriad of mechanisms that may create glass cliffs (Ryan & Haslam, 2007), most theoretical work focuses on gender and leadership stereotypes as the central force (Bruckmüller & Branscombe, 2010; Cook & Glass, 2014; Ryan et al., 2011). Based on gender and leadership stereotypes studies (Heilman, 2001; Schein, 1973), glass cliff researchers argue that stereotypical male traits are generally in line with ideas about successful leadership, whereas presumed female traits tend to be incompatible with the successful leader prototype, resulting in a bias toward male leadership. Extending this role (in)congruency argument, they suggest that the conception of a good leader changes under crisis because greater emotional sensitivity and interpersonal skills are required to make difficult personal decisions (Cook & Glass, 2014; Ryan & Haslam, 2007). Female leaders with ascribed communal attributes are thus seen as more capable in times of crisis than men with ascribed agentic qualities (Bruckmüller & Branscombe, 2010; Cook & Glass, 2014).
We do not challenge the robust evidence on gender and leadership stereotypes and their discriminatory effect in hiring and promotion decisions (Cheung et al., 2016; Colella, Hebl, & King, 2017; Koenig, Eagly, Mitchell, & Ristikari, 2011); rather, we argue that the stereotyping logic may not fully explain the glass cliff phenomenon. In this context, the stereotyping mechanism rests on the idea that female leaders are appointed under crisis because their leader behavior is believed to enhance company performance. Even if we assume that the hiring committee may be subject to gendered leadership stereotypes, it seems unlikely that one woman with her perceived crisis leadership competences would be expected to single-handedly steer a company out of crisis. Morgenroth et al.'s (2020) meta-analysis further substantiates the incompleteness of stereotyping arguments in explaining the glass cliff. Finding that other disadvantaged groups for which female stereotypes do not exist (e.g., Black and Asian Americans) are appointed onto glass cliffs, the authors concluded that the stereotyping mechanism is not the only driver of glass cliffs (Morgenroth et al., 2020).
A plausible alternative is that female leaders are selected because appointing females to upper echelons increases a firm's value by altering market expectations of its willingness to change (Miller & Triana, 2009) and not because of the stereotypical conviction that female leader behavior increases firm performance. We suggest that the signaling aspect of a female appointment plays a key role in creating glass cliffs. Emerging research generally supports the relevance of signaling in top management contexts where stakeholders infer firm quality from the characteristics of board members or top managers (Certo, 2003; Zhang & Wiersema, 2009). Yet signaling logic has been largely neglected and remains underdeveloped in the context of female executive appointments. Moreover, the few studies touching on signaling logic in this context (e.g., Miller & Triana, 2009; Ryan & Haslam, 2007) neglect the role of the signaler, signal, and receiver as core aspects of signaling theory (Bergh, Connelly, Ketchen, & Shannon, 2014; Connelly et al., 2011). Thus, developing the signaling logic presents an opportunity for theory-guided inspection of potential moderators of glass cliffs, enabling more nuanced, contextualized predictions for when glass cliffs occur.
Signaling Through Glass Cliff Appointments
Companies in crisis may appoint a female executive to signal to investors their willingness to change. The incentives for sending positive signals include influencing stock performance, facilitating the provision of financial resources by capital markets, and enabling more time to reorganize (Kulich et al., 2015; Ndofor & Levitas, 2004). Female appointments signal a change from entrenched male status structures to forward-thinking and sensitivity to evolving social norms, which might be inconsistent with commitment to the status quo.
Using a top manager's gender to shape evaluations of a struggling firm is well-aligned with tenets of signaling theory (Connelly et al., 2011), which focuses on information asymmetry between two parties (Spence, 2002) and explains why parties (e.g., companies) deliberately communicate information to stress positive attributes. Observers find it difficult to ascertain a crisis firm's recovery prospects because of incomplete information about internal operations, the industriousness of management and employees, and reorganization efforts; by contrast, managers have better access to such information (Cohen & Dean, 2005; Xia, Dawley, Jiang, Ma, & Boal, 2016). This creates an information asymmetry in which observers (e.g., investors) use visible cues to make inferences about the firm's unobservable attributes (e.g., change efforts); thus, the company can use signals to indicate maximum commitment to getting back on track (Gangloff, Connelly, & Shook, 2016).
Signaling theory conceptualizes effective signals as those that are
Transferring the general logic of signal observability and signaling costs to the context of executive appointments under crisis, we suggest that female leader appointments are effective change signals: First, an executive's gender can be readily observed, and female executive appointments more often receive media coverage than do male executive appointments (Gaughan & Smith, 2016; Lee & James, 2007). Second, appointing a female executive is more costly for firms that internally want to keep the status quo than for firms committed to fundamental change. The higher cost of female appointments for change-reluctant firms is substantiated by research on identity concerns in economic decision-making (Akerlof & Kranton, 2005; Akerlof & Kranton, 2010). Studies have shown that the dominant male majority in executive suites resists female executive appointments and tends to enact exclusionary strategies (Dezső et al., 2016; Knippen, Shen, & Zhu, 2019; Zhu, Shen, & Hillman, 2014). Thus, for companies unwilling to break entrenched status structures, a female appointment is particularly costly as it counters the internal norm of status quo preservation; for change-willing firms, by contrast, internal norm violation is less pronounced.
It may also be more difficult to hire a suitable female candidate who anticipates her window-dressing function, particularly where executives are required to hold shares in the company, entailing personal financial commitment and, thus, financial risk (Korczak & Liu, 2014). Moreover, where women detect that the hiring firm is unwilling to change, they may demand higher compensation because of the difficulties in leading such a company (for a similar argument linking executive risk to compensation, see Hermalin & Weisbach, 2012). By contrast, a change-willing company may find it easier to hire a female leader and without high upfront compensation, as the female candidates may see better long-term prospects for the company. In sum, appointing a female executive can be seen as a credible signal to investors that the company is actively dealing with the crisis and radically deviating from the status quo (Kulich et al., 2015; Ryan & Haslam, 2007).
In glass cliff studies, initial evidence suggests the importance of signaling. In a recent scenario experiment, Kulich et al. (2015) found that crisis firms’ preference for female leadership candidates is grounded in the potential of female leaders to communicate change. In line with the theoretical arguments outlined above, we propose:
Moderators of Glass Cliff Appointments
While crisis status might increase the likelihood of female appointments as a signaling effort, the decision to signal plausibly has multiple determinants. As Connelly et al. (2011) argue, more accurate understanding of signaling decisions requires consideration of the signaler, signal, and receiver (see also Gomulya & Boeker, 2014). In our study, the presence of other female executives before the focal appointment is a signaler characteristic, the appointment type (insider vs. outsider) is an aspect of the signal, and the level of investors’ attention toward the firm represents the receiver's role. As we outline below, all three components might shape a crisis firm's decision to signal change through appointing females.
The signaler
We argue that the existing presence of one or more women in top management is a central attribute of the signaler (i.e., the crisis firm) that influences the incentive to appoint a female executive in response to a crisis. In general, appointing a woman into a top executive position can attract favorable attention for the firm (Kanter, 1977; Wright, Ferris, Hiller, & Kroll, 1995). Yet given the costs underlying a signaling effort, the firm considers the expected utility of a signal before deciding whether to send it (Bergh et al., 2014; Ndofor & Levitas, 2004). Specifically, crisis firms may perceive that female appointments have only marginal signaling value when there are already one or more female executives. In such a scenario, appointing a female executive is no longer a notable and costly deviation from the status quo. Moreover, this signal may have lost credibility as the first female appointment did not generate change or prevent the firm slipping into crisis. Indeed, Dezső et al. (2016: 100) note that “while firms gain legitimacy from having women in top management, the marginal value of this legitimacy declines with each woman.” Accordingly, we propose:
The signal
The availability of alternative signals may also influence a company's decision to appoint a female executive as a viable signal. Beyond a successor's gender, another particularly powerful signal in executive appointments—receiving wide attention in past research—is successor origin (Connelly, Ketchen, Gangloff, & Shook, 2016; Gangloff et al., 2016). Friedman and Singh (1989: 726) even contend that successor origin conveys “the clearest signal among the messages implicit in succession.”
We suggest that appointing an outsider is a relevant change signal, although this is unlikely to co-occur with a female appointment as a change signal. In general, past signaling research acknowledges that firms aim to send consistent and reinforcing signals (Connelly et al., 2011); hence, one may expect that crisis companies opt for the strongest change signal possible by appointing a female outsider. At the same time, crisis companies need to balance between signaling their change willingness through a notable deviation from the status quo and signaling their capability for successful change implementation.
While appointing a nonprototypical female outsider is the clearest signal in terms of change willingness, the signal is less powerful on the implementation dimension because of an outsider versus insider signaling trade-off: Outsiders typically stand for fundamental change but difficulties in the implementation process, while insiders potentially facilitate change implementation but stand for less fundamental change (Georgakakis & Ruigrok, 2017; Shen & Cannella, 2002). Outsiders are prized for their fresh perspectives and independence from existing networks and traditions (Gangloff et al., 2016; Shen & Cannella, 2002). Conversely, their lack of company-specific knowledge and their smaller networks inside a company may limit outsiders’ power to initiate and implement change (Berns & Klarner, 2017; Kotter, 1982; Zhang & Rajagopalan, 2010), especially in the unstable context of a company struggling financially (Berns & Klarner, 2017; Georgakakis & Ruigrok, 2017). When considering executive gender, appointing a female outsider in contrast with a prototypical male outsider may signal high change willingness but particularly weak change implementation likelihood, as the combination of outsider origin and gender difference from sitting male colleagues may result in a double outsider status, making the implementation of change more difficult for a leader (Georgakakis & Ruigrok, 2017).
By contrast, insider appointments are less representative of fundamental change; however, insiders can draw on company- and industry-specific knowledge, networks, and established relationships with employees to quickly and successfully initiate change (Berns & Klarner, 2017; Kotter, 1982; Zhang & Rajagopalan, 2010). Crisis firms appointing an insider may find a female hire a viable strategy to signal notable deviation from the status quo and quick change implementation—important when close to bankruptcy. In support of this argument, Georgakakis and Ruigrok (2017) found that outsider succession is only positively related to subsequent firm performance when a new CEO is similar sociodemographically to incumbent executives. Thus, we suggest that the marginal signaling value from appointing a female insider instead of a female/male outsider is negative or nonexistent. We propose:
The receiver
The firm's signaling decision might also be influenced by who will likely observe and interpret a signal. Signaling change can be an act of investor management, where signals received and positively interpreted by investors may restore their confidence in the company (Gangloff et al., 2016; Huang & Thakor, 2013). Whereas current owners and executives have access to extensive information about the firm's change efforts and existing internal resistance to change, investors have relatively little access to such insights (Cohen & Dean, 2005; Ndofor & Levitas, 2004). A crisis firm appointing a female executive may send a signal to investors from which the firm's change-willingness is inferred, thereby restoring trust in the company's potential.
Yet signaling efforts are unlikely to succeed when potential investors are not looking for a signal (Vergne, Wernicke, & Brenner, 2018). Accordingly, Connelly et al. (2011) suggest that receiver attention—the extent to which receivers vigilantly scan the environment for signals—is a key part of the signaling process. We predict that the level of investor attention directed to the firm might influence its decision to appoint a female executive for signaling purposes. Attention toward female leader appointments might vary based on a range of factors, such as company media exposure, industry visibility, and other concurrent public events (Chang, Milkman, Chugh, & Akinola, 2019). As each credible signaling effort entails costs, crisis firms will carefully evaluate the cost–benefit ratio of a signal (Bergh et al., 2014). Accordingly, we expect investor attention to moderate the link between crisis status and female executive appointments because crisis firms on which investors are focused will more likely signal change through female leader appointments.
Greater attention might also be important for overcoming the resistance of sitting male executives to appointing a female leader. The male majority, still found in almost all TMTs, will likely resist female leader appointments as a fundamental challenge to male leadership and status (Knippen et al., 2019; Zhu et al., 2014). In a company under higher scrutiny, decision-makers may feel greater pressure to engage in legitimacy-seeking behaviors (Chang et al., 2019), so male executives’ resistance is more likely overruled.
These predictions align with prior findings that public (e.g., investor) attention shapes upper echelons’ staffing decisions and composition, for instance by influencing executive turnover (Boivie, Graffin, & Pollock, 2012; Harrison, Boivie, Sharp, & Gentry, 2018), board gender composition (Knippen et al., 2019), and career outcomes for executives involved in corporate fraud (Naumovska, Wernicke, & Zajac, 2020). We thus propose:
Method
Data
We analyze 26,156 top manager turnovers in 3,883 U.S. public firms between 2000 and 2016. We source data from two comprehensive databases. Information on top management turnovers and board characteristics are derived from the BoardEx database. For our purpose, top managers include the CEO, chief financial officer (CFO), chief operations officer (COO), and chairperson: All four roles are clearly and consistently reported by BoardEx. Because the top manager roles are particularly powerful and visible, they are most likely used for signaling efforts. In focusing on these top managers, our approach is comparable to that of other top management studies analyzing executives at the strategic apex of an organization (e.g., Dezsö & Ross, 2012; Finkelstein et al., 2009). We obtained financial information, used to measure crisis status and control for firms’ financial situation, from Compustat North America. All main statistical analyses were conducted in Stata 14 SE.
As this study focuses on executive staffing, our main analyses only consider firm-year observations for which a turnover among top managers occurred (for a similar procedure, see Gupta, Mortal, Silveri, Sun, & Turban, 2020). 1 All industries other than financial services are represented in the sample, with the manufacturing sector most strongly represented (49% of firms), followed by the service sector (22%) and the wholesale and retail trade sector (12%). We excluded financial service firms because their unique asset structures affect the precision of the measure used to identify crisis status (Haleblian, McNamara, Kolev, & Dykes, 2012). Notably, the central results were unchanged when financial service firms were included.
Measures
Independent variable
The study's independent variable,
The Altman Z-score is a composite measure based on five financial ratios with complex interrelations: (1) working capital to total assets, (2) retained earnings to total assets, (3) earnings before interest and taxes to total assets, (4) sales to total assets, and (5) market value of equity to book value of total liabilities. A lower Z-score indicates higher bankruptcy risk. To facilitate interpretation, Altman (1968) defined three zones of discrimination: “crisis zone,” “gray zone,” and “safe zone.” A Z-score below 1.81 differentiates the “crisis zone” from the “gray zone”: A firm below this cut-off faces considerable risk of bankruptcy over the next 2 years. Several of the professional and popular financial market data providers (e.g., Altman Z-Score+, Finbox, GuruFocus, MarketInOut) highlight the zones of discrimination along with the Altman Z-score, and major financial websites prominently reference the zones of discrimination when discussing Altman Z as a bankruptcy indicator (e.g., Investopedia, Financial Express). For instance, Financial Express states under the headline “Your Money: Use Altman Z-Score to Sniff Out Bankruptcy Potential” that “if the Z score is less than 1.81, the firm is a bankruptcy candidate.” 2 Accordingly, all firms scoring below 1.81 have a highly salient crisis status. Given the widespread use of Altman Z-scores and the established cut-off value for the “crisis zone,” firms below this cut-off might plausibly feel particular pressure to react (a conceptual argument we empirically evidence later in “Validity of Regression Discontinuity Design and Robustness Checks”). Accordingly, the crisis cut-off is a clear, salient way to measure a firm's crisis status.
To causally estimate the impact of crisis status on female executive appointments, our identification strategy relies predominantly on comparing firms slightly above and below the crisis cut-off of 1.81 in the pre-appointment year. 3 While the number of firms with crisis status is only a small proportion of the total sample, it is sufficiently large to identify a causal effect at the threshold. For instance, 169 observations had an Altman Z-score within the ±1% interval around the crisis threshold, and 507 had a score within the ±3% interval. These numbers give our estimations sufficient power, and our sample size compares favorably with those of other regression discontinuity setups (Arvate, Galilea, & Todescat, 2018; Flammer & Bansal, 2017).
Dependent variable
The dummy-coded dependent variable for female executive appointments (
Moderator variables
We measured
We measured
Google search volume offers a valid measure of investor attention because searches are a “revealed attention measure” (Da et al., 2011: 1462), with searching for a company on Google means one is paying attention to it. Moreover, Google is the most widely used search engine, making it an unbiased source for search behavior (Ren, Hu, & Cui, 2019). 4 In support, studies show that Google search scores moderately correlate with traditional attention measures but also measure investor attention in anticipation of events that may not occur or are not (yet) covered by the media (Da et al., 2011; Drake et al., 2012).
To capture investor attention, we follow Da et al. (2011; see also Drake et al., 2012) by using the Google search frequency for stock ticker symbols (e.g., “AMZN” for Amazon.com Inc., “NFLX” for Netflix Inc.) instead of full company names, as stock symbols are largely unambiguous and investor-related (though using full corporate names produced similar results; see Appendix Table A1). Monthly data were requested for all queries by U.S.-based users via the Google Trends service (https://trends.google.com/trends). Data were only available from 2005 onward, so all analyses involving investor attention as a moderator have a restricted sample. We scraped data automatically using the “gtrendsR” package (Massicotte & Eddelbuettel, 2018) in the statistical environment R (R Core Team, 2015). Google normalizes trends data by default, setting to 100 the month with the most search queries in the requested time frame. Companies for which no searches were made during the study period (i.e., 0 values throughout) were recoded as missing (though retaining them in the sample produced similar results; see Appendix Table A1). We aggregated monthly search volumes into rolling windows of 12 months before each turnover in the analyses.
Covariates
We consider several covariates to test the robustness of our models. First, we include
To mitigate the impact of outliers, we winsorized all continuous variables at the 0.5% and 99.5% levels—a common practice in strategic management (e.g., Hill, Upadhyay, & Beekun, 2015) and finance research (e.g., Geiler & Renneboog, 2015).
Table 1 gives summary statistics for all variables this section describes. Of the 26,156 executives appointed during 2000–2016, only 7.4% were women, in line with the common conception that women rarely reach the TMT (e.g., Dezső et al., 2016; Jeong & Harrison, 2017). Of all sample turnover observations, 25.2% occurred in crisis firms, consistent with findings that poor performance only explains some departures (Finkelstein et al., 2009).
Summary Statistics
Estimation Challenges and Strategy
This paper aims to estimate the causal effect of a firm's crisis status on the likelihood of appointing female executives and the contingencies of this effect. As omitted variables and reverse causality may bias correlations between firm crisis and executive appointments, we employ the regression discontinuity design. Omitted variable bias may occur as crisis status is undoubtedly correlated with (unobserved) third variables, such as corporate culture, diversity management efforts, and firm vision, which may also influence executive appointments. Relatedly, reverse causality might be an issue because, according to upper echelons theory (Hambrick & Mason, 1984), the presence of female executives may influence company performance. Given such difficulties, a regression of firm crisis on female leader appointments would likely be biased, even when controlling for the above covariates, and not provide substantial insights into the causal direction of the effect.
A regression discontinuity design is a powerful way to address omitted variable bias and reverse causality by approximating the ideal setting of a randomized experiment in the field by introducing only mild assumptions (Lee & Lemieux, 2010; Sieweke & Santoni, 2020; for a plausibility test of the assumptions, see “Validity of Regression Discontinuity Design and Robustness Checks” below). To identify the causal effect, the design exploits exogenous variance in settings where a unit's score above or below a threshold on a continuous variable determines the unit's treatment status. Falling slightly above or below is akin to random assignment (Sieweke & Santoni, 2020).
The regression discontinuity design's core identifying assumption is fulfilled when falling slightly below or above the crisis threshold of Altman Z introduces random variation in crisis status. As described in the “Measures” section, the widespread use of Altman Z among investors and the salience of the zones of discrimination make the threshold of 1.81 highly consequential for firms: Falling below this threshold gives companies a visible label of looming bankruptcy, thereby applying pressure to regain trust. The as-if-random assignment around the crisis threshold is plausible because the multivariate Altman Z-score of five financial ratios with complex interdependencies and unique drivers of the single ratios (Almamy et al., 2016; Altman, 1968) avoids that companies have perfect control over their exact Altman Z-score. Thus, it is a matter of chance if a company falls slightly below or above the threshold of 1.81 separating crisis and noncrisis firms. For instance, whether a company's score is 1.80 (crisis zone) or 1.82 (gray zone) is as good as random (for supporting evidence, see “Validity of Regression Discontinuity Design and Robustness Checks” below). Accordingly, assignment of the crisis label at 1.81 is random variation, and there should be no systematic covariation with observable and unobservable confounders. Our design thus addresses central sources of omitted variable bias and reverse causality.
We employ a sharp regression discontinuity design (Imbens & Lemieux, 2008) to estimate the difference in female executive appointments between firms slightly below and above the crisis threshold. To test the moderation effects proposed in Hypotheses 2 to 4, we split the sample based on the moderation variable, estimate the effect for each subsample separately, and compare the treatment effect between subsamples to reveal the moderation pattern (Flammer, 2015).
In estimating the treatment effect, we rely on Calonico, Cattaneo, and Titiunik's (2014) bias-corrected RD estimator, which was shown to outperform alternative estimators. To account for within-firm dependence across appointments, we cluster standard errors at the firm level. We base treatment effect estimations on local linear polynomials for various bandwidths and do not control for higher degree polynomials. As Gelman and Imbens (2019) outline, local linear estimates are less noisy, less sensitive to the polynomial degree, and have better coverage of confidence intervals compared to higher degree polynomials.
Results
Test of the Direct Glass Cliff Effect
To test Hypothesis 1's prediction that crisis status positively affects the probability of appointing female executives, we first plotted the likelihood of appointing female executives against the lagged (

Female Executive Appointments Around Crisis Cut-Off
A formal test of Hypothesis 1 is reported in Table 2. Models 1 to 7 report estimated differences in female executive appointments between companies with and without crisis status (i.e., treatment effect) for different model specifications. Based on Imbens and Lemieux (2008), we tested the treatment effect for different bandwidths. Bandwidth choice is important in regression discontinuity designs because it involves a trade-off: Larger bandwidth offers greater power but also higher bias by including observations further from the threshold for which the as-if-random assumption is more questionable. Model 1 restricts the sample to the 169 appointments within the 1% interval around the crisis threshold. Firms with crisis status are 9.3 percentage points more likely to appoint a female executive than firms without crisis status (
Female Executive Appointments Around Crisis Cut-Off
We finally included all covariates from the
Tests of Glass Cliff Moderators
To test if the glass cliff effect is conditional on the existing presence of another female top manager (Hypothesis 2), the executive's origin (Hypothesis 3), or investor attention (Hypothesis 4), we performed subgroup analyses within the regression discontinuity framework.
We first reestimated the full model separately for companies with no female incumbent and those with at least one (Table 3, Model 1a & 1b). We find a significant positive crisis effect for firms with no female incumbent (
Female Executive Appointments Around Crisis Cut-Off With Sample Split According to Moderators
Next, we performed a similar set of analyses to examine executive origin as a moderator. To test Hypothesis 3, we estimated the full model separately for insider and outsider appointments (Table 3, Model 2a & 2b). We find a significant positive coefficient for insider appointments (
To test Hypothesis 4's prediction, we re-estimated the full model separately for companies above and below the median Google Trends score across all firms in the year before the appointment (for a similar procedure, see Flammer, 2015).
6
As displayed in Models 3a and 3b (Table 3), the effect of crisis status on female appointments is significant (
Validity of Regression Discontinuity Design and Robustness Checks
To assess the validity of our regression discontinuity design and the conclusions drawn from it, we conducted additional tests of the randomization assumption, noise, an alternative status threshold, sample selection bias, and generalizability.
Randomization assumption
The central identification assumption of our design is that having an Altman Z-score slightly above or below the crisis threshold is as good as random. Otherwise, the design cannot address omitted variable bias and reverse causality. We use two standard tests for the implications of this assumption (Eggers, Fowler, Hainmueller, Hall, & Snyder, 2015; Imbens & Lemieux, 2008; Sieweke & Santoni, 2020): (1) a test of the continuity of the Altman Z-score distribution around the crisis threshold, and (2) a test of preexisting differences in covariates between companies just above and below the threshold.
We conducted the McCrary (2008) test for the continuity of the Altman Z-score distribution. This test considers the smoothness of the density function of Altman Z-scores around the crisis threshold. A jump in the density function around the threshold would indicate companies’ ability to manipulate their Altman Z-scores, thus invalidating the as-if-random assumption around the cut-off. As Figure 2 shows, there is no evidence for a significant jump around the crisis threshold, with widely overlapping confidence intervals around the point estimates on each side. The null hypothesis of continuity of the Altman Z-score variable cannot be rejected (

Test for Continuity of the Altman Z-Score Distribution Around the Crisis Threshold
To test for preexisting differences in covariates between companies around the threshold, we conducted balance tests. If assignment of crisis status at the threshold is indeed random, we should find no differences between companies on either side. Table 4 lists the differences in means among all firms with an Altman Z-score within 3% of the threshold. The results do not indicate any significant difference between companies below and above the threshold. This is in line with the assumption of randomization around the cut-off.
Pre-Existing Differences in Covariates as a Function of Crisis Cut-Off
Placebo tests
An alternative explanation for the discontinuity at the crisis threshold could be that the interpretation of Altman Z-scores is so idiosyncratic that our model picks up noise rather than a change in crisis status. To rule out that our design measures noise, we artificially shift the cut-off threshold in steps of 0.5 Altman Z-score units. Estimates are reported in Figure 3. None of the alternative cut-off values (0.31, 0.81, 1.31, 2.31, 2.81, 3.31) yields a significant increase (or decrease) in female appointments. For all but the original cut-off value of 1.81, coefficients are insignificant (

Estimated Change in Female Appointments at Placebo Thresholds and the Actual Crisis Threshold (1.81)
We also considered if the glass cliff occurs not only at the threshold separating the “crisis zone” from the “gray zone” (Z = 1.81) but also at the second available threshold between the “gray zone” and the “safe zone” (Z = 2.99). This would indicate that a glass cliff occurs whenever a firm is downgraded, countering our hypothesis that crisis status drives the glass cliff effect. To test this possibility, we performed a placebo test re-estimating all specifications in the original analyses using the Altman Z-score threshold of 2.99 as the cut-off. No significant treatment effect is found at the alternative threshold (full results reported in Appendix Table A2), providing additional evidence that the glass cliff effect is primarily driven by crisis status and not by other salient status changes.
Sample selection
Although our set-up addresses omitted variable bias and reverse causality as the central sources of endogeneity discussed in the literature (Certo, Busenbark, Woo, & Semadeni, 2016), sample selection may bias our estimates because we can only observe our outcome (i.e., gender of newly appointed executives) for the firms that had a TMT vacancy. The extent of empirical issues created by nonrandom samples is usually less severe than some scholars suggest (Certo et al., 2016). Nonetheless, we addressed this potential bias in an additional analysis by implementing a Heckman selection correction. Nonturnover observations were retained to predict inclusion in our final sample by retirement of prior executive plus all model and control variables (Mitra, Post, & Sauerwald, 2021). From the selection model, we calculated a correction factor (inverse Mills ratio) that we include with all other covariates in our final turnover-only sample as a control (Flammer & Bansal, 2017). The treatment effect when controlling for potential covariates and selection effects remains significantly positive (
Generalizability
Because the regression discontinuity design exploits as-if-random assignment around the cut-off to establish causality, the causal estimate is based only on observations around the cut-off. These companies may not be representative of firms far away from the cut-off, potentially limiting the generalizability of our findings (Flammer & Bansal, 2017). We inspected this issue by comparing the characteristics from Table 4 for companies close to and far away from the cut-off. For each characteristic, we compared the mean in the 3% bandwidth around the cut-off to the mean for all firms outside the 3% bandwidth (and calculated the corresponding
Supplementary Tests
Although the results are robust and the moderation effects in line with our suggested signaling mechanisms, we could not directly test if crisis firms aimed to signal change to investors by appointing female executives. To further understand the mechanism underlying our findings, we conducted supplementary analyses.
We captured firms’ signaling efforts in a more direct way by analyzing press releases, which firms use to deliberately communicate information to a wider audience (Zavyalova, Pfarrer, Reger, & Shapiro, 2012). Specifically, we hand-collected press releases on all executive appointments for firms in the 3% bandwidth around the threshold and coded the extent to which the press releases conveyed change. 8
To obtain press releases on executive appointments in the 3% bandwidth around the threshold, we used the Nexis Uni database (www.nexisuni.com). We collected data from firm press releases disseminated via Business Wire and PR Newswire, two leading distribution services on which prior studies have relied to obtain a representative set of press releases (e.g., Graffin, Carpenter, & Boivie, 2011; Zavyalova et al., 2012). For the 507 turnovers within the 3% bandwidth, we obtained 136 relevant press releases.
To code the extent of change-related signals, we content-analyzed for change-oriented words with the quanteda package (Benoit et al., 2021) in R. We used the dictionary compiled and validated by McClelland, Liang, and Barker (2009), which other top management studies have applied to measure change orientation (Post, Lokshin, & Boone, 2020).
To test if firms with crisis status used more change-oriented framing than noncrisis firms for female appointments but not for male appointments, we adapted our regression discontinuity set-up by using the change-orientation measure as the outcome and keeping Altman Z as the running variable with the crisis threshold of 1.81. We then estimated the model once for the female appointments subsample and again for the male appointments subsample.
As predicted, crisis firms (compared to non-crisis firms) used significantly more change-oriented wording in their press releases when appointing a female executive (
Discussion
The glass cliff hypothesis has received much attention from researchers and the public, but findings have been inconsistent. This study draws on signaling theory (Connelly et al., 2011; Spence, 2002) to develop a more context-specific glass cliff model. We theorize that crisis firms use female executive appointments to strategically signal fundamental change efforts. Based on the signaling rationale, we contend that the crisis effect on female appointments should vary based on the nature of the signaler, signal, and receiver. Whereas past field research has primarily relied on correlational designs, we use a regression discontinuity design to provide a clean causal estimate of the crisis effect on female appointments, employing a large-scale sample of 26,158 executive appointments by U.S. firms during 2000–2016. Our findings support the theoretical model and reveal a significant positive effect of firm crisis status on female appointments. We also find more change-related framing of female appointments in press releases by crisis firms compared to noncrisis firms. Further supporting our theoretical signaling arguments, we find that the glass cliff is context-dependent and occurs only when companies have no existing female top managers, when firms appoint an insider rather than an outsider, and when the executive appointment receives high investor attention.
The theoretical model and empirical findings make three notable contributions to the literature. First, the study primarily contributes to glass cliff research. By developing novel theoretical insights into the signaling mechanism and contextual factors underlying the glass cliff phenomenon, the study explains past inconsistent findings. Glass cliff research has relied strongly on stereotyping arguments (think crisis
Moreover, the study provides much-needed additional evidence on the glass cliff from real-world organizations. There has recently been a considerable increase in use of scenario experiments with students or working adults making a hypothetical selection decision (e.g., Kulich et al., 2015; Rink, Ryan, & Stoker, 2013; Ryan et al., 2011). As demonstrated by early glass cliff work (Haslam & Ryan, 2008), scenario experiments are a good way to get an idea off the ground, develop general understanding of a phenomenon, and take steps toward establishing causality (Aguinis & Bradley, 2014). However, the limitations of this specific research design and skepticism toward its use outside social psychology (Lonati, Quiroga, Zehnder, & Antonakis, 2018) may explain reservations over purported evidence for the glass cliff in other scientific domains (i.e., strategic management and economics). In particular, scenario experiments have been criticized for potential demand effects and inherently low external validity (Gloor, Gazdag, & Reinwald, 2020; Lonati et al., 2018); it is questionable whether real-world top executive promotions with multiple stakeholders and complex selection procedures result in a similar glass cliff effect to that found in simplified hypothetical cases. Our work takes the logical next step in glass cliff research by complementing scenario experiments with causal evidence from the field.
Second, this study adds to the growing research stream on the signaling of top managers. Behavioral theorizing on TMTs—most prominently upper echelons theory (Hambrick & Mason, 1984)—has traditionally focused on how top managers’ attributes or the TMT's composition influence executive behaviors, in turn influencing organizational outcomes (Finkelstein et al., 2009). More recent studies have considered signaling theory and examined visible attributes of top managers that can boost companies’ public reputation (e.g., Gomulya & Boeker, 2014; Zhang & Wiersema, 2009). Such research has, for instance, studied how CEO attributes signal the quality of firms’ financial statements (Zhang & Wiersema, 2009) and investigated how the composition of a firm's TMT can signal legitimacy that, in turn, affects investor decisions (Higgins & Gulati, 2006). Considering signaling through staffing decisions is also in line with research showing that equity evaluations depend on both financial and nonfinancial information (Certo, 2003; Trueman, Wong, & Zhang, 2000). Our study extends signaling research in the context of TMTs by considering how companies use top management gender to signal change during a crisis. The valuation of female executives is also in line with prior research demonstrating that female executives can benefit from their minority status and receive higher compensation than male colleagues (Hill et al., 2015; Leslie, Manchester, & Dahm, 2017). We also contribute to signaling theory by studying how signaling decisions can change based on the nature of the signaler, signal, and receiver. This contrasts with the predominant focus on the quality of the signaler in most signaling research on top management (Connelly et al., 2011; for an exception, see Gomulya & Mishina, 2017).
Last, this study contributes to efforts in management research to tackle endogeneity bias. Today, management researchers can draw on a solid domain-specific literature base spanning critiques and reviews of existing research designs and best-practice recommendations for designing field studies as quasi-experiments that mimic the gold standard of randomized experiments (e.g., Antonakis et al., 2010; Hill et al., 2021; Semadeni et al., 2014). Yet despite regression discontinuity designs being praised as “a much closer cousin of randomized experiments than other competing methods” (Lee & Lemieux, 2010: 289), they are rarely applied in management research (for exceptions, see Brzykcy & Boehm, 2021; Flammer, 2015). Part of the knowledge–action gap may be attributed to the difficulties of translating general recommendations on quasi-experimental designs into specific designs to address pressing management questions: Quasi-experimental designs require a new way of thinking about potential sources of exogenous variance in an endogenous predictor. Thus, our work may inspire future research to use innovative identification strategies and contribute to more robust management science.
Limitations and Future Research
The study has some notable strengths as it incorporates extensive field data with a clean causal identification strategy and supplementary evidence on the proposed mechanisms to improve understanding of the glass cliff effect. Still, some shortcomings generate avenues for future research. First, the regression discontinuity design is not without limitations. Because it exploits the as-if-random assignment around the cut-off to establish causality, the generalizability of our findings may be restricted to firms near the cut-off. While we did not find marked differences in central attributes between firms near to and far from the cut-off (see Appendix Table A3), readers should extrapolate our findings with caution.
Second, our study looks at glass cliff appointments for the four most powerful and visible top manager roles (i.e., CEO, CFO, COO, chairperson) but does not perform specific subanalyses for CEOs. CEOs have traditionally attracted prime attention among business scholars and the wider public, and we control for differences in top manager roles in our analyses (for a similar approach, see Bechtoldt et al., 2019). Still, given the high visibility of CEO appointments, additional subanalyses for CEO positions have value. Yet the small number of female appointments to CEO positions further exaggerated by the regression discontinuity design's focus on turnover observations in a small bandwidth around the crisis threshold precluded any CEO-specific analyses and resulted in convergence issues of the models. At the same time, the robust glass cliff effect found in our sample, with a relative overrepresentation of non-CEO appointments, suggests that non-CEO appointments serve notable signaling purposes in times of crisis. Because past work has largely overlooked the signaling role of non-CEO appointments, we encourage future research in this area.
Third, this study treats a crisis as one overall construct and does not differentiate between crisis types. The regression discontinuity at the crisis cut-off allowed us to identify the effect of perilous financial status on a firm's likelihood of appointing female leaders, as postulated by glass cliff theory, but our identification strategy precludes differentiating between different forms of crisis. It is theoretically plausible that signaling through female appointments is most attractive when the crisis requirements match the stereotypical attributes of female leaders. For instance, a firm in a reputation crisis after financial misconduct may be more likely to signal trustworthiness via female appointments, whereas a firm whose crisis stems from competitive attacks may be more likely to appoint male executives to signal dominance and strength. We encourage future research to investigate this possibility.
Another potential opportunity for future research is to study the consequences of the glass cliff for the careers of female top managers. While initial research suggests that the relative riskiness of glass cliff positions leads to higher dismissal rates for female CEOs (Gupta et al., 2020), relatively little is known of the conditions in which female top managers are likely to succeed as leaders. Past research suggests that women receive higher wages than men when the widespread adoption of diversity goals in organizations creates opportunities for high-potential women (Leslie et al., 2017). Such an environment could plausibly help female top managers make career progress and avoid falling off the glass cliff.
Future research could also investigate whether this paper's signaling arguments can explain the emergence of a glass cliff for other demographic groups. Researchers have recently observed that glass cliffs may also exist for persons of color (Cook & Glass, 2014; Gündemir, Carton, & Homan, 2019). Yet Morgenroth et al. (2020) note that the stereotyping mechanism, as the traditional rationale, may fall short in explaining race- or ethnicity-based glass cliffs because the fit between crisis leadership and female stereotypes does not apply to all underrepresented racial/ethnic groups. By contrast, the signaling mechanism we propose may be more generalizable as every notable deviation from the status quo in executive appointments can signal change to the market. Accordingly, we encourage future studies to replicate our work focusing on race/ethnicity.
Practical Implications
Ryan and Haslam (2007: 550) conclude from the glass cliff effect that women are disproportionally bound to fail and may thus face a “second wave of discrimination.” Yet designing strategies to prevent the emergence of the glass cliff is a difficult endeavor: For single appointments, it is not always apparent whether a position is a glass cliff or an honest attempt to respond to public calls to increase female representation in the executive suite. Nonmarket mechanisms, like gender quotas, could be considered to prevent crisis firms from intentionally appointing women for signaling purposes. Such quotas would increase the general presence of women in top leadership positions, thereby weakening the signaling function of female appointments for crisis firms, as women in top management would be the norm, not the exception. However, appointing females beyond the quota might still signal “intent to change,” such that women remain relatively more likely than men to be placed in risky positions. Thus, mirroring the wider debate on the advantages and disadvantages of gender quotas (Hughes, Paxton, & Krook, 2017; Leslie, 2019), it remains unclear if gender quotas could avoid the creation of risky glass cliff positions by effectively reducing the attractiveness of the signal for firms.
Hence, before contemplating such quotas for TMTs, companies could first work toward making staffing processes more transparent and formalized. Decisions on top executive appointments are made behind closed doors and are often nonroutine and informal, which increases the chance of signaling decisions undermining the selection process (Glass & Cook, 2016). In a similar vein, individuals involved in executive staffing (i.e., members of boards and staffing committees) should be made aware of the potential for a glass cliff, relevant contextual factors, and underlying mechanisms (i.e., signaling), enabling them to actively counter tendencies to use female leadership appointments to signal change in times of crisis. We do not advocate banning signaling aspects from staffing decisions. Rather, companies should not rely on overly broad, demographic categories but instead consider nuanced, individualized information about potential new leaders’ real competencies. Thereby, companies would combat the glass cliff and ensure they select the best-qualified candidate for the job, instead of a candidate who primarily serves a signaling goal.
