Abstract
Introduction
Financially motivated unethical acts such as corruption, money laundering, or scams typically have tremendous consequences for individuals and societies at large. For example, in 2019 alone, U.S. citizens reported losing more than $667 million to so-called imposter scams. Therein, scammers pretend to be calling from a family member, the government, a romantic interest, or a well-known business with an emergency and request payments. While law enforcement can assist in some cases, the money most often remains lost (Vaca, 2020). Unethical behaviors and tendencies are a pervasive problem in society, negatively affecting individuals and organizations across contexts.
Reflecting the widespread occurrence and consequences of unethical behaviors and tendencies, researchers from various disciplines have aimed to reveal and understand their driving factors. For example, research on situational factors fostering unethical behaviors suggests that individuals are less likely to act unethically when being monitored (e.g., Gneezy et al., 2018; Lilleholt et al., 2020), when punishment is severe and likely (e.g., Thielmann & Hilbig, 2018), or when country-level prevalence of rule violations is low (e.g., Gächter & Schulz, 2016). Research on interindividual differences related to unethical behaviors suggests that younger individuals are typically more likely to act unethically for a financial gain than older individuals and that men tend to be more likely to act unethically for a financial gain than women (Gerlach et al., 2019). Furthermore, different personality dimensions such as honesty-humility (e.g., Heck et al., 2018), the dark factor of personality (
Concerning other person characteristics, it has been proposed recently that the endocrine system might play an important role in fostering unethical behaviors and tendencies (e.g., Lee et al., 2015). Corresponding evidence is mixed, however. Herewith, we contribute to this stream of research by investigating whether women’s steroid hormones such as testosterone, cortisol, estradiol, and progesterone, which fluctuate naturally across different days and systematically across their menstrual cycle, as well as women’s fertility (i.e., conception probability), are linked to various factors representing unethical behaviors and tendencies.
Testosterone, Cortisol, and Unethical Behaviors and Tendencies
Steroids that function as hormones, such as testosterone, cortisol, estradiol, and progesterone, play an important role in various physiological processes, including the development and maintenance of reproductive function, glucose metabolism, and stress response (Viau, 2002). Testosterone and cortisol have been suggested as key hormones predicting unethical behaviors more broadly (Lee et al., 2015). More precisely, higher testosterone was reported to interact with lower cortisol in predicting antisocial-approach strategies or income-generating crime (Armstrong et al., 2022; Mehta et al., 2017; Popma et al., 2007). While the evidence for this dual-hormone hypothesis is rather mixed (Dekkers et al., 2019; Grebe et al., 2019), one study reported that higher testosterone levels are associated with more financially motivated unethical behavior (i.e., lying about the own performance in solving matrices for a financial gain), but only when cortisol levels were high (Lee et al., 2015). In contrast, two studies found that testosterone administration is rather linked to less financially motivated unethical behaviors (Henderson et al., 2018; Wibral et al., 2012).
Further unethical cognitions and behaviors, such as behavior in economic games or cheating in committed relationships, may be influenced by hormone levels. One study reported that testosterone administration increased reciprocity in a trust game (Boksem et al., 2013), whereas a different study found that testosterone levels were associated with generous behavior toward distant others, but not with prosocial intentions and behavior (Wang et al., 2022). Also, it has been reported that men with higher testosterone levels were more often sexually unfaithful in committed relationships (Klimas et al., 2019), but this association remains unclear in women.
Relations between personality characteristics that are closely related to unethical cognitions and behaviors, such as aversive traits (e.g., Schröder-Abé & Fatfouta, 2019) or honesty-humility (Ścigała et al., 2020; Zettler et al., 2020), and both testosterone and cortisol are mixed. In a study by Pfattheicher (2016), for instance, testosterone was positively related to higher levels of grandiose narcissism, but there were no significant relations to Machiavellianism and psychopathy. Furthermore, cortisol was not related to any of the three characteristics and did also not interact with testosterone in predicting them. Other studies revealed mixed findings on whether baseline levels of cortisol are related to grandiose narcissism (Reinhard et al., 2012; Wardecker et al., 2018). Moreover, a recent meta-analysis suggests that previously reported findings linking aversive personality characteristics to interacting levels of testosterone and cortisol might not be robust (Dekkers et al., 2019).
The Relation Between Ovarian Hormones, Conception Probability, and Unethical Behavior and Tendencies
Another line of research suggests that ovarian hormones also regulate unethical behavior and tendencies. For example, women with higher average estradiol levels reported a higher likelihood for opportunistic mating, that is, they reported a higher likelihood to engage in unfaithful behavior in committed relationships, such as having a one-night stand or affair with a man who is not their primary partner (Durante & Li, 2009). Furthermore, it has been suggested that (financially motivated) prosocial behavior may be influenced by hormonal contraceptives altering ovarian hormone levels (Strojny et al., 2021) or may vary systematically with hormonal fluctuations across women’s ovulatory cycles (Eisenbruch & Roney, 2016; Lucas & Koff, 2013). Women’s ovulatory cycles are approximately 28 days long (with strong intraindividual and interindividual variations), and women are only fertile (i.e., elevated conception probability) for up to 6 days per cycle. Different cycle phases and different conception probabilities are marked by substantial hormonal changes across the cycle. High conception probability, also known as the fertile window, is characterized by higher levels of estradiol and lower levels of progesterone, whereas levels of estradiol are lower and levels of progesterone increase during the luteal phase, when conception probability is low (between ovulation and menstrual onset; Roney & Simmons, 2013). Different cycle phases are sought to be related to different cognitions and behaviors which respond to specific needs women have during that cycle phase (Fessler, 2003).
In line with this assumption, previous research has argued that in the luteal phase, when progesterone levels are high, the female body prepares for pregnancy, which leads women to behave more prosocial to foster relationships (Sellitto & Kalenscher, 2022; Stenstrom et al., 2018). Other research argued that, when fertile, women try to maximize the chances for successful reproduction by retaining financial resources, for example, through lower offers in the ultimatum game or reduced investment in the trust game (Eisenbruch & Roney, 2016; Lucas & Koff, 2013), which is strongly negatively linked to, for example, honesty-humility (Thielmann et al., 2020). Importantly, other studies reported effects in the opposite direction, with higher prosocial orientation in the follicular phase (Anderl et al., 2015), or no evidence for a link between estradiol or progesterone and prosocial orientation (Ranehill et al., 2018; Strojny et al., 2021; Wang et al., 2022).
Summary and Aims of the Current Study
In summary, mixed results and methodological variability (e.g., self-reports or behavioral measures) in previous research make it hard to know whether there is a biological basis (likely) regulating unethical behaviors and tendencies and to disentangle within-subject (e.g., cycle shifts or states) and between-subject (e.g., average hormone levels or traits) effects. Some of these mixed results might be due to variability in statistical power, cycle-phase determination, or hormone measure validity. Research on the effects of steroid hormones, for example, received criticism because many of the previous studies were (considered to be) underpowered and did not assess hormone levels directly (Gangestad et al., 2016; Grebe et al., 2019). Furthermore, recent research casts doubt on the validity of salivary immunoassays that are usually used to analyze hormone levels, suggesting the use of liquid chromatography tandem mass spectrometry (LC-MS/MS) as a more valid alternative (Arslan et al., 2023; Bae et al., 2016; Welker et al., 2016). Together, there is a need for large-scale studies in this research area aiming at replicating and extending previous studies while overcoming methodological shortcomings. Tackling this, we conducted an exploratory study with the aim to provide information about whether steroid hormones are related to different unethical cognitions and behaviors in women (who are generally underrepresented in this stream of research), as well as whether such relations vary intraindividually or interindividually. To this end, we investigate the relation between testosterone, cortisol, estradiol, and progesterone with different measures of unethical behaviors and tendencies: self-report state and trait measures (honesty-humility, narcissism,
Methods
Open data, analysis script, and material are provided on the Open Science Framework (https://osf.io/p4qmb/). Preregistration of a larger data collection covers the sample characteristics, the procedure, and the measures (https://osf.io/dwcsm/) but no hypotheses and analyses for the study at hand. Thus, we do not make specific predictions, and results from this study should be interpreted as exploratory. All participants signed a written consent form, and the local ethics committee approved the study protocol (no. 225).
Participants and Recruitment
Our participants had to fulfill the following preregistered criteria to take part in the study: female, between 18 and 35 years of age, and naturally cycling, that is, no hormonal contraception for at least 3 months; no expected switch to hormonal contraception while in the study; no current pregnancy or breastfeeding; no childbirth or breastfeeding during the previous year; not taking hormone-based medication or anti-depressants; and no endocrine disorders. A total of 282 heterosexual female participants were recruited at a German university. Out of these, 257 participants (aged 18–35 years,
Procedure
All participants took part in an introductory session in the laboratory, followed by an online questionnaire to be filled out at home, and four consequent individually scheduled testing sessions in the laboratory. In the introductory session, participants received detailed information about the general procedure, duration of the study, and compensation. A research assistant explained the luteinizing hormone (LH) ovulation tests and checked the inclusion criteria. The average cycle length and the dates of the last, the penultimate, and the next menstrual onset were assessed to plan the dates of the next sessions. After the introductory session, participants completed an online questionnaire (at home) assessing demographic information, personality characteristics, relationship information, and health variables. The online questionnaire was administered using the open-source software formr (Arslan et al., 2020).
The following four testing sessions were computer-based laboratory sessions and took place across different phases of the ovulatory cycle, scheduled based on backward counting and the observed LH test surge. All participants completed two sessions in their mid-to-late follicular phase (when expected being fertile) and two sessions in their expected luteal phase (in the mid-luteal phase and premenstrual). Scheduling was validated via LH test results and a follow-up to the next menstrual onset. The starting session for each participant depended on their current cycle phase at the introductory session and their personal schedule. Of all participants who finished all sessions, 134 participants started with the first session in their follicular phase, and 123 started in the luteal phase.
To control for possible effects of diurnal changes in hormone levels, all sessions were scheduled in the second half of the day (between 12 pm and 6 pm). At each testing session, when arriving at the lab, participants first completed a screening questionnaire, assessing their eligibility and some control variables for saliva sampling (Schultheiss & Stanton, 2009). Then, they completed the questionnaires including the items to assess state honesty and the mind game. Immediately after completing the mind game (~15 minutes after arriving at the lab), saliva samples were collected via passive drool. The timing of saliva sample collection was chosen on purpose to give the participants the opportunity to adapt to the lab situation and to avoid any influence of, for example, stressors or social situations on salivary hormone concentrations before coming to the lab. At the same time, this timing helped to avoid any influence of playing the mind game on hormone concentrations due to the time steroid hormones in saliva need to react to a specific event (which is considered to be about 10–30 minutes), as they first need to transfer from blood to saliva (e.g., Del Giudice et al., 2011; Kirschbaum et al., 1993; Schultheiss & Stanton, 2009).
All data in the lab were assessed using the open source framework Alfred (Treffenstaedt & Wiemann, 2018). Besides the tasks described in the current study, participants had to complete other tasks as a part of a larger investigation (see https://osf.io/dwcsm/?view_only=e042065672934440950259c86a68eb1b). All different tasks were fully randomized between participants and sessions. Upon completion of all sessions, participants received a payment of 60€ or course credit.
Measures
Hormone Measures
We collected four saliva samples from each participant in the laboratory (one per testing session). Contamination of saliva samples was minimized by asking participants to abstain from eating, drinking (except plain water), smoking, chewing gum, or brushing teeth for at least 1 hour before each session. The samples were stored at −80°C directly after collection until shipment on dry ice to the Kirschbaum Lab at the Technical University of Dresden, Germany (one freeze-thaw cycle), where progesterone, testosterone, and cortisol were assessed via liquid chromatography mass spectrometry (LCMS, Gao et al., 2015). Because the lab had no valid protocol for LCMS analysis of estradiol levels, the samples were reanalyzed for estradiol using the highly sensitive 17β-estradiol enzyme immunoassay kit (IBL International, Hamburg, Germany). The lab reported that their procedures yield coefficients of variation (a reliability measure) < 11%. We excluded outliers ≥3
We centered all hormone values on their subject-specific means and scaled them afterwards (i.e., divided them by a constant), so that the majority of the distribution for each hormone varied from −0.5 to 0.5 to facilitate calculations in the linear mixed models (Figure S1 in the Supplemental Material). This is a common procedure to isolate effects of within-subject changes in hormones and to facilitate calculations in linear mixed models. Importantly, this procedure did not change any findings compared to analyses with untransformed or log-transformed hormone values. We averaged hormone levels for each participant to obtain a baseline hormone level for the between-subjects analyses but included single values from each session for the within-subjects analyses.
Conception probability
Participants’ conception probability (see Supplemental Table S1) was assigned based on highly sensitive (10 mIU) LH urine ovulation test strips from MedNet GmbH (Münster, Germany). Participants started LH testing after menstruation and continued until a positive test result was followed by 2 days with negative tests (as suggested by Roney, 2018). Participants were provided with a minimum of 10 LH tests each and provided daily pictures of the tests to the investigators. Based on LH test results (e.g., negative tests only) or irregular cycles (e.g., >40 days, <20 days), 22% of all participants (
Honesty-Humility
Honesty-humility was assessed once in the online questionnaire filled out at home between the introduction and the first testing session using the 10 respective items from the German version (Moshagen et al., 2014) of the HEXACO-60 (Ashton & Lee, 2009). This measure includes items referring to different facets of honesty-humility (with three items of each fairness and sincerity and two items of each greed avoidance and modesty), but facets are typically not considered when using the HEXACO-60. Responses regarding the items were made on five-point scales from 1 = “
Dark Factor of Personality
Narcissistic Admiration and Rivalry
We assessed the two-dimensional structure of narcissism using the 18-item version of the Narcissistic Admiration and Rivalry Questionnaire (Back et al., 2013) in the online questionnaire. This questionnaire includes nine items for assessing narcissistic admiration and nine items for assessing narcissistic rivalry. Each of the two dimensions contains three items for three facets: grandiosity, strive for uniqueness, and charmingness for narcissistic admiration and devaluation, strive for supremacy, and aggressiveness for narcissistic rivalry. The same response scale as for honesty-humility was used. All nine items for each subscale were averaged to create subscale scores for narcissistic admiration and narcissistic rivalry, and McDonald’s omegas were good (ωadmiration = .84, ωrivalry = .87).
Unfaithfulness in Committed Relationships
In the online questionnaire, participants were asked “
Self-Serving Economic Dishonesty
Self-serving economic dishonesty was measured via the mind game paradigm (see Schild et al., 2019) in each testing session in the laboratory (i.e., four times per participant). That is, participants were instructed to choose a number between one and eight in private. Then, a random number between one and eight was displayed on a screen, and participants were asked if the displayed number matches the number they chose. If the numbers matched, participants received a bonus incentive of 2€. This scenario gave them the opportunity to cheat to gain more money, by reporting that the numbers matched even if they did not. The probability of economic dishonesty was then estimated as described in the study of Moshagen and Hilbig (2017). This approach corrects the observed proportion of win responses for the expected percentage of winners (participants should have experienced matched numbers in 12.5% of all trials) and, thus, allows for an unbiased estimation of cheating responses.
Feeling Honest
In each testing session (i.e., four times per participant), participants were asked to indicate whether the statement “
Statistical Analyses
All analyses were done with the statistic software R 4.1.0 (R Core Team, 2021). The R packages used including the versions can be found in Supplemental Note d). All statistical tests were two-tailed. For between-subjects analyses, we computed correlation coefficients between continuous variables, such as self-reported personality characteristics or salivary hormone levels (averaged within participants between sessions), and linear models whenever interaction effects were modeled. We computed logistic regression models for categorical outcomes (unfaithfulness in committed romantic relationships). Within-subjects analyses were done by fitting multilevel linear (continuous outcomes) or logistic (economic dishonesty) models with a random intercept for participant ID to account for the nested structure of the data (repeated measures for each testing session).
Results
Between-Subjects Effects
Hormones and Personality Dimensions
Average cortisol levels were significantly negatively related to
Means, Standard Deviations, and Correlations With Confidence Intervals.
Hormones and Unfaithfulness in Committed Relationships
Logistic regression models showed no significant relation between average hormone levels (Supplemental Table S6), their interactions (testosterone × cortisol and progesterone × estradiol, respectively), and unfaithfulness in committed relationships (Table 2), providing no compelling evidence for a link between steroid hormones and unfaithfulness in committed romantic relationships.
Logistic Regression Analyses of Self-Reported Unfaithfulness in Romantic Relationships.
Hormones and Behavioral Dishonesty
We tested whether steroid hormone levels were linked to behavioral dishonesty, that is, whether higher or lower average hormone levels were related to cheating more or less often in the mind game across sessions. Average progesterone was significantly related to the number of reported wins across sessions (
Regression Analyses of Economic Dishonesty.
Within-Subjects Effects
Hormones, Conception Probability, and State Honesty
Next, we investigated within-subjects effects or changes in hormone levels between testing sessions and whether these fluctuations are linked to self-reported or economic dishonesty. For this purpose, we fitted multilevel models which included hormone levels and random effects for each individual as predictors of state honesty ratings. We were also able to investigate participants’ conception probability as an indicator for fertility in the particular testing session. Our results suggest that neither hormone levels (Supplemental Table S9) nor their interactions (testosterone × cortisol and progesterone × estradiol; Supplemental Table S8) or conception probability (Supplemental Table S9 in the Supplementary Material) significantly predicted state honesty ratings.
Hormones, Conception Probability, and Economic Dishonesty
We fitted multiple models which included hormone levels and random effects for each individual. To estimate the relation between the proportion of dishonest individuals in the mind game and the hormone levels, a modified logistic regression model was used (see Moshagen & Hilbig, 2017). Neither hormone levels nor their interactions (testosterone × cortisol and progesterone × estradiol) did significantly predict the proportion of dishonest individuals (Supplemental Tables S10 and S11), with confidence intervals, especially for the interaction effects, being very wide, suggesting that these effects are too complex for the data structure.
Self-Reported Personality and Behavioral Measures
Finally, we tested whether we could replicate previously reported links between self-reported and behavioral dishonesty to validate these relations in our data. Self-reported honesty-humility was significantly negatively related to average economic dishonesty (β = −0.23, 95% CIs = [−0.35, −0.11],
Discussion
To better understand the biological basis that potentially regulates unethical behaviors and tendencies, we conducted a well-powered study that tested within- and between-subjects effects of a broad set of steroid hormones and both self-reported and behavioral measures of unethical behaviors and tendencies. We tested these associations in women, as (a) this gave us the opportunity to test links of unethical behaviors or tendencies and ovarian hormones (next to testosterone and cortisol) as well as of women’s conception probability as a marker of fertility and (b) as previously reported links between unethical behaviors or tendencies and testosterone or cortisol were often only tested in men, leading to less clarity whether reported effects apply to women as well.
Our exploratory results suggest no compelling evidence for most of our investigated relations between within- or between-subjects steroid hormone levels or conception probability that were assumed to be linked to a broad range of measures of unethical behaviors and tendencies. We found some significant negative relations between average cortisol levels and both
As a validation check for our used measures, we replicated well-known patterns of results regarding relations across different self-report and behavioral measures of dishonesty. For example, the results that honesty-humility and
Interpretation of Mixed Findings
The reported links between average cortisol and both
Different methods likely explain these differences in results. Some of our measures clearly differed from those used in previous studies (although they are very likely intercorrelated) that, for example, relied on self-reports of previously committed crime or measured prosocial rather than antisocial tendencies. Most previous studies did not use a repeated-measures design, being unable to investigate within-subject effects (although some cycle studies interpreted their findings as such, despite a between-subjects design, e.g., Anderl et al., 2015; Eisenbruch & Roney, 2016; Lucas & Koff, 2013). However, a within-subjects design comes with the risk of lacking variability in repeated measures, as participants might remember their responses in previous sessions and repeat these responses in following sessions. This might especially be relevant for categorial measures (such as the mind game). Nevertheless, within-subjects variability (
Furthermore, methods of hormone assessment differed between studies: While most previous studies relied on immunoassays that have been criticized for questionable validity (e.g., Arslan et al., 2023; Schultheiss et al., 2019), we analyzed hormone levels (except estradiol) using the current gold standard method LC-MS/MS. Another reason for non-replicability might be the differences in test power. Moreover, it is possible that effects are smaller than expected or more complex and only occurring under specific conditions that we did not assess (i.e., behavior in other economic games, such as the ultimatum game investigated in Eisenbruch & Roney, 2016). A final reason for our findings might be that, put simply, neither average nor fluctuating hormones play an important role in regulating unethical behaviors and tendencies.
Strengths and Limitations
Strengths of the current study include the relatively large sample size, the within-subjects design, and the multiple averaged hormone assays that provide a better baseline measure of hormones than a between-subjects measure as well as single measures (Stern et al., 2022). Furthermore, we did not only rely on self-reported dishonesty but also combined these measures with behavioral observations (i.e., the mind game) and analyzed hormone levels using the current gold standard method LC-MS/MS.
Limitations include the fact that our study did not involve an actual social situation (Baumeister et al., 2007), although hormones evolved to regulate social behavior, which might explain the large share of our null-findings. Future studies should employ social situations with a higher ecological validity to investigate whether links between steroid hormones and unethicality emerge in social situations.
As another potential limitation, we had to rely on immunoassays for estradiol, as there was no valid protocol for the superior method LC-MS/MS. Salivary immunoassays have recently been criticized for lacking validity, as they may not correspond to conception risk or show the expected peak in the fertile phase (Arslan et al., 2023). Thus, other methods to analyze estradiol levels, that were (at that time) unfortunately unavailable at the lab where our samples were analyzed, would have been preferable, and we cannot rule out that respective different analyses would have led to different results.
Reflecting on strengths and limitations, our sample consisted of women, allowing us to investigate effects of conception probability or ovarian hormones, but a sample including all genders would be desirable for future studies. Nevertheless, it should be pointed out that previous studies did not report any differences between men and women (e.g., Lee et al., 2015), and the dual-hormone hypothesis makes no gender-specific assumptions. Still, there might be sex-specific hormonal effects that should be investigated in future studies.
Another potential limitation is that we did not assess all data in the same setting at the laboratory, but participants completed an online self-report questionnaire assessing the personality characteristics as well as cheating in a committed relationship at home. We cannot preclude that different settings influenced responses, for example, because participants felt more anonymous while filling the online questionnaire, which made them respond more or less honest. Overall, participants might not have honestly answered questions regarding unfaithfulness given that this is a sensitive topic. The lack of a continuous scale of unfaithfulness (i.e., asking participants how often they have cheated on their partner) further limits the significance of our findings and should be addressed by future studies.
Finally, the fact that neither hypotheses nor analyses of the current article were preregistered is a limitation with regard to the interpretability of our results. More precisely, the current investigation is purely exploratory, the significant effects were of small magnitude, their confidence intervals barely excluded zero, and the found (small) effects would not remain significant when controlling for multiple testing. The latter might be a problem of lacking test power for detecting small to very small effects in our data. To test if the found effects are real and replicable, preregistered replication studies with a larger sample size are needed.
Conclusion
Unethical behaviors and tendencies can have important consequences for peoples’ lives, leading scholars to investigate mechanisms regulating them. Our exploratory results suggest no compelling evidence that the steroid hormones testosterone, cortisol, estradiol, and progesterone play an important role in regulating dishonesty, unfaithfulness in committed relationships, prosociality, or related personality characteristics in women, although we report some small links between average cortisol or progesterone levels and the personality dimensions
Supplemental Material
sj-docx-1-psp-10.1177_01461672231199961 – Supplemental material for Revisiting the Relation Between Steroid Hormones and Unethicality in an Exploratory, Longitudinal Study With Female Participants
Supplemental material, sj-docx-1-psp-10.1177_01461672231199961 for Revisiting the Relation Between Steroid Hormones and Unethicality in an Exploratory, Longitudinal Study With Female Participants by Julia Stern, Christoph Schild and Ingo Zettler in Personality and Social Psychology Bulletin
Footnotes
Declaration of Conflicting Interests
Funding
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
