Abstract
Introduction
Studies have shown that women perform worse than men on financial literacy tests, report lower financial confidence, and are less likely to participate in financial markets (Al-Bahrani et al., 2020; Almenberg & Dreber, 2015; Arellano et al., 2018; Cwynar et al., 2019; Fonseca & Lord, 2020; Grohmann & Schoofs, 2021; Ooi, 2020; Preston & Wright, 2024; Cwynar et al., 2025). This phenomenon – commonly referred to as the gender gap in consumer finance – perpetuates gender inequality and remains largely unexplained by observable differences between women and men such as age, education or income (Bucher-Koenen et al., 2017; Cupák et al., 2018; Potrich et al., 2024; Robson & Peetz, 2020). Researchers suggest that it may result from less observable cultural factors related to social gender roles and the ensuing stereotypes (Aristei & Gallo, 2022; Bottazzi & Lusardi, 2021; Driva et al., 2016; Preston et al., 2023; Rink et al., 2021; Tinghög et al., 2021).
In this article, we examine one such factor, which is a primary manifestation of culture – language. Its role in the gender gap in consumer finance is not well understood. Language encodes socially imposed gender roles and stereotypes (Goodhew et al., 2022; Johansson, 2005; Shoham & Lee, 2018). Extensive literature indicates that the language used by women differs from that used by men due to cultural rather than biological factors (Meier et al., 2020; Pennebaker et al., 2003; Piersoul & Van de Velde, 2023). Male linguistic expression tends to be more direct, instrumental, impersonal and socially distant, often focusing on object characteristics and neutral topics. In contrast, female linguistic expression is generally more elaborate, warmer, more interpersonal, and frequently involves references to psychological and social processes, with a tone that is more polite, complimentary, and considerate (Biber & Burges, 2000; Mulac et al., 2001; Newman et al., 2008; Park et al., 2016; Thomson et al., 2001). Additionally, there are gendered expressions and words tied to stereotypes about social gender roles (e.g.,
Although the role of language in shaping consumer financial choices has been recognized, particularly through framing studies (Bayer & Ke, 2013; Florence et al., 2022; Jin et al., 2017), the impact of language on the gender gap in consumer finance is only beginning to emerge (Boggio et al., 2015, 2020). Language, and words specifically, are considered “strategic instruments of influence” (Farrow et al., 2018, p. 580), capable of shaping cognition, attitudes, and behavior by enhancing identity and social belonging (Farrow et al., 2018; Phillips & Boroditsky, 2003). By using specific words or invoking specific metaphors, language signals what is compatible with a person’s gender and the resulting social norms, encouraging or discouraging a particular activity. Nowadays, finance is still stereotyped as a masculine domain, associated with active, sometimes aggressive, pursuit of livelihoods and material prosperity (Allen & Gervais, 2017; Tinghög et al., 2021; Von Hippel et al., 2015). The domain-specific masculine language might contribute to the gender gap in consumer finance by not giving women a sense of belonging and resulting in their withdrawal from the domain of finance.
Identifying the linguistic causes of this gap is necessary for developing expedient solutions to narrow or even eliminate it. Confirming language as a factor in this detrimental phenomenon could significantly improve financial education programs. Goodhew et al. (2022) empirically show that gender stereotypes are pervasive in general language use, that is even when communication is aimed at large and diffuse audiences, and does not assume interaction – as is the case with many financial education materials (book content, websites, leaflets, podcasts etc.). Ample empirical evidence indicates that even minor linguistic alteration can lead to significant behavioral changes (see the literature indicated in Gaucher et al., 2011). Thus, altering the linguistic design of financial education materials could be a cost-effective strategy for policymakers to reduce gender inequality (Farrow et al., 2018), offering a low-cost solution to financial education, which is considered expensive and economically inefficient (Willis, 2008).
The lack of research on gender differences in language use in consumer finance, and the potentially serious socioeconomic implications of the existence of such differences, prompted us to fill this gap. Therefore, the purpose of this article is to investigate whether women use written language differently than men when referring to financial issues. In undertaking this study, we posed three questions: (Q1) Does the written language of women differ from that of men in the domain of consumer finance? (Q2) If it does, what are the differences? (Q3) Are these possible differences in line with what is known about women’s and men’s language use from wider trends observed in other content (subject) domains, or are they specific to the consumer finance domain? To answer these questions, we conducted three studies using language corpora (Polish) on 10 financial terms (cost, credit, insurance, interest, investment, money, pension, profit, saving, tax) commonly used in formal and informal consumer finances communication. These corpora were analyzed separately using machine learning (natural language processing), statistical, and linguistic methods to identify gender differences.
Our study makes an original contribution to two strands of literature. First, it advances the use of textual and machine learning analysis in financial phenomena. While such analyses have been applied in corporate finance (see an overview in Li, 2020), their application to consumer finance is emerging (Levantesi & Zacchia, 2021). Our study is the first to use machine learning to examine gender-specific language in consumer finance contexts.
Second, it contributes to the literature on gender differences in financial language. While general gender differences in language use are well-documented, little research has explored these differences within specific domains, such as consumer finance, which possess distinct linguistic characteristics. Previous research by Boggio et al. (2015) showed that metaphors on retail investor websites predominantly draw from masculine conceptual domains (such as physical activity, game, war/conflict, etc.). However, this study focuses exclusively on metaphors, using a very small (totaling around 15,000 words) and undifferentiated linguistic corpus (topics related only to investing) and very simple analytical methods (expert identification of metaphors and links between gender and the use of these metaphors).
The second study which identifies gender differences in financial language use (Ben-Shmuel et al., 2024), although rigorously designed and conducted, uses only a qualitative evaluation of a linguistic corpus from only one source (social media) created by individuals with (presumably) above-average financial expertise (finfluencers).
In contrast, our study uses three language corpora, which are large and diverse – both in terms of the nature of communication (e.g., parsimonious and concise statements, constrained by word limits, on Twitter vs. the unrestricted and unlimited written expression in a questionnaire survey) and in terms of subject matter (the full range of issues representing consumer finance: from day-to-day cash management to savings to taxes and insurance). Second, the content we analyzed was produced largely by non-expert women and men (ordinary consumers of both sexes). Third, our study uses a rich array of analytical methods, both statistical and linguistic, as well as those representing natural language processing. As a result, we were able to study very different aspects of language, both those manifested in the surface and deep structures of the language: formal, semantic, related to pragmalinguistics, etc. Hence, our findings offer new insights into the gender gap in consumer finance and suggest potential avenues for addressing this disparity, including the need to rethink how financial literacy and behavior indicators should be developed.
Literature Review
Theoretical Framework
The study of language’s role in finance is an emerging interdisciplinary field lacking a well-established theoretical foundation. Farrow et al. (2018, p. 558) suggest that behavioral economics is uniquely positioned to explore this area, emphasizing various psychological processes through which language can shape cognition and behavior in the domain of finance. One key process is the impact of language on identity. By focusing on identity, we can better understand how language influences financial cognition and behavior, offering a unified theoretical framework that integrates existing partial approaches. Accordingly, we adopt social identity theory as the primary theoretical foundation for our analysis.
Social identity theory, developed by social psychologists (Tajfel & Turner, 1979), has been effectively applied in economics to explain economic behavior (Akerlof & Kranton, 2000). The theory suggests that individuals are categorized as either “female” or “male,” each associated with specific behavioral norms and socially prescribed gender roles, which give rise to stereotypes – generalized and simplified beliefs about different social groups. These stereotypes are not merely descriptive but often prescriptive or proscriptive, and deviating from them can result in negative consequences such as anxiety, discomfort, conflict, and reduced well-being. To avoid these costs, individuals tend to conform to gender-specific behavioral norms.
Language, the central focus of this article, both reflects and reinforces social categorization (Newman et al., 2008). Social identity theory posits that individuals derive a sense of self from their group memberships, which strengthens adherence to group norms, including linguistic markers of social categories. In a socially “group-structured world” (Goodhew et al., 2022), distinct groups – such as females and males – use language to differentiate between in-group and out-group members, much like subcultures employ slang or jargon (Farrow et al., 2018). Social gender roles were likely encoded in language in the ancient past and began to be expressed in language by human ancestors due to inherent gender differences in reproduction and the resulting division of labor and specialization (Johansson, 2005). Because linguistic structures are highly durable (Mavisakalyan & Weber, 2018), language as a vehicle for cultural expression has preserved social norms and belief systems associated with gender roles until today.
Since finance remains stereotyped as a male-dominated field (Allen & Gervais, 2017; Tinghög et al., 2021; Von Hippel et al., 2015), the language used in this domain is likely to reflect this socially resonating gendered division. For instance, men’s language in financial contexts may convey confidence, expertise, and a sense of agency, reinforcing their perceived affiliation with (belongingness to) the financial world, in line with the mechanisms discussed by Gaucher et al. (2011) and theoretical expectations stemming from social identity theory. Conversely, associating finance with men may contribute to women’s feelings of alienation and lack of belonging, diminishing their confidence and perceived influence in financial matters (Gaucher et al., 2011; Kricheli-Katz & Regev, 2021).
Empirical Evidence
Gender and Language Use in General
Research on the relationship between gender and language use dates back to Lakoff (1973). The literature on this topic indicates statistically significant, though often small, differences in linguistic production between women and men, observed across various linguistic levels, including phonetic, lexical, and morphosyntactic. Contemporary linguistic, sociological, and discourse studies confirm that women and men use language differently in both speech and writing, albeit with modest distinctions (Meier et al., 2020; Newman et al., 2008; Park et al., 2016; Piersoul & Van de Velde, 2023; Plug et al., 2021; Van der Velde et al., 2015).
Empirical evidence suggests that women tend to use more affiliative speech, whereas men are more likely to adopt assertive speech patterns (Leaper & Ayers, 2007). Women’s language is generally warmer, more sympathetic, and polite (Thomson et al., 2001), while men’s language tends to be colder, more hostile, and less personal (Park et al., 2016). Additionally, women use more words related to psychological and social processes, whereas men focus more on object properties and impersonal topics, despite having a similar overall word count (Newman et al., 2008).
Women are also more likely to use language relationally, whereas men use it primarily for informational purposes (Biber & Burges, 2000; Newman et al., 2008; Park et al., 2016; Tannen, 1994), a distinction first noted by Lakoff (1973). Since emotions play a key role in relationship-building, women’s language is often described as more tentative, attentive to the interlocutor, and rich in features that foster emotional connection (Leaper & Robnett, 2011; Mulac et al., 2001; Newman et al., 2008; Park et al., 2016). This so-called “affective” or “emotional” style is reflected in various linguistic markers, including sentence-initial adverbs, uncertainty verbs, modal auxiliary verbs, emotion words, and intensifying adverbs.
One example of men’s greater emphasis on information is their more frequent references to quantities and locations (Mulac & Lundell, 1986), as well as their focus on objects, in contrast to women’s tendency to focus on people (Newman et al., 2008; Park et al., 2016). More broadly, women’s language can be characterized as personal and interpersonal, whereas men’s language is more impersonal and socially distant.
Gender and Language Use in Finance
Boggio et al. (2015) examined a linguistic corpus (English, Italian, and Dutch) from websites targeting retail investors, focusing on the frequency of metaphors representing different (feminine and masculine) conceptual domains. They found that the vast majority of the metaphors were drawn from masculine source domains such as, for instance, physical activity, war, game, and farming.
In a series of studies, Gaucher et al. (2011) showed that job advertisements for male-dominated occupations (including accounting and financial management) contained more masculine words and that inserting more masculine language into job advertisements made them less appealing to women and led participants to perceive the jobs as male-dominated. They concluded that “masculine wording in job advertisements leads to less anticipated belongingness and job interest among women, which (…) likely perpetuates gender inequality in male-dominated fields” (Gaucher et al., 2011, p. 119).
In an experimental study, Boggio et al. (2020) found that using language that women can identify with in describing the financial tasks increased attention, interest, and consistent responses between participating girls. A recent study by Ben-Shmuel et al. (2024) found that financial advice varies by gender. Male finfluencers tend to emphasize quantitative aspects, utilizing numbers and graphs to communicate their messages, whereas female finfluencers are more inclined to incorporate narratives and personal stories, connecting financial topics to broader life goals and personal experiences.
The empirical findings presented in this section are promising; however, this research has certain limitations that our study successfully addresses (see the Method section for details). The study of Boggio et al. (2015) focuses solely on metaphors, relying on a small and undifferentiated linguistic corpus of approximately 15,000 words, limited to investment-related topics. The analysis employs simple methods, involving expert identification of metaphors and their associations with gender. In turn, the study of Ben-Shmuel et al. (2024) is based on a qualitative analysis of linguistic data from a single source – social media – produced by individuals presumed to have above-average financial knowledge (finfluencers).
Nevertheless, building on the theoretical framework outlined in section 2.1 and the emerging empirical evidence discussed in section 2.2, we propose the following main hypothesis for this study:
H1: In the domain of consumer finance, women and men use language differently.
Drawing on the findings of Ben-Shmuel et al. (2024), which demonstrate that gendered language use in finance aligns with broader trends reported in linguistic and discourse studies (e.g., more personal, experience-based, and narrative style in women vs. more technical, argumentative, and informational style in men), we propose the following additional hypothesis:
H2: Gender differences in language use within the consumer finance domain reflect the general patterns observed across other subject areas.
The Present Research
General Information
Three distinct studies were conducted to examine gender differences in language use in consumer finance: (a) a computer-assisted web interview (CAWI) survey on a sample of 1,068 individuals representative for the population of adult Poles in terms of sex, age, size of residence and region (Study 1), (b) a study of over 77,000 tweets retrieved from platform X (Twitter) (Study 2), and (c) an online survey on a purposive sample of 358 students (Study 3). In each study, language corpora were collected and the analyses were conducted in a uniform manner.
The studies were approved by the WSEI University Bioethics Committee. They were carried out in accordance with relevant guidelines and according to the principles expressed in the Declaration of Helsinki. Except Study 2 (in which we collected anonymized data from the X platform), before data collection all participants were informed about the study protocol and gave their consent to take part in the study. With the exception of Study 1, which took place on a paid research panel, participants did not receive compensation.
We chose to utilize a diverse linguistic corpora to capture universal patterns, independent of the medium’s characteristics, rather than focusing on patterns specific to one medium. The linguistic corpus derived from Twitter (Study 2) is peculiar, primarily due to the limitations on the number of words one can use. It also serves as a forum for exchanging opinions and expressing views. The same features are shared by the language corpus collected in Study 3, which was stylized on Twitter. In contrast, the survey questionnaire we prepared for the CAWI survey (Study 1) allowed for unrestricted and unlimited written responses, without the involvement of interaction with other individuals. For these reasons, the use of language in these distinct media can be (and certainly is) at least somewhat different. We were keen to take this into account.
The language corpora from Study 1, Study 2, and Study 3 were analyzed using the same set of 10 keywords (henceforth referred to as 10-KW) covering the thematic range of consumer finance issues indicated in the relevant literature (Dew & Xiao, 2011; Huston, 2010; Remund, 2010). This consistent vocabulary ensured that we were examining the same research objects across studies. Our goal was to investigate the (written) language used by consumers of both sexes around these keywords.
The word selection was a four-stage process with six research team members from various disciplines (economics and finance, psychology, sociology), including two women: (a) literature review; (b) collecting literature-based proposals from team members; (c) checking the frequency of proposed words in Google search engine; (d) selecting the top 10 words from those proposed. We considered the frequency of proposed words in Google searches as a measure of each word’s importance in consumer communication. The final selected keywords are: cost, credit, insurance, interest, investment, money, pension, profit, saving, tax.
Figure 1 synthesizes the set-up of our research. First, we selected 10 keywords for the consumer finance domain (10-KW). We then used them in three studies of written language from two media that differ strongly in the constraints imposed on communication mode and language (upper and lower parts of Figure 1). Studies 2 and 3 were related, as we explain in detail in Section 3.4.1. Study 3 was designed to verify the accuracy of artificial intelligence classification algorithms for recognizing differences in financial language between men and women in Study 2 (in a linguistic corpus extracted from platform X, which is devoid of gender tags). For this reason, in Figure 1 we present them at the same level, that is, in the lower part of the figure.

Study design.
To make this article easier to navigate, we briefly introduce here the structure of the Method and Results section. The Method section continues with the following:
1) In Sections 3.2, 3.3, and 3.4, we present the assumptions of each of our three studies (Study 1, Study 2, and Study 3) in more detail (each of these sections is divided into two smaller sections: Study design and Data)
2) In Section 3.5, we explain the nature of the analyses performed, dividing this section into sub-sections on the domains represented by the methods used (machine learning, statistics, linguistics).
In turn, the Results section is divided into three parts presenting the results of Study 1, Study 2, and Study 3. Each of these three parts in turn consists of the results of applying statistical and linguistic analyses (machine learning analyses were used as pre-processing to enable statistical and linguistic research).
Study 1
Study Design
In Study 1, the 10-KW set prompted respondents to generate free-form written statements regarding: (i) associations with the 10-KW (e.g.,
Data
The survey was conducted on a quota-based sample (1,068 respondents) representative of the population of adult Poles in terms of sex, age, size of the place of residence, and region (voivodeship). Data were collected on one of the most populous research panels in Poland (Ariadna Panel –https://panelariadna.com/) between February 14 and 22, 2024 using computer-assisted web interviews (CAWI). Participants comprised 561 females (52.5%) and 507 males (47.5%), with a balanced age distribution, primarily 55 years and older (40.8%) and 35 to 44 years (20.9%) (Table 1).
Participants of the Study 1.
Educational attainment varied, with most participants having an upper secondary or post-secondary education (52.5%). Engineering, manufacturing, and construction were indicated as the field of education by 17.7%, and 16.9% pursued business, administration, and law. Nearly half were married for the first time (48.2%), while singles made up 17.2%. In terms of employment, 45.6% were full-time employees, and 29.7% were retired. Residence was predominantly in villages (38.6%), followed by towns with populations of 20 to 100 thousand (19.9%), and cities with 101 to 500 thousand inhabitants (16.9%).
Pearson chi-square tests revealed significant differences between women and men in educational background (χ2 = 120.930, df = 11,
Study 2
Study Design
We collected textual data via API of the platform X (still Twitter at that time) from September 7, 2022 to January 6, 2023. We gathered 343,709 tweets using 10 hashtags (equivalent to our 10-KW): cost, credit, insurance, interest, investment, money, pension, profit, saving, tax. After removing duplicates and short messages (less than three words), we ultimately worked with a corpus of 77,088 tweets. The distribution of hashtags in the cleaned corpus is as follows: credit – 10,537; money – 9,919; profit – 9,589; tax – 9,371; cost – 8,919; investment – 8,060; pension – 7,489; insurance – 5,246; interest – 4,425; and saving – 3,542.
However, the language corpus extracted from platform X has the property that posts (tweets) do not have gender tags. Therefore, in the next step we created a dictionary of gender-specific linguistic endings in Polish, where the conjugation of verbs and adjective endings often indicate the writer’s gender. Using the endings, we identified 828 tweets by women and 2,588 tweets by men.
Having labeled the training material by gender endings, we used artificial intelligence methods to identify a larger number of male and female posts (though not all, as identification by word endings is not exhaustive in determining the author’s gender). To this end, we employed three Polish language models: HERBERT, Polish_BERT, and XLM. After quantitative and qualitative verification, the Polish_BERT returned the best results. In total, we identified exactly 10,000 posts by women and 11,997 posts by men. We then conducted statistical and linguistic analyses on these data.
To verify the accuracy of the classification model, we tested it on data collected in Study 3 (see section 3.4), which was designed to replicate the communication properties of the X platform, but at the same time the gender of the post authors was known to us. As indicated in section 3.4, in Study 3 we collected data from 268 females and 85 males. It should be emphasized that it is quite difficult to achieve a good classification result on such an imbalanced dataset, hence the necessity for class weighting. It turned out that the best model is Polish_BERT with an accuracy of 77% (Herbert 37%, XLM 18%).
Study 3
Study Design
Study 3 aimed to verify the accuracy of the artificial intelligence classification algorithms to recognize financial language differences between women and men in the linguistic corpus extracted from the X platform which is devoid of gender tags (see Study 2 in section 3.3; as we indicated there, the highest accuracy – provided by the Polish_BERT model – was 77%).
To achieve this, we developed a survey questionnaire to gather content similar to posts from platform X. The process consisted of several stages:
Statement creation: Two individuals (one male and one female) each formulated 10 short statements (stylized as platform X posts) using a different word from the pool of 10-KW.
Psychological verification based on the patters indicated in Newman et al. (2008).
Linguistic cleansing: A linguist/communicologist on the team removed any gender-marking characteristics from the statements.
Survey preparation: The 20 statements (10 produced by a male and 10 by a female) were added to a survey questionnaire, along with other questions about consumer finances, prepared by two team members. This questionnaire was distributed online to students from several Polish universities.
Participants answered demographic questions and formulated responses (re-tweets) to the statements. The automated online questionnaire displayed two pools of 10 statements: one from a woman, one from a man, in random order, without indicating the author’s gender. Participants responded to at least three tweets from each pool. The choice of tweets, the number of responses (more than three), and the length of each response were up to the participants, who could stop responding at any time without explanation.
Data
Data were collected in November 2023 using the platform of the market research company BEW.EDU.PL. The questionnaire was distributed to students through the universities’ educational platforms. The survey link was opened by 1,260 individuals. Out of these, 902 provided their sociodemographic information (gender, age, and native language skills), and 358 native speakers completed the questionnaire fully. We rejected five responses due to unintelligible random entries, leaving us with data from 353 participants (268 females and 85 males).
Table 2 reports distribution of the sample in terms of the key sociodemographic variables. The mean age of the participants was 28.19 years (
Participants of the Study 3.
The initial pool of words from these 353 individuals totaled 61,289. The shortest re-tweet was a single one word, indicating the respondent stopped participating; the longest was 557 words, with a median length of 155 words. Our final language corpus included 588 frequently repeated words, each used at least twice, after filtering out stop-words and non-dictionary or nonsensical terms.
Analysis
In each Study (Study 1–3) collected data underwent the same set of analyses: machine learning processing, followed by statistical analysis, and then linguistic analysis.
Machine Learning Analysis
To analyze the text data (linguistic corpora) collected in Studies 1 to 3, we employed a range of natural language processing techniques using Python libraries including sklearn, nltk, gensim, and spacy. The analysis consisted of several stages. Initially, we cleaned the data and lemmatized the text corpus using the publicly available Polish language model, “pl_core_news_lg” (GNU GPL 3.0), which has 500,000 unique vectors across 300 dimensions. Given that Polish has highly inflectional nature, lemmatization reduced words to their roots. We then removed common stop-words and punctuation, which carry little meaning (equivalent to “the,”“at,”“on” in English). Graphical elements such as emojis were excluded from the analysis to maintain consistency across data sources.
Next, we counted the frequency of words in the lemmatized corpus, providing insights into the differences in language usage between women and men regarding financial matters. We then conducted
The identified bigrams and trigrams were visualized as networks using Python libraries such as networkx, matplotlib, and seaborn. Although
Statistical Analysis
We identified the words most frequently used by women and by men – both in separate corpora produced by women and men, and in one aggregate corpus combining the two. We then determined which words showed statistically significant frequency differences between the sexes, examining the individual sub-corpora and the combined corpus.
We analyzed 10 most common words in each sub-corpora, setting this cut-off threshold (10 words) for statistical reasons, as less frequent words did not provide sufficient statistical differentiation power. We used the chi-square coefficient to compare the frequency of use of each word by women and men. With the significance level of .05, the minimum acceptable test power of 0.8, and the expected effect size of 0.5, the minimum sample size was
In Study 1, in addition to strictly linguistic questions, we asked respondents a question to learn how important the chosen financial terms (10-KW) are to their everyday functioning (see the section 3.2.1 for the wording of the question). A five-point Likert scale ranging from
Linguistic and Communicative Analysis
To analyze biolects, that is, the way of speaking that is characteristic of a particular biological group, often based on gender or age and their interpretation of the world, including finance, we first applied traditional generative grammar tools to identify similarities and differences in the deep and surface structures of the content produced by women and men (Wołkowski, 2010, pp. 145–161). We focused on: (i) thematic-remedial structures, examining syntagmatic connections within sentences and whole responses, (ii) valences and semantic roles of the statement components (Fillmore, 1969), and (iii) how generative models in the deep structure influence surface linguistic constructions (Chomsky, 1982, 2010).
Secondly, we analyzed the extracted connections, constructions and linguistic units using textological, ethnolinguistic (referring mainly to the methods of cultural semantics), and cognitive linguistics methods (de Beaugrande & Dressler, 1990). This comprehensive approach allowed us to reconstruct the linguistic image of finance highlighting similarities and differences between women and men in key interpretative categories such as categorization, axiology, onomastics, and metaphorization (Bartmiński, 2012; Nowak, 2002; Tokarski, 2014). The methods of ethnolinguistics/cultural and cognitive linguistics helped us interpret conceptual metaphors and contexts (Lakoff & Johnson, 2003) and assess whether only masculine metaphors appear (Boggio et al., 2015).
Third, from a pragmatic perspective, we evaluated the intentions and effects of participants’ language production, assigning single sentences and whole statements to different types of speech acts (Searle, 1987). We described differences and similarities in the verbal responses of women and men regarding finance, influencing the differentiation of the locus (formal structure) and illocution (intentionality) of these acts. This analysis allowed us to categorize statements into types such as assertions, directives (command or request), commissives (committing to future actions or expressing intentions to perform certain actions), expressives (expression or evocation of emotional states), or, finally, constatives (stating facts, beliefs, or truths).
The Polish Language
Studies 1 to 3 were conducted in Polish, which influenced the study design and might have impacted the findings as well. The Polish language, belonging to the West Slavic group, has evolved over 1,000 years, influenced by Proto-Slavic and other Slavic languages, distinguishing it from English, French, and German. Notable differences lie in its intricate grammar, rich grammatical categories, and forms, and heterogeneous semantics shaped by both intra-linguistic factors and external contexts such as culture, politics, and society.
Contemporary Polish is characterized by:
Grammatical and semantic gender markers, more abundant than in English.
A deliberate mid-20th-century decision of the Polish authorities to neutralize masculine gender leading to fewer distinctions between genders in terms like professions’ names, unlike English.
Patriarchal cultural norms and administrative choices favoring linguistic masculinity in public discourse.
Preference for masculine forms in conveying professionalism and seriousness.
Consequently, the “femininity” of Polish language manifests predominantly in the underlying structures of communication (public, official, didactic, scientific, etc.), emphasizing valuation, emotionalization, outcome description over processes, and less abstractness and typicality/ordinariness in narrative.
Results
Study 1
Statistical Analysis
Women used more words (Me = 37) than men (Me = 35;
As mentioned in section 3.5.2, we asked respondents a question allowing us to learn how important the chosen financial terms (10-KW) are to their everyday functioning. Table 3 presents the importance of these terms based on the responses to the survey question “
Respondents’ Answers to the Question “You Feel That You Are Finding Your Way Well in Finance When You Sort Out the Following Things ….”
Mean in range between 1 –“
Respondents most often reported that they felt that they were finding their way well in finance when sorting out money, saving, cost, and interest (Table 3). Conversely, the least often reported were credit, investment, tax, and pension. The most pronounced gender differences were noted for five tested terms: “investment” (
Linguistic Analyses
Differences in financial language between women and men appear across almost every keyword studied.
Money
Both genders identify money primarily a means of payment. However, men showed a need for detailed categorization referring to cash, currency, banknotes, coins, change, and the slang terms like dough, rocks, dust, and many more. Women do not use such terms and focus on money’s presence, associating it with wealth, whereas men discuss the lack of money, that the need for credit, and insufficiency (generally, in female language production, “money is there,” while in male language production, “money is missing”).
Additionally, women discuss money in the context of everyday life, usually positively aligning with common sense and in accordance with a communication strategy known as
Pension
Both genders emphasize importance of knowing what their lives will be like upon retirement. The verb “to know” is highly ranked in the frequency for both sexes, reflecting a rational, specialized approach. In men, the verb “to know” ranks first, while in women – second. Women’s most frequent term is “oldness (old age),” combined with the noun “time” and the verb “to rest.” In contrast, men’s frequent terms include “work” and “money,” introducing financial categories and scaling pensions with words like “minimum” or “average,” emphasizing a specialized paradigm and competences. Women do not scale pension, but instead create an axis of valuation using terms like “small”/“low” and “high.” This reflects the difference in perspectives: male linguistic production is more process-oriented while female – more result-oriented. Men also connect pension to government actions and contextual, macroeconomic factors, often pointing out the low pensions of women. Women’s economic contextualization is more private (daily) focused, without indicating the higher pensions of men.
Cost
Men approach costs rationally and are process-oriented as reflected in their frequent use of the verb “to know” and other imperfective verbs like “to spend” or “to grow.” Women often use nouns like “a cost” and “an expense,” along with perfective verbs like “to spend,”“to incur,” and “to increase.” Once again, men tend to categorize and detail costs precisely, for example, auto, fuel, VAT, energy, gas, etc., while women discuss the experienced effects of costs, using terms like “pricey.”
Investment
The linguistic richness of women’s statements about investments is notable, with 114 communicatively independent words compared to men’s 97. Women often evoke their affective state, linking investment to values and emotions (e.g., responsibility, security, uncertainty, good). This emotional dimension is largely absent in men’s statements. As with the other 10-KW, men tend to categorize investment types in detail, from deposits to cryptocurrencies, listing 23 types in total.
Credit
Both men and women associate credit with negative concepts. Women link credit to risk, stress, and negativity, while men associate it with fear, exploitation, and deception. Men view credit more fearfully than women, using phraseologisms like “crutch” (“ball and chain”) and portraying credit institutions negatively as exploitative or usurious. These associations are absent in women’s statements. Women also see credit as a means of achieving goals and as a financial boost, whereas men often see it as last resort and a tool for consumer exploitation (fraud, thievery).
Interest
Both genders view interest negatively, akin to credit. Women perceive interest more negatively than credit, while men see both equally negatively, though distinguishing between harassment (interest) and usury (credit). Both genders recognize the connection between interest and state institution, as indicated by references to Marek Belka (former Prime Minister of Poland and Minister of Finance) who introduced the capital gains tax (so-called
Saving
The interpretation of saving differs the least between genders due to its common usage and metaphorical richness (imagery) in general language. According to the Modern Dictionary of the Polish Language (www.wsjp.pl), “saving” has four meanings, including financial saving, saving time (declaratively important for women),
Tax
Tax, along with credit and interest, forms the most negatively interpreted triad of the 10-KW. Both genders interpret tax negatively, but women focus on paying tax, while men categorize and explain taxation system. Men mention “CIT” (corporate income tax) and other specific terms, while both genders refer to “VAT” (value added tax), “PIT” (personal income tax), and excise taxes. Only women mention “toll” (“myto” in Polish). Women also connect tax to the budget, and see it as a societal contribution, whereas men view tax as “coercion,”“fraud,”“deception,” and an individual burden.
Insurance
Women frequently mention “life insurance” and “casualty insurance,” while men focus more on the institutional nature of insurance (e.g., “PZU”– Powszechny Zakład Ubezpieczeń– Poland’s largest insurance company), “peace of mind” and associated costs. Men also mention “MTPL” (motor third party liability) policy more often than women. Women’s language associates insurance with “fear” and “illness.” Men also view insurance as a “scam” (this term also appears in women’s corpus), “(forced) tribute,”“coercion,” and a “thief.” Once again, men’s language about insurance is self-centered and express belief in their invulnerability, while women’s language is sociocentric and narrative, emphasizing responsibility, which appears in their corpus but not in men’s.
Profit
Men focus on the process of calculating profits, using words like “positive,”“result,”“difference,”“plus,”“minus,” and “balance.” Women associate profit with more emotional and random concept, such as “reward,”“surplus,”“win,”“good,”“joy,” and “satisfaction.” Women describe the result of profit with terms like “effect” and “commodity,” which are absent from men’s language.
Study 2
Statistical Analysis
The three recurring words that were used significantly more often by women in the sub-corpora centered around the 10-KW were the verbs “to want” (chi-square ranging from 4.167 (
Compared to women, men significantly more frequently used: (i) professional language terms such as “net” (in the sense of the opposite of “gross”; in the sub-corpus related to “profit”; χ2 = 18.591;
Linguistic Analyses
In analyzing the language corpus extracted from platform X, we faced two challenges. First, the platform itself is a communicologically and sociologically specific medium. In Poland, its users are primarily aged 18 to 34 (55.5%) and predominantly male (74%). Tweets about economics and finance rank 10th in popularity, viewed by 41% of users (IAB Polska, 2023). Second, identifying gender differences in the language of tweets on financial topics is complicated by technical constraints and Polish language rules. The 280-character limit on tweets encourage concise and direct communication – a style traditionally attributed to men. This limitation often forces women to adopt a similar style, reducing the visibility of gender-specific linguistic features. Additionally, Polish language rules allow to use masculine forms as generic, further blurring gender distinctions in language.
Despite these limitations, we identified key linguistic differences in how women and men linguistically interpret financial phenomena. These differences manifest not necessarily in the vocabulary but in the use of word ambiguity highlighting and hiding a different ordering of semantic features of the same lexical unit. For instance, men frequently use the verb “to grow” in tweets about money and saving, appearing 12 times and related terms (e.g., “growing,”“growth,”“increase,”“stop” and “decrease” or “fall”) appearing 24 times. This reflects a view of finance as a dynamic process influenced by external factors (politics, economy). Men’s language sees financial activities as processes with time varying outcomes, emphasizing growth and decline.
In contrast, women rarely use the verb “to grow” in the same context. When they do, it often refers to outcomes (again, result-oriented perspective), rather than processes, aligning finance with the metaphor
Men’s statements are agentive, focusing on action and instruments leading to financial goals. In contrast, women’s language is experiential and dative, highlighting how financial activities affect them emotionally and personally. Safety and health are important themes in women’s financial communication, almost non-existent in men’s. Additionally, the genres of speech differ: women’s posts are often narratives about everyday life, while men’s are mini-lectures, news stories, or instructions, aiming to showcase expertise.
Study 3
Statistical Analysis
The analysis of word frequency in the language corpora revealed relatively few differences between women and men. In some sub-corpora such as those related to “profit,”“tax,”“cost,”“insurance,” and “investment,” no differences were noted. High frequencies of both “money” and “happiness” in both women and men were found. However, it must be noted here that some part of the linguistic corpus resulted from automatic re-tweeting by the study participants. For example, when our post mentioned that money does not give happiness, participants overwhelmingly expressed their opinions whether money gives happiness or not. This could be a reason for high prevalence of these two terms.
Interestingly, no words were significantly more frequently used by men in the sub-corpora produced in relation to various 10-KW. However, several words occurred significantly more often in women’s posts. The verb “to have” was more frequent in the women’s corpus for “pension” (χ2 = 6.587;
In the combined corpus of all responses, only one significant difference emerged: men used the verb “to have” more frequently than women (χ2 = 3.892;
Linguistic Analyses
From a grammatical perspective, generative linguistics connects the formal structures of language with its semantics. Beyond the obvious differences due to the gender inflection in Polish, we observed other characteristics of gendered language use. Women’s statements often include suppositional mode and conditional sentences (
Men’s statements, in contrast, frequently feature indicative mode and causal or manner sentences (“therefore,”“as”/“this way”). There are more professional terms and fewer emotional terms in men’s bigrams and trigrams (e.g., “stress,” common among women, is rare among men). Active semantic valences are more frequent in men’s statements, with men assigning themselves agentive roles (
From the perspective of cultural linguistics and cognitivism, women’s interpretation of the financial world utilizes different conceptual metaphors than in men – these are mainly the typical metaphors such as “money/finances are something valuable” (“lose,”“compound”) and “credit is something dangerous” (“be afraid of credit,”“be careful with credit”); centers around the individual rather than finance itself, evident in the frequent use of the noun “person”; treats communication about finances primarily as communication, indicated by verbs of speak and think: “say,”“suppose,”“assume.”
In contrast, men’s interpretation includes ordering and rational spatial metaphors, with idealized cognitive models (ICMs) like center-periphery, source, way, and space (“even”–“uneven,”“high”–“low,”“source of income,”“something will be nibbled on the side” (“nibble” as a metaphor for food/eating/consumption), “option,”“catch on to something”); conceptualizes finances as essential and varying by context; with metaphors related to movement and parameters (e.g., “arrives,”“flows down” (“streams down”), “everything goes to the current,”“I did not move anything in this direction,”“stable,”“trimmed,”“weigh” (“burden,”“load”), “went 1,480 zloty,”“additional burden”); and views finances not just as communication but as concrete actions and tasks to be accomplished (e.g., “to solve,”“to cope,”“to know”).
Pragmalinguistic analysis reveals that women’s speech acts are propositional, less categorical, and often contain questions. Their assertions emphasize the individuality and subjectivity rather than universal or objective judgments. Men’s speech acts, however, are more constatives, aiming to establish or define reality, with greater illocutionary force, confirming the rationality and expertise of the speaker.
Discussion
The purpose of this research was to examine whether the language used by women regarding financial matters differs from that used by men. The results indicate that such linguistic differences do exist, positively answering our first research question (Q1) and confirming H1 hypothesis, and likely have implications for consumer finance phenomena such as the gender gap in finance. These findings align with previous studies that have shown that financial language is gendered (Ben-Shmuel et al., 2024), particularly at the level of metaphors (Boggio et al., 2015), and that the language may be linked to the gender gap in financial literacy (Boggio et al., 2020).
The differences in language use identified between women and men are numerous. Some of these differences are known from broader trends observed in other content (subject) domains. However, we have identified some novel differences that are likely unique to the language used in the finance domain, which answers our third research question (Q3). Gender differences in language production identified in our studies offer opportunities for designing policy interventions. These interventions could target language change in measuring consumer finance phenomena (e.g., financial literacy and behavior) and developing financial education materials. However, before discussing these potential applications, we will outline the most important differences in how women and men use language when dealing with financial matters, which answers our second research question (Q2).
First, men perceive financial phenomena as processes they both understand and control, presenting their statements as informative and authoritative while positioning themselves as experts. This tendency is reflected in their frequent use of professional terminology (or at least stylized as such). In contrast, women are more likely to focus on financial states rather than processes, conveying personal experiences and specific cases through colloquial language, with specialized terms appearing less frequently.
Second, women tend to use more figurative, narrative, sociocentric, and humanistic language, often referring to social aspects and expressing a higher degree of emotional intensity. In contrast, men are more likely to use technical and impersonal language, prioritizing precision and striving to establish structure, order, and categorical distinctions between phenomena (objects).
Third, women and men tend to ascribe slightly different meanings to key terms in consumer finance (saving, investing, and borrowing). For women, credit is often perceived as a means of achieving goals and providing financial support, whereas men are more likely to view it as a last resort for financing and a tool for consumer exploitation. Similarly, women more frequently associate saving with purchasing and day-to-day financial management, while men link it to future financial planning and the transfer of resources across time periods. Regarding investment, women tend to interpret the concept more broadly, often extending beyond the financial domain, whereas men typically adhere to the professional product categorizations established within the investment industry.
Fourth, men often perceive financial instruments and related institutions as exploitative, expressing feelings of being used, manipulated, cheated, or robbed. This is reflected in their frequent use of terms such as “(forced) tribute” or “exaction” when referring to taxes and their association of “government” with “costs.” In contrast, women tend to view manipulation and fraud as inherent aspects of life experience, assessing financial matters through a moral lens of good and evil without expressing surprise.
Fifth, men position themselves at the center of financial processes, viewing themselves as active agents who control or evaluate financial outcomes – often framed as winning or losing, in line with the game metaphor. Their focus is primarily on performance and results, as reflected in the higher frequency of “bottom-line”-oriented nouns such as “profit” and “loss” in men’s linguistic corpora. In contrast, women are more likely to perceive themselves as experiencers within the financial domain, viewing themselves as “inhabitants” subject to its rules and events rather than as agents shaping financial processes. Women tend to assess financial matters through the lens of emotions and values, as evidenced by the more frequent use of language related to personal experiences. For instance, words such as “accident” in the context of “insurance” and phrases like “to make a living” in relation to “cost” appear more frequently in women’s linguistic corpora.
Many of the differences we identified align with findings from previous studies on language corpora outside the financial domain, confirming our H2 hypothesis. Our study corroborates that women tend to use language in a more personal and relational manner, often referencing social and psychological phenomena as well as their own experiences. In contrast, men are more likely to use language in an informational, instrumental, technical, argumentative, and impersonal way. These patterns are well-documented in prior linguistic, discourse, and sociological research (Biber & Burges, 2000; Newman et al., 2008; Park et al., 2016; Thomson et al., 2001). In this regard, our findings are also consistent with the recent study by Ben-Shmuel et al. (2024, p. 1), which examined social media content produced by finfluencers (“Men emphasize quantitative aspects, whereas women incorporate narratives and personal stories”). These findings are also consistent with psychological research on gender differences in financial beliefs, attitudes, and behaviors. A recent scoping review by Sesini et al. (2023) found that men tend to perceive money as a symbol of success, status, power, and prestige. In contrast, women are more likely to view money in a way that reflects their tendency to consider not only their own needs but also those of others, such as household members (relational and personal aspect).
However, our study yielded several new and original findings. A particularly notable result, which is not widely documented in the literature on gendered language use, is the pronounced tendency of men to use language that signals agency (active semantic value), whereas women’s language more often reflects an “experiencer” stance (passive semantic value). In addition, men exhibit a greater propensity to use professional terminology and linguistically construct themselves as experts in finance. This patterns may stem from deeply ingrained stereotypes, as we hypothesized, that frame finance as a male domain, creating social pressure for men to identify with and demonstrate efficacy in financial matters – hence the strong linguistic manifestation of agency observed in our data. How the (stereotypical) beliefs in implied ability, activity, and agency of women and men are used in the language of advertising messages that shape children’s future choices and perpetuate gender stereotypes and inequities has been convincingly shown by Hourigan (2021).
These novel findings confirm that the language used in the domain of finance reflects the stereotypical perception of this field as male-dominated. As a result, men’s language in financial contexts conveys greater confidence, expertise, and a sense of agency, reinforcing their perceived affiliation with the financial world. This pattern aligns with the mechanisms outlined in social identity theory and discussed by Gaucher et al. (2011).
Implications and Future Research
There are many important implications from our study: for financial institutions, financial educators and advisors, and individual consumers. As for financial institutions, banks and other financial intermediaries should tailor their communications regarding financial products (e.g., loans, saving plans, investment instruments) to different audiences. Women may be more responsive to relational, narrative-based communications, while men may prefer precise, technical language. Women may also be more receptive to communications that emphasize security, while men may engage more with content that emphasizes control, efficiency, and expertise.
Regarding financial educators and advisors, institutions should ensure that financial training does not perpetuate or reinforce stereotypes, but rather empowers both men and women, taking into account their different financial perspectives and needs. Financial advisors should recognize gender differences in financial perceptions. Men may prefer expert, analytical advice, while women may appreciate a more personalized, experience-based approach.
When it comes to individual consumers, understanding that gender norms shape language can help individuals critically evaluate their financial attitudes and behavior and avoid the influence of stereotypes. Consumers can become more aware of the way they use language in the finance domain, increasing their chances of avoiding the pitfalls and cognitive biases resulting from language. Recognizing that finance is often perceived as male-dominated, women can proactively build confidence in financial matters, while men can benefit from incorporating relational and social considerations into their approach to finances.
Our results also point to many interesting and important directions for future research. First, this study underscores the need for further investigation into how language shapes financial behavior, particularly in digital spaces such as social media and financial education. The next logical step is to investigate whether manipulating language in financial education and diagnostic tests (assessing financial literacy and behavior) leads to differential outcomes for women and men. Building on our findings presented in this article, we have developed a financial literacy test, a scale to measure sound financial behaviors, and financial education materials in three language versions: feminized, masculinized, and gender-neutral. A forthcoming longitudinal randomized controlled trial (Cwynar et al., 2025) will assess the impact of these language versions. We hypothesize that reactions will vary based on language compatibility with gender, which could inform the development of unbiased financial literacy and behavior scales, as well as the content of financial education programs.
Second, future research should examine how other factors (e.g., culture, socioeconomic status, education) interact with gender in financial discourse. Third, our findings can help future researchers develop gender-sensitive financial policies, educational materials, and intervention programs to reduce gender gaps in financial confidence, literacy, and decision-making. Future researchers will face the challenge of “de-masculinizing” (or more universally: “de-genderizing”) financial language. Although Boggio et al. (2020) have shown that linguistic alignment of educational material with gender yields positive results, we believe in the need of universal language that avoids gender biases. The key question for future research is how to achieve this effectively.
Limitations
Future research should also aim to overcome the limitations inherent in our study. First, our research was conducted exclusively on Polish, so its findings may not directly apply to other languages. While we believe the patterns we identified are universal, further research in different linguistic contexts is necessary to confirm this. It is crucial to investigate whether our findings apply to other languages, particularly those with different level of gender-marking, such as Spanish (heavily gendered) on the one hand, and Finno-Ugric languages (genderless) on the other – see Corbett (2008).
Second, our study focused solely on written language, lacking data on spoken language. Therefore, we cannot confirm or refute our findings in relation to spoken discourse. Third, our linguistic corpus was built around 10 selected financial keywords (10-KW). Exploring the sensitivity of such a corpus to changes in keyword selection would be an interesting avenue for future research.
We used three research samples with varying distributions of socio-demographic characteristics. In Study 1, we used data from a quota-based sample representative of the adult Polish population, which allows us to generalize the conclusions of this study. The adult population, however, is characterized by a preponderance of older individuals. The design of our study was, however, to balance this predominance with data mainly from the younger part of the population (Studies 2 and 3). Nevertheless, the results from these research samples with distinctly different age structures converge. However, we consider it necessary and interesting to check in future studies whether age does not significantly differentiate the linguistic production of women and men in the financial domain.
Last, the reliability of gender identification in a linguistic corpus extracted from platform X (Twitter) poses significant challenges. Utilizing machine learning methods for gender labeling warrants further investigation, which we intend to explore in detail in a forthcoming article (Błaszczykowski et al., 2025).
