Abstract
Keywords
Introduction
Since the beginning of the 21st century, the discourse surrounding social responsibility has emerged as a paramount focal point within the sphere of social organizations but also within the corporate sector (Corvo et al., 2021). Some European guidelines, namely those issued by the European Commission (2014a), have reinforced the importance of this topic in society, mandating that large entities start reporting on their environmental and Social Impact (SI) within their management reports. Among those European guidelines, two of them can be highlighted. The EU Sustainable Finance Disclosure Regulation (SFDR) 2019/2088 aims to integrate ESG disclosure and reduce information asymmetries in the financial services sector (Bengo et al., 2022). Regulation EU 2020/852 establishes a common language for sustainability and aligns criteria for determining sustainable economic activities. In particular, article 8 focuses on the disclosure of information about financial products that respect and meet socio-environmental characteristics, including the methodology used to measure these characteristics (Bengo et al., 2022).
This involves the acknowledgment of value beyond monetary value and corresponds to a paradigmatic change towards a broad approach of an enterprise’s activity. The proliferation of organizations dedicated to addressing complex social issues that have been historically neglected or inadequately addressed by public entities explains the emphasis on SI and the growing interest on social entrepreneurship and its impact (Kah and Akenroye, 2020). This is also the case of the academic-scientific community that presented different studies on the subject (Bernardino and Santos, 2020).
This paper aims to conduct a benchmark of Social Impact Assessment (SIA) models, highlighting their characteristics, strengths, and weaknesses. The objective is twofold: first, to assist professionals in the field in selecting the most suitable model based on their specific needs. Second, to determine if a single method encompasses the most important characteristics outlined in literature.
There are many SIA models, with different assumptions, leading to results that become incomparable between organizations or between types of beneficiaries. Drawing upon the literature review, three core characteristics have been identified as essential to a robust approach to social impact assessment: the quantitative dimension, comparability, and exhaustiveness. These characteristics reflect fundamental requirements consistently emphasized in the field for conducting effective and credible evaluations. Collectively, these characteristics serve as foundational pillars for comprehensive and methodologically sound impact measurement.
(1)
(2)
(3)
The paper is organized into the following sections: the second section presents a review of the literature on the subject, including methodologies for SIA, their main characteristics, and limitations; section three outlines the methodological options employed in the research; section four presents the results and discusses the findings. The final section considers the results in the context of the existing literature, culminating in the conclusion of the study.
Review of the literature
Humanity faces increasingly complex and interrelated challenges concerning production and consumption systems, including climate change, biodiversity loss, social inequality, and the overexploitation of natural resources. In this context, sustainable entrepreneurship emerges not only as a driver of value creation but also as a catalyst for systemic change. It plays a critical role in promoting environmental resilience, advancing inclusive and equitable economic development, and fostering innovation that contributes to the achievement of long-term sustainability goals (Matzembacher et al., 2019).
In the second half of the 20th century, managers began to assume the responsibility of balancing the maximization of their profits with the creation of social value in the communities in which they operate (Carroll, 2009). Managers have begun to recognize that high Environmental, Social, and Governance (ESG) performance is associated with better financial returns (Bengo et al., 2022). The discourse surrounding companies’ intentions regarding Corporate Social Responsibility (CSR) began, delineating perspectives spanning from profit-centric orientations to those prioritizing the generation of social value (Maas and Liket, 2011).
Companies observe a transformation in the discourse surrounding SI and evaluation methodologies, signifying an evolution in both expectations and the linguistic frameworks employed (Arvidson and Lyon, 2014). Enterprises promptly recognized that the concept of value extends beyond conventional financial goals (Corvo et al., 2021). Therefore, a shift has been observed from the Single Bottom Line (economic value) paradigm to a Multiple Bottom Line approach that encompasses considerations across economic, social, and environmental dimensions (Maas and Liket, 2011).
The Multiple Helices framework also helps to understand the multidimensional nature of social impact. Originating from the Triple Helix model, which emphasizes collaboration between academia, industry, and government, later expansions such as the Quadruple and Quintuple Helix add the roles of civil society and the environment, respectively. These models underscore that sustainable social impact is not generated in isolation but emerges through dynamic interaction among key societal sectors (Peris-Ortiz et al., 2016).
From the perspective of social organizations and companies engaged in socially responsible or green innovation projects, the exigency of complex social and environmental challenges – combined with competition for financial and human resources – drives the need for greater efficiency, effectiveness, and impact orientation. While social organizations often lead in addressing unmet societal needs, an increasing number of private firms are integrating social and environmental objectives into their business models, aligning with ESG criteria and sustainable innovation frameworks (Bernardino and Santos, 2020).
In summary, there is a growing recognition that attentiveness to social issues can yield favorable effects on corporate financial performance and a growing number of social investors and mutual funds use social criteria to evaluate their investments (Epstein and Roy, 2001).
The emphasis on SI has experienced an escalation, driven by the promulgation of policy documents, legislative initiatives, and an intensified focus on achieving the well-known sustainable development goals (Kah and Akenroye, 2020). The academic-scientific community has also demonstrated a growing interest in social entrepreneurship and its impact on society, resulting in a significant increase in publications that explore the SI of social entrepreneurship initiatives (Bernardino and Santos, 2020).
SIAs (SIA) have emerged as a discrete discipline within the broader field of impact assessment, concentrating on the social dimensions of sustainable development and evaluating the ramifications of planned interventions on communities (Arce-Gomez et al., 2015). Measuring SI must evolve into an integral component of an organization’s operations and decision-making processes (European Commission, 2015).
SIA spans a spectrum ranging from qualitative, subjectively derived measures to multidimensional quantitative metrics (Öncer, 2019). Measurement must be relevant, useful, simple, natural, certain, understood and accepted, transparent, well-explained and supported by evidence. SI should cover the impact on communities, both short and long-term, and distinguish between direct and indirect SI (European Commission, 2014b). SIA must offer a thorough evaluation of SI, encompassing diverse dimensions such as demographics, individual well-being, community welfare, employment dynamics, and job satisfaction. Furthermore, SIA needs to be exhaustive in addressing the various stakeholders involved, encompassing evaluations of the impact on employees, suppliers, customers, and the broader local community (Miller et al., 2007). SIA aims to consider qualitative and quantitative aspects, addresses proportionality and relevance, and adapts to the evolving nature of social enterprises (European Commission, 2014b).
Evaluation serves as a valuable tool for leaders to demonstrate the positive impact of interventions (Lee et al., 2021). Demonstrating the value created and being accountable for actions and performance are crucial aspects for social initiatives (Bernardino and Santos, 2020). Assessments serve as a tool to identify, measure, and demonstrate the net benefits or damages resulting from an organization’s activities, as well as the effects on its stakeholders (Barby et al., 2021).
Recent organizations tend to place greater value on evaluating results, realizing benefits such as greater effectiveness and the efficient use of resources (Bernardino and Santos, 2020), visibility, the increase of the likelihood of attracting financial support for the project (Lee et al., 2021), and future funders and donations (Arvidson and Lyon, 2014) and attraction of employees and volunteers (LeRoux and Wright, 2010). The positive effects on employee motivation, retention and recruitment, cost savings and increased revenue, customer attraction and retention, reputation, and employer attractiveness are clear (Weber, 2008). Among the main motivations for measuring impact are the improvement in demonstrating transparency, responsibility, and organizational legitimacy to stakeholders (including investors) (Lall, 2019).
Firms undertaking green innovation initiatives benefit from SIA models by systematically evaluating the environmental and social outcomes of their projects. These models support the identification of long-term value creation, enhance sustainability reporting, and provide measurable evidence of impact to regulators, investors, and customers. By integrating SIA into their innovation processes, companies can align their strategies with global sustainability goals and strengthen their competitive advantage in environmentally conscious markets (Liu et al., 2024).
The field of SIA has evolved over time, with contributions from the academia, international organizations, and professionals, leading to the development of multiple methodologies for impact assessment (Esteves et al., 2012). There has been an emergence of new models related to sustainability, including models focused on ESG performance, as well as assessments aligned with the United Nations Sustainable Development Goals (SDGs) (Corvo et al., 2021). Various metrics, including cost-benefit analysis, ranking systems, and social accounting and audit, are employed to fulfil the diverse information requirements of different stakeholders. The choice of the metric depends on the specific information needs and objectives of each stakeholder (European Commission, 2015). In the following paragraphs, a selection of some of the most renowned methodologies is presented while Table 1 offers a comparative overview highlighting their main characteristics, strengths, and limitations.
Overview of the key strengths and limitations of the most widely recognized methodologies.
Scientific articles within the nonprofit sector predominantly concentrate on case studies detailing the implementation of Social Return on Investment (SROI). SROI analysis involves calculating the financial and social value of a project, determining the relationship between benefits and costs, and using financial proxies to estimate the social value of non-tradable goods (Öncer, 2019). These types of economic models may not fully capture the nature and full impact of the activities (Antadze and Westley, 2012).
The Balanced Scorecard (BSC) is presented as a performance measurement tool that balances short-term financial goals with long-term strategic objectives and that considers both financial and non-financial measures (Öncer, 2019).
In a Cost-effectiveness analysis (CEA), the costs associated with each intervention are measured and compared against their respective outcomes or benefits. This methodology may not capture intangible impacts (Antadze and Westley, 2012).
Cost-benefit analysis (CBA) is based on monetary valuation and has methodological and conceptual limitations when applied to non-monetary goods (Antadze and Westley, 2012).
The well-known Theory of Change is a framework that explains how activities lead to a series of outcomes, ultimately achieving the desired impact. The flow includes resources, activities, outputs (points of contact with beneficiaries), results (changes achieved in the lives of beneficiaries), and impact (the extent to which results are attributable to the specific activities) (European Commission, 2014b).
The creation of countless models involving measurement processes shows the extent to which there is a clear need to evaluate this social reality, which appears to be complex and multidimensional. An abundance of confusing initiatives has thus contrasted with the need for a simplified and effective measurement system (Barby et al., 2021). The European Commission and many other organizations have started efforts to develop methodologies for measuring SI, but no definitive solution has yet been found (Costa and Pesci, 2016). In other words, despite the multiple efforts, there is a lack of widespread adoption of SIAs in evaluating projects and no methodology has become a universal consensual reference (Burdge, 2002), even though that common logic is clearly needed (Costa and Pesci, 2016).
There is no single measure that is applicable for the whole sector of nonprofit organizations and able to capture changes adequately (Pennerstorfer and Rutherford, 2019). Besides, there is also a lack of clear classification and comparison of these SIA models (Grieco et al., 2015).
First, the lack of a common language makes it difficult to debate and adopt SIA models among professionals (Corvo et al., 2021). There is no universal definition of the concept of SI, with variations in terminology observed between academic fields. The conceptual nature of SI requires flexibility and the consideration of multiple perspectives, stakeholder engagement and context-specific approaches (Lee et al., 2021). Although the term “impact” in the social sector lacks a consistent definition, it is associated with lasting changes in people’s lives and it addresses the root causes of social problems (Ebrahim and Rangan, 2014).
Burdge and Vanclay (1996) state that by SI we mean the consequences to human populations of any public or private actions that alter how people live, work, play, relate to one another, organize to meet their needs, and generally act as members of society. Latané states that by SI we mean any of the great variety of changes in physiological states and subjective feelings and emotions, cognitions and beliefs, values, and behavior, that occur in an individual, human or animal, as a result of the real, implied, or imagined presence or actions of other individuals. Freudenburg states that SI refers to impacts (or effects, or consequences) that are likely to be experienced by an equally broad range of social groups as a result of some action. For Gentile, SI is the wider societal concern that reflects and respects the complex interdependency between business practice and society (Maas and Liket, 2011).
The International Association for Impact Assessment (IAIA), considered the leading global authority on impact assessment best practices, defends that SI Assessment are “intended and unintended social consequences, both positive and negative, of planned interventions (policies, programs, plans, projects) and any social change processes invoked by those interventions” (IAIA, n.d.).
On the other hand, the European Commission has a different definition regarding SI: “The reflection of social outcomes as measurements, both long-term and short-term, adjusted for the effects achieved by others (alternative attribution), for effects that would have happened anyway (deadweight), for negative consequences (displacement), and for effects declining over time (drop-off)” (European Commission: Directorate-General for Employment, Social Affairs and Inclusion, 2014).
For Impact Europe (formerly European Venture Philanthropy Association—EVPA), SI is “the attribution of an organization’s activities to broader and longer-term outcomes, which are in turn defined as the changes, benefits, learnings, or other effects (positive or negative, both long and short term) that result from an organization’s activities” (Impact Europe, n.d.).
On the other hand, Social Value International (SVI) states that Impact is the difference between the outcome for participants, considering “what would have happened anyway, the contribution of others, and the length of time the outcomes last” (Social Value International, 2015).
Regardless of the different perspectives that are observed, there are common elements between them, such as the focus on the long-term changes and on the needs of society with regard to environment, human rights, economic development, education, citizenship, and health (Matzembacher et al., 2019).
Second, the evaluation of social performance is challenging due to the difficulty of establishing causal links between different interventions (Ebrahim et al., 2014).
Third, the interdisciplinary nature of SI measurement: different paradigms and levels of measurement are considered in the academic debate, where there is a clear diversity of beliefs and preferences for measurement methods (Molecke and Pinkse, 2017). Measuring SI must consider the richness and complexity of the construct, including multidimensional approaches. For example, assessing the impact of a youth development program may require indicators that go beyond participation rates or academic improvement. It might also involve capturing changes in self-esteem, sense of belonging, or aspirations for the future—dimensions that are subjective, and evolve over time. Similarly, a community housing initiative may be evaluated not only by the number of homes built, but also by changes in residents’ sense of safety, neighborhood cohesion, and access to services. Addressing such complexity and multidimensional reality often requires a combination of quantitative indicators (e.g., changes in income levels or employment) and qualitative methods (e.g., interviews, case studies, or life stories) to truly capture the depth and nuance of social change (Rawhouser et al., 2019).
Fourth, a conceptual challenge, the oversimplification of complex issues: SI is difficult to measure as it is abstract and not easily captured by metrics, involving a complex and abstract phenomenon (Antadze and Westley, 2012). SI results from emotional actions, help, kindness, aimed at protecting the most vulnerable. For this reason, SI is associated with psychological or emotional concepts, such as “well-being,” “personal fulfilment,” “autonomy,” “faith in the goodness of others,” and other similar ones. These are concepts seen as “qualitative,” in which the generalization of the group is avoided in favor of the centralization of the single individual. There is, therefore, a certain resistance in measuring this “qualitative” concept, for fear of approaching this enormous emotional dimension, often considered not visible, in a reductive way. This resistance is reinforced by the fact that social areas do not commonly interconnect with analytical areas.
Fifth, the challenge of proportionality: in measuring impact there is a challenge of costs that is not compatible with most dimensions of social organizations. Measuring impact requires specialized knowledge (European Commission, 2014b), longitudinal studies and resource allocation that may be beyond the capabilities of operating organizations and funders (Ebrahim and Rangan, 2014). Some social entrepreneurs consider SI as immeasurable and consider the cost and effort of implementing formal methodologies to be unrealistic (Molecke and Pinkse, 2017).
But these challenges are also faced by governments in assessing social value due to limited resources, public funding constraints, and growing social needs (Corvo et al., 2021).
Sixth, the challenge of time: the importance of considering the time intervals in which activities lead to results, especially when results manifest outside the observation period. It is therefore crucial to measure the long-term impacts of activities designed to benefit society (Rawhouser et al., 2019).
Seventh, social organizations frequently adhere to conventional management methods and may have established structures rooted in tradition. Thus, the introduction of SI measurement and evaluation processes can create internal discomfort and resistance within organizations. Even if organizations understand these assessments as enabling them to secure funding, assessments can threaten autonomy, create anxiety, and challenge organizational values.
Eighth, the potential problem of double counting and cumulative effects (Arce-Gomez et al., 2015) when measuring indirect social effects in an evaluation. Certain social effects may be counted multiple times in different parts of the assessment (Zimdars et al., 2018).
Ninth, clear importance is given to comparability and consistency in measuring SI, where the need for a harmonized approach must consider diversity between sectors. Social enterprises struggle to determine whether the numbers represent marginal or breakthrough success due to the absence of a point of comparison (Molecke and Pinkse, 2017).
Tenth, SIA methods must consider a more comprehensive and inclusive measurement, considering all stakeholders throughout the entire measurement process (Costa and Pesci, 2016). Community engagement helps organizations address problems and opportunities, obtain valuable feedback, improve community relations, and enhance performance in areas of community interest (Epstein and Roy, 2001). Involving citizens in evaluating results leads to reducing conflicts (Arce-Gomez et al., 2015) and to improving their lives (Soler-Gallart and Flecha, 2022). Disadvantages and challenges of participatory approaches include the need for clarification, management of diverse stakeholders, and determination of realistic SI (Arce-Gomez et al., 2015).
Other challenges include limitations in terms of incomplete data and the need to improve access to data (Antadze and Westley, 2012) and the disconnection between metrics and the reality on the ground (Molecke and Pinkse, 2017). Given the diversity of challenges presented, Table 2 provides a consolidated overview by grouping them into thematic categories, allowing for a clearer understanding of the main conceptual, methodological, operational, and contextual barriers to social impact measurement.
Classification of key challenges in measuring social impact.
Different entities may require different methods tailored to their specific activities, objectives, and impact aspects they need to measure (Maas and Liket, 2011). Impact investors are looking for uniform measurement standards, but a balance between uniformity (ensuring comparability) and relevance is crucial, enabling specific purposes while enabling sectoral analysis.
Social entrepreneurs often rely on a combination of elements from several methodologies, rather than fully committing to a specific formal methodology for measuring SI. The idea of a universal “gold standard” is often criticized (Costa and Pesci, 2016). Thus, social entrepreneurs turn to “bricolage,” a combination of material and ideal practices, allowing to overcome the limitations of formal methodologies and be more aligned with their own understandings and experiences (Molecke and Pinkse, 2017).
The variety of models and the absence of a single model that suits all organizations challenge managers and academics. Taking a step behind, classification systems have been proposed to help organizations understand the main characteristics of models. They allow the identification of macro categories of models based on several variables, offering more personalized options for social entrepreneurs (Grieco et al., 2015).
Dimensions such as purpose, deadline, orientation (input or output), duration of time (short or long term), perspective (micro, meso, or macro), approach (process methods, impact methods, monetization methods), and generalization of application (multi-sector vs. single sector) (Rawhouser et al., 2019) are described as factors that differentiate methods. These characteristics influence the purpose, the focus, and the results of measurement. Some methods can assess impacts prospectively, focusing on the expected outcomes of planned programs. On the other hand, Öncer (2019) identifies seven analysis variables for mapping SI assessment models: data typology, impact typology, purpose, model complexity, sector, period, and entity. Studies focus on various aspects of SIA models, including their measurability, purpose, data viability, sectoral applicability, function (e.g., monetization, evaluation, management), and usability of tools to measure SI (Corvo et al., 2021).
Materials and Methods
The work will be segmented into two sections: (1) mapping and classifying SIA models; and (2) mapping the indicators related to these models. In the latter phase, efforts will be made to harmonize indicator nomenclature through a combination of manual and automated methods.
The study type comprises a Literature (Narrative) Review followed by a classification of methodologies and a listing of corresponding indicators.
The databases employed for sourcing methodologies encompassed Google Scholar, Scopus, ResearchGate, and EBSCO Host. The investigation utilized a set of keywords, including “social impact” AND “measurement” OR “assessment’ OR “analysis” OR “metrics” OR “analytics” OR “evaluation.”
Criteria for inclusion and exclusion of methodologies
Employing a comprehensive literature review, a classification of the existing SIA models was carried out. The inclusion criteria encompassed methodologies utilized by social organizations, private enterprises, and public entities for measuring impact on individuals, thereby excluding assessments on entities and on organizations themselves. It also considered both internal impact (on employees, on customers, and on suppliers) and external impact (on the community).
There were excluded from the mapping methodologies (1) based on formulas or generalized statistical models not specialized in the social sector (

Selection of methodologies and indicators for inclusion in the study, based on the established inclusion and exclusion criteria.
Dimensions of the classification table
Therefore, a classification table was created mapping the following dimensions, for each methodology: Methodology name, Brief explanation, Type of methodology, Formula, Result, Formula details, Result details, Areas, Stakeholders, No. of fixed indicators, No. of example indicators, Pre-defined indicators, Type of organization, Strengths, Weaknesses, Impact indicators (example), Observations and Source (Table 3).
Classification table structure: dimensions and fields.
The type of methodology is divided into seven categories:
While overlaps may exist (e.g., a standard that also includes a list of indicators), categorization was based on the
Indicators mapping
Simultaneously, a supplementary classification table was compiled containing a mapping of indicators utilized by the methodologies under consideration.
Indicators serve as indispensable tools in measurement, furnishing quantifiable benchmarks that monitor progress and provide insights into the efficacy of initiatives. They guide informed decision-making and facilitate continuous improvement efforts (Pennerstorfer and Rutherford, 2019).
Indicators were excluded according to the following criteria:
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
Applying these exclusion criteria, the initial pool of 7197 indicators was narrowed down to 1310. The final list included 1361 indicators, as some of the 1310 indicators were broken down into multiple indicators (Figure 1).
The mapping included the following identification: Methodology name, Stakeholder, Indicator area, and Indicator name. Only indicators associated with the previously mapped methodologies were included, that is, there were excluded isolated indicators that organizations use in their reports.
Indicator harmonization
After the establishment of the classification table incorporating indicators aligned with the mapped methodologies, a harmonization process ensued. This procedure entailed harmonizing the nomenclature and concepts of the numerous indicators under scrutiny. Failure to undertake this harmonization would have stopped progression to the subsequent phase: descriptive analysis.
The harmonization process employed two distinct methodologies: manual and automatic. Manual harmonization confers the advantage of assimilating pre-existing knowledge accrued during the investigation, albeit it remains susceptible to human error and bias. Conversely, the automatic methodology, drawing upon thousands of published databases within the domain, offers the benefit of impartiality. Nevertheless, its reliance on an automated approach may not invariably align with the specificity of this problem. The overarching objective is to place alongside these two methodologies and integrate their outcomes into a unified mapping, facilitating subsequent dimensional aggregation. The results obtained manually and through automatic techniques were compared and compiled into a single harmonization, regarding their name, the stakeholder they refer to, and the impact area they pertain to.
To automate the harmonization of indicators, a Text Mining model was employed.
Specifically, Topic Modeling, a technique within the domain of Natural Language Processing (NLP), was deployed. Topic modeling, an unsupervised learning method, endeavors to discern abstract “topics” or themes within the corpus of text, based on probabilistic latent semantic analysis (Nikolenko et al., 2017). Specifically, the BERT (Bidirectional Encoder Representations from Transformers), the state-of-the-art natural language processing model introduced by Google in 2018, was utilized. To obtain textual representations in a vector space, the BERT model was utilized as a foundational methodological framework. BERT was selected due to its widespread adoption and recognition as a state-of-the-art model for several years. Its ability to generate contextualized representations of textual content provides a more accurate semantic understanding—particularly valuable for analyzing the nuanced and domain-specific language of social impact indicators (Arora et al., 2020).
Subsequently, the Uniform Manifold Approximation and Projection (UMAP) technique was employed for dimensionality reduction, allowing the transformation of high-dimensional textual data into a lower-dimensional space while preserving the underlying structure and relationships. UMAP was employed as it more effectively preserves both local and global data structures. This characteristic ensures that semantically similar indicators remain closely positioned after projection, while dissimilar indicators remain distinct—an essential property for meaningful topic modeling (Wang et al., 2021).
Additionally, Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) served as an advanced clustering algorithm to discern meaningful groupings within the reduced-dimensional space, thereby facilitating the identification of latent patterns and associations inherent within the textual data corpus. HDBSCAN was selected for its high robustness to noise and outliers. These characteristics make it especially well-suited for clustering high-dimensional, unstructured textual data, particularly in contexts where the number of underlying topics is not known in advance (Campello et al., 2013).
BERT, grounded in the Transformer architecture, undergoes pre-training on numerous volumes of text data (3.3 billion words) via unsupervised learning tasks such as Masked Language Modeling (MLM) and next-sentence prediction, alleviating the unidirectionality constraint (Devlin et al., 2018). The Transformer architecture is based on the mechanism of self- attention, which allows the model to weigh the importance of different words in a sentence during processing (Vaswani et al., 2017). This conceptually simple and empirically powerful bidirectional approach (left-to-right language model, that is, capturing contextual information from preceding and following words in a sentence) (Vaswani et al., 2017) enables it to encapsulate rich contextual information by representing each word within a sentence based on its surrounding context (Devlin et al., 2018).
Specifically, the study utilized the MiniLM model, a refined iteration derived from the BERT model, to enhance its ability in identifying semantically similar sentences. Developed by Microsoft, MiniLM employs various techniques to reduce the number of parameters compared to BERT, thereby enhancing its scalability and enabling fast performance across resource-constrained environments (Wang et al., 2020).
As previous referred, two sub-models were instrumental in this endeavor:
UMAP (Uniform Manifold Approximation and Projection): a novel developed method for dimensionality reduction that exhibits nonlinear properties. Its primary objective is to reduce high-dimensional datasets into lower-dimensional representations while retaining both the local and global structural characteristics of the original data (McInnes et al., 2018). UMAP distinguishes itself through its commendable computational efficiency, allowing for swift execution even with substantial datasets. Moreover, it offers a degree of adaptability through Parameter Tuning, enabling users to adjust the balance between preserving local and global structures according to specific requirements. Notably, UMAP demonstrates robust scalability and adeptness in managing extensive datasets, owing to its implementation of an approximate nearest neighbor search algorithm, a feature that facilitates the handling of datasets comprising millions of data points.
HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise): a density-based clustering algorithm tailored for high-dimensional data. HDBSCAN constructs a hierarchy of clusters based on varying density levels, subsequently identifying clusters with the highest stability as the final clusters (Malzer and Baum, 2020). This methodology is particularly suited for datasets characterized by irregular shapes and varying densities (Asyaky and Mandala, 2021). HDBSCAN utilizes the following parameters:
• min_samples = the minimum number of neighbours requisite for a core point, with higher values leading to the exclusion of more points as noise/outliers (DBScan component of HDBScan). Hence, HDBSCAN is able to identify clusters of points that are surrounded by a large amount of noise or outliers.
• min_cluster_size = minimum size a final cluster can attain, with larger values resulting in larger clusters (H component of HDBScan).
Several values for these parameters were tested, ultimately leading to the decision to create one model with a low value (5) and another with a high value (50), integrating both sets of results.
The employment of dimensionality reduction and clustering models assumes paramount significance in harmonizing the extensive array of analyzed indicators, facilitating the reduction of the original data’s dimensionality. Moreover, the grouping of analogous concepts facilitates the translation of apparently disparate concepts sharing common ideas.
The classification tables were created in Microsoft Excel. The automated Harmonization methods were developed in Python version 3.11. The figures were generated using Excel (Figure 2), Python (Figure 3), Venngage (Figures 4–7), and Excalidraw (Figure A1).

Distribution of methodologies, by type.

Distribution of methodologies based on the stakeholders they target.

Distribution of indicator areas concerning the stakeholder

Distribution of indicator areas concerning the stakeholder

Distribution of indicator areas concerning the stakeholder

Distribution of indicator areas concerning the stakeholder
The original data presented in the study (both Python Code and Methodologies and Indicators datasets) are openly available in https://github.com/ [The full link was not provided to maintain the anonymity of the article, but the information was included in a different file submitted.].
Results and discussion
Examples of the most well-known and analyzed methodologies in the literature include the Social Return on Investment (SROI), the London Benchmarking Group (LBG), the B Impact Assessment, the Balanced Scorecard (BSc), the Social Costs-Benefit Analysis (SCBA), and the Robin Hood Foundation Benefit-Cost Ratio. The full list of the analyzed methodologies is presented in the Appendix (Table A1).
Methodologies classification: Quantitative dimension, comparability and exhaustiveness
284 methodologies were mapped with the previously identified dimensions. 140 methodologies were excluded based on the exclusion criteria. As a final list, 144 methodologies were mapped. The SIA models classification allowed to obtain 7197 indicators (Figure 1). To our current knowledge, Corvo et al. (2021) (Corvo et al., 2021) have analyzed the largest number of methodologies in a single article, examining 98 methodologies. This record is followed by Grieco et al. (2015) (Grieco et al., 2015), who analyzed 76 methodologies. Therefore, this study surpasses the number of methodologies mapped and compared in scientific articles, totaling 144 methodologies.
A total of 74 out of the 144 methodologies (51%) are considered Quantitative in nature. The remaining methodologies were distributed as being Qualitative (
As a specific formula, it refers to a metric that is obtained through a set of mathematical operations that are not typically used such as average or percentage (Figure 2).
Only 42 of the 144 (29%) methodologies are aimed to all stakeholders, while the rest are only aimed to some of them (such as, for example, employees and customers or the community in general). 125 (87%) of the methodologies are applicable to the community in general, 77 (53%) measure the impact on organizations or companies globally, 105 (73%) measure the impact on employees, and 67 (47%) measure the impact on customers or suppliers (Figure 3).
Of the 74 quantitative methodologies, 15 were exhaustive, that is, meeting all stakeholders. Of these 15, only 4 include pre-defined indicators, a fundamental characteristic for comparability. However, 3 of these 4 are only applicable to companies and not to non-profit organizations. The remaining methodology does not allow applicability to any area of intervention: it only includes indicators relating to the economic growth and innovation of organizations.
Thus, no SIA model was found that simultaneously exhibits the three fundamental characteristics: quantitative nature, exhaustiveness, and comparability.
Indicators mapping and harmonization
Regarding the analysis at indicator level, of the 7197 indicators initially listed, 5887 were excluded. Of the 1310 indicators that remained from the screening according to the exclusion criteria, some were divided into multiple ones when they represented different concepts, resulting in a final list of 1361 indicators (Figure 1).
The indicators underwent harmonization regarding their nomenclature, initially through manual methods and subsequently via Text Mining techniques, as delineated in the methods section. The results obtained manually and through automatic techniques were compared and compiled into a single harmonization, regarding their name, the stakeholder they refer to, and the impact area they pertain to.
The 1361 indicators from the final list were distributed according to the stakeholder they represented and their impact areas, having obtained the distribution shown in Figures 4 through 7. The size of the circles represents the proportion of indicators that each area includes, compared to the others. The number of indicators presented (N) in each circle refer to the count of unique indicators within each area and each stakeholder. They totalize 153 unique indicators.
Conclusion and future directions
Since the onset of the 21st century, the discussion regarding social responsibility has become a central focus not only for social organizations but also within the corporate sector (Corvo et al., 2021). The creation of numerous models for measuring impact underscores the clear necessity to evaluate this complex and multidimensional social reality (Barby et al., 2021).
A universal measure that comprehensively captures changes across the nonprofit sector remains elusive (Burdge, 2002; Pennerstorfer and Rutherford, 2019). Moreover, there exists a deficiency in the clear classification of these models for assessing SI (Grieco et al., 2015).
The literature outlines numerous challenges associated with measuring SI. First, a historical reason: there has not been a progressive integration of the strengths of each model into a single one, but rather a new methodology independent of the previous ones has always been created, which does not allow for the evolution and robustness of a single model. Second, institutions of transnational responsibility (such as the European Commission) have difficulty in adopting an official measurement or even supporting the creation of a reference in this area (Costa and Pesci, 2016). In part, this difficulty arises from the lack of a consensual definition of SI (Corvo et al., 2021). Without this prior harmonization work, it is not possible to progress towards a single model. It will be very important to accept a definition for the concept of SI that is as comprehensive from a theoretical point of view as it is based on concrete and specific characteristics of each reality. Third, a conceptual reason: SI results from emotional actions, help, kindness, aimed at protecting the most vulnerable. SI is associated with psychological or emotional concepts, such as “well-being,” “personal fulfilment,” “autonomy,” “faith in the goodness of others,” and other similar ones. These are concepts seen as “qualitative,” in which the generalization of the group is avoided in favor of the centralization of the single individual. There is, therefore, a certain resistance in measuring this “qualitative” concept, for fear of approaching this enormous emotional dimension, often considered not visible, in a reductive way (Antadze and Westley, 2012). Fourth, different entities may require different methods tailored to their specific activities, objectives, and impact aspects they need to measure. Social entrepreneurs often rely on a combination of elements from several methodologies, rather than fully committing to a specific formal methodology for measuring SI (Costa and Pesci, 2016).
These challenges give rise to significant concerns: on one hand, the importance of considering all stakeholders, realizing that SI does not result only from specific actions from social sector entities within the community, it is also a consequence of the daily activity of institutions on employees, suppliers, and customers (Exhaustiveness) (Costa and Pesci, 2016; Miller et al., 2007). On the other hand, the importance of using analytical knowledge to transform emotional concepts into indicators that can be observed and, consequently, into quantitative variables (Quantitative dimension) (Grieco et al., 2015). Also the importance of measuring every dimension, otherwise there is no comparability (in space, in time, by type of promoting entity, by type of action, by type of beneficiary) (Molecke and Pinkse, 2017). Lastly, the importance of creating observation and collection methods that can be relatively quick and financially accessible, making the model more operational and capable of being applied by a maximum number of users (Proportionality) (European Commission, 2014b). To this end, it is very important to know the statistical indicators available, nationally and internationally, to avoid specific collections through surveys, although these must always exist. Some common sense is also necessary in the decision regarding the total number of indicators because although the definition of SI is multidimensional, especially if we consider it applied to several stakeholders, the number of indicators for each sub-dimension identified must be reduced to a minimum.
Given the challenges outlined, there is a clear and pressing need: developing robust classification systems that enable an understanding of the strengths and weaknesses of each model, and help professionals select the methodology that best aligns with their specific needs. Given this pressing need, our study first focused on this classification. Our analysis revealed that no SIA model simultaneously presents the three fundamental characteristics: quantitative nature, exhaustiveness, and comparability. This prompts inquiry into whether the absence of a universal reference stems from a pragmatic perspective, necessitating methodologies tailored to individualized contexts, or from the continued absence of a comprehensive model that integrates all fundamental attributes.
As a prospective path for further research, the proposition persists in interrogating the preceding inquiry. Should it be discerned that the absence of a universal methodology stems from the absence of one possessing the delineated attributes, there arises a pressing necessity to attempt toward its creation. Interviewing organizations spanning diverse maturity levels and operational domains would be instrumental in elucidating this inquiry.
This study presents as its main limitations, on one hand, the absence of the environmental component. This was deemed as an exceedingly comprehensive, multidimensional, and specific domain warranting a dedicated investigation. On the other hand, the exclusion of methodologies assessing the impact on organizational structure (but on individuals) may render this study somewhat restrictive. Nevertheless, once again, the intricacy and specificity of this domain necessitates an independent investigation.
This study represents an advancement in academic practice as it, on one hand, encompasses the most extensive array of models documented in any single academic work to date. On the other hand, it delineates fundamental characteristics for an impact measurement methodology and identifies those characteristics that, while highly significant, are absent from any existing methodology concurrently.
In summary, this study not only presents a comprehensive analysis of diverse methodologies for measuring SI but also underscores the imperative for future research to integrate essential yet currently absent criteria, thereby enhancing the efficacy and applicability of impact measurement frameworks in addressing complex societal challenges.
