Abstract
Keywords
Introduction
In the field of civil war recurrence, as elsewhere in the broader social sciences, an ever-growing amount of methodologically robust empirical data underpin a wealth of individually rigorous but cumulatively inconclusive studies. Fragmented findings reflect competing theoretical explanations, multiple causal mechanisms, different definitions and operalizations of concepts (Neudorfer et al., 2025), competing measurements, variety in the case studies and/or the contextual factors considered. As a result, it is becoming increasingly challenging to develop and test comprehensive hypotheses, formulate general theories and produce coherent policy recommendations.
In this paper, we offer a novel framework that combines the methodological strengths of existing multi-methods research designs. It allows researchers to
The MSMMF builds on three significant methodological advancements in the social sciences to provide three sets of innovations. First, valuable mixed-methods research designs have emerged to overcome a persistent divide between qualitative and quantitative approaches in fields where, ‘because of the [their] different fundamental assumptions, it is very difficult for in-depth studies of individual cases to communicate meaningfully with claims about mean causal effects across a large set of cases’ (Beach, 2020: 163). Despite extensive progress, existing mixed-methods frameworks often remain skewed towards one end of the methodological spectrum, emphasizing either qualitative or quantitative techniques and reproducing ‘a larger methodological divide than commonly understood’ (Beach, 2020: 164). They also typically focus either on
While recognizing the usefulness and value of existing methodological tools, we propose that some research questions demand different research frameworks that combine the strengths of existing qualitative, quantitative and mixed-methods approaches, to
Second, the expansion and refinement of computational social sciences present unique opportunities for identifying general trends and themes in high-dimensional datasets, in which variables by far outnumber observations (Grimmer et al., 2021). The MSMMF adds machine learning (ML) into mixed-methods research designs. Previously overlooked in multi-methods research, we show that supervised ML can be combined with qualitative research techniques to develop robust hypotheses in fragmented and contradictory research fields. These hypotheses can subsequently be tested through appropriate statistical methods and explored through in-depth qualitative case studies.
Third, academics have increasingly engaged with policymakers, practitioners and potential research users to disseminate their research findings, among others, through in-person talks, workshops, consultations and podcasts. Policy engagement has been encouraged by funding bodies and research quality assessment exercises in countries such as the United Kingdom and Germany. We propose embedding engagement with practitioners and users at various stages of research, transforming them from recipients to research participants. In our field, we define practitioners as individuals who actively engage in the practical application of conflict prevention, mediation and peacebuilding. They are potential research users because they can employ our research findings to inform their activities, but they can also contribute to identify and refine research questions and evaluate the relevance of insights (Bobo et al., 2024). The MSMMF recognizes the crucial contribution that practitioners can make to academic insights (and
The potential to harness the strength of different mixed-methods approaches and to combine them with ML and systematic user engagement into the MSMMF became apparent when we started researching the question:
In this article, we aim to make the MSMMF more widely accessible, to contribute to the advancement of research methodologies, and to offer a new resource to researchers seeking a comprehensive and adaptable approach for their own questions. We proceed as follows. First, we map the field of civil war recurrence, explaining our interest in exploring the question:
Why do some peace processes bring an end to large-scale conflict-related violence while others do not? Identifying a methodological need
The field of civil war recurrence has been growing rapidly. Most of the existing research focuses either on a few case studies (e.g., Call, 2012) or on a comparative investigation of highly specific variables among the wide range of potential factors explaining civil war recurrence (e.g., Bara et al., 2021; Loyle and Appel, 2017; Pushkina et al., 2022; Ta-Johnson et al., 2022). This generates a wealth of data and a variety of (often conflicting) findings that do not comprehensively explain why some conflicts relapse into large-scale violence and others do not.
Three partly contradictory arguments emerge from the existing literature. Some scholars suggest that civil wars resume after peace accords primarily because of contextual factors. For example, a considerable body of research suggests that adverse economic conditions lead to a resumption of violence (e.g., Collier et al., 2008; Walter, 2004). Others highlight the characteristics of the previous conflict as reasons for civil war recurrence (e.g., Nilsson and Svensson, 2021). A third group of scholars explain civil war recurrence or non-recurrence focusing on peace accords, either examining how agreements are achieved or emphasizing what provisions they contain. For example, Quinn et al. (2013) find that third-party mediation has little long-term effect on the sustainability of peace, while Gurses et al. (2008) argue that mediation increases the prospects of longer-lasting peace. In terms of peace agreement provisions, the key debates concern the impact of provisions for power sharing (e.g., Hartzell and Hoddie, 2015; Horowitz, 2014; McGarry and O’Leary, 2004), territorial self-governance (e.g., Hale, 2004; Neudorfer et al., 2022; Weller, 2009) and transitional justice (e.g., Druckman and Wagner, 2019; Duursma, 2020; Leib, 2022).
As a whole, this growing body of literature presents widely divergent findings. While highly valuable for identifying the possible factors explaining the recurrence and non-recurrence of civil war, existing publications do not capture the overall picture of the complex settings of civil wars, or the impact of the interaction of multiple factors on the likelihood of civil war recurrence. As a result, a fundamental question remained unanswered:
We argue that this divergence partly exists because some studies focus on specific selections of explanatory factors, some focus on a small number of highly specific case studies, others rely on fundamentally different datasets and/or sources and yet others – perhaps most importantly – rely on single

Multi-Stage Mixed-Methods Framework at the intersection of different mixed-method approaches.
To address this methodological need, we built on existing mixed-method research designs to create the MSMMF, a novel framework that enables the combination of the two main stages of social science research: hypothesis development; and hypothesis testing. We do not want to eliminate the methodological and epistemological divisions which are a valuable part and parcel of the social sciences. However, we hope that our efforts will enable significant advancements in fields where existing theories and evidence are fragmented and contradictory (such as civil war research), and to provide accurate policy recommendations to interested policymakers and practitioners.
The MSMMF: Purpose and design
Figure 1 compares the key features of our novel MSMMF with those of existing
To ensure a genuine combination of

A step-by-step guide to the Multi-Stage Mixed-Methods Framework.
Stage I aims to develop robust, empirically grounded hypotheses. Steps 1–3 are essential to evaluate the suitability of the MSMMF for each specific research question. These steps are traditionally employed to map existing findings and identify generalizable patterns that can be formulated into hypotheses in the civil war literature.
Steps 4–7 describe the application of the MSMMF. Given the space limitations of a single journal article, we can only briefly map the four steps necessary to apply the MSMMF. In our application section below, we briefly describe the methods we chose to employ in our research project, with dedicated boxes focusing on practitioner engagement (Box 1); machine learning (Box 2); regression and survival analysis (Box 3); and congruence analysis and process-tracing (Box 4). In our application we did not carry out an original qualitative comparative analysis (QCA) study, therefore, we have not included a dedicated box on QCA. The boxes aim to summarize a method’s objectives, applications and suggestions for further reading, but are not exhaustive explanations of the individual methods (and combinations of methods) available to researchers at each step of the MSMMF.
As Figure 2 shows, researchers can employ a combination or a selection of qualitative and/or quantitative techniques at all stages of research. This flexibility aims to enhance researchers’ agency in choosing the approach(es) for each stage of research and how to sequence and combine multiple research methods. These techniques include ML (computational social science, aimed at pattern recognition), pilot case studies (qualitative research to identify potential causal mechanisms), regression analysis (traditional statistics aimed at pattern testing), QCA (aimed at pattern recognition) and congruence analysis and process-tracing (to explore and illustrate causal mechanisms).
The research question and available data will largely guide which methods are most appropriate for each individual research project. For example, to develop hypotheses (Step 5), ML is particularly suitable for datasets with low numbers of observations and high number of variables (generally, if the number of observations divided by 30 is smaller than the number of explanatory variables). If both the number of observations and the number of variables is low, QCA is particularly suitable for hypothesis development. Hypothesis development through pilot case studies may be more or less feasible depending on highly volatile context-specific socio-political conditions, especially in conflict settings. For all methods applied in the MSMMF, we trust that researchers will adhere to common practices in political science, including considerations of significance levels, model fit, truth tables and ethical research practices (see also the methods boxes for further details).
Due to the application of multiple methods concurrently, Step 5 of the MSMMF can result in three possible outcomes. Ideally, all the methods employed will identify similar or compatible hypotheses, allowing researchers to retain and test them in the next stage. Alternatively, different methods might identify diverging hypotheses. Such diversity is not problematic at the hypothesis development stage, so we recommend retaining all hypotheses for testing in Stage II of the MSMMF. Finally, the methods employed at Stage I may lead to extremely contradictory hypotheses. In this case, we recommend retaining all of these – however contradictory – hypotheses for hypothesis testing, as even less robust findings at Stage I will ultimately strengthen the rigour and explanatory power of the research at Stage II.
Step 6 in Figure 2 lists the methods that can be combined or selected to test and explore the hypotheses. For Stage II, we strongly suggest employing
Finally, Step 7 brings together the evidence to formulate overarching conclusions. Moran-Ellis et al. (2006) provide a useful illustration of different approaches to combine and synthesize findings that may guide researchers at this stage. Depending on individual preferences and research questions, integration of methods, integration of analysis and/or integration of theories may be appropriate to individual research projects (Moran-Ellis et al., 2006: 47–49). We recommend, at this stage, to thoroughly triangulate findings from the multiple research methods employed at Step 6 through the consultation with practitioners during dedicated user workshops.
The MSMMF embeds consultations with practitioners throughout the research process. These are signposted in Figure 2 as ‘user workshops.’ While we label these consultations as ‘workshops,’ their specific format may vary from project to project, and may encompass surveys, interviews, informal communications, etc. (see Box 1). In our case, alongside a series of bespoke workshops, we created a dedicated advisory board, and consulted the members at pivotal stages of research when decisions had to be made on case selection, hypothesis development, testing and dissemination of findings. Our engagement activities focused on individuals interested in employing evidence-based practices for peacebuilding, mediation and conflict prevention, but the population of users might vary depending on the specific research question. We identified interested practitioners through our professional networks, the existing literature and the text of peace agreements. We selected them based on their expertise and potential insights, but also ensured a diverse participation in our events to give voice to different perspectives (geographical, professional and gender). In total, we consulted approximately 100 practitioners, including officials from governments and international organizations, mediators, personnel working in one of the 11 countries in the Conditions of Recurrence Dataset (CoR-D, see also Figure 3R), practitioners in non-governmental organizations, civil society actors and academics. Our project had full ethical approval from our university’s research governance committee, and all participants consented for discussions to be included in our outputs without attributing individual remarks (Chatham House Rule). These sessions provided opportunities to transfer knowledge between academics and practitioners through an iterative and long-term process of repeated engagements, and ensured the relevance and visibility of our project.
Box 1. Practitioner and user engagement.
Encompassing a variety of activities to engage research users at different stages of research projects, including presentations, workshops, consultancy, secondment, briefings and dialogue, commissioning of research, submission of evidence, educational content, games and simulations (Bobo et al., 2024).

Universe of cases of the Conditions of Recurrence Dataset (for hypothesis development).
Application and findings
Our research project on civil war recurrence provided a real-world context for the development and application of MSMMF. In
We then carried out a review of the existing literature to identify the different arguments and explanatory factors considered in previous studies, as well as potential gaps (
In
One of the key innovations of the MSMMF is the systematic and repeated engagement with users and practitioners at all stages of research. Through a
In
For other research questions, where relevant large datasets already exist, it may be possible to employ an existing dataset for hypothesis development and testing, with three main stipulations. First, the dataset needs to be adaptable for ML, regression-based analysis (inferential statistics) and/or QCA, depending on the research question and on researchers’ methodological preference. The choice of method(s) in Step 5 of the MSMMF is not predetermined, but depends on the research question, on existing datasets and/or the availability of resources to build one. Different estimation methods might require slightly different datasets: QCA is better suited for datasets with a limited number of observations and variables (Marx, 2006); inferential statistics requires a random sample (Agresti, 2018); and ML needs a dataset large enough to be split into training and testing sets (James et al., 2013, see also Figure 4).

How to split existing datasets for the Multi-Stage Mixed-Methods Framework.
Second, a dataset (or subsection of a dataset) needs to be employed for the hypothesis development stage (DSg in Figure 2) and a
Third, when employing supervised ML at Stage I of the MSMMF, DSg should be further split into a training and testing dataset as per best practice in ML. The sequential splits are visually represented in Figure 4.
In
Box 2. Machine learning (ML).
Encompassing lasso and sparse regression, classification and regression trees, boosting and support vector machines. In supervised ML the dependent variable has already been labelled by humans. Unsupervised ML automatically groups observations without human intervention (James et al., 2013).
We carried out supervised ML in the form of decision trees (classification and regression trees or Classification and Regression Trees Analysis (CART), James et al., 2013) to identify factors consistently associated with our outcome of interest (the non-recurrence of civil war). Figure 5 summarizes the steps involved in applying CART in our own research. As a method, CART offers the advantage of being easily interpretable, making it accessible to both academics and practitioners without extensive statistical training. CART operates through recursive partitioning, a process that repeatedly divides data into groups that are as homogeneous as possible. In so doing, the algorithm identifies variables that best predict an outcome. The quality of the factors identified in the training dataset is then evaluated by using these factors to predict the dependent variable in the testing dataset. Good training models will have a high percentage of correctly predicted observations in the testing dataset. This procedure might be familiar to scholars working with maximum likelihood regression analysis, where the fit of an inferential statistical model is tested by examining the correctly predicted observations.

How to carry out Classification and Regression Trees Analysis.
In our case, CART identified UN leadership, and peace agreement provisions for the inclusion of women in post-conflict societies and for plural justice mechanisms as accurately predicting non-recurrence of civil war in the training dataset. These three factors also had a high predictive accuracy when employed to predict non-recurrence in the testing dataset. Based on supervised ML, we therefore identified three possible explanatory variables which appeared associated with the end of civil war in peace processes that experienced prior conflict recurrence: UN leadership of the mediation process; provisions to include women in post-conflict societies; and provisions for plural justice (visually represented in Figure 6).

Results of our Classification and Regression Trees Analysis.
Because of the COVID-19 pandemic, we could not travel and carry out fieldwork to develop hypotheses concurrently with supervised ML. Instead, we conducted remote expert interviews and online focus group discussions on all of the 14 peace processes in CoR-D. We also did not carry out an original QCA study, but instead we cross-referenced our CART findings with two recent QCA-based studies on the characteristics of resilient peace accords: Fontana et al. (2021b); and Pushkina et al. (2022). Our CART results were broadly consistent with their emphasis on the beneficial impact of third-party involvement in peace processes, and of the inclusion of formerly marginalized groups in peace accords. Finally, we convened a
In sum, in Stage I of the MSMMF, and especially in Steps 4–5, we developed hypotheses on the factors associated with the end of recurrent civil war: the CART suggested that UN-led mediation, provisions to include women in post-conflict societies, and plural justice provisions are associated with the end of recurrent civil war. However, these insights are not generalizable beyond the 14 peace processes in CoR-D and provide few insights on a specific causal mechanism, that is,
At Stage II of the MSMMF, we recommend testing and exploring hypotheses in parallel through quantitative and qualitative methods (
Box 3. Regression and survival analysis.
Survival analysis is used to estimate the time until an event – such as the recurrence of war – occurs. Cox survival analysis makes no assumptions about the hazard over time. This means that the hazard of failure could be increasing and then decreasing, or decreasing and then increasing, or remaining constant over time.
Regression analysis deals with continuous or categorical outcomes and is therefore appropriate to answer questions such as ‘will fighting resume after a peace accord?’ With a binary dependent variable, it is advisable to use maximum likelihood estimation. For interval-level variables, it is advisable to apply ordinary least squares regression.
Survival analysis deals with time-to-event data and is typically used for questions such as ‘how long does peace last?’.
In our case, we used a separate dataset for hypothesis testing due to the highly specific nature of CoR-D (which encompasses the whole universe of cases). This enabled us to accommodate the strong demand from practitioners to be able to test our findings on more cases. We selected the dataset of Political Agreements in Internal Conflicts (PAIC) (Fontana et al., 2021a) which best fulfilled our needs with respect to country coverage, time frame and existing control variables, but also captured more peace processes than CoR-D, including situations where negotiated settlements immediately led to the cessation of extensive violence. Following existing studies on peace agreements (Hartzell and Hoddie, 2007), we tested our hypotheses through Cox proportional hazard regression analysis. Our analysis confirmed that UN leadership and provisions for the inclusion of women in post-conflict society are associated with the end of civil war globally. Conversely, provisions for plural justice have no robust and significant relationship with the end of conflict-related violence beyond the 14 peace processes in CoR-D. To test robustness, we repeated the analysis by only including accords that experienced at least one previous relapse into conflict. These observations encompassed agreements addressing recurrent conflicts where no stable settlement was achieved, agreements concluded after 2015 (such as Colombia’s
In parallel with quantitative hypothesis testing, we engaged in qualitative research on selected case studies (
Box 4. Congruence analysis and process-tracing.
Process-tracing investigates the workings of the mechanism (s) that contribute to producing the outcome of interest by tracing the theoretical causal mechanism(s) linking explanatory factors and an outcome of interest. They are typically presented as ‘a stepwise test of each part of a causal mechanism, especially in the theory-testing variant’ (Beach and Pedersen, 2013: 5–6).
To identify suitable case studies, we ran a standard multivariate regression analysis of CoR-D, including the factors identified in our hypothesis development (
We subsequently carried out congruence analysis to determine
By applying the MSMMF to answer the question
Conclusion
In this article, we outlined a new methodological framework to combine hypothesis development and hypothesis testing into a coherent and robust multi-method research design: the MSMMF. The MSMMF contributes to methodological advancement in the social sciences in three respects. First, it combines the strengths of existing mixed-method approaches in a novel way, providing researchers with a rigorous framework to formulate theoretically sound hypotheses and derive empirically grounded findings. This enables researchers to tackle extant questions in fragmented and contradictory research fields. Second, the MSMMF pilots the use of supervised ML (computational social science) for hypothesis development, taking multi-methods designs into the machine-learning age. Third, the MSMMF embeds iterative opportunities for engagement with practitioners at all stages of the research process. This enables researchers to address questions of high policy relevance while ensuring that the insights of practitioners inform research (and
Despite the benefits of the MSMMF, there are some limitations to the framework and its applicability. Due to the variety of methods employed, the MSMMF is best suited to large research teams with diverse methodological expertise. While collaborative work has become increasingly common, the challenge of working across methodological cultures is well documented, so the successful application of the MSMMF requires cooperative and mutually supportive research teams. Alternatively, the MSMMF is suited to a researcher able to acquire an eclectic methodological expertise: methods training is a standard component of graduate programmes and additional expertise can be acquired through a variety of methods schools.
The MSMMF can also appear resource-intensive and time-intensive. However, its costs may be minimized by employing pre-existing datasets rather than generating new ones, where suitable ones are available. For example, there is a wealth of datasets on territorial self-governance (Neudorfer et al. 2025), and a lively but inconclusive debate on the impact of federalism, decentralization and autonomy on civil war occurrence and duration (Fontana et al., 2021a). The MSMMF could be employed to synthesize all existing findings and data into robust hypotheses, and then testing and exploring them systematically to advance the debate.
Finally, the MSMMF is not a silver bullet for tackling and answering all research puzzles. Social science is continuously evolving, and we see the MSMMF as contributing to this wider evolution by enabling a more joined-up, comprehensive research process. No methodological framework, including the MSMMF, should or could claim to solve all research questions. However, the MSMMF uniquely combines the strengths of case-based and variance-based mixed-methods approaches, embeds supervised ML and maps engagement with practitioners at all stages of research. Its added value is most apparent when tackling contradictory and fragmented research fields with a high policy relevance. The MSMMF may be less suited for more specific research questions that address the relationship between one specific factor and one specific outcome of interest, where researchers may use other existing research designs. Our application, however, shows that for research questions focusing on causes-of-effects, where there is a multiplicity of contrasting theories, data and empirical findings, the benefits of the MSMMF outweigh its costs.
In this paper we summarized how – through the MSMMF – we explored an important but still not conclusively answered question:
