Abstract
Keywords
Introduction
Scholarship at the intersection of migration studies and international relations has been increasingly interested in the transnational activity of diasporas in world politics (Collier and Hoeffler 2000; Østergaard-Nielsen 2001; Shain and Barth 2003; Kapur 2005; Adamson and Demetriou 2007; Orjuela 2008; Koinova 2009; Brinkerhoff 2016; Marinova 2017). Yet analysis that integrates relevant literature in systematic ways through the rigorous coding of large-scale migration data is still largely missing. This paper revolves around two major questions: How can mixed-methods techniques contribute to the evolution of a relatively new research program, such as the study of diaspora mobilizations in conflict processes? How can these techniques identify data patterns that could help establish meaningful analytical categories and factor in multi-sited complexity?
This Research Note presents a rich theoretically informed coding procedure that systematizes the analyses of four researchers working on a large-scale research project on conflict-generated diasporas in multiple European host-countries linked to conflicts in multiple countries of origin. Initially facing scattered literatures that could not adequately cover the complexity of diaspora mobilizations in breath and variability, we introduced a comprehensive coding of qualitative data followed by quantitative correspondence and cluster analyses, and isolated the profiles of diaspora entrepreneurs as an important analytical category. Such profiles allowed us to not confine the notion of diaspora to that of a group, but to highlight individual agency, which is much needed in scholarship and practice but difficult to arrive at without access to large-scale data.
This paper briefly reviews migration and international relations scholarship relevant to diaspora mobilizations and crucial to our integrative analysis. Further, we discuss how we conducted semi-structured interviews with diaspora political entrepreneurs in Germany, Sweden, and the Netherlands in 2012–2016. Interviews were systematically coded on the basis of 11 codebook sections, subjected to inter-coder procedures, and analyzed through cluster and correspondence analyses. We conclude by reviewing the impact of these early procedures on subsequent research products—a cross-national survey and a comparative book—that advanced the research program in a coherent way.
Diasporas in World Politics
At the outset of our investigations, several lines of research provided varying insights into diaspora activities in conflict, post-conflict, and development. A large-N World Bank study argued that if an intra-state conflict draws resources from an affluent US-based diaspora, conflict is perpetuated (Collier and Hoeffler 2000). These results turned contradictory, as a new quantitative study showed that large diasporas can significantly reduce conflict risks (Collier, Hoeffler and Söderbom 2008). Case studies of Armenian, Chechen, Kosovar, Sikh, Somali, and Tamil intra-state conflicts found that conflict resolution was resisted when diasporas sponsored rebel factions (Lyons 2006; Adamson and Demetriou 2007). Diasporas can also affect the purchase and smuggle of weapons, provide sanctuary for rebels, and draft soldiers from diaspora circles (Byman et al. 2001). Other scholars have argued that diasporas can have both positive and negative influences on conflict dynamics with their remittances, philanthropy, human capital, and policy influences (Smith and Stares 2007; Orjuela 2008; Koinova 2011; Brinkerhoff 2011). It eventually became clear that research needed to better understand the conditions and context-shaping behaviors of diasporas. More researchers started calling for the use of comparative and quantitative methods.
For their part, development studies focused on diasporas more generally, and beyond the conflict-generated ones discussed here. Diasporas send much-cherished remittances back home, constituting 10–26 percent of a fragile state's GDP (World Bank 2017). Remittances maintain households during warfare and rebuild lives, housing, and infrastructure in the aftermath. Diasporas also invest in various enterprises (Smart and Hsu 2004), buy diaspora bonds (Leblang 2010), and make philanthropic contributions (Sidel 2003; Brinkerhoff 2008). Diasporas act upon political opportunities and constraints by mobilizing structures available to them locally and globally (Østergaard-Nielsen 2001; Smith and Stares 2007).
Diaspora entrepreneurs are important in such movements. We acknowledge the growing scholarship on them from a business perspective (Zapata-Barrero and Rezaei 2020), yet focus on its political dimensions, especially on mobilizations in conflict processes. We define diaspora entrepreneurs as “formal and informal leaders in a diaspora community, associated with migrant, religious, and other identity-based institutions, … acting autonomously as activists, businessmen, politicians, … who actively make public claims with a homeland-oriented goal. These are political and social entrepreneurs, even if some of them may have a business background. What unites them is a strong commitment to a cause related to their original homeland” (Koinova 2021: 11).
Diaspora entrepreneurs can bridge structural gaps through “brokerage” (Adamson 2013), “in-between advantage” (Brinkerhoff 2016), and their socio-spatial positionality in different global contexts (Koinova 2017). They can mobilize more or less contentiously through: (a) Different
Delving deeper, we aimed at combining five relevant aspects of diaspora mobilization discussed separately by existing scholarship: (1) Diaspora group characteristics; (2) host-state migration integration regimes; (3) host-state foreign policies towards the home-state; (4) critical events in the homeland; and (5) homeland policies towards diasporas abroad. Considering these domains through common methodological procedures allowed us to extract from semi-structured interviews with diaspora entrepreneurs a variety of identity-based, contextual, and behavioral “traits” that shape their “profiles.” We discuss these domains below and in more detail in the codebook.
For In Regarding Regarding
None of these five theoretical domains could single-handedly or coherently explain diaspora mobilizations during conflict. Hence, one needs to examine the domains concurrently, consider how mobilizations occur in context, and eventually isolate profiles of diaspora entrepreneurs shaped by these contextual dimensions.
Collecting and Coding Diaspora Interviews
Given the growing number of large migration projects involving multiple individual researchers working on a common theme, 1 this Research Note demonstrates how a specially designed coding system can help analyze interview transcripts through a mixture of qualitative and quantitative methods to identify data patterns across researchers’ sub-projects. In our case, the emerging patterns revealed “profiles” of diaspora entrepreneurs. We went through three phases, as indicated in Table 1.
Investigation and Coding Process.
During the
Interviewees were selected through purposive and snowball sampling to represent active diaspora members who are involved in claim-making and organizing of others about homeland-based political and social projects. Respondents were sampled from different networks, to avoid selection bias. Some of the interviewees belonged to diaspora organizations, while others were wealthy individuals who did not need diaspora organizations to endorse them. Still, others held jobs that brought them in frequent contact with the diaspora community, such as shopkeepers, restaurant owners, and hairdressers. This open definition of diaspora entrepreneurship captured a multitude of people who made claims about their original homelands, or mobilized resources for specific projects.
In the
The
This selection elucidates relationships concerning the
The codebook and coding protocols were applied progressively to the forty interviews and improved accordingly. Coders were initially instructed to assign up to two codes to each conversational unit. Intercoder discussions during a pilot phase revealed that these two codes should not be hierarchized (into master and secondary codes), as this made it more difficult to code in subsequent treatments. Eventually, a “negotiated agreement” approach was adopted, so that the two coders could “code a transcript [independently], compare codings, and then discuss their disagreements in an effort to reconcile them and arrive at a final version in which as many discrepancies as possible have been resolved” (Campbell et al. 2013: 305). The final codebook (see Appendix 1) underwent twelve drafts to refine the existing categories.
Analyzing Interview Data
We used successively
CA, our primary (quantitative) analytical method, is a type of factorial analysis focused on categorical variables (Benzecri 1992; Le Roux and Rouanet 2004; Greenacre and Blasius 2006). CA can be counterintuitive to scholars used to testing hypotheses but is helpful at establishing systematic empirical patterns (here diaspora entrepreneurs’ profiles) within a relatively new research program dominated by case studies and some comparative studies, where causal relationships have not been tested yet. CA can cluster personal characteristics, mobilization pathways, and entrepreneurs’ experiences, and establish specific patterns of contextual entanglement.
CA centers on individuals with attributes (e.g., “being a woman,” “having migration experience of political violence,” “mobilizing for one's homeland independence”). CA does not capture ties between individuals connected in a network as social network analysis does. We did not investigate such ties. The merit of CA is different: To capture how the above-mentioned attributes are tied to individuals with similar attributes, within various groups and contexts in the same social field, and to calculate the mathematical distance between cases (here diaspora individuals in Europe linked to conflict and postconflict polities in its neighborhood). CA also has the specificity of progressing inductively and acquiring robustness through
CA has proven fruitful through its application to multifactorial phenomena, such as the social construction of tastes (Bourdieu 1984), social reproduction in education (Bourdieu and Passeron 1990), social structures and political hierarchies (Hjellbrekke et al. 2007), structure and stability of political elites (Bühlmann, David and Mach 2012), and evolution of electoral constituencies (Blanchard 2007). Ragazzi's (2009) work is an exception in having used CA to study aspects of diaspora politics, most notably how sending-states reach out to diasporas abroad.
We followed CA's three usual steps: (1) Calculate (hidden) factors that summarize optimally the information from the original dataset; (2) assess and interpret factors by means of maps that determine distance between cases and codes, and joint statistical outputs, where each factor is described through a contrast between its extremes; and (3) use the calculated factors to elaborate a classification of interviewees characterized by cross-tabulations and prototypical individuals.
Our analysis is based on the transcripts of forty interviews, stratified by diaspora group and host-country. CA is appropriate for our analysis, as statistical representativity is less important than the richness of cases, the structure of similarities between them, and how all cases give shape to the field of diaspora mobilizations. On average, each interview was divided into thirty-nine conversational moves and attributed thirty-two unique codes, chosen from a list of 138 provided in Appendix 1. Codes refer to values of variables, mostly categorical, often binary. A few exceptions are ordinal, such as age range, or public opinion about immigration, with three nominal values (
Figure 1 shows twenty extracted factors with a fairly smooth decrease in explanatory power (variance). Such a vast and diverse corpus is fully multifactorial. Axes 1, 2, and 3 account for 21 percent of the explained variance, and appear to structure the experience of diaspora entrepreneurs. This relatively low variability is the rule more than the exception in CA. This reflects the fact that many aspects of the multivariate combinations of codes are not summarized by the clustering. Nevertheless, CA extracts the largest contrasts within the space of codes produced by the coded corpus.

Distribution of factors extracted by means of correspondence analysis.
Axis 1 contrasts those diaspora entrepreneurs who are reasonably well “segmentally assimilated” (Portes and Zhou 1993) into the host-society with those who make homeland-oriented claims based on political grievances. This does not mean that a person who is segmentally assimilated cannot potentially make homeland-oriented claims based on political grievances, but that the opposition between these two characteristics is structuring axis 1. This is the first and largest axis, yet it exhausts only part of the total explained variance and may not be structuring for other axes. Axis 2 contrasts individuals shaped mostly by experiences in the original homeland with those engaged in host-state institutions or transnationally. Axis 3 (Appendix 2) contrasts individuals who are predominantly self-employed and engaged mostly in communal activities with those based in migrant-dominated areas and more actively involved in their host society. Overall, the main three axes contrast personal experiences of migration and settlement with balancing identity between home and host-country. This means that some respondents engage better than others with the host-land, and diaspora mobilizations are constrained in some cases, or more enabled in others.
Exposing Clusters and Profiles of Diaspora Entrepreneurs
An examination of the axes reveals the main differences between diaspora entrepreneurs. However, these remain abstract from the reality of the respondents. To present respondents’ specific characteristics more concretely, we further cluster them. The concept of statistical clustering is similar to other approaches to build typologies but differs insofar as it relies on precise calculations (its algorithm). Clustering aims to maximize the similarities between cases belonging to the same cluster and to minimize them between those belonging to distinct clusters.
Here cluster analysis is based on the three main axes detailed above, plus some minor ones, grouping sets of interviews with similar combinations of properties over the range of 138 codes. All interviews are examined in pairs, with the most similar clustered first. All remaining interviews and clusters are examined in the same way, and those most similar are clustered again. Eventually, each small branch of the “clustering tree” merges into a bigger branch. The process moves from the forty distinct interviews (left side of Figure 2), towards the whole population (right side). 4

Clustering tree. The 40 interviews on the left-hand side are aggregated progressively from left to right, from pairs with the most similar codes to most dissimilar. The horizontal axis materializes dissimilarity between interviews: The more similar two interviews, the shorter the fork that pairs them.
Labeling the clusters associates human characteristics with the statistical modeling results, and creates diaspora entrepreneur profiles based on a combination of extracted “traits,” mobilization modes, and contextual embeddedness. A “constrained” diaspora entrepreneur has relatively restricted access to host-institutions, yet still lobbies them through media, protests, and boycotts. A “contented” diaspora entrepreneur has a more positive outlook of host-country institutions, yet is more limited in mobilizing. An “enabled” diaspora entrepreneur has soft powers based on economic resources and relies highly on education to engage with their original homeland, whereas a “discontented” diaspora entrepreneur believes conditions in the host-country and its public opinion to be negative or divided and demonstrates strong transnational activism.
Cluster 1, or the constrained diaspora entrepreneur, is the largest: It contains nineteen interviews, gathering 44 percent of the codes. It occupies the lowest third of the clustering tree (Figure 2), and is over-represented in the Netherlands and among Armenians and Palestinians (Table 2). This cluster derives codes nearly exclusively from six codebook sections (perception of host-countries, access to host-institutions, participation in host-country, types of claims, activism modes, and events triggering mobilization). This is a striking result, as no statistical rule explains why characteristic codes for a cluster would be so selective regarding the code sections.
Diaspora Entrepreneurs and Their Mobilizations.
Cluster 1, the constrained diaspora entrepreneur wishes they had more access to host-institutions. They advance claims associated with religion, genocide recognition, and less frequently gender and human rights. They define themselves by their ability to mobilize, struggling to do so at times because of moderate host-country support or limited personal involvement in host-country institutions. They are not strongly linked with their homeland, as if marooned between it and their host-country, and can't access either's valuable resources.
Cluster 2, the contented diaspora entrepreneur, is the third-largest. It contains eight interviews, gathering 21 percent of the codes (middle block, Figure 2), and is slightly over-represented in Sweden and among Palestinians (Table 2). In contrast to Cluster 1, it is mostly defined by two sections of the codebook: Personal attributes and experience with migration and integration. Dominant codes are, in decreasing order of magnitude, having transited through the host-country (and/or possibly a third country), leaving home due to violence or economic reasons, being a woman, and having attended university, mainly in the homeland. These individuals have close friends among co-ethnics and host-country nationals, have a good command of both homeland and host-country languages, and want to integrate themselves in the host-country. Contented diaspora entrepreneurs share quite positive views on many aspects of integration, including knowledge of the host-country, social and institutional resources that can enable them to act and mobilize, and support from local institutions. However, they don't mention much about the kinds and modes of mobilization they favor. They are less politically active.
Cluster 3, the enabled diaspora entrepreneur, is the smallest in our dataset, grouping four interviews at the top of Figure 2, and gathering 6 percent of all codes. It is over-represented in Germany (Table 2). An enabled diaspora entrepreneur is well-settled in the host-country, mostly through the host-land mainstream. Education and other soft means of influence are crucial to their role as diaspora leaders. They talk much more than other respondents about diaspora educational activities aimed at the community (language classes, job-training schemes) and specific youth programs in the diaspora. They establish and maintain links between citizens of the host-country, home-country, and other locations around trade, partnerships, educational initiatives, and other activities, and are often engaged with homeland advocacy through social media. They describe tight relationships with the homeland, which seems to translate to their own successful involvement. Yet they also report disagreements with homeland institutions, a sign that they cannot reach all their objectives as diaspora leaders.
Cluster 4, of the discontented diaspora entrepreneur, is the second-largest in our dataset, with nine interviews and 29 percent of all codes (Figure 2). It is over-represented in Germany and among Albanians (9/9 respondents) (Table 2). This cluster centers on homeland relations with the diaspora. These are seen as weak when the homeland is not consistently engaged with its diaspora, but also as strong when homeland officials frequently consult diaspora leaders, appoint them to government positions, or seek their investments. Respondents indicate strong relations with homeland parties and extraterritorial electoral campaigns, as well as committed to remittances to support their families. A discontented diaspora entrepreneur views conditions and public opinion in the host-country as negative and divided, with the host-country discriminating against the diaspora and migrants more generally. Respondents stress calling upon the homeland to politically change, reform, and democratize its institutions and are discontent about the inability to temporarily or permanently return to political and economic conditions there.
Table 2 demonstrates how our methodology integrated aspects of various literatures, listing some elements of these and how they relate to specific diaspora entrepreneur profiles in context.
From Interview Profiles to Survey Design and Comparative Causal Analysis
The early methodological work in this large-scale migration project established a common codebook informed by data in individual research projects, and later provided the foundations for a
The
The survey respondents were “regular” diaspora members, not diaspora entrepreneurs who organize others; therefore, direct operationalization of profiles was neither intended nor possible. Yet, by using questions relevant to diaspora entrepreneurs, we were able to compare different categories of individuals within the same country-group and across groups. The sound empirical grounds enabled us to consider many factors already identified as having an impact on diaspora mobilization. Inspired by the early findings, we asked in the survey how diaspora members are being simultaneously embedded in different contexts and about the resulting political effects. Our survey explored how, for example, diaspora members are more or less integrated in their host-countries, while maintaining translocal ties with their places of origin, engaging politically in host-land and homeland elections, responding to critical events in the homeland or another country, campaigning for genocide recognition, or supporting independence movements and military interventions.
The second item that drew on this early methodological analysis is a
First, insights about the data-driven profiles were refined by identifying and prioritizing a diaspora entrepreneur's
Furthermore, by refining insights from the original profiles, other contextual factors were singled out, such as host-land foreign policies, critical events, and other homeland-based influences. These became further IVs in the typological theory. Since the profiles also signified that diaspora entrepreneurs can act more or less contentiously, the category “contention,” important theoretically, was singled out as being analytically important and a dependent variable (DV) for the typological theory. Conceptually, this category was further unpacked into three nominal values: contentious, non-contentious, and dual-pronged mobilizations. IVs and DV were then introduced in a thorough process-tracing analysis, resulting in nine causal pathways that repeated themselves across different cases of conflict-generated diasporas linked to contested states. In contrast to the profiles, which captured data-driven patterns of multi-sited embeddedness, the typological theory took the analysis further. It used causal pathways to show
Concluding Remarks
This Research Note presents a novel contribution to scholarship at the intersection of migration studies and international relations in several ways. First, it presents a possible blueprint for conducting integrative analysis of dispersed theoretical streams in a relatively new research program, especially relevant for large-scale projects that must process the output of several researchers and their sub-projects simultaneously. As Horvath and Latcheva (2019) rightly observe, while migration research has used different methods—interviews, ethnographic analysis, text analyses, and surveys—very few research projects have integrated qualitative and quantitative methods as a coherent whole. Challenges emerge from migration being a transnational phenomenon, reflected in growing critiques of methodological nationalism (Wimmer and Glick-Schiller 2003), essentialism, and the need to consider multi-sited approaches (Beauchemin 2014; DAMR 2021; Fauser 2018).
This Research Note speaks directly to these concerns from the vantage point of one large research project. It seeks to provide a way to de-essentialize diasporas by analyzing transcripts featuring individual behaviors, and to use theoretically informed grounded coding in conjunction with quantitative methods to extract data patterns and produce meaningful analytical categories. Here diaspora entrepreneur profiles factor in multisited embeddedness, providing insights for a later generalization through cross-sectional survey research and comparative causal analysis. Other publications from this large-scale project have also benefitted from the initial systematic coding procedures. Three journal special issues have featured the importance of diaspora connectivities to different global contexts, on topics of conflict and post-conflict reconstruction, sending-states’ diaspora engagement, and transitional justice (Koinova 2018b, Koinova and Tsourapas 2018, Koinova and Karabegovic 2019). The early coding procedures were helpful in integrating the analysis of diaspora mobilizations across different global contexts and in selecting which factors to potentially prioritize for the analysis. Once the sound fundamentals of a new research program are established, the analytical endeavor could then move on to new data gathering, including longitudinal data, and creating causal models that factor in how contexts condition behaviors over time (see Kupchik, Highberger and Bear 2022).
Second, we have presented an under-utilized CA method. Coupled with cluster analysis it helped to identify meaningful analytical categories. Sometimes wrongly regarded as mere exploratory and descriptive, these tools are strong in the ways they spot structuring relationships between multiple attributes, here derived from thousands of lines of interviews, and analyzed in deep detail through a rich and structured coding grid, closely informed by theory. This comprehensive set of codes is intended to make best usage of information conveyed by respondents. CA also identifies factors that could be relevant for hypothesis testing. Without such careful analytical work, further quantitative analyses could suffer from omitted variable bias, feature variables without potential causal impact, and identify wrong hypotheses to test.
On a final note, our paper could be useful for policy-makers. This paper paints a much more nuanced picture than thinking of diaspora entrepreneurs as conflict mongers or peace-makers: individual diaspora entrepreneurs are preoccupied with different aspects of homeland-oriented activism. Some are content and enabled to influence transnational politics related to their original homelands; others feel more discontent or constrained. Policy-makers need to start thinking about the variety of diaspora entrepreneurs and their specific inclinations to engage in homeland-oriented activism.
Supplemental Material
sj-pdf-1-mrx-10.1177_01979183231172103 - Supplemental material for Mixed Methods in an Evolving Research Program on Diaspora Entrepreneurs and Their Mobilizations
Supplemental material, sj-pdf-1-mrx-10.1177_01979183231172103 for Mixed Methods in an Evolving Research Program on Diaspora Entrepreneurs and Their Mobilizations by Maria Koinova, Philippe Blanchard and Ben Margulies in International Migration Review
Supplemental Material
sj-docx-2-mrx-10.1177_01979183231172103 - Supplemental material for Mixed Methods in an Evolving Research Program on Diaspora Entrepreneurs and Their Mobilizations
Supplemental material, sj-docx-2-mrx-10.1177_01979183231172103 for Mixed Methods in an Evolving Research Program on Diaspora Entrepreneurs and Their Mobilizations by Maria Koinova, Philippe Blanchard and Ben Margulies in International Migration Review
Footnotes
Acknowledgments
Declaration of Conflicting Interests
Funding
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
