Sage Journals: Discover world-class research

Abstract

Machine learning and other computer-driven prediction models are one of the fastest growing trends in computational social science. These methods and approaches were developed in computer science and with different goals and epistemologies than those in social science. The most obvious difference being a focus on prediction versus explanation. Predictive modeling offers great potential for improving research and theory development, but its adoption poses some challenges and creates new problems. For this reason, Hofman et al. published recommendations for more effective integration of predictive modeling into social science. In this communication, I review their recommendations and expand on some additional concerns related to current practices and whether prediction can effectively serve the goals of most social scientists. Overall, I argue they provide a sound set of guidelines and a classification scheme that will serve those of us working in computational social science.

Keywords

machine learning social science epistemology explanatory modeling predictive modeling integration of computer and social science

The rapid introduction of machine learning in social science brings together researchers with different ways of thinking about and doing science. This brings new ambiguities and potential clashes. Hofman et al. (2021) published recommendations to help scholars integrate machine-driven prediction practices with explicit goals of testing generalizability and developing new methods, categorizing model types and granularities, and adopting open science practices from the computer sciences. As the Hofman team’s recommendations were published in Nature, they are not in the direct line of sight of most social scientists. Thus, I hope to review their recommendations, discuss pros and cons as contexts for their classification scheme and raise questions about the likelihood that scholars will widely adopt integrative modeling practices.

Before proceeding, I draw the reader’s attention to the current revolution taking place in social science where the use of machine learning is one of the fastest growing trends. For obvious reasons, this journal’s focus on the intersection of computers and social science places it in the center of this revolution. Figure 1 displays the number of publications mentioning some form of machine learning in comparison to all publications in social science in general and in this journal (“SSCR”) since 1986. Usage of the term “social science” itself has increased over time while the number of articles in this journal remained steady (light green lines for “Social Science,” values divided by 10 for ease of comparison). In comparison, terms relating to R statistical software, causal inference, and machine learning have outpaced social science in general and as a share of articles in this journal.

Figure 1.

Key trends in computational social science, 1986–2020. Note. “Google Scholar” (left panel) refers to a Google Scholar search including (“social science”) as an “exact phrase.” “SSCR” (right panel) refers to Social Science Computer Review and searches include (source: “Social Science Computer Review” or [the former title] “computers and the social sciences”) to identify only articles in this journal. For key word searchers: Causal Inference = [“causal inference” OR “confounder” OR “collider” OR “directed acyclic” OR “causal path model” OR “Judea Pearl”]; Machine Learning = [“random forest” OR “bag of words” OR “wordfish” OR “neural network” OR “machine learning”]; R Statistical Software = [R studio OR “R statistical software” OR “R software” OR “R package”]; Social Science* refers to the total number of articles per time period divided by 50 for Google Scholar and all articles published (regardless of whether they contain “social science”) divided by 10 for SSCR.

Understanding and Categorizing Computational Social Science Now

The Hofman team propose a four-category scheme to distinguish modeling approaches: descriptive, predictive, explanatory, or integrative. They propose this scheme to raise awareness of model types and to encourage scholars to categorize their own work. Their category integrative modeling is a kind of foreshadowing of what might represent the future as these models are practically unheard of in current social science. Put another way, their scheme and recommendations should demonstrate how lessons from computer science can lead to social science models that “generate high-quality predictions about future outcomes in a (potentially) changing world” (p. 183).

Descriptive and Explanatory Modeling (a.k.a. Social Science)

As the Hofman et al. (2021) team point out (Figure 1), these models try to explain how changes impact outcomes in a given situation and are developed predominantly by logic and experimental design. They tend to have goals of causal inference, theory development, and constructing and testing formal models (as with mathematical sociology). Effective usage of explanatory models necessitates careful consideration of the data generating model and whether there is random assignment and that all confounding and colliding pathways are accounted for before developing tests or drawing conclusions. This control knowledge can only come from theory and prior experience with the subject of study.

Whether a model is descriptive or explanatory has mostly to do with researchers’ prior expectations. Without assumptions or specific hypotheses to test, work is descriptive, that is, to uncover if there is an association of X with Y in a given population. This is well clarified in table 2 in Hofman et al. (2021). Descriptive and explanatory modeling embody essentially all of quantitative social science. The label “explanatory modeling” can be confusing given that general linear models such as regressions are actually “predictive models” used to test explanatory theories and derived hypotheses. The “explanatory” here refers to the goals of the researcher in deploying statistical models and the usage of entire data samples when running models rather than how closely the predicted values of Y fit the observed values of Y.

The advantages of explanatory modeling are primarily scientific. They provide advances in categorization and description of human societies, behaviors, structures, and processes. Ideally, they better educate students and the general public about how and why things are the way they are and information to assist in policy-making. As the models tend to represent specific theories of a narrow range of social or behavioral processes, they are tested on data reflecting unique times, places, contexts, and especially sources. Thus, their explanatory “power” tends to be low, for example, regression coefficients and r-squared are not usually large and human society itself (as reflected in a given data set) remains mostly an abyss of unexplained variance.

A major drawback in explanatory modeling is haphazard deployment. Scholars rely on null-hypothesis significance testing (NHST) and often selectively report coefficients that have asterisks (p hacking). NHST is an exceptionally weak test of a theory as pointed out by the Hofman team and others, because p values and t tests are designed to show that the theory represented by the test model cannot be ruled out given the data at hand and nothing more (Lakens, 2021; Scheel et al., 2020). This means that before even considering predictive or integrative modeling approaches, social scientists should become familiar with all of the equations and implicit assumptions they are employing when pointing-and-clicking their way to results using modern, user-friendly statistical software. And should become ethical and committed to science, rather than results that further their careers. This contrasts sharply with predictive modeling where any hacking that produces higher quality predictions is generally a good thing. If social scientists have used general linear modeling techniques to explain society while systematically failing to understand or appreciate the implications of these models or their actions (Christensen et al., 2019; Rinke & Schneider, 2018), it should give great pause before suddenly embracing predictive modeling.

Fortunately, the open science movement and shifts toward meta-science are helping bring these issues to light. Also, perhaps driven by some influence from computer scientists and their predictive modeling approaches, explanatory modelers are increasingly running many models, testing robustness, and considering replication or meta-analysis to ensure that a theory (explanation of something) passes the scrutiny of many data sets and specifications and that a reported “effect” should be judged on other criteria such as relevance rather than simply being non-zero (Freese & Peterson, 2018; King, 1995; Stahel, 2021).

Predictive Modeling

This is essentially all forms of machine learning, also sometimes known as “algorithmic modeling.” The approach is generally a-theoretical and pays little attention to causal mechanisms or explaining anything. It is widely applied in computer science and in the private sector to predict online behaviors and sell products or improve investment decisions, for example. However, the use of predictive modeling has grown exponentially (see Figure 1). These models seek to exploit all known information from a given source of data, including meta-data and contextual data, to predict an outcome. This is done using a subset of the available data and then the preferred algorithm is tested on a different subset of the data. If the predictive power is high, then the model is acceptable. This makes for easy judging criteria, unlike with explanatory models where theoretical discussions, causal logic, consideration of previous literature, and various statistical tests and fit statistics are simultaneously used to decide if a model is useful.

In social science, being able to predict an outcome is of little use unless it benefits goals of classification or theory development. Thus, predictive modeling has entered the social sciences mostly in service of explanatory modeling. It can accomplish tasks that humans cannot. For example, qualitative coding of topics or events requiring too many human coders or the capacity to code data faster than human coders. The advantages can be monumental, for example, scholars could track the spread of the SARS-CoV-2 virus and public sentiment across the world daily thanks to predictive modeling.¹ This demonstrates how predictive modeling could contribute to an active social science with real-time data and results.

The major drawback of predictive modeling is that the factors driving predictive accuracy are more or less an abyss. Another drawback is data availability. Human behaviors and outcomes can be predicted with accuracy, but only when large data sets are available with thousands of variables, there is rarely so much information available except for specific surveys at specific moments. Thus, having a powerful and accurate machine algorithm is useless most of the time, as large-scale surveys are very rare and expensive and sensitive public information is not freely available. Other drawbacks are general replication issues, some of these are similar to those already well known in explanatory modeling (Breznau, 2021a; Campion et al., 2020; Hendriks et al., 2020; Janz, 2015; Open Science Collaboration, 2015), but some are unique to predictive modeling (Kapoor & Narayanan, 2021). For example, certain steps in the process are completely out of the hands of the researchers so that identical start code and routines produce different results in the presence of different choice layers or graphics cards (GPUs) inherent to the software or computer being used (Vijayakumar & Cheung, 2019; Villa & Zimmerman, 2018).

Still more concerns relate to the environmental impact of computer energy consumption in larger and larger predictive models (Bender et al., 2021) and evidence that humans often can predict outcomes just as well as machine learning algorithms in sociological and psychological studies (Christodoulou et al., 2019; Dressel & Farid, 2018; Salganik et al., 2020; Saveski et al., 2021). One poignant example of this demonstrated that a human and machine algorithm was roughly identical in predicting unemployment spells but the machine algorithm relied on 10,000 variables while the human logistic regression needed only four (McKay, 2019). If human models generated by trained experts can perform just as well, then they are preferable because they use less degrees of freedom, require less computing power, cause less climate change, and are more cost effective in data requirements (e.g., the cost of a survey with four vs. 10,000 questions!).

Natural language processing in machine learning brings up some serious critical race and inequality issues. When machines code things in lieu of humans, they can reproduce existing social biases to further disadvantage already disadvantaged groups. The technical language used to categorize people could be coded with negative affect, for example, “Black” can be identified with negative sentiment contra “White,” and this certainly could lead to racial biases and harms from machine algorithms (Gebru, 2019). Thus, when policy makers or law enforcement use biased algorithms, they reinforce bias (Janssen et al., 2020). The same has been shown for phrases that describe persons with disabilities. Hutchinson et al. (2020) demonstrated that phrases used to describe persons with disabilities are coded by a (well-trained) machine as having high levels of “toxicity” (a negative affect sentiment), for example, “I am a person with mental illness” or “I am a deaf person” and even “I will fight for people who are deaf” would all have a high degree of toxicity in machine language processing. If used in monitoring or censoring social media, such algorithms could disadvantage mentally ill and mental illness support or advocate groups.

Integrative Modeling

The Hofman team foreshadows this approach as a potential new trend in social science. Integrative models would involve explanatory and predictive approaches in a single study. The single study might involve many smaller modeling steps, but they would all contribute collectively to an integrative model. The Hofman et al. (2021) team defines an integrative outcome as one that “[t]ests a claim both for causality and predictive accuracy” (p. 185) and could “help to formulate predictively accurate causal explanations” (p. 184). The Hofman team provides two examples, one from Athey et al. (2011) who come up with an explanatory model of bidding behavior in an auction and use it to predict outcomes that are then tested against the actual outcomes. The other example from coordinate ascent algorithms that iteratively alternate between predictive and explanatory models, in particular, this involves manipulating some aspect of the subjects while under study to help better explain the outcomes (Agrawal et al., 2020). Somehow, such models should provide benefits that are greater than explanatory or predictive models done in isolation because they can predict “magnitude and direction of individual outcomes under changes or interventions” (Hofman et al., 2021, table 2).

Because of the technical barriers to predictive modeling and the risks of inappropriate usage of explanatory and predictive modeling in isolation, it is possible that integration will simultaneously bring even less reliable outcomes. Pointed out by Lazer et al. (2009), most social science methods were developed to handle snapshots of data. This means that methodological developments are needed to keep pace with machine learning approaches and larger data sets with ongoing sampling. It is already a monumental achievement to analyze networked data with 10,000 nodes (with a potential 50 million network ties), it is another altogether to do this with 10,000 nodes over 10,000 days (with a potential 500 billion transactions across those daily ties). The technical skills and computing power needed to achieve integrative modeling is a serious concern and should be weighed against the positive potential benefits and new enthusiasms of social scientists to jump on the artificial intelligence bandwagon.

Another barrier is that social scientist are unlikely to have integrative goals. Studying a time- and place-specific phenomenon may mean that having predictive accuracy on out of sample data is irrelevant because the interest is only on that particular moment. Moreover, when bringing in new data, it is very likely the data generating model changed and this would require rethinking the theory rather than trying to maximize predictive accuracy. Again, a lack of data also precludes many integrative goals. For example, Altaweel (2021) developed a predictive natural language processing algorithm to classify cultural objects advertised on eBay and then used regression techniques to predict which sell more often, or sell faster. The goals were simultaneously to predict and explain. But because eBay does not offer reliable data on buyers and sellers an integrative model is not possible.

Currently, all articles published in SSCR in the last 2 years using machine learning would not qualify as integrative models; and I assume that this reality roughly characterizes all of social science as well. Although SSCR publications are not yet integrative, explanatory approaches published in SSCR could be imagined as integrative models. For example, Wasike (2021) tested whether posting research papers in online repositories or discussing them on social media impacts citation counts among 150 of the most cited papers in communications journals using manual data collection and altmetric data. This study’s data collection and analysis could be given to a machine to predict what papers get cited more in general to check if the explanatory model maybe missed some important other factors that lead to higher or lower citation counts (i.e., could improve the causal theory of citation counts). This step that would really just enhance the explanatory model, but it could then (in lieu of having a large team of researchers) be used to test if the explanatory model works similarly across disciplines or maybe changes over time like after introduction of certain policies such as Plan-S in Europe or gold open-access journal options—thus becoming an integrative model.

There are exceptions in the broader literature and these exceptions will likely grow as a function of knowledge and discussion of best practices regarding machine learning, especially if social scientist heed the recommendations of the Hofman team. Sometimes when deployed with high technical skill, integrative modeling could identify explanatory and causal mechanisms that researchers simply cannot see under normal circumstances. In random, forest algorithms machines might help to identify combinations of variables that stand out as predictors of an outcome or make clear an otherwise suppressed relationship to an outcome after testing all other possible combinations and thus ruling out “luck” or random chance that a scholar arrived at such a result (Molina & Garip, 2019). Such a combination of variables might be a meaningful subgroup in a given society (Brand et al., 2021).

Currently, the social science I am familiar with has goals of description and explanation. Studies use machine learning in one stage to define a variable to use in their main explanatory model. They use prediction in the service of estimation (Choi, 2020; Mullainathan & Spiess, 2017). None the less, if a heyday for integrative modeling happens to arrive, for example, if funding agencies and academic institutions start calling for such models, the Hofman team’s recommendations and vision of integrative modeling would be extremely relevant for social scientists.

Integrative Lessons

Overall, the Hofman team demonstrate that social scientists (explanatory modelers) and computer scientists (predictive modelers) can learn from each other’s procedural differences. For example, the shift to open science leads social scientists to embrace methods insulating against analytical flexibility (Nosek et al., 2018) while computer scientists use crowdsourcing, such as the “common task framework,” to achieve larger modeling goals (Breznau, 2021b).² Cross-integration of these practices could help both types of science to become more reliable, hack-proof, reproducible, and generalizable in scope. Social science gains are already emerging in “many analysts” studies which mimic crowdsourcing competitions of computer scientists but achieve goals of explanation not just developing a better (meta-)algorithm (Botvinik-Nezer et al., 2020; Breznau et al., 2021; Silberzahn et al., 2018). At the same time, if predictive models were preregistered and peer reviewed, it could help improve their efficiency, for example, by avoiding redundant testing of models on same data subsets introducing bias loops and possibly overstating predictive accuracy. This would in turn benefit modelers who try to use prediction to serve explanation goals but may not be as skilled as computer scientists in predictive modeling. Preregistration peer reviews could greatly reduce shoddy machine learning research practices.

The Hofman team’s recommendations come at a critical moment when more and more researchers are employed to do computer science in service of social science goals. These researchers will struggle if they only pursue predictive modeling. In the end, social science is about explanation and this requires theory. In fact, it is social scientists who can teach computer scientists to understand that prediction itself requires basic assumptions, and assumptions are the building blocks of theory. For example, knowledge and assumptions about human sentiments are necessary before supervising a machine to arrive at usefully coded sentiments (Watanabe & Zhou, 2020). Goals of theoretical explanation can help resolve the reproducibility crisis currently facing social science (Gervais, 2021) if not the ethical crises facing computer science. Social scientists often try to maximize r-squared values by adding variables haphazardly. They do this by falsely thinking a higher r-square is “better,” that is, more likely to impress reviewers. This means they are inherently pushing a predictive modeling goal which can undermine their intentions to do explanatory modeling. If they label their work explanatory in advance, and understand clearly what this means, it should make it less likely that they hack or chase predictive power. It is a seldom appreciated fact that qualitative, theoretical arguments are the necessary conditions for identifying causality in a model, not data, higher r-squared or fancy algorithms (Elwert, 2013). Here, the Hofman team makes another crucial suggestion. That in addition to type of model, social scientists should also clearly label the level of granularity the results provide. For example, clarifying if they have determined if an effect is simply not zero, directional or offers evidence of a reliable magnitude, and at what level, for example, aggregated or individual-level information. This should also help those who cite such works to more accurately and modestly report on the findings.

It was my intention in this communication to raise awareness for computational social scientists about the risk-reward trade-offs in integrating predictive modeling. As such I would argue the Hofman team’s “Summary of Suggestions” (p. 187) should be a standard reference for integration, because it calls social scientists to (1) integrate explanatory and predictive modeling with explicit goals of testing generalizability and developing new methods, (2) clearly labeling contributions by model type and granularity, and (3) to standardize open science practices across social and computer sciences should be standard practice in the new post–machine learning social science era that we just entered. Underlying the many benefits of these goals is the possibility to improve social science through better theory production. First, generalizability and new (better) methods improve the quality of theory and theory testing. Second, clear delineation of a model and its level of granularity in a way that is interpretable by another social scientist is an exercise in reflective logic. Spending more time logically reflection on a model provides an opportunity for scholars to better develop their theory. Third, open science practices are there to remove barriers and promote a more robust and reliable social science. With fewer barriers, there are more opportunities for theoretical testing and development, and with more robust findings social scientists will spend less time recycling poorly supported findings and theories.

Footnotes

Declaration of Conflicting Interests

The author declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Funding

The author received no financial support for the research,authorship,and/or publication of this article.

Notes

Author Biography

Nate Breznau is a Principal Investigator of the German Science Foundation Projects “The Reciprocal Relationship of Public Opinion and Social Policy”,(DFG,BR 5423/2-1) and “The Role of Theory in Resolving the Reproducibility Crisis”,(DFG,BR 5423/3-1 – SPP “META-REP”) at the University of Bremen. He earned degrees from Bates College (BA),the University of Nevada,Reno (MA) and the Bremen International Graduate School of Social Sciences (PhD). His work centers around public opinion and social policy and recently uses natural language processing to investigate the usage of public opinion in German parliamentary debates.

References

Agrawal

Peterson

J. C.

Griffiths

T. L.

(2020). Scaling up psychology via scientific regret minimization. Proceedings of the National Academy of Sciences, 117(16), 8825–8835. https://doi.org/10.1073/pnas.1915841117

Altaweel

(2021). The market for heritage: Evidence from Ebay using natural language processing. Social Science Computer Review, 39(3), 391–415. https://doi.org/10.1177/0894439319871015

Athey

Levin

Seira

(2011). Comparing open and sealed bid auctions: Evidence from timber auctions. The Quarterly Journal of Economics, 126(1), 207–257. https://doi.org/10.1093/qje/qjq001

Bender

E. M.

Gebru

McMillan-Major

Shmitchell

(2021). On the dangers of stochastic parrots: Can language models be too big? Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–623. https://doi.org/10.1145/3442188.3445922

Botvinik-Nezer

Holzmeister

Camerer

C. F.

Dreber

Huber

Johannesson

Kirchler

Iwanir

Mumford

J. A.

Adcock

R. A.

Avesani

Baczkowski

B. M.

Bajracharya

Bakst

Ball

Barilari

Bault

Beaton

Beitner

… Schonberg

. (2020). Variability in the analysis of a single neuroimaging dataset by many teams. Nature, 582(7810), 84–88. https://doi.org/10.1038/s41586-020-2314-9

Brand

J. E.

Koch

Geraldo

. (2021). Uncovering sociological effect heterogeneity using tree-based machine learning. Sociological Methodology, 51(2), 189–223. https://doi.org/10.1177/0081175021993503

Breznau

(2021a). Does sociology need open science? Societies, 11(1), 9. https://doi.org/10.3390/soc11010009

Breznau

(2021b). I saw you in the crowd: Credibility, reproducibility, and meta-utility. PS: Political Science & Politics, 52(2), 309–313. https://doi.org/10.1017/S1049096520000980

Breznau

. (2021c, June 1). Public opinion, pandemic infection and policymaking: The COVID-19 story of liberty and death. COVID-19 Blog of the Collaborative Research Center “The Global Dynamics of Social Policy.” https://www.socialpolicydynamics.de/blog/post/?blog=28#blog28

10.

Breznau

Rinke

E. M.

Wuttke

Adem

Adriaans

Alvarez-Benjumea

Andersen

H. K.

Auer

Azevedo

Bahnsen

Balzer

Bauer

P. C.

Baumann

Baute

Benoit

Bernauer

Berning

Berthold

…, Nguyen

H. H. V

. (2021). Observing many researchers using the same data and hypothesis reveals a hidden universe of data analysis. MetaArXiv. https://doi.org/10.31222/osf.io/cd5j9

11.

Campion

Gasco-Hernandez

Jankin Mikhaylov

Esteve

(2020). Overcoming the challenges of collaboratively adopting artificial intelligence in the public sector. Social Science Computer Review. https://doi.org/10.1177/0894439320979953

12.

Choi

(2020). When digital trace data meet traditional communication theory: Theoretical/methodological directions. Social Science Computer Review, 38(1), 91–107. https://doi.org/10.1177/0894439318788618

13.

Christensen

Freese

Miguel

. (2019). Transparent and reproducible social science research. University of California Press. https://www.ucpress.edu/book/9780520296954/transparent-and-reproducible-social-science-research

14.

Christodoulou

Collins

G. S.

Steyerberg

E. W.

Verbakel

J. Y.

Van Calster

(2019). A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology, 110, 12–22. https://doi.org/10.1016/j.jclinepi.2019.02.004

15.

Dong

Gardner

(2020). An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases, 20(5), 533–534. https://doi.org/10.1016/S1473-3099(20)30120-1

16.

Donoho

. (2015). 50 Years of data science [Conference Paper]. https://courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf

17.

Dressel

Farid

(2018). The accuracy, fairness, and limits of predicting recidivism.Science Advances, 4(1), eaao5580. https://doi.org/10.1126/sciadv.aao5580

18.

Elwert

(2013). Graphical causal models. In Morgan

S. L.

(Ed.),Handbook of causal analysis for social research (pp. 245–272). Springer Science & Business Media. https://doi.org/10.1007/978-94-007-6094-3

19.

Freese

Peterson

(2018). The emergence of statistical objectivity: Changing ideas of epistemic vice and virtue in science. Sociological Theory, 36(3), 289–313. https://doi.org/10.1177/0735275118794987

20.

Gebru

(2019). Oxford handbook on AI ethics book chapter on race and gender. ArXiv:1908.06165 [Cs] . http://arxiv.org/abs/1908.06165

21.

Gervais

W. M.

(2021). Practical methodological reform needs good theory. Perspectives on Psychological Science, 16(4), 827–843. https://doi.org/10.1177/1745691620977471

22.

Hendriks

Kienhues

Bromme

(2020). Replication crisis = trust crisis? The effect of successful vs failed replications on laypeople’s trust in researchers and research. Public Understanding of Science, 29(3), 270–288. https://doi.org/10.1177/0963662520902383

23.

Hofman

J. M.

Watts

D. J.

Athey

Garip

Griffiths

T. L.

Kleinberg

Margetts

Mullainathan

Salganik

M. J.

Vazire

Vespignani

Yarkoni

(2021). Integrating explanation and prediction in computational social science. Nature, 595(7866), 181–188. https://doi.org/10.1038/s41586-021-03659-0

24.

Hutchinson

Prabhakaran

Denton

Webster

Zhong

Denuyl

(2020). Social biases in NLP models as barriers for persons with disabilities. ArXiv:2005.00813 [Cs] . http://arxiv.org/abs/2005.00813

25.

Janssen

Hartog

Matheus

Yi Ding

Kuk

(2020). Will algorithms blind people? The effect of explainable AI and decision-makers’ experience on AI-supported decision-making in government. Social Science Computer Review. https://doi.org/10.1177/0894439320980118

26.

Janz

(2015, May 4). Leading journal verifies articles before publication—So far, all replications failed. Political Science Replication Blog. https://politicalsciencereplication.wordpress.com/2015/05/04/leading-journal-verifies-articles-before-publication-so-far-all-replications-failed/

27.

Kapoor

Narayanan

. (2021). (Ir)reproducible machine learning: A case study (p. 6). https://reproducible.cs.princeton.edu/

28.

King

(1995). Replication, replication. PS: Political Science & Politics, 28(3), 444–452. Cambridge Core. https://doi.org/10.2307/420301

29.

Lakens

(2021). The practical alternative to the p value is the correctly used p value. Perspectives on Psychological Science, 16(3), 639–648. https://doi.org/10.1177/1745691620958012

30.

Lazer

Pentland

Adamic

Aral

Barabási

A. -L.

Brewer

Christakis

Contractor

Fowler

Gutmann

Jebara

King

Macy

Roy

Alstyne

M. V.

(2009). Computational social science. Science, 323(5915), 721–723. https://doi.org/10.1126/science.1167742

31.

McKay

(2019). When 4 ≈ 10,000: The power of social science knowledge in predictive performance. Socius, 5, 1–7. https://doi.org/10.1177/2378023118811774

32.

Molina

Garip

(2019). Machine learning for sociology. Annual Review of Sociology, 45(1), 27–45. https://doi.org/10.1146/annurev-soc-073117-041106

33.

Mullainathan

Spiess

(2017). Machine learning: An applied econometric approach. Journal of Economic Perspectives, 31(2), 87–106. https://doi.org/10.1257/jep.31.2.87

34.

Nosek

B. A.

Ebersole

C. R.

DeHaven

A. C.

Mellor

D. T.

(2018). The preregistration revolution. Proceedings of the National Academy of Sciences, 115(11), 2600–2606. https://doi.org/10.1073/pnas.1708274114

35.

Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. https://doi.org/10.1126/science.aac4716

36.

Rinke

E. M.

Schneider

F. M.

(2018). Probabilistic misconceptions are pervasive among communication researchers. SocArXiv . https://doi.org/10.31235/osf.io/h8zbe

37.

Salganik

M. J.

Lundberg

Kindel

A. T.

Ahearn

C. E.

Al-Ghoneim

Almaatouq

Altschul

D. M

Brand

J. E.

Carnegie

N. B.

Compton

R. J

Datta

Davidson

Filippova

Gilroy

Goode

B. J.

Jahani

Kashyap

Kirchner

McKay

… McLanahan

. (2020). Measuring the predictability of life outcomes with a scientific mass collaboration. Proceedings of the National Academy of Sciences, 117(15), 8398–8403. https://doi.org/10.1073/pnas.1915006117

38.

Saveski

Awad

Rahwan

Cebrian

(2021). Algorithmic and human prediction of success in human collaboration from visual features. Scientific Reports, 11(1), 2756. https://doi.org/10.1038/s41598-021-81145-3

39.

Scheel

A. M.

Tiokhin

Isager

P. M.

Lakens

(2020). Why hypothesis testers should spend less time testing hypotheses. Perspectives on Psychological Science, 16(4), 744–755. https://doi.org/10.1177/1745691620966795

40.

Silberzahn

Uhlmann

E. L.

Martin

D. P.

Anselmi

Aust

Awtrey

Bahník

Š.

Bai

Bannard

Bonnier

Carlsson

Cheung

Christensen

Clay

Craig

M. A.

Dalla Rosa

Dam

Evans

M. H.

Flores Cervantes

… Nosek

B. A

. (2018). Many analysts, one data set: Making transparent how variations in analytic choices affect results. Advances in Methods and Practices in Psychological Science, 1(3), 337–356. https://doi.org/10.1177/2515245917747646

41.

Stahel

W. A.

(2021). New relevance and significance measures to replace p-values. PLoS One, 16(6), 1–22. https://doi.org/10.1371/journal.pone.0252991

42.

Vijayakumar

Cheung

M. W. - L.

(2019). Assessing replicability of machine learning results: An introduction to methods on predictive accuracy in social sciences. Social Science Computer Review. https://doi.org/10.1177/0894439319888445

43.

Villa

Zimmerman

(2018, May 25). Reproducibility in ML: Why it matters and how to achieve it. Determined AI. https://determined.ai/blog/reproducibility-in-ml

44.

Wasike

(2021). Citations gone #social: Examining the effect of Altmetrics on citations and readership in communication research. Social Science Computer Review, 39(3), 416–433. https://doi.org/10.1177/0894439319873563

45.

Watanabe

Zhou

(2020). Theory-driven analysis of large corpora: Semisupervised topic classification of the UN speeches. Social Science Computer Review. https://doi.org/10.1177/0894439320907027

Integrating Computer Prediction Methods in Social Science: A Comment on Hofman et al. (2021)

Abstract

Keywords

Understanding and Categorizing Computational Social Science Now

Descriptive and Explanatory Modeling (a.k.a. Social Science)

Predictive Modeling

Integrative Modeling

Integrative Lessons

Footnotes

Declaration of Conflicting Interests

Funding

Notes

Author Biography

References