Abstract
Introduction
Background
Precision public health (PPH) is a new area of public health that supports the growth of precision medicine (PM) by leveraging improvements in the latest technology and information gained from Big Data (BD) to deliver more tailored public health initiatives in the communities. 1 In short, PPH is defined as an approach to implementing the right intervention for the right population at the right time. 1 According to experts, PPH is capable of improving the ability to prevent disease, promote health, and reduce health disparities in populations by applying emerging methods and technologies for measuring disease, pathogens, exposures, behaviors, and susceptibility in populations; and developing policies and targeted implementation programs to improve health. 2
This application involves the use of single or multiple data sources with large volumes of data known as BD3,4 and utilizes methods such as machine learning5–7 and artificial intelligence.8,9 The combination of data sources such as omics data, clinical data, social data, patient-generated health data, environmental data, and demographic data can provide a complete perspective of a patient or community.8–10 The application of PPH and BD would be able to improve the accuracy in select fields of public health which includes disease surveillance, risk prediction, targeted intervention, and understanding of disease. 11 This advancement in health-related technology was further catalyzed by the recent COVID-19 pandemic. Various studies have shown the significant impact of PPH approaches in managing the COVID-19 pandemic, especially in understanding the novel disease, 12 clinical management,13,14 disease spread and burden,15–17 population behavior,18–20 and the effectiveness of strategies implemented.21–23 However, the application of PPH possesses several challenges and limitations. First, there is limited knowledge and theory on PPH which thus limits our understanding of it. 24 Secondly, the availability of data sources and resources to cater to PPH applications. Current technology limits the extent to which the large volume of data could be utilized in public health fields, especially in middle- and low-income countries. 25 Furthermore, various ethical and privacy issues should be addressed such as data security, individual right to access personal information, and accessibility of vast amounts of data by various stakeholders. 2
In this scoping review, we aimed to explore the literature on the PPH approaches in outbreak management. Outbreak management is defined as coordinated and systematic approaches taken to control and mitigate the spread of infectious/communicable disease or other harmful agents within a specific population or geographical region. 26 Hence, this review can provide insights into the various types of health-related data which could be utilized in various stages of outbreak management. The findings may be used as a policy guideline for the implementation of PPH at various stages of healthcare systems, especially in outbreak management.
Methods
Review questions
How does the PPH approach complement the conventional outbreak management practice in responding to infectious disease outbreaks?
How could various types of nonconventional health-related data complement outbreak management?
What are the barriers and challenges of implementing the PPH approach in outbreak management?
Eligibility criteria
A preliminary search was performed in major databases such as PubMed, Web of Science (WoS), the Cochrane Database of Systemic Review, and Joanna Briggs Institute (JBI) Evidence Synthesis to identify any new or ongoing reviews on the study topic and no reviews or studies were found.
The present review used the PCC (Population, Concept, and Context) framework proposed by the JBI. The JBI scoping review is a widely used reviewing method that incorporated other known frameworks proposed by Arksey and O’Malley, and enhancement proposed by Levac, Colquhon, and O’Brien. 27 The eligibility criteria were justified based on the article type, language, population, concept, and context (Table 1).
Article eligibility criteria.
PRISMA extension for Scoping Reviews (PRISMA-ScR)
In our study, we employed the PRISMA-ScR framework as a guiding tool to ensure transparency and methodological rigor in the conduct and reporting of our scoping review. By adhering to PRISMA-ScR guidelines, we systematically identified, selected, and synthesized relevant literature pertaining to our research question, following a predefined protocol.
The adherence to the PRISMA-ScR checklist was robust, with 20 out of the 22 relevant criteria fully met in the conduct of this scoping review (Appendix 1). This dedication to adherence underscores our commitment to transparency and methodological rigor in reporting the review process and findings. While two criteria specific to systematic reviews were not applicable to our scoping review methodology, the overall compliance with PRISMA-ScR guidelines enhances the credibility and reproducibility of our review.
In our study, although the research protocol was developed and followed throughout the conduct of the scoping review, it was not formally published due to financial constraint. Despite the protocol not being published, rigorous adherence to a predefined protocol was maintained throughout the study process to ensure consistency and transparency in the conduct of the scoping review.
Search strategy
The search strategy aimed to locate published studies. The search was conducted in PubMed, WoS, and Science Direct databases using the keywords “precision public health” and “communicable disease” (Table 2). The search strategy was conducted based on index terms and adapted for each database and information source. For instance, the PPH is closely related to BD and data-driven analytics while outbreak management, as defined previously, involves the management of communicable diseases. Hence, these keywords were included in the search terms. The PPH topic was not explored using Medical Subject Headings (MeSH) terms due to the highly specific and novel nature of the topic. No appropriate MeSH terms are available to accurately capture the concept. Hence, relying solely on keywords search yielded more relevant results. The reference list of the selected studies was screened as an additional source of evidence. We included publications in the English language in this study. We retrieved studies conducted between the 1st of January 2017 and the 1st of January 2023. This limitation was relevant to ensure that the review captures the most recent and relevant literature on PPH. Limiting the search to recent years helps to focus on current trends, developments, advancements in the PPH-related field.
Search strategy.
Selection of studies
Following the search, all identified citations were grouped and added to the EndNote version X9 year 2018 (by Clarivate Analytics, PA, USA) and duplicates were removed. Two review authors (EGR and RKS) independently screened the eligibility of the studies. Studies with titles and abstracts that did not fit the eligibility criteria were excluded from the study. Following that, the retrieval and assessment of the full text were performed, and only relevant studies were kept using predefined inclusion and exclusion criteria. Any disagreement between the review authors was resolved by further discussion with the third review author (FMH) who acts as an arbiter. Studies were included if they describe the PPH approach, including BD application in communicable disease or outbreak management.
Data extraction
Data extraction from the studies was performed independently by two of the review authors (EGR and RKS) using a data extraction tool developed by the reviewers. The researchers first identified the key variables or pieces of information that needed to be extracted from each study to address the research question and objectives of the scoping review. These variables were selected based on their relevance to the topic under investigation and the specific aims of the review. For instance, in terms of types of data, the researchers established inclusion criteria based on the types of data sources relevant to the research question and objectives. The inclusion criteria were determined based on the scope of the review and the types of data sources commonly used in PPH applications.
Once the relevant variables were identified, the researchers developed a structured data extraction tool to systematically capture these variables from each included study. The tool was designed to ensure consistency and completeness in data extraction across all studies.
The data extracted included specific details as follows:
Author(s) name. Publication year. Type of study, including cross-sectional, cohort, experimental, and review. Disease studied includes specific infectious/communicable disease such as dengue and influenza or combination of diseases. Type(s) of data source, namely traditional health-related data (for example, medical records, genetic data, and surveillance data) and non-conventional data (for example meteorology data, zoonotic data, and Internet data). Type(s) of PPH application, explaining the approach in disease management, for instance, predicting risk, allocation of resources, and signal detection. Findings, the outcomes of the study. Type(s) of limitations/barriers the researcher identified during the study was conducted.
The summary of the articles is presented in Table 3.
Article summary.
Data analysis
The key information from the selected studies was systematically extracted, including data as mentioned previously. This step ensured that relevant data were captured in a structured manner for further analysis.
The extracted data were then analyzed thematically to identify common themes, patterns, and trends across the literature. This involved organizing the findings into meaningful categories or concepts that captured the essence of the research findings. With regards to types of PPH application, we constructed a table based on the review by Dolley et al. on the role of BD in PPH as the use of BD is vital in implementing the precision of public health strategies. 11 These roles include performing disease surveillance and signal detection, predicting risk, targeting interventions, and understanding disease.
Moreover, rather than relying on quantitative measures of evidence, the synthesized findings were presented in a narrative format. This approach allowed for a qualitative interpretation of evidence, highlighting key themes, relationships, and areas of consensus or contention within the articles.
The synthesized findings were interpreted in the context of the research question and objectives of the scoping review. Any implications, limitations, or gaps identified in the literature were discussed, providing insights into the state of knowledge on the topic and guiding future research directions.
Patient consent
In the context this scoping review, patient consent was not required as the synthesis of existing literature does not involve direct interaction with human subjects or their personal information.
Results
Search outcome
The systematic search process (Figure 1) involved a comprehensive search of relevant literature sources, including PubMed, WoS, and Science Direct databases, to identify articles addressing the research question. The initial search yielded a total of 1058 articles, with PubMed contributing 332 articles, WoS contributing 626 articles, and Science Direct contributing 100 articles. Following the removal of duplicates, 714 unique articles remained for further evaluation.

Flowchart of preferred reporting items for systematic reviews and meta.
The screening phase commenced with the assessment of article titles to determine their suitability for inclusion in the scoping review. During this process, 647 articles were excluded due to a variety of reasons diverging from the research questions. Among these exclusions were studies focusing on non-communicable diseases such as cancer, Parkinson’s disease, and diabetes mellitus, which did not directly address infectious/communicable disease outbreaks or the application of PPH approaches in outbreak management. Additionally, articles centered on the development of decision-making algorithms or artificial intelligence modeling were excluded as they were not directly related to PPH strategies or outbreak management. Furthermore, studies related to BD application in non-health-related sectors such as agriculture and economy were deemed outside the scope of the review, which specifically targeted health-related data and outbreak management. Finally, case studies lacking generalizability or potential impact of PPH approaches on outbreak management aspects were also excluded, indicating a lack of relevance to the research questions. These exclusions were made to ensure that the articles included in the scoping review were aligned with the defined research objectives, focusing specifically on the role of PPH in outbreak management.
Subsequently, abstract screening was conducted on the remaining 67 articles. After careful evaluation, an additional 37 articles were excluded due to no information on potential PPH application in infectious/communicable or outbreak management were stated (exclusion criteria in Table 1). The remaining 30 articles underwent a thorough full-text assessment to ascertain their eligibility for inclusion in the scoping review. Among the full-text articles, one was excluded as it did not pertain to the specific application of PPH, while two articles were excluded due to the absence of outcome measures related to the research question. This meticulous selection process resulted in a final set of 27 articles that met the eligibility criteria and were deemed appropriate for inclusion in the descriptive synthesis of the scoping review.
Study demographics
Table 4 shows the demographic of studies included in the review. Most of the articles were published in the year 2020 (
Study demographics.
Principal findings
PPH applications
The reviewed articles were classified based on the five themes of PPH application in outbreak management as mentioned in methods chapter. On top of that, we found that two articles were focused on resource allocation and optimization which was included in the theme of PPH application. Some of the reviewed articles contained more than one type of PPH application. Most of the review articles were focused on the PPH approach in performing disease surveillance and signal detection (
Data sources in the PPH application
From the article analysis, we found that various data types were used in implementing PPH in outbreak management, either as a single data type or in combination. The widely used data in PPH application was surveillance or epidemiological data (51.9%) such as national infectious disease surveillance data,28–30 local region dengue incidence data,31,32 history of disease occurrence data, 33 and COVID-19 morbidity and mortality rate. 34 Meanwhile, environmental or geographical data (44.4%) were also often used in studies such as daily local weather data which includes mean humidity and temperature,31,35 wastewater surveillance for the COVID-19 virus, 36 and urban topography data. 37 Similarly, Internet queries and social media data (37.0% and 22.2%, respectively) such as Google search trends or Baidu search indexes on specific diseases,30,33,38–42 Weibo posting index, 38 and Twitter trends.18,41 Additionally, personal health-related data (3.7%) such as personal insurance claim data, 28 and personal social-related data (11.1%) such as consumer purchase data, 43 human footprint data, 44 and school calendar patterns 42 were utilized as well. We also found that genomic data44,45 and various clinical data (25.9%) such as patient medical records,29,41 laboratory data, 44 and online medical appointment or consultation data 30 were included in some of the studies. Another important data type that is crucial in PPH implementation is global positioning data which could be retrieved such as mobile positioning terminals, 46 street or satellite images, 47 and global positioning systems (GPS) coordinates data. 48 Lastly, entomology or zoology data were also utilized in outbreak management such as vector egg density index in the dengue outbreak, 49 bird species surveillance in the Japanese Encephalitis outbreak, 44 and mosquito infection growth rate in the West Nile virus outbreak. 50
Gaps and challenges in PPH application
Our synthesis revealed several noteworthy limitations and barriers associated with PPH application in outbreak management. The quality and availability of data emerged as the most prevalent limitation mentioned in the reviewed articles (
Discussion
Principal findings
To date, several articles have been published regarding the role of PPH and BD in the public health field.51–55 However, to our knowledge, this is the first review focusing specifically on the PPH application in outbreak management. The results from this scoping review will aim to address the application of PPH in outbreak management. We aim to describe the types of PPH approaches applied in outbreak management as well as the limitations and challenges encountered throughout the implementations.
PPH applications in outbreak management
Disease surveillance involves systematic collection, analysis, and interpretation of health-related data to determine early signals of potential disease outbreaks. PPH leverages various sources of data to improve the accuracy and timeliness of disease surveillance and monitoring by including not only currently available conventional disease surveillance systems, but also integrating other non-conventional data sources such as social media and Internet query data,18,30,33,34,38–42 and environmental data.31,35–37,56 For instance, Chen et al. (2019) found that the number of Internet query data in search engines and social media increased as early as four weeks before the epidemic occurrence. 38 Similarly, Li et al. (2022) developed a dengue prediction framework by integrating multiple environmental data and Brazil’s national historical dengue data which accurately forecast the weekly dengue cases five weeks ahead. 56 The ability of PPH approaches to forecast an outbreak earlier than the actual occurrence provides adequate time to establish effective prevention and control measure strategies to reduce or nullify the possible impact due to outbreak occurrence.
The risk prediction potential of the PPH approaches involves the adaptation of predictive analytics models to identify specific populations or regions with a higher risk of disease outbreak occurrences. These predictive models integrate various risk factors, such as biological data, 44 socio-demographic data, 44 environmental alterations,35,37 and population behavior patterns 48 to generate specific precise risk scores or assessments. For instance, Yoon et al. (2022) conducted a study to determine the risk of highly pathogenic avian influenza (HPAI) in Korea using deep learning computation. The authors utilized livestock surveillance program data, local meteorological reports, and the movement of livestock vehicles in the interior of the poultry farm to predict the risk of HPAI virus spread. They found that livestock vehicle movements were strongly associated with the HPAI virus spread during an outbreak. 48 Meanwhile, in Italy, Ippoliti et al. (2019) attempted to develop a risk map of West Nile Disease (WND) by adopting a data-driven spatial clustering approach using topographical, environmental, and climate factors. 35 They found the Bluetongue vector was strongly associated with high mean temperature, dry environments, flattened and sunny surfaces, and low vegetation. Based on this information, they were able to develop a much more precise risk map that could facilitate decision-makers to prioritize preventive measures and resource allocations more effectively.
The fundamental application of PPH is the right intervention for the right population at the right time. 2 Therefore, targeted intervention is an essential component of PPH to optimize the outcomes of the intervention. This method is tailored based on individual or population-level characteristics which involve more than one type of data. For instance, a review published by Moller et al. (2018) showed that various types of consumer purchase data, such as loyalty card data, bank card data, and history of transactions were able to facilitate outbreak management. 43 More tailored intervention could be provided to specific exposed populations based on the outbreak investigation. On top of that, the review also reported that consumer purchase data were used in generating hypotheses, analytical epidemiology, tracing back the source of an outbreak, and supporting existing evidence or hypothesis.
The other key application of PPH in outbreak management is the capability to optimize the allocation of limited healthcare resources, such as manpower, medical supplies, and infrastructures. Via utilization of data-driven simulations and models, PPH endeavors to identify resource gaps, anticipate future resource needs and optimize the resources allocation to regions or populations with the highest disease burden or vulnerability. The evidence of such advantages was mentioned by Minetto et al. (2020) in a study conducted in the United States of America to develop a framework to map the flow of vehicles over the area of interest with the combination of publicly available data sources, such as satellite images and OpenStreetMap, and machine learning-based vehicle detection method. 47 The author reported that the proposed framework was able to provide stable and accurate vehicle counts in most regions of interest. This information helps the decision makers to COVID-19 control strategies based on the different region of interest groups and their expected behavior. On the other hand, the adoption of PPH also allows for more reliability and accuracy in measuring the effectiveness of an intervention or strategy. For instance, during the COVID-19 pandemic, China implemented five phases of public gathering striction strategies to control COVID-19 transmission. 46 Nei et al. (2020) conducted a study to evaluate the impact of these strategies using mobile terminal positioning data as the source of information. The application of mobile terminal positioning data in the difference-in-difference (DID) model provided information on the number of people present at a specific time duration at designated points of interest. The author was able to show the effectiveness of public gathering restrictions by measuring the number of people presented at specific places during a specific time.
The concept of PPH application of leveraging data and advanced analytics allows us to gain insights into the underlying causes, mechanisms, and dynamics of diseases. Integration of various types of data from multiple sources, such as genomic data, clinical data, social data, environmental data, and social determinants of health allows PPH to provide a much more accurate understanding of disease etiology and progression.1,11 The recent COVID-19 pandemic is a perfect example of how quickly we were able to understand the nature of the disease. A study was conducted in two different cities in China to understand the nature of COVID-19 using environmental and national surveillance data. 57 The author found that the incubation period of COVID-19 disease varies, and the longest incubation period was seen if maintained mean temperature at 10 to 15 degrees Celsius (50 to 59 degrees Fahrenheit). This evidence of association played an important role in the prediction of the transmission and evolution cycle of the COVID-19 outbreak in China. Meanwhile, a study was conducted in Uganda to identify clusters of unexplained, influenza-negative severe acute respiratory infections (SARI) by applying spatiotemporal analysis using SARI surveillance data and genomic data. 45 The analysis identified nine statistically significant clusters accounting for 10.4% of unexplained cases. The method, applied in real-time to routinely collected surveillance data, able to mitigate the outbreak of preventable viral disease by targeted vaccination strategy, capable to signal outbreak investigation of unusual clusters of pathogens and resource allocations to higher risk areas (hotspots).
Despite having several proof-of-concept applications of PPH in outbreak management, there are still many unexplored aspects in this approach. As tailored or data-driven approaches had been well established in other sectors such as finance, economy, and education, there is potential of adopting these applications into healthcare system particularly in outbreak management. For instance, the utilization of data-centric strategies within health marketing such as tailored communication and engagement, seamlessly measuring success, and better insight of the target population.58–60 These strategies facilitate the customization of messaging and communication directed towards distinct demographic segments, yielding substantial advantages. 58 Moreover, data analytics offers the capacity to furnish immediate and dynamic insights into the performance of a campaign. 60 This real-time understanding empowers agile adjustments and refinements to endeavors and stratagems. On top of that, utilizing data encompassing demographic attributes, geographical disposition, as well as health-related behaviors and attitudes, affords the capacity to formulate campaigns of heightened efficacy and discerningly allocate focus to paramount health imperatives.58,59
Furthermore, incorporating robotic technology to enhance the effectiveness of public health strategies, especially outbreak management will be the next groundbreaking approach. This unexplored technology in public health have shown tremendous benefits in clinical practices.61–63 Some possible advantages of robotic applications in outbreak management includes enhancing data collection,64,65 smarter prediction of outbreak progression,64,65 act as a health educator,64,66 remote checkups using biosensors,65,67 automated logistic and delivery systems, 64 as well as surveillance and preventive strategies in highly hazardous locations. 67 However, these possible applications warrant further research and concept development in public health discipline.
Types of data in PPH applications
The PPH approaches are dependent on the large volume of information from conventional and non-conventional data sources. One of the important health-related data sources that are readily available is molecular profile data. This data are information regarding the features of molecular profiles, such as genes, proteins, chemical compounds, carbohydrates, and metabolites which help us to provide us a better understanding of relationships between molecules of an organism. The research of such interactions is critical in the discipline of PPH. 68 This research can provide numerous discoveries, such as determining whether there are distinct variants of a disorder and identifying characteristics that may contribute to the effectiveness of a possible treatment.45,69 In a disease outbreak, the use of such data can provide accurate information regarding the different variants that exist within the same species of pathogen which could provide a better understanding of the pathogenesis and virulence of the pathogen. This information is vital to establish effective intervention and prevention measures during the outbreak.29,70
Another health-related data that is vital in PPH approaches is clinical data. Clinical data includes diagnostic procedures such as Computerized Tomography scans, electrocardiograms, and x-rays. In addition, certain forms of clinical data may be acquired and controlled entirely by patients through the use of health equipment such as digital scales, wearables, and digital sphygmomanometers. 11 However, such clinical data resources and applications are not routinely supervised or recorded in healthcare institutions or patients’ Electronic Health Records (EHR). 10 Clinical data may be obtained from various sources, including EHRs, administrative data, claims data, patient registries, questionnaires, and clinical studies data. 71 Such data could provide valuable information in the surveillance of population at risk,29,45,72 determining disease burden 28 and forecasting the occurrence of future disease outbreak. 41
Meanwhile, social data, nonconventional data is simply defined as information made publicly available on social networking sites, such as a user’s location, network of friends, and language. 73 Extending this approach, the use and interpretation of specific facts are inextricably linked to social relationships such as behavioral data. This would include social networking sites like Facebook, Twitter, and Google (for example, browsing history and Google Trends), as well as data on social engagements. These data are capable to provide additional information on disease surveillance38,74 and complement conventional data to provide more accurate information. 40 For example, Deiner et al. (2016) examined the incidence of diagnosed conjunctivitis using Twitter and Google search data. 74 They found a strong correlation between seasonal conjunctivitis and Google search data and a moderate correlation with Twitter data. However, caution should be exercised when utilizing social data since many of these study findings may not be reproducible. 10
On the other hand, environmental data is associated with information obtained about the environment in which individuals and communities live, such as water quality, 36 climate,35,57 landscape suitability, 44 global positioning satellite, 48 geographical information system.35,56 Several studies have shown evidence of the importance of incorporating environmental data into outbreak surveillance and management help to improve the accuracy of predicting the risk of a disease outbreak,44,48 targeted surveillance 35 and forecast of a disease outbreak 56 which led to early preparedness and interventions. The types of data that are applicable in the PPH approach are limitless and depend widely on the availability and accessibility of such data sources that have the possibility to further enhance the effectiveness of outbreak management.
Tackling the gaps and challenges in PPH approach in outbreak management
The most prevalent limitation was data quality and availability. This finding underscores the significant challenge posed by poor quality or inadequate data, fragmented data, and non-standardized data collection methodologies. These factors lead to ineffective implementation of PPH strategies in outbreak management. For instance, the limitation on the information on COVID-19 surveillance availability in the public domain to identify contact and infected data leads to delays in outbreak investigation and prevention strategies. 57 Data quality has been a major concern in utilizing social media data whereby the noisy nature of data could reduce the accuracy of analytic outcomes. 18
Secondly, due to the complexities associated with integrating data from diverse sources, such as national healthcare systems, public health surveillance systems, and meteorological monitoring systems. Variations in data formats, compatibility between different data systems, and restriction in seamless data exchange mechanisms contribute to the challenge. For instance, developing novel BD streams commonly uses public domain data sources such as disease surveillance data, Internet queries, and environmental data. 52 However, some data such as genomic data and entomological data are underutilized most likely due to limited access to designated personnel only. 41 Many studies attempted to integrate various data sources into single-stream BD. However, due to the different data systems and the absence of seamless data exchange mechanisms, the variation of data type in the BD stream is limited.29,34,56
Meanwhile, insufficient infrastructure and outdated technological resources contribute to the limited implementation of PPH in outbreak management. Moreover, limited access to necessary tools and expertise further constrains the applicability of PPH in outbreak management, such as COVID-19 forecast whereby manual collection of wastewater samples may lead to short-term heterogeneity of viral RNA abundance. 36 Additionally, resource constraints, such as inadequate budget allocation and accessibility to the latest technologies can compromise the ability to predict an outbreak incidence and disease burden. For instance, the estimation of conjunctivitis incidence using Internet query data is more saturated in urban regions with adequate Internet access than the rural communities. 39
Privacy concern is one of the main barriers, especially upon involving personal data such as personal health records, mobile surveillance data, social media data, and genomic data. Issues about the collection, use, and sharing of sensitive health and social data, such as insurance claim data 28 and consumer purchase data 43 can encounter public resistance and legal challenges. Therefore, policymakers should prioritize data security and discretion regardless of the data type, sources, storage, and accessibility. 75
On the other hand, public health agencies face challenges in addressing health disparities and ensuring equitable access to healthcare services, and implementing interventions across diverse population groups. 45 These challenges are often seen in low- and middle-income countries where the margin of health disparities and inequity is very high compared to high-income countries. 45 Furthermore, within a country setting, inequity is seen between urban and rural regions, especially in infrastructure and health service availability and limited information on public health surveillance. 76
The cost incurred to possess the latest technologies limits the advancement in public health systems, especially in low- and middle-income countries.45,47 Similarly, lesser resources, including trained personnel, competing priorities, and allocation discrepancies impede the successful implementation of PPH approaches in outbreak management. 36
Lastly, public acceptance and trust emphasize the importance of garnering public acceptance and establishing trust in PPH approaches. As mentioned by Nei et al. (2020), there is a possible risk of overestimation of the effect of the study, in this case, the effectiveness of public intervention using mobile positioning data, because the acceptance of the public towards the evaluation process was not considered. 46 On the other hand, Ma et al. (2022) were concerned that the search index data is highly sensitive to media coverage leading to instability that could propagate into the prediction of an outbreak. 40
Addressing barriers and challenges in PPH necessitates a multifaceted and comprehensive approach encompassing scientific, technological, ethical, and social strategies. Data collection and integration obstacles, such as the lack of high-quality and interoperable data sources, can be mitigated by investing in data infrastructure, fostering data sharing and collaboration, and implementing advanced integration techniques.77,78 Meanwhile, in terms of privacy and ethical concerns associated with sensitive health data, a robust data governance framework is essential, emphasizing data anonymization, informed consent, and encryption for secure storage.79,80 Similarly, navigating regulatory and policy uncertainties involves advocating for adaptive regulations that keep pace with innovation while safeguarding individual rights. 81 Effective communication and education strategies should be tailored for various stakeholders, and success stories should be highlighted to underscore the value of precision interventions. 82
PPH approaches can exacerbate health disparities if not implemented equitably. Therefore, overcoming health disparities that might arise from precision approaches requires inclusivity, diversity in data collection, and interventions that consider socioeconomic factors. 83 Meanwhile, health literacy challenges such as, relying on individuals’ understanding of their health data, which may require a high level of health literacy, can be addressed by developing user-friendly tools and educational resources that simplify complex health data interpretation. 84 On top of that, cultural beliefs and societal norms can influence individuals’ willingness to engage with precision health initiatives. It is necessary to tailor strategies that align with community preferences and values, engaging community leaders and influencers to promote acceptance and adoption.85,86
Meanwhile, to manage the rapid pace of technological advancements, collaboration between researchers, technologists, and public health professionals is crucial to ensure responsible integration of emerging technologies like AI and genomics.87,88 Moreover, rigorous evaluation methods and collaborations between researchers, clinicians, and data scientists are vital to validate the impact of tailored interventions. On top of that, it is challenging to implement PPH as it can be resource-intensive, especially in resource-limited settings. Hence, managing resource allocation demands prioritization of cost-effective interventions, potentially through public-private partnerships.89,90
In essence, conquering the barriers and challenges of PPH necessitates a harmonious blend of these strategies, facilitated by collaboration between diverse stakeholders, including researchers, public health experts, policymakers, and communities. This collective effort is pivotal in unlocking the full potential of precision interventions to enhance outbreak management and population health.
Strengths and limitations
Our scoping review possesses several limitations, Firstly, we limited our interest to the studies that contained PPH application outcomes in outbreak management. Therefore, studies involving predictive model development and application of artificial intelligence or deep learning framework without any outcome measurements were not included in our scoping review. Secondly, we only included published English language medium articles. There are possibilities of articles with successful and effective implementation of the PPH approach that were not published or in different languages. Furthermore, we focused on publications published in the past six years only to retrieve studies related to current practice, hence missing out on the publications before the timeline. We may also have limited access to more publications due to the search term that we developed. Lastly, the review does not develop and promote practical recommendations as the methodology did not involve methodological evaluation and evidence rating levels. However, this review could help to identify the advantages and gaps in the PPH approaches in outbreak management and the need for future systematic reviews.
Conclusion
In conclusion, PPH applications in outbreak management utilize a wide range of data sources and analytical techniques to enhance disease surveillance, investigation, modeling, and prediction. Through the integration of various types of data, such as genomic data, personal data, Internet query data, environmental data, and global positioning data, authorities can gain valuable insights into disease occurrence, spread, and impact. Modeling and prediction techniques, including metapopulation models, risk mapping algorithms, and machine learning frameworks, enable proactive interventions, targeted resource allocation, and evidence-based decision-making. Additionally, sentiment analysis of social media data provides valuable insights into public perceptions and sentiments, enabling authorities to address concerns, combat misinformation, and foster public engagement. By leveraging these tools and approaches, PPH contributes to more effective and efficient outbreak management, ultimately reducing the burden of infectious diseases on populations. Meanwhile, the limitations and challenges in the application of PPH approaches in outbreak management emphasize the need to strengthen the surveillance systems, promote data sharing and collaboration among relevant stakeholders, and standardize data collection methods while upholding privacy and ethical principles. By enhancing data quality and availability, decision- and policymakers can make informed decisions, implement a targeted intervention, and effectively control and mitigate outbreaks using limited and effective resource allocation strategies.
