Sage Journals: Discover world-class research

Abstract

Background

Accurately predicting hospital admissions from the emergency department (ED) is essential for improving patient care and resource allocation. This study aimed to predict hospital admissions by integrating both structured clinical data and unstructured text data using machine learning models.

Methods

Data were obtained from the 2021 National Hospital Ambulatory Medical Care Survey—Emergency Department (NHAMCS-ED), including adult patients aged 18 years and older. Structured data included demographics, visit characteristics, vital signs, and medical history, while unstructured data consisted of free-text chief complaints and injury descriptions. A Gradient Boosting Classifier (GBC) was applied to structured data, while a fine-tuned GPT-2 model processed the unstructured text. A combined model was created by averaging the outputs of both models. Model performance was evaluated using 5-fold cross-validation, assessing accuracy, precision, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC).

Results

Among the 13,115 patients, 2264 (17.3%) were admitted to the hospital. The combined model outperformed the individual structured and unstructured models, achieving an accuracy of 75.8%, precision of 39.5%, sensitivity of 75.8%, and specificity of 75.8%. In comparison, the structured data model achieved 73.8% accuracy, while the unstructured model reached 64.6%. The combined model had the highest AUC, indicating superior performance.

Conclusions

Combining structured and unstructured data using machine learning significantly improves the prediction of hospital admissions from the ED. This integrated approach can enhance decision-making and optimize ED operations.

Keywords

Hospital admission prediction emergency department machine learning natural language processing

Introduction

Predicting hospital admissions from the emergency department (ED) is critical for optimizing hospital resource allocation, improving patient outcomes, and reducing the burden on healthcare systems.^1–4 With over 145 million ED visits annually in the United States, a substantial proportion of patients require hospitalization, making timely and accurate predictions a key priority for hospital management and clinical care teams.^5,6 Traditionally, hospital admission decisions have been guided by clinicians’ assessments based on a combination of patient demographics, vital signs, clinical history, and presenting complaints.^3,4,7 However, these methods are subjective and can vary widely between clinicians and institutions. Recent advances in machine learning offer the potential to improve admission predictions by integrating complex patterns from both structured clinical data and unstructured text data.^8–10

Structured data, such as patient demographics, vital signs, and medical history, have long been used to support clinical decision-making.^3,4 However, these data provide only a partial picture of a patient's condition. Unstructured data, including free-text chief complaints and clinical narratives, offer additional context that may not be captured by traditional structured data alone.^11–14 The challenge lies in effectively combining these disparate data types to produce reliable and actionable predictions.^11,12

In recent years, machine learning models such as gradient boosting classifiers (GBC) have demonstrated strong performance in predictive analytics using structured data.^15–17 These models iteratively build decision trees, refining predictions with each iteration to capture complex relationships between variables. However, unstructured data, which often hold valuable clinical information, have been more challenging to integrate into machine learning workflows. Advances in natural language processing (NLP) techniques, particularly transformer-based models, such as the Generative Pre-trained Transformer 2 (GPT-2),^18,19 offer a promising approach for processing and extracting insights from unstructured text.

The objective of this study was to develop and evaluate models for predicting hospital admissions from the ED by leveraging both structured and unstructured data. Specifically, we aimed to compare the performance of models using structured data alone, unstructured data alone, and a combined approach that integrates insights from both data types. We hypothesized that a model incorporating both structured clinical information and unstructured free-text descriptions would outperform models that rely on only one data type. By utilizing a combined machine learning approach, this study seeks to enhance hospital admission predictions, ultimately contributing to better decision-making in ED settings.

Methods

Data source and study population

This study is a secondary analysis of data collected as part of the 2021 National Hospital Ambulatory Medical Care Survey—Emergency Department (NHAMCS-ED).²⁰ NHAMCS-ED is a nationwide survey conducted by the U.S. Centers for Disease Control and Prevention (CDC) to provide insights into healthcare utilization and delivery patterns across emergency departments in the United States. The dataset includes detailed records of emergency department visits from a representative sample of hospitals during the calendar year 2021. Our analysis focused exclusively on adult patients aged 18 years and older. After excluding pediatric cases, the final study sample consisted of 13,115 adult patients. The NHAMCS-ED dataset contained both structured and unstructured data, which were used to predict hospital admissions.

Data collection and variables

Structured data included variables related to patient demographics, visit characteristics, and clinical information. Demographic information encompassed age, sex, and race/ethnicity, while visit characteristics included factors such as arrival time, mode of arrival (e.g., ambulance or private transport), and whether the visit was a follow-up or occurred within 72 hours. Clinical data included vital signs like temperature, heart rate, systolic and diastolic blood pressure, respiratory rate, and pulse oximetry, along with reported pain levels. We also incorporated the Emergency Severity Index (ESI), a five-level triage tool that helps prioritize patient care based on the severity of their condition. In addition to this, the structured data captured patients’ medical history, which included chronic conditions such as Alzheimer's disease, chronic obstructive pulmonary disease, diabetes, and coronary artery disease. Other factors included the patient's type of residence (private home, nursing home, homeless, or other) and insurance type. Information about injuries, trauma, poisoning, and adverse medical treatment effects was also considered. Missing values in the structured data were handled using median imputation to reduce bias.

Unstructured data used in this study were derived from free-text fields in the 2021 NHAMCS-ED dataset, which capture chief complaints and reasons for injury. These fields were documented by healthcare providers or staff during patient encounters in emergency departments and reflect provider-documented information based on patient-reported concerns. The NHAMCS documentation does not specify the precise methods used for recording this data (e.g., verbal reporting transcribed by staff or provider interpretation). As such, these entries should be regarded as semi-structured data reflecting provider-modified patient-reported information. This distinction is important for understanding the context and variability of the unstructured data used in our predictive modeling.

Statistical analysis

Descriptive statistics were used to summarize the structured variables of the study population, stratified by hospital admission status. Categorical variables were presented as frequencies and percentages, and comparisons between admitted and non-admitted patients were conducted using chi-square tests. To further explore the association between structured variables and hospital admission, multivariable logistic regression models were developed to estimate odds ratios (ORs) for each variable. To address missing values in the structured data, median imputation was applied prior to conducting multivariable analyses to minimize potential biases. Statistical significance was assessed for all analyses, with a threshold of P-values < 0.05 considered statistically significant. All hypothesis tests were two-sided. Statistical analyses were performed using SAS (version 9.4) and Python (version 3.11).

Predictive models

Three predictive modeling approaches were implemented: one for structured data and one for unstructured data, with a combined model integrating both. The first approach used a Gradient Boosting Classifier (GBC) for structured data. GBC is an ensemble learning technique that builds decision trees sequentially, where each subsequent tree attempts to correct the errors made by the previous trees. This method aggregates the output of weak learners (i.e., decision trees) to form a strong predictive model, effectively capturing complex interactions between variables.

For unstructured data, we employed a Generative Pre-trained Transformer 2 (GPT-2) model, a state-of-the-art deep learning model for natural language processing tasks. GPT-2 was fine-tuned to predict hospital admissions based on the free-text descriptions of patient complaints and injuries. The model was pre-trained on large corpora of general text data and then adapted to this specific task by retraining its final layers on the NHAMCS-ED dataset. This allowed GPT-2 to recognize important patterns in the free-text data that could predict hospital admission likelihood.

For the combined model, we integrated structured clinical data and unstructured text data into a unified feature matrix. Structured data, comprising demographic, clinical, and visit-related variables, were preprocessed by applying median imputation to handle missing values. Unstructured text data, including chief complaints and reasons for injury, were tokenized using the GPT-2 tokenizer, with padding and truncation applied to a maximum sequence length of 128 tokens. Contextualized embeddings for the text data were generated using a pre-trained GPT-2 model, with the hidden state vector of the first token from the model's last layer extracted as the embedding. The structured data and GPT-2-generated embeddings were concatenated into a single feature matrix, enabling simultaneous utilization of insights from both data types. A Gradient Boosting Classifier was trained on this combined feature set to predict hospital admissions.

To enhance the performance of the GBC, we optimized hyperparameters such as learning rate, the number of estimators, and tree depth. We employed a grid search to systematically optimize the hyperparameters of the Gradient Boosting Classifier. The hyperparameters evaluated included the learning rate, with values of 0.01, 0.05, 0.1, and 0.2; the number of estimators, tested at 50, 100, 150, and 200; and the maximum tree depth, examined at 3, 5, 7, and 10. The grid search was conducted within a 3-fold cross-validation framework for each fold of the outer 5-fold cross-validation. The evaluation metric used for both inner and outer folds was the area under the receiver operating characteristic curve (ROC). This setup ensured a robust selection of the best hyperparameters for each fold while minimizing overfitting and bias. We have included a table summarizing the tested hyperparameters and their ranges in Table S1 for reference.

Model training and evaluation

The models were trained and evaluated using 5-fold cross-validation, which splits the dataset into five equal parts. In each fold, one part was used as the testing set while the remaining four parts were used for training. This process was repeated five times, and the performance results were averaged across all folds. The cross-validation method was chosen to ensure robust evaluation and to minimize overfitting.

We assessed model performance using multiple metrics, including accuracy, precision, sensitivity (recall), and specificity. Accuracy measured the proportion of correctly predicted hospital admissions, while precision focused on the proportion of true positive predictions among all positive predictions. Sensitivity measured the model's ability to correctly identify patients who were admitted, and specificity assessed the model's ability to correctly identify patients who were not admitted. Additionally, we determined the optimal decision threshold for each model using the ROC curve,^21,22 which allowed us to balance sensitivity and specificity to maximize predictive performance.

Ethical considerations

This study utilized publicly available data from the National Hospital Ambulatory Medical Care Survey—Emergency Department, conducted by the U.S. Centers for Disease Control and Prevention. The dataset is fully anonymized and does not contain any personally identifiable information. According to the CDC's guidelines for data use, specific permissions are not required for secondary analyses of these data. Additionally, this study was reviewed and approved as IRB exempt by the University of Pittsburgh Institutional Review Board under protocol STUDY24120115. As such, this study complies with ethical standards for research involving publicly available, de-identified datasets. All data processing and analyses were conducted in accordance with ethical guidelines for secondary data analysis, ensuring compliance with privacy and confidentiality standards.

Results

A total of 13,115 adult patients were included in the study, of which 2264 (17.3%) were admitted to the hospital and 10,851 (82.7%) were discharged from the emergency department (ED). The demographic and clinical characteristics of these patients are presented in Table 1. The gender distribution showed that 51.5% of admitted patients were female, compared to 54.5% among non-admitted patients (P = .0088). Age was a significant factor in predicting admission, with 44.0% of admitted patients aged 65 years and older, compared to only 19.7% in the non-admitted group (P < .0001). Admitted patients were more likely to be White (66.3%) compared to non-admitted patients (57.5%) (P < .0001). Other racial/ethnic groups, such as Black (18.8%) and Hispanic (10.1%) patients, were less likely to be admitted than non-admitted patients, who comprised 23.9% and 14.7%, respectively. Similarly, patients covered by Medicare (47.9%) were more likely to be admitted than those with private insurance (23.4%) or Medicaid/CHIP (22.8%) (P < .0001). Patients admitted to the hospital exhibited different clinical characteristics compared to those discharged. A significantly larger proportion of admitted patients arrived via ambulance (44.1%) compared to non-admitted patients (16.3%) (P < .0001). In terms of clinical conditions, admitted patients were more likely to report no pain (45.8%) compared to non-admitted patients (37.3%) (P < .0001). Abnormal vital signs were more common among admitted patients. For instance, 46.8% of admitted patients had heart rates exceeding 90 beats per minute, compared to 37.2% in non-admitted patients (P < .0001). Similarly, abnormal respiratory rates (greater than 20 breaths per minute) were more prevalent in admitted patients (20.7% vs. 6.6%, P < .0001). Medical history was a strong predictor of admission, with significant differences between the groups across multiple chronic conditions. Admitted patients had higher rates of chronic kidney disease (13.8% vs. 3.0%), chronic obstructive pulmonary disease (14.8% vs. 5.3%), coronary artery disease (18.6% vs. 5.6%), and diabetes mellitus type II (18.0% vs. 7.6%) (all P < .0001).

Table 1.

Demographic and clinical characteristics of emergency department patients categorized by hospital admission status.

	Not admitted	Admitted	P value
	10,851 (82.7)	2264 (17.3)
Gender
Female	5916 (54.5)	1166 (51.5)	.0088
Male	4935 (45.5)	1098 (48.5)
Age, y
18–39	4703 (43.3)	456 (20.1)	<.0001
40–65	4013 (37.0)	811 (35.8)
≥65	2135 (19.7)	997 (44.0)
Race/ethnicity
White	6236 (57.5)	1501 (66.3)	<.0001
Black	2598 (23.9)	425 (18.8)
Hispanic	1598 (14.7)	229 (10.1)
Other	419 (3.9)	109 (4.8)
Residence type
Private residence	10,062 (95.0)	1990 (91.0)	<.0001
Nursing home	151 (1.4)	115 (5.3)
Homeless	269 (2.5)	45 (2.1)
Other	107 (1.0)	37 (1.7)
Insurance type
Private insurance	2996 (30.6)	486 (23.4)	<.0001
Medicare	2208 (22.5)	993 (47.9)
Medicaid or CHIP	3327 (33.9)	472 (22.8)
Uninsured	868 (8.9)	73 (3.5)
Other	401 (4.1)	49 (2.4)
Day of week
Weekdays	8091 (74.6)	1719 (75.9)	.1742
Weekend	2760 (25.4)	545 (24.1)
Arrival time
Morning	3042 (28.7)	645 (29.5)	.0283
Afternoon	3255 (30.7)	725 (33.1)
Evening	1699 (16.0)	313 (14.3)
Night	2613 (24.6)	505 (23.1)
Arrive by ambulance	1728 (16.3)	966 (44.1)	<.0001
Follow-up visit	765 (7.7)	147 (7.2)	.4709
Seen within last 72 hours	411 (4.2)	91 (4.5)	.583
Pain level
No pain	2548 (37.3)	576 (45.8)	<.0001
Mild	2281 (33.4)	330 (26.2)
Severe	2004 (29.3)	353 (28.0)
Temperature
36 °C–38 °C	9745 (95.3)	1885 (91.6)	<.0001
≤36 °C	333 (3.3)	82 (4.0)
>38 °C	147 (1.4)	90 (4.4)
Heart rate, times/min
61–90	6007 (58.3)	1027 (48.4)	<.0001
≤60	460 (4.5)	102 (4.8)
>90	3828 (37.2)	994 (46.8)
DBP mm Hg
60–80	4546 (43.7)	1019 (47.1)	<.0001
<60	523 (5.0)	231 (10.7)
>80	5324 (51.2)	912 (42.2)
SBP mm Hg
80–120	2326 (22.4)	577 (26.7)	<.0001
<80	10 (0.1)	14 (0.6)
>120	8063 (77.5)	1569 (72.6)
Pulse oximetry (percent)
0–94	828 (8.1)	470 (22.1)	<.0001
95+	9391 (91.9)	1654 (77.9)
Respiratory rate per minute
12–20	9599 (93.1)	1684 (78.6)	<.0001
<12	28 (0.3)	15 (0.7)
>20	680 (6.6)	444 (20.7)
Injury/trauma, overdose/poisoning or adverse effect of medical/ surgical treatment
Yes, injury/trauma	2813 (27.3)	339 (15.7)	<.0001
Yes, overdose/poisoning	111 (1.1)	28 (1.3)
Yes, adverse effect of medical/surgical treatment	265 (2.6)	96 (4.4)
No	6884 (66.8)	1657 (76.6)
Questionable injury status	228 (2.2)	43 (2.0)
Medical History
Alzheimer's disease/Dementia	120 (1.1)	86 (3.8)	<.0001
Asthma	1186 (10.9)	216 (9.5)	.0517
Cancer	398 (3.7)	260 (11.5)	<.0001
Cerebrovascular disease/history of stroke (CVA)	359 (3.3)	223 (9.8)	<.0001
Chronic kidney disease (CKD)	324 (3.0)	313 (13.8)	<.0001
Chronic obstructive pulmonary disease (COPD)	580 (5.3)	335 (14.8)	<.0001
Congestive heart failure (CHF)	349 (3.2)	330 (14.6)	<.0001
Coronary artery disease (CAD)	610 (5.6)	421 (18.6)	<.0001
Depression	1555 (14.3)	415 (18.3)	<.0001
Diabetes mellitus (DM)—Type I	81 (0.7)	21 (0.9)	.3723
Diabetes mellitus (DM) —Type II	830 (7.6)	407 (18.0)	<.0001
End-stage renal disease (ESRD)	91 (0.8)	94 (4.2)	<.0001
Pulmonary embolism (PE), deep vein thrombosis (DVT), or venous thromboembolism (VTE)	185 (1.7)	96 (4.2)	<.0001
HIV infection/AIDS	114 (1.1)	27 (1.2)	.5513
Hyperlipidemia	1184 (10.9)	568 (25.1)	<.0001
Hypertension	3084 (28.4)	1215 (53.7)	<.0001
Obesity (BMI ≥30)	809 (7.5)	294 (13.0)	<.0001
Obstructive sleep apnea (OSA)	322 (3.0)	132 (5.8)	<.0001
Osteoporosis	106 (1.0)	53 (2.3)	<.0001
Substance abuse or dependence	1067 (9.8)	245 (10.8)	.154
Emergency severity			<.0001
Immediate	123 (1.7)	79 (5.3)
Emergency	875 (12.1)	606 (40.6)
Urgent	4098 (56.7)	724 (48.5)
Semi-urgent	1897 (26.3)	42 (2.8)
Non-urgent	230 (3.2)	42 (2.8)

Note: The variables “Respiratory Rate,” “Temperature,” “Pulse Oximetry,” “Heart Rate,” “Payment Type,” “Seen Within Last 72 Hours,” and “Episode of Care” have missing data proportions ranging between 5% and 10%. The variables “Arrival Time,” “Patient Residence,” “Arrival by Ambulance,” “Systolic Blood Pressure,” “Diastolic Blood Pressure,” and “Visit Related to Injury/Trauma, Overdose/Poisoning, or Adverse Effect of Medical/Surgical Treatment” have missing data proportions of less than 5%.

Figure 1(a) and Figure 1(b) present forest plots of odds ratios (ORs) with 95% confidence intervals for various factors associated with hospital admission. The forest plots indicate that age, medical history, and arrival by ambulance were strong predictors of hospital admission. Patients aged 40–65 (OR = 1.23, 95% CI: 1.07–1.42, P = .0045) and those aged ≥65 (OR = 1.55, 95% CI: 1.28–1.87, P < .0001) had significantly higher odds of being admitted compared to those aged 18–39. Similarly, patients arriving by ambulance were much more likely to be admitted (OR = 2.57, 95% CI: 2.28–2.89, P < .0001). Race/ethnicity showed that Black patients (OR = 0.74, 95% CI: 0.64–0.84, P < .0001) were less likely to be admitted compared to White patients, while there was no significant difference for Hispanic and other racial groups. Insurance type also played a role in admission likelihood, with patients without insurance (OR = 0.62, 95% CI: 0.47–0.81, P = .0007) being less likely to be admitted than those with private insurance. Vital signs, such as high heart rate (>90 beats/min) (OR = 1.43, 95% CI: 1.28–1.60, P < .0001) and low systolic blood pressure (<80 mm Hg) (OR = 0.74, 95% CI: 0.65–0.83, P < .0001), were also significant predictors of admission. Patients with oxygen saturation levels below 94% (OR = 1.57, 95% CI: 1.36–1.83, P < .0001) had increased odds of admission as well. Chronic conditions such as chronic kidney disease (OR = 1.96, 95% CI: 1.59–2.42, P < .0001), coronary artery disease (OR = 1.33, 95% CI: 1.13–1.58, P = .0008), and diabetes mellitus type II (OR = 1.26, 95% CI: 1.07–1.49, P = .0052) significantly increased the likelihood of hospital admission. In contrast, patients with asthma (OR = 0.74, 95% CI: 0.62–0.88, P = .0008) and injury (OR = 0.55, 95% CI: 0.48–0.64, P < .0001) were less likely to be admitted.

Figure 1.

(a) Forest plot of odds ratios with 95% CI (log scale). (b) Forest plot of odds ratios with 95% CI (log scale).

The performance of the three predictive models—structured data only, unstructured data only, and combined structured and unstructured data—are shown in Figure 2. The combined model outperformed both the structured and unstructured models in predicting hospital admission. The combined model had the highest accuracy (75.8%), precision (39.5%), and sensitivity (75.8%). In comparison, the structured data model achieved an accuracy of 73.8%, a precision of 36.6%, and a sensitivity of 70.8%, while the unstructured data model had an accuracy of 64.6%, a precision of 27.7%, and a sensitivity of 65.1%. ROC analysis showed that the combined model had a superior area under the curve (AUC), indicating a better overall performance in distinguishing between admitted and non-admitted patients. The structured data model and the unstructured data model performed similarly in terms of specificity, with values of 74.4% and 64.6%, respectively, compared to 75.8% for the combined model.

Figure 2.

Mean ROC curve for the three classification models predicting emergency department hospitalization admission.

Discussion

In this study, we developed and evaluated machine learning models to predict hospital admissions from the ED using both structured and unstructured data. Our findings demonstrate that integrating structured clinical data with unstructured text data significantly improves prediction accuracy compared to models relying on either data type alone. The combined model, which utilized a gradient boosting classifier for structured data and a fine-tuned GPT-2 model for unstructured data, showed superior performance with an accuracy of 75.8%, precision of 39.5%, and sensitivity of 75.8%. These results highlight the potential of machine learning models in enhancing decision-making and optimizing resource allocation in ED settings.

Several factors were found to be significant predictors of hospital admission. Age, medical history, and arrival by ambulance were strongly associated with higher odds of admission, consistent with previous studies. Patients aged 65 years and older, for example, were much more likely to be admitted, as were those with chronic conditions such as chronic kidney disease, coronary artery disease, and diabetes mellitus type II. Interestingly, our model also identified race/ethnicity and insurance type as important predictors. Black patients and those covered by Medicaid/CHIP had lower odds of admission compared to White patients and those with private insurance, raising important questions about potential disparities in healthcare access and decision-making. Vital signs, such as heart rate, blood pressure, and oxygen saturation, were also significant in predicting hospital admissions. Patients with abnormal values, particularly those with heart rates exceeding 90 beats per minute or systolic blood pressure below 80 mm Hg, were more likely to be admitted. These clinical markers are routinely used in triage and assessment, reaffirming their importance in early prediction models. The incorporation of these variables into the GBC model likely contributed to its strong performance in distinguishing between admitted and non-admitted patients. The addition of unstructured data, such as chief complaints and reasons for injury, added a valuable layer of information to the prediction models. While structured data provides objective, quantitative measures, unstructured text captures the nuances of patient symptoms and physician assessments that may not be fully reflected in vital signs or medical history. The GPT-2 model, which processed this unstructured data, was able to extract meaningful patterns from free-text descriptions, contributing to the overall performance of the combined model.

Our results align with previous research showing the utility of structured data in predicting hospital admissions.^10,23,24 However, the integration of unstructured data, particularly using advanced NLP techniques like GPT-2, represents a novel contribution to the field. Previous studies have demonstrated the potential of NLP in extracting clinical insights from text data, but few have integrated structured and unstructured data to predict hospital admissions. Our study extends this body of work by showing that models incorporating both types of data can outperform traditional structured-data models.

A related study by Lequertier et al.²⁵ focused on predicting length of stay in acute and emergency care settings using a deep neural network. Their work highlights the effectiveness of embedding-based representations for structured administrative data in predictive modeling tasks. While their approach used deep learning to achieve high performance on a multiclass classification problem, our study builds on this idea by integrating GPT-2 embeddings from unstructured text data with structured clinical data to predict hospital admissions as a binary classification problem. This distinction underscores complementary approaches to leveraging advanced feature representations for different healthcare applications. Furthermore, both studies emphasize the potential for machine learning to improve resource management in emergency care settings, while also pointing to the need for robust generalizability across different datasets and healthcare systems.

The findings from this study have several implications for clinical practice. First, integrating structured and unstructured data into predictive models can enhance the accuracy of hospital admission predictions, helping ED clinicians make more informed decisions. By improving the early identification of patients requiring hospitalization, hospitals can better allocate resources, reduce wait times, and optimize bed management, ultimately improving patient outcomes. Second, our findings highlight the need for healthcare systems to invest in infrastructure that supports the use of advanced machine learning and NLP techniques. Many EDs already collect large volumes of structured and unstructured data, but few fully leverage this information for predictive analytics. Implementing systems that can process both data types in real time could significantly enhance decision-making capabilities in the ED. Finally, the identification of potential disparities in admission likelihood, particularly among different racial/ethnic groups and insurance types, raises important ethical and policy considerations. While machine learning models can help standardize decision-making, care must be taken to ensure that these models do not perpetuate existing biases in the healthcare system. Future research should focus on addressing these disparities and developing models that promote equitable healthcare access.

This study has several limitations that should be considered when interpreting the results. First, the NHAMCS-ED dataset, while comprehensive and nationally representative, is cross-sectional in design. This limits the ability to evaluate temporal trends in hospital admission patterns or outcomes following ED visits. Future studies employing longitudinal data could provide deeper insights into the dynamics of admission processes over time. Second, the unstructured data used in this study, such as chief complaints and reasons for injury, were provider-documented rather than directly reported by patients. This documentation often involves varying levels of interpretation, refinement, and transcription, which may differ across institutions and providers. Such variability introduces a potential source of inconsistency, impacting the reliability and generalizability of the results. Additionally, the NHAMCS-ED dataset does not provide metadata on how these free-text fields were recorded, limiting the ability to fully characterize and standardize the nature of these entries. Third, while the GPT-2 model was fine-tuned on ED-specific data, its generalizability to other clinical environments or populations has not been established. Differences in healthcare systems, documentation practices, or patient demographics in other settings may necessitate additional model validation and refinement to ensure applicability. Fourth, the study lacked access to detailed clinical notes and physician assessments, which are not included in the NHAMCS-ED dataset. While unstructured text data were incorporated into the analysis, more granular documentation of provider decision-making processes could further enhance model performance. Future research should explore integrating comprehensive clinical documentation alongside structured data to create more robust predictive models. Fifth, although the combined model demonstrated improved predictive performance, the sensitivity (75.8%) indicates room for further enhancement. Incorporating additional variables, such as social determinants of health or longitudinal follow-up data, could potentially improve model sensitivity and overall accuracy in predicting hospital admissions. Finally, the cross-sectional nature of the NHAMCS-ED dataset, combined with the semi-structured nature of the unstructured data, highlights the need for caution in generalizing these findings. Future efforts should focus on addressing these limitations to refine predictive models and better support decision-making in EDs.

Conclusion

In conclusion, this study demonstrates that combining structured and unstructured data using machine learning models can significantly improve hospital admission predictions from the ED. The integration of GBC for structured data and GPT-2 for unstructured text data provides a robust approach to leveraging the full range of clinical information available during ED visits. These findings highlight the potential of machine learning to transform decision-making in the ED, ultimately contributing to better patient outcomes and more efficient hospital operations.

Supplemental Material

sj-docx-1-dhj-10.1177_20552076251331319 - Supplemental material for Machine learning-driven prediction of hospital admissions using gradient boosting and GPT-2

Supplemental material, sj-docx-1-dhj-10.1177_20552076251331319 for Machine learning-driven prediction of hospital admissions using gradient boosting and GPT-2 by Xingyu Zhang, Hairong Wang, Guan Yu and Wenbin Zhang in DIGITAL HEALTH

Footnotes

Acknowledgments

None.

Human ethics and consent to participate

Not applicable,as this study utilized publicly available,anonymized data from the NHAMCS-ED dataset.

ORCID iD

Xingyu Zhang

Ethical considerations

Not applicable,as the research was conducted using publicly available,anonymized data from the NHAMCS-ED dataset.

Consent to participate

Not applicable,as this study utilized publicly available,anonymized data.

Author contributions/CRediT

Xingyu Zhang contributed to the conceptualization of the study and the development of machine learning models and provided critical insights during data analysis and interpretation. Hairong Wang participated in the data analysis and development of machine learning models. Xingyu Zhang,Guan Yu,and Wenbin Zhang were involved in manuscript drafting and reviewing for intellectual content.

Funding

The authors disclosed receipt of the following financial support for the research,authorship,and/or publication of this article: This work was supported by the Department of Communication Science and Disorders at the University of Pittsburgh faculty start-up funding (no specific grant number is associated with this funding).

Conflicting interests

The authors declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Data availability

The NHAMCS-ED dataset can be accessed through the website of the US Centers for Disease Control and Prevention (CDC) (

). The detailed explanation of the survey data for each year and the code book can be found here:

The SAS dataset for each year can be found here:

All the data were in a SAS format. To get the unstructured data,one needs to run the SAS format files under the following link before importing the data into the analysis software.

Supplemental material

Supplemental material for this article is available online.

References

Morley

Unwin

Peterson

, et al. Emergency department crowding: a systematic review of causes, consequences and solutions. PloS one 2018; 13: e0203316.

Mostafa

El-Atawi

. Strategies to measure and improve emergency department performance: a review. Cureus 2024; 16: e52879.

Yancey

O'Rourke

. Emergency department triage. 2023 Aug 28. In: StatPearls [Internet], 2025 Jan. Treasure Island (FL): StatPearls Publishing. PMID: 32491515.

Christ

Grossmann

Winter

, et al. Modern triage in the emergency department. Dtsch Arztebl Int 2010; 107: 892–898.

Frisch

. Improving emergency department nurse triage via big data analytics. Doctoral dissertation, University of Pittsburgh, 2020. Retrieved from https://d-scholarship.pitt.edu/39472/1/Frisch_final_ETD.pdf

Harrison

Ferguson

. The crisis in United States hospital emergency services. Int J Health Care Qual Assur 2011; 24: 471–483.

Wuerz

Milne

Eitel

, et al. Reliability and validity of a new five-level triage instrument. Acad Emerg Med 2000; 7: 236–242.

Chiu

Chien

, et al. Integrating Structured and Unstructured EHR Data for Predicting Mortality by Machine Learning and Latent Dirichlet Allocation Method. Int J Environ Res Public Health 2023; 20 2023/03/12. doi:10.3390/ijerph20054340

Zhang

Bellolio

Medrano-Gracia

, et al. Use of natural language processing to improve predictive models for imaging utilization in children presenting to the emergency department. BMC Med Inform Decis Mak 2019; 19: 87.

10.

Zhang

Kim

Patzer

, et al. Prediction of emergency department hospital admission based on natural language processing and neural networks. Methods Inf Med 2017; 56: 377–389.

11.

Miotto

Kidd

, et al. Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci Rep 2016; 6: 26094.

12.

Suresh

Hunt

Johnson

AEW

, et al. Clinical Intervention Prediction and Understanding using Deep Networks. ArXiv 2017; abs/1705.08498.

13.

Rasmy

Xiang

Xie

, et al. Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction. npj Digital Medicine 2021; 4: 86.

14.

Tang

Yao

Zhu

, et al. Embedding electronic health records to learn BERT-based models for diagnostic decision support. In: 2021 IEEE 9th International Conference on Healthcare Informatics (ICHI), 9–12 Aug. 2021, 2021, pp.311–319.

15.

Natekin

Knoll

. Gradient boosting machines, a tutorial. Front Neurorobot 2013; 7: 21.

16.

Zhang

Zhao

Canes

, et al. Predictive analytics with gradient boosting in clinical medicine. Ann Transl Med 2019; 7: 152.

17.

Lyashevska

Malone

MacCarthy

, et al. Class imbalance in gradient boosting classification algorithms: application to experimental stroke data. Stat Methods Med Res 2021; 30: 916–925.

18.

Haupt

Marks

. AI-generated medical advice—GPT and beyond. Jama 2023; 329: 1349–1350.

19.

Zheng

Zhang

Woodland

. Adapting GPT, GPT-2 and BERT language models for speech recognition. In: 2021 IEEE Automatic speech recognition and understanding workshop (ASRU), 2021, pp.162–168. IEEE.

20.

Cairns

Kang

. National hospital ambulatory medical care survey: 2019 emergency department summary tables. 2022.

21.

Zhou

Gandomi

Chen

, et al. Evaluating the quality of machine learning explanations: a survey on methods and metrics. Electronics (Basel) 2021; 10: 593.

22.

Naidu

Zuva

Sibanda

. A review of evaluation metrics in machine learning algorithms. In: Computer Science On-line Conference, 2023, pp.15–25. Springer.

23.

Parker

Liu

, et al. Predicting hospital admission at the emergency department triage: a novel prediction model. Am J Emerg Med 2019; 37: 1498–1504.

24.

Araz

Olson

Ramirez-Nafarrate

. Predictive analytics for hospital admissions from the emergency department using triage information. Int J Prod Econ 2019; 208: 199–207.

25.

Lequertier

Wang

Fondrevelle

, et al. Length of stay prediction with standardized hospital data from acute and emergency care using a deep neural network. Med Care 2024; 62: 225–234.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.02 MB

0.00 MB