Sage Journals: Discover world-class research

Abstract

In response to the challenges of multimodal data analysis in disaster events, this study proposes a two-stage technical framework of “feature alignment evidence fusion”. A cross-modal contrastive learning framework (PMCL) utilizing agents is constructed during the feature alignment stage, which achieves cross-modal feature collaboration through a bimodal Transformer encoder, agent sample collaborative optimization, and geometric constraints. In the fusion decision-making stage, a Cross-modal Enhanced Fusion Network (CEFN) is constructed, and Dirichlet distribution parameterization, projection distance evaluation, and an adaptive fusion mechanism are used to address semantic uncertainty. Experiments have shown that PMCL achieved an accuracy rate of 85% on crisis multimodal information classification datasets, which was 30% higher than the single modal baseline. The accuracy of CEFN in this dataset task was 1.16% higher than that of suboptimal models, and the conflict loss function still controlled performance degradation within 3.34% under 100% inconsistent samples. In addition, PMCL's multimodal pre-training initialization strategy improved the accuracy of the model by 7.1%. This study provides an efficient and interpretable technical solution for disaster emergency response, which has important practical significance for multimodal data-driven intelligent disaster reduction decision-making.

Keywords

multimodal fusion PMCL CEFN disaster events media graphic data

Get full access to this article

View all access options for this article.

References

Beck

Jeong

. Probabilistic disaster social impact assessment of infrastructure system nodes. Struct Infrastruct Eng 2024; 20: 421–432.

Xiang

Peng

Bian

, et al. Unpaired dual-modal image complementation learning for single-modal medical image segmentation. IEEE Trans Biomed Eng 2025; 72: 664–674.

Prestley

Morss

. Contextualizing disaster phases using social Media data: hurricane risk visualizations during the forecast and warning phase of hurricane irma. Weather Clim Soc 2023; 15: 1049–1067.

Duan

Zhou

, et al. Impact of media information on social response in disasters:a case study of the freezing rain and snowstorm disasters in southern China in 2008.international. J Disaster Risk Sci 2024; 15: 73–87.

Wang

Qin

, et al. A rapid prediction method for flooding risk of distribution terminals based on multimodal data fusion. J Electr Power Sci Technol 2025; 39: 92–100.

Koshy

Elango

. Multimodal tweet classification in disaster response systems using transformer-based bidirectional attention model. Neural Comput Appl 2023; 35: 1607–1627.

Madichetty

Madisetty

. A RoBERTa based model for identifying the multi-modal informative tweets during disaster. Multimed Tools Appl 2023; 82: 37615–37633.

Shen

Zhong

Wang

, et al. Construction and application of flood disaster knowledge graph based on multi-modal data. Geomatics Inf Sci Wuhan Univ 2023; 48: 2009–2018.

Fleming

Marvel

Supak

, et al. Toxpi* GIS Toolkit: creating, viewing, and sharing integrative visualizations for geospatial data using ArcGIS. J Expo Sci Environ Epidemiol 2022; 32: 900–907.

10.

Jitt-Aer

Wall

Jones

, et al. Use of GIS and dasymetric mapping for estimating tsunami-affected population to facilitate humanitarian relief logistics: a case study from Phuket, Thailand. Nat Hazards 2022; 113: 185–211.

11.

Purohit

Dave

. Leveraging deep learning techniques to obtain efficacious segmentation results. Arch Adv Eng Sci 2023; 1: 11–26.

12.

Qiu

Huang

, et al. Integrating NLP and ontology matching into a unified system for automated information extraction from geological hazard reports. J Earth Sci: Engl Ed 2023; 34: 1433–1446.

13.

Wei

Yan

, et al. SVMFN-FSAR: semantic-guided video multimodal fusion network for few-shot action recognition. Big Data Min Anal 2025; 8: 534–550.

14.

Zhong

Yao

Boppana

, et al. Improving case duration accuracy of orthopedic surgery using bidirectional encoder representations from transformers (BERT) on radiology reports. J Clin Monit Comput 2024; 38: 221–228.

15.

Chen

Wang

Yao

. High-resolution frequency domain decomposition for modal analysis of bridges using train-induced free-vibrations. Adv Struct Eng 2024; 27: 1528–1546.

16.

Wang

Jiang

. A novel rapid positioning method for dynamic load location based on newmark explicit method and modal shape comparison method. Meccanica. 2025; 60: 841–859.

17.

Hossain

Chowdhury

. Breast cancer subtype prediction model employing artificial neural network and18F-fluorodeoxyglucose positron emission tomography/ computed tomography. J Med Phys 2024; 49: 181–188.

18.

Wang

, et al. Misclassification-guided loss under the weighted cross-entropy loss framework. Knowl Inf Syst 2024; 66: 4685–4720.

19.

Zhong

Zhang

Cao

, et al. Power Muirhead mean operators of interval-valued intuitionistic fuzzy values in the framework of Dempster-Shafer theory for multiple criteria decision-making. Soft Comput: Fusion Found Methodol Appl 2023; 27: 763–782.

20.

Baddar

Languasco

Migliardi

. Efficient analysis of overdispersed data using an accurate computation of the Dirichlet multinomial distribution. IEEE Trans Pattern Anal Mach Intell 2025; 47: 1181–1189.