Multimodal Machine Learning Approaches in Predictive Healthcare Analytics: A Comprehensive Survey

Vavekanand, Raja and Kumar, Teerath and Kumar, Sanjai and Kumar, Ganesh and Laghari, Asif Ali (2026) Multimodal Machine Learning Approaches in Predictive Healthcare Analytics: A Comprehensive Survey. Archives of Computational Methods in Engineering. ISSN 11343060

Full text not available from this repository.
Official URL: https://www.scopus.com/pages/publications/10503478...

Abstract

This survey explores the application of multimodal machine learning techniques in predictive healthcare analytics. By integrating various data modalities, such as medical imaging, clinical text, time-series signals, and structured tabular data, these approaches aim to emulate clinical reasoning and enhance diagnostic and prognostic accuracy. Across studies, multimodal ML consistently outperformed unimodal baselines, with intermediate fusion employed in 60 of cases and achieving average AUC improvements of 5�12 over single-modality models. Oncology and neurology emerged as the leading domains, where combining imaging with genomic and cognitive data significantly improved cancer survival and Alzheimer�s detection. Despite progress, key challenges persist, including modality misalignment (23), missing data (18), and limited external validation (12). Recent trends highlight transformer-based cross-modal attention, self-supervised learning for data-scarce settings, and hybrid fusion architectures in critical care. While multimodal ML offers clear clinical advantages, regulatory constraints and interoperability gaps continue to hinder deployment. This survey contributes to the existing literature by providing a comprehensive synthesis of multimodal ML applications across healthcare domains. It documents comparative fusion strategies, modelling approaches, and empirical performance outcomes. The paper�s primary contribution lies in identifying intermediate fusion as the most effective integration strategy and revealing systematic gaps in external validation and model transparency that must be addressed for clinically trustworthy multimodal systems. © The Author(s) 2026.

Item Type: Article
Additional Information: Cited by: 0; All Open Access; Hybrid Gold Open Access
Uncontrolled Keywords: Data integration; Diagnosis; Health care; Interoperability; Learning algorithms; Learning systems; Machine learning; Medical imaging; Modal analysis; Self-supervised learning; Clinical analytic; Fusion strategies; Machine learning approaches; Machine learning techniques; Machine-learning; Multi-modal; Multimodal machine learning; Predictive healthcare; Tabular data; Time series signals; Predictive analytics
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 15 Apr 2026 02:10
Last Modified: 15 Apr 2026 02:10
URI: https://khub.utp.edu.my/scholars/id/eprint/20559

Actions (login required)

View Item
View Item