International Journal of Emerging Research in Science, Engineering, and Management
Vol. 2, Issue 3, pp. 239-245, March 2026.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Hybrid Ensemble Machine Learning Algorithm for Heterogeneous Health Care Data Analytics
M Manivannan
K Thrishitha
K Sanjay
Shaik Mahammad Rafi
Shaik Dileep
Aman Kumar Singh
Department of CSE, Siddartha Institute of Science and Technology, Puttur, India.
Abstract: Modern healthcare environments generate large volumes of heterogeneous data from multiple sources such as electronic health records, laboratory reports, medical imaging systems, and wearable sensors. Traditional single-model machine learning techniques often struggle to effectively analyse such diverse datasets due to differences in structure, scale, and data quality. To address these challenges, this paper proposes a hybrid ensemble machine learning framework for heterogeneous healthcare data analytics and disease prediction. The proposed system integrates multiple learning algorithms using ensemble strategies such as bagging, boosting, and stacking to improve prediction accuracy, robustness, and generalization performance. The framework supports multimodal data integration and incorporates feature selection and preprocessing techniques to handle noise, missing values, and class imbalance commonly present in healthcare datasets. Experimental analysis demonstrates improved reliability and predictive capability compared with conventional standalone models. The proposed approach provides an efficient decision-support framework for clinical risk prediction, disease diagnosis, and intelligent healthcare analytics in real-world medical environments.
Keywords: Hybrid Ensemble Learning, Heterogeneous Healthcare Data, Disease Prediction, Multimodal Data Analytics, Clinical Decision Support Systems.
References:
- N M. R. Hasan and J. Li, “Privacy-Preserving collaborative diabetes prediction in heterogeneous health care systems: algorithm development and validation of a secure federated ensemble framework,” JMIR Diabetes, vol. 11, p. e79166, Dec. 2025, doi: 10.2196/79166.
- B. Wang, W. Li, A. Bradlow, E. Bazuaye, and A. T. Y. Chan, “Improving triaging from primary care into secondary care using heterogeneous data-driven hybrid machine learning,” Decision Support Systems, vol. 166, p. 113899, Nov. 2022, doi: 10.1016/j.dss.2022.113899.
- V. Kumar, C. Prabha, D. Gupta, S. Juneja, S. Kumari, and A. Nauman, “Multi-model machine learning framework for lung cancer risk prediction: A comparative analysis of nine classifiers with hybrid and ensemble approaches using behavioral and hematological parameters,” SLAS TECHNOLOGY, vol. 33, p. 100314, Jun. 2025, doi: 10.1016/j.slast.2025.100314.
- A. Alotaibi, “Ensemble Deep Learning Approaches in Health Care: a review,” Computers, Materials & Continua/Computers, Materials & Continua (Print), vol. 82, no. 3, pp. 3741–3771, Jan. 2025, doi: 10.32604/cmc.2025.061998.
- M. I. Arias, L. Cadavid, and J. D. Velásquez, “Advancing healthcare analytics: a thematic review of machine learning, health informatics, and real-world data applications,” Journal of Biomedical Informatics, vol. 171, p. 104934, Oct. 2025, doi: 10.1016/j.jbi.2025.104934.
- Z. Gharibi, “An ensemble learning approach for predicting hospital stay in transplant patients,” Healthcare Analytics, vol. 9, p. 100444, Dec. 2025, doi: 10.1016/j.health.2025.100444.
- M. A. Shuheil et al., “Machine learning and artificial intelligence in perovskite quantum dot electroanalysis: From data-driven synthesis to intelligent sensing interfaces,” Talanta Open, vol. 13, p. 100623, Feb. 2026, doi: 10.1016/j.talo.2026.100623.
- K. M. M. Uddin, A. Chowdhury, Md. M. R. Druvo, M. T. Jaima, Md. T. A. Bhuiyan, and Md. M. Islam, “Improving diagnostic accuracy for PCOS: A hybrid machine learning architecture with feature selection, data balancing, and explainable AI techniques,” Results in Control and Optimization, vol. 22, p. 100647, Dec. 2025, doi: 10.1016/j.rico.2025.100647.
- D. Amilo, K. Sadri, E. Hincal, and M. Hafez, “A hybrid machine learning and Fractional-Order dynamical framework for Multi-Scale prediction of breast cancer progression,” Computer Modeling in Engineering & Sciences, vol. 145, no. 2, pp. 2189–2222, Jan. 2025, doi: 10.32604/cmes.2025.070298.
- E. Sarathkumar and R. S. Jayasree, “Machine learning-integrated lateral flow assays: Unlocking the future of intelligent point-of-care sensing,” TrAC Trends in Analytical Chemistry, vol. 193, p. 118478, Sep. 2025, doi: 10.1016/j.trac.2025.118478.
- A. Ahmadi, M. Fakhimi, and C. Magnusson, “Hybrid modelling using simulation and machine learning in healthcare,” Computers & Operations Research, vol. 185, p. 107278, Sep. 2025, doi: 10.1016/j.cor.2025.107278.
- U. Islam, G. Mehmood, A. A. Al-Atawi, F. Khan, H. S. Alwageed, and L. Cascone, “NeuroHealth guardian: A novel hybrid approach for precision brain stroke prediction and healthcare analytics,” Journal of Neuroscience Methods, vol. 409, p. 110210, Jul. 2024, doi: 10.1016/j.jneumeth.2024.110210.
