Vitalpath: A Cardiovascular Risk Assessment Framework Using Bayesian-Optimized Ensembles and SHAP

Authors

Yash Choudhary

Department of Computer Science & Engineering, R.D. Engineering College, Ghaziabad (India)

Raja Singh

Department of Computer Science & Engineering, R.D. Engineering College, Ghaziabad (India)

Vaibhav Sharma

Department of Computer Science & Engineering, R.D. Engineering College, Ghaziabad (India)

Ravindra Chauhan

Department of Computer Science & Engineering, R.D. Engineering College, Ghaziabad (India)

Article Information

DOI: 10.51584/IJRIAS.2026.11040004

Subject Category: Computer Science

Volume/Issue: 11/4 | Page No: 72-81

Publication Timeline

Submitted: 2026-04-04

Accepted: 2026-04-09

Published: 2026-04-24

Abstract

Heart disease is one of the leading factors of death in whole world. Yet, predicting it early remains a huge setback. Doctors face two main problems in hospitals: patient files usually contain empty fields and complex AI models act like black boxes, making them hard to trust for users. VitalPath AI is designed to solve these issues in efficient and reliable manner. First, we tackle data gaps using the MICE algorithm. This lets us fill in missing patient details without throwing away valuable data. Next, we use SMOTE to balance the dataset classes and level the playing field, ensuring the model learns fairly and prevents any model from generating biased outcomes. Instead of just guessing settings, we used Bayesian Optimization to hunt down the optimal configurations for several machine learning models. When evaluated on the UCI Heart Disease dataset, AdaBoost came out on top by ending up attaining an AUC-ROC score of 0.963. This performance metrics surpassed what we originally hoped for, though accuracy alone isn't the only objective for this work to accomplish. To make the model easy for doctors to trust, we need transparency. That’s why we integrated SHAP, which breaks down exactly why each prediction was made, letting doctors and patients see if factors like cholesterol or chest pain drove the decision.

Keywords

Bayesian Optimization, Cardiovascular diseases

Downloads

References

1. Shah, P., Shukla, M., Dholakia, N. H., & Gupta, H. "Predicting cardiovascular risk with hybrid ensemble learning and explainable AI." Scientific Reports, (2025). 15(1), Article 17927. [Google Scholar] [Crossref]

2. Rao, C.; Li, M.; Huang, T.; Li, F. "Stroke Risk Assessment Decision-Making Using a Machine Learning Model: Logistic-AdaBoost." CMES-Comput. Model. Eng. Sci., (2024) 139, pp. 699–724. [Google Scholar] [Crossref]

3. Almazroi, E. A. Aldhahri, S. Bashir and S. Ashfaq. "A Clinical Decision Support System for Heart Disease Prediction Using Deep Learning," in IEEE Access, (2023). vol. 11, pp. 61646-61659. [Google Scholar] [Crossref]

4. Khan, M. A., & Algarni, F. "A Healthcare Monitoring System for the Diagnosis of Heart Disease in the IoMT Cloud Framework Using Explainable AI." IEEE Access, (2022). vol. 10, pp. 63583-63593. [Google Scholar] [Crossref]

5. Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. "Optuna: A Next-generation Hyperparameter Optimization Framework," in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (2019). [Google Scholar] [Crossref]

6. Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. "CatBoost: unbiased boosting with categorical features," in Advances in Neural Information Processing Systems (NeurIPS), (2018). vol. 31. [Google Scholar] [Crossref]

7. Lundberg, S. M., & Lee, S. I. "A Unified Approach to Interpreting Model Predictions," in Advances in Neural Information Processing Systems (NeurIPS), (2017). vol. 30. [Google Scholar] [Crossref]

8. Chen, T., & Guestrin, C. "XGBoost: A Scalable Tree Boosting System," in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (2016). pp. 785-794. [Google Scholar] [Crossref]

9. S. van Buuren and K. Groothuis-Oudshoorn, "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, (2011). vol. 45, no. 3, pp. 1–67. [Google Scholar] [Crossref]

10. Janosi, A., Steinbrunn, W., Pfisterer, M., & Detrano, R. "Heart Disease Data Set." UCI Machine Learning Repository. (1988). [Google Scholar] [Crossref]

Metrics

Views & Downloads

Similar Articles