Comparative Analysis of Machine Learning Models for Early Heart Disease Diagnosis

Authors

  • Rajeshree Khande Balaji Institute of Technology & Management, Sri Balaji University, Pune, India
  • Walid Ayadi Mechatronics and Intelligent Systems, Abu Dhabi Polytechnic, UAE
  • Nikita Bhandhari Balaji Institute of Technology & Management, Sri Balaji University, Pune, India
  • Yasser Farhat Academic Support Department, Abu Dhabi Polytechnic, Abu Dhabi, UAE
  • P.S. Metkewar School of Computer Science and Engineering, Dr Vishwanath Karad MIT World Peace University, Pune, India
  • Sharvari R. Shukla Symbiosis Statistical Institute, Symbiosis International (Deemed University), Pune, India
  • Aafaq A. Rather Symbiosis Statistical Institute, Symbiosis International (Deemed University), Pune, India
  • Mushtaq A. Lone Departmet of Statistics, FOH, SKUAST –K, P.O. Box 190025, J&K, India

DOI:

https://doi.org/10.6000/1929-6029.2025.14.56

Keywords:

Heart Disease Prediction, Machine Learning, Support Vector Machine (SVM), Clinical Decision Support, Feature Engineering

Abstract

Heart disease remains among the leading causes of death worldwide, and its early detection ability can be the difference between life and death. In this research, we investigate the capability of machine learning—namely Support Vector Machines (SVM)—to predict the occurrence of heart disease based on regular clinical information. We used the Cleveland Heart Disease dataset, which contains critical patient data like age, gender, blood pressure, cholesterol level, type of chest pain, and other crucial health factors. Prior to creating our model, we pre-processed and cleaned the data by dealing with missing values, changing categorical variables into numerical form, and scaling the features for uniformity. We then optimized the SVM model using grid search and cross-validation to make it run at its optimal level. The resulting model had an accuracy of 86.41% in the test set and performed better than other popular models such as logistic regression and random forest.

The significant about this work is the potential for applying it in practical situations. An SVM-based program such as this could be a second opinion for physicians or integrated into early diagnostic tools—most helpful in clinics with limited access to specialists. It's progress toward smarter, data-driven healthcare that enables faster and more precise diagnoses.

There's still potential for expansion, using bigger, more varied datasets or incorporating real-time patient information could further enhance the model. But this research demonstrates that with the proper data and methodology, machine learning can be a useful tool in the early diagnosis of heart disease.

References

Polaraju K, Durga Prasad D. Prediction of heart disease using multiple linear regression model. International Journal of Engineering Development and Research 2017; 5(4): 2321-9939. https://rjwave.org/ijedr/papers/IJEDR1704226.pdf

Khanna D, Sahu R, Baths V. Comparative study of classification techniques (SVM, logistic regression and neural networks) to predict the prevalence of heart disease. International Journal of Engineering Research and Applications 2015; 5(4): 25-30. DOI: https://doi.org/10.7763/IJMLC.2015.V5.544

Alsabhan,W, Alfadhly A. Effectiveness of machine learning models in diagnosis of heart disease: A comparative study. Scientific Reports 2025; 15 24568. DOI: https://doi.org/10.1038/s41598-025-09423-y

Sharmila S, Manimegalai D. A hybrid big data analytics model using SVM integrated with Hadoop distributed file system for healthcare applications. Cluster Computing 2019; 22(S5): 12243-12251.

Mohan S, Thirumalai C, Srivastava G. Effective heart disease prediction using hybrid machine learning techniques. In 2019 IEEE International Conference on Big Data 2019; pp. 260-266. https://ieeexplore.ieee.org/abstract/document/8740989/

Kai L, Wei H. Optimizing heart disease diagnosis: A reinforcement learning-based ensemble method. Egyptian Informatics Journal 2025; 31: 100750. DOI: https://doi.org/10.1016/j.eij.2025.100750

Sadr H, Salari A, Ashoobi MT, Nazari M. Cardiovascular disease diagnosis: A holistic approach using the integration of machine learning and deep learning models. European Journal of Medical Research 2024; 29(1): 455. DOI: https://doi.org/10.1186/s40001-024-02044-7

Baghdadi NA, Farghaly Abdelaliem SM, Malki A, et al. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. Journal of Big Data 2023; 10: 144. DOI: https://doi.org/10.1186/s40537-023-00817-1

Chang V, Bhavani VR, Xu AQ, Hossain MA. An artificial intelligence model for heart disease detection using machine learning algorithms. Healthcare Analytics 2022; 2: 100016.

Victor C, Vallabhanent RB, Ariel QX, Hossain MA. An artificial intelligence model for heart disease detection using machine learning algorithms. Healthcare Analytics 2022; 2: 100016. DOI: https://doi.org/10.1016/j.health.2022.100016

Kumar N, Narayan Das N, Gupta D, Gupta K, Bindra J. Efficient automated disease diagnosis using machine learning models. Journal of Healthcare Engineering 2021; 2021: 9983652. DOI: https://doi.org/10.1155/2021/9983652

Muhammad Y, Tahir M, Hayat M, Chong KT. Early and accurate detection and diagnosis of heart disease using intelligent computational model. Scientific Reports 2020; 10: 19747. DOI: https://doi.org/10.1038/s41598-020-76635-9

Nagavelli U, Samanta D, Chakraborty P. Machine learning technology‐based heart disease detection models. Journal of Healthcare Engineering 2022; 2022: 7351061. DOI: https://doi.org/10.1155/2022/7351061

Singh A, Mahapatra H, Biswal AK, Mahapatra M, Singh D, Samantaray M. Heart disease detection using machine learning models. Procedia Computer Science 2024; 235: 937-947. DOI: https://doi.org/10.1016/j.procs.2024.04.089

Smith J, Doe A. A comparative study of machine learning models for classification tasks. Journal of Machine Learning 2020; 45(3): 123-135.

Saadia T, Fazal M, Muhammad AK, Muhammad UK, Dawar A, Neelam G, Shahid K, Amal A-R. A machine learning-based framework for heart disease diagnosis using a comprehensive patient cohort. Computers, Materials and Continua 2025; 84(1): 1253-1278. DOI: https://doi.org/10.32604/cmc.2025.065423

Ogunpola A, Saeed F, Basurra S, Albarrak AM, Qasem SN. Machine learning-based predictive models for detection of cardiovascular diseases. Diagnostics 2024; 14(2): 144. DOI: https://doi.org/10.3390/diagnostics14020144

Pınar C, Ahmet S, Celal ŞE, Uğur A, Özgür A. AI-aided cardiovascular disease diagnosis in cattle from retinal images: Machine learning vs. deep learning models. Computers and Electronics in Agriculture 2024; 226: 109391. DOI: https://doi.org/10.1016/j.compag.2024.109391

Van Miguel Pires G, Marques G, Garcia NM, Ponciano V. Machine learning for the evaluation of the presence of heart disease. Procedia Computer Science 2020; 177: 432-437. DOI: https://doi.org/10.1016/j.procs.2020.10.058

Yian M, Bahiru LJ, Tefera BM. Machine learning algorithms for heart disease diagnosis: A systematic review. Current Problems in Cardiology 2025; 50(8): 103082. DOI: https://doi.org/10.1016/j.cpcardiol.2025.103082

Gibbons RJ, Balady GJ, Bricker JT, Chaitman BR, Fletcher GF, Froelicher VF, Winters WL. ACC/AHA 2002 guideline update for exercise testing. Journal of the American College of Cardiology 2002; 40(8): 1531-1540. DOI: https://doi.org/10.1016/S0735-1097(02)02164-2

Downloads

Published

2025-10-01

How to Cite

Khande, R. ., Ayadi, W. ., Bhandhari, N. ., Farhat, Y. ., Metkewar, P. ., Shukla, S. R. ., Rather, A. A. ., & Lone, M. A. . (2025). Comparative Analysis of Machine Learning Models for Early Heart Disease Diagnosis. International Journal of Statistics in Medical Research, 14, 590–600. https://doi.org/10.6000/1929-6029.2025.14.56

Issue

Section

Special Issue: Trends in Artificial Intelligence and Machine Learning in Healthcare

Most read articles by the same author(s)