Comparative Analysis of Machine Learning Models for Early Heart Disease Diagnosis
DOI:
https://doi.org/10.6000/1929-6029.2025.14.56Keywords:
Heart Disease Prediction, Machine Learning, Support Vector Machine (SVM), Clinical Decision Support, Feature EngineeringAbstract
Heart disease remains among the leading causes of death worldwide, and its early detection ability can be the difference between life and death. In this research, we investigate the capability of machine learning—namely Support Vector Machines (SVM)—to predict the occurrence of heart disease based on regular clinical information. We used the Cleveland Heart Disease dataset, which contains critical patient data like age, gender, blood pressure, cholesterol level, type of chest pain, and other crucial health factors. Prior to creating our model, we pre-processed and cleaned the data by dealing with missing values, changing categorical variables into numerical form, and scaling the features for uniformity. We then optimized the SVM model using grid search and cross-validation to make it run at its optimal level. The resulting model had an accuracy of 86.41% in the test set and performed better than other popular models such as logistic regression and random forest.
The significant about this work is the potential for applying it in practical situations. An SVM-based program such as this could be a second opinion for physicians or integrated into early diagnostic tools—most helpful in clinics with limited access to specialists. It's progress toward smarter, data-driven healthcare that enables faster and more precise diagnoses.
There's still potential for expansion, using bigger, more varied datasets or incorporating real-time patient information could further enhance the model. But this research demonstrates that with the proper data and methodology, machine learning can be a useful tool in the early diagnosis of heart disease.
References
Polaraju K, Durga Prasad D. Prediction of heart disease using multiple linear regression model. International Journal of Engineering Development and Research 2017; 5(4): 2321-9939. https://rjwave.org/ijedr/papers/IJEDR1704226.pdf
Khanna D, Sahu R, Baths V. Comparative study of classification techniques (SVM, logistic regression and neural networks) to predict the prevalence of heart disease. International Journal of Engineering Research and Applications 2015; 5(4): 25-30. DOI: https://doi.org/10.7763/IJMLC.2015.V5.544
Alsabhan,W, Alfadhly A. Effectiveness of machine learning models in diagnosis of heart disease: A comparative study. Scientific Reports 2025; 15 24568. DOI: https://doi.org/10.1038/s41598-025-09423-y
Sharmila S, Manimegalai D. A hybrid big data analytics model using SVM integrated with Hadoop distributed file system for healthcare applications. Cluster Computing 2019; 22(S5): 12243-12251.
Mohan S, Thirumalai C, Srivastava G. Effective heart disease prediction using hybrid machine learning techniques. In 2019 IEEE International Conference on Big Data 2019; pp. 260-266. https://ieeexplore.ieee.org/abstract/document/8740989/
Kai L, Wei H. Optimizing heart disease diagnosis: A reinforcement learning-based ensemble method. Egyptian Informatics Journal 2025; 31: 100750. DOI: https://doi.org/10.1016/j.eij.2025.100750
Sadr H, Salari A, Ashoobi MT, Nazari M. Cardiovascular disease diagnosis: A holistic approach using the integration of machine learning and deep learning models. European Journal of Medical Research 2024; 29(1): 455. DOI: https://doi.org/10.1186/s40001-024-02044-7
Baghdadi NA, Farghaly Abdelaliem SM, Malki A, et al. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. Journal of Big Data 2023; 10: 144. DOI: https://doi.org/10.1186/s40537-023-00817-1
Chang V, Bhavani VR, Xu AQ, Hossain MA. An artificial intelligence model for heart disease detection using machine learning algorithms. Healthcare Analytics 2022; 2: 100016.
Victor C, Vallabhanent RB, Ariel QX, Hossain MA. An artificial intelligence model for heart disease detection using machine learning algorithms. Healthcare Analytics 2022; 2: 100016. DOI: https://doi.org/10.1016/j.health.2022.100016
Kumar N, Narayan Das N, Gupta D, Gupta K, Bindra J. Efficient automated disease diagnosis using machine learning models. Journal of Healthcare Engineering 2021; 2021: 9983652. DOI: https://doi.org/10.1155/2021/9983652
Muhammad Y, Tahir M, Hayat M, Chong KT. Early and accurate detection and diagnosis of heart disease using intelligent computational model. Scientific Reports 2020; 10: 19747. DOI: https://doi.org/10.1038/s41598-020-76635-9
Nagavelli U, Samanta D, Chakraborty P. Machine learning technology‐based heart disease detection models. Journal of Healthcare Engineering 2022; 2022: 7351061. DOI: https://doi.org/10.1155/2022/7351061
Singh A, Mahapatra H, Biswal AK, Mahapatra M, Singh D, Samantaray M. Heart disease detection using machine learning models. Procedia Computer Science 2024; 235: 937-947. DOI: https://doi.org/10.1016/j.procs.2024.04.089
Smith J, Doe A. A comparative study of machine learning models for classification tasks. Journal of Machine Learning 2020; 45(3): 123-135.
Saadia T, Fazal M, Muhammad AK, Muhammad UK, Dawar A, Neelam G, Shahid K, Amal A-R. A machine learning-based framework for heart disease diagnosis using a comprehensive patient cohort. Computers, Materials and Continua 2025; 84(1): 1253-1278. DOI: https://doi.org/10.32604/cmc.2025.065423
Ogunpola A, Saeed F, Basurra S, Albarrak AM, Qasem SN. Machine learning-based predictive models for detection of cardiovascular diseases. Diagnostics 2024; 14(2): 144. DOI: https://doi.org/10.3390/diagnostics14020144
Pınar C, Ahmet S, Celal ŞE, Uğur A, Özgür A. AI-aided cardiovascular disease diagnosis in cattle from retinal images: Machine learning vs. deep learning models. Computers and Electronics in Agriculture 2024; 226: 109391. DOI: https://doi.org/10.1016/j.compag.2024.109391
Van Miguel Pires G, Marques G, Garcia NM, Ponciano V. Machine learning for the evaluation of the presence of heart disease. Procedia Computer Science 2020; 177: 432-437. DOI: https://doi.org/10.1016/j.procs.2020.10.058
Yian M, Bahiru LJ, Tefera BM. Machine learning algorithms for heart disease diagnosis: A systematic review. Current Problems in Cardiology 2025; 50(8): 103082. DOI: https://doi.org/10.1016/j.cpcardiol.2025.103082
Gibbons RJ, Balady GJ, Bricker JT, Chaitman BR, Fletcher GF, Froelicher VF, Winters WL. ACC/AHA 2002 guideline update for exercise testing. Journal of the American College of Cardiology 2002; 40(8): 1531-1540. DOI: https://doi.org/10.1016/S0735-1097(02)02164-2
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Policy for Journals/Articles with Open Access
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are permitted and encouraged to post links to their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
Policy for Journals / Manuscript with Paid Access
Authors who publish with this journal agree to the following terms:
- Publisher retain copyright .
- Authors are permitted and encouraged to post links to their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work .