Enhanced Prediction of Chronic Kidney Disease using XGBoost Machine Learning Model

Authors

  • Rajeshree Khande Balaji Institute of Techology & Management, Sri Balaji University, Pune, Maharashtra, India
  • Nrashant Singh Department of Forensic Science, Amity University Dubai, Dubai International Academic City (DIAC), Dubai, UAE
  • Sachin Naik School of Computer Science and Engineering, Department of Computer Science and Applications, Dr. Vishwanath Karad MIT World Peace University, Pune, India
  • Satvik Bodke School of Computer Science and Engineering, Department of Computer Science and Applications, Dr. Vishwanath Karad MIT World Peace University, Pune, India
  • Aniket Gaikwad School of Computer Science and Engineering, Department of Computer Science and Applications, Dr. Vishwanath Karad MIT World Peace University, Pune, India
  • P.S. Metkewar School of Computer Science and Engineering, Department of Computer Science and Applications, Dr. Vishwanath Karad MIT World Peace University, Pune, India
  • Raeesa Bashir Department of Mathematics and Statistics, Faculty of Science and Technology, Vishwakarma University, Pune, India
  • Aafaq A. Rather Symbiosis Statistical Institute, Symbiosis International (Deemed University), Pune, India

DOI:

https://doi.org/10.6000/1929-6029.2026.15.10

Keywords:

Chronic Kidney Disease, XGBoost, Machine Learning, Predictive Modeling, Feature Selection

Abstract

Chronic kidney disease (CKD) might progress to end stage renal disease; moreover, cardiovascular dangers are dire. Machine learning used in for more speed and accurate diagnosis of CKD. The CKD prediction model proposed in this paper was developed using the XGBoost algorithm, which is quite effective in classification problems. Other clinical parameters such as blood urea, serum creatinine and white blood cell count are some of the 24 indices identified from among the 400 patient records in the dataset. Feature selection using SelectKBest was relevant, and hyperparameter tuning was done by RandomizedSearchCV Both quantitative and categorical data were preprocessed. Altogether, 75% of data used for training, while 25% of data used for testing. The XGBoost model had a better result with 96.88 % recall, 100% precision, and 98% accuracy. However, the proposed approach has disadvantages; namely, a small sample cross-section and possibly an imbalanced class. Further, the dataset will be increased, the methods of dealing with class imbalance will be applied using SMOTE algorithm, and the effectiveness of the proposed model will be tested in real clinical practice. This work also highlight how crucial it is to employ and enhance machine learning, especially XGBoost to detect early stage of CKD, proper treatment, low mortality rate, and increased survival rate among patients.

References

International Society of Nephrology, "New global kidney health report sheds light on current capacity around the world to deliver kidney care," 2023. . https://www.theisn.org/blog/2023/03/30/new-global-kidney-health-report-sheds-light-on-current-capacity-around-the-world-to-deliver-kidney-care/.

MSD Manuals, "Chronic kidney disease.". https://www.msdmanuals.com/professional/genitourinary-disorders/chronic-kidney-disease/chronic-kidney-disease

Mayo Clinic, "Hemodialysis.". https://www.mayoclinic.org/tests-procedures/hemodialysis/about/pac-20384824

National Center for Biotechnology Information, "The importance of early identification of chronic kidney disease." . https://pmc.ncbi.nlm.nih.gov/articles/PMC6188457/.

National Center for Biotechnology Information, "Machine learning has been extended and applied in the healthcare sector for purposes of diagnosing diseases such as chronic kidney disease." . https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10143586/.

Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. Journal of Big Data 2019; 6(1): 1-25. DOI: https://doi.org/10.1186/s40537-019-0217-0

Tusar MTHK, Islam MT, Raju FI. Detecting chronic kidney disease (CKD) at the initial stage: A novel hybrid feature-selection method and robust data preparation pipeline for different ML techniques. in Proc. 2022 5th Int. Conf. on Computing and Informatics (ICCI), 2022; pp. 400-407. DOI: https://doi.org/10.1109/ICCI54321.2022.9756094

Durga P, Sudhakar T. Analytical comparison for the diagnosis of chronic kidney disease applying intelligent ML-based systems. in Proc. 2023 7th Int. Conf. on Intelligent Computing and Control Systems (ICICCS) 2023; pp. 51-59. DOI: https://doi.org/10.1109/ICICCS56967.2023.10142601

Farjana A, et al. Predicting chronic kidney disease using machine learning algorithms. in Proc. 2023 IEEE 13th Annual Computing and Communication Workshop and Conf. (CCWC) 2023; pp. 1267-1271. DOI: https://doi.org/10.1109/CCWC57344.2023.10099221

James N, Kaushik J. Comparison of machine learning algorithms for predicting chronic kidney disease. in Proc. 2022 2nd Int. Conf. on Advance Computing and Innovative Technologies in Engineering (ICACITE) 2022; pp. 1134-1139. DOI: https://doi.org/10.1109/ICACITE53722.2022.9823572

Patel S, et al. An experimental study and performance analysis of supervised machine learning algorithms for prognosis of chronic kidney disease. in Proc. 2022 1st Int. Conf. on Electrical, Electronics, Information and Communication Technologies (ICEEICT) 2022; pp. 1-6. DOI: https://doi.org/10.1109/ICEEICT53079.2022.9768478

Sridevi P, Rabbani M, Ahamed SI. A comprehensive study for predicting chronic kidney disease, diabetes, hypertension, and anemia by machine learning and feature engineering techniques" in Proc. 2023 IEEE Int. Conf. on Digital Health (ICDH) 2023; pp. 248-257. DOI: https://doi.org/10.1109/ICDH60066.2023.00043

Liang Pm et al. Deep learning identifies intelligible predictors of poor prognosis in chronic kidney disease. IEEE Journal of Biomedical and Health Informatics 2023; 27(7): 3677-3685. DOI: https://doi.org/10.1109/JBHI.2023.3266587

Haque A, et al. Determining association between fatal heart failure and chronic kidney disease: A machine learning approach. in Proc. 2022 21st IEEE Int. Conf. on Machine Learning and Applications (ICMLA) 2022; pp. 1679-1686. DOI: https://doi.org/10.1109/ICMLA55696.2022.00258

Ahmed I, et al. Performance analysis of machine learning algorithms in chronic kidney disease prediction. in Proc. 2022 IEEE 13th Annual Information Technology, Electronics and Mobile Communication Conf. (IEMCON) 2022; pp. 0417-0423 DOI: https://doi.org/10.1109/IEMCON56893.2022.9946591

Islam MA, Majumder MZH, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms Journal of Pathology Informatics 2023; 14: 100189. DOI: https://doi.org/10.1016/j.jpi.2023.100189

Raihan MJ, et al. Detection of chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP. Scientific Reports 2023; 13: 6263. DOI: https://doi.org/10.1038/s41598-023-33525-0

Raihan MMS, et al. Chronic renal disease prediction using clinical data and different machine learning technique. in Proc. 2021 2nd Int. Informatics and Software Engineering Conf. (IISEC) 2021; pp. 1-5. DOI: https://doi.org/10.1109/IISEC54230.2021.9672365

Saravanan S, Dar SA, Rather AA, Qayoom D, Ali I. Deep Learning Models for Intrusion Detection Systems in MANETs: A Comparative analysis. Decision Making Advances 2025; 3(1): 96-110. DOI: https://doi.org/10.31181/dma31202556

Downloads

Published

2026-03-18

How to Cite

Khande, R. ., Singh, N. ., Naik, S. ., Bodke, S. ., Gaikwad, A. ., Metkewar, P. ., Bashir, R. ., & Rather, A. A. . (2026). Enhanced Prediction of Chronic Kidney Disease using XGBoost Machine Learning Model. International Journal of Statistics in Medical Research, 15, 109–120. https://doi.org/10.6000/1929-6029.2026.15.10

Issue

Section

General Articles

Most read articles by the same author(s)