Mortality Prediction and Survival Estimation in Dialysis Patients Using Logistic and Cox Regression with Machine Learning Feature Selection

Authors

  • Vajala Ravi Department of Statistics, Sri Venkateswara College, University of Delhi, India
  • Sanjay Kumar Singh Department of Statistics, Pannalal Girdharlal Dayanand Anglo-Vedic College, University of Delhi, India
  • Chandra Bhan Yadav Department of Statistics, Hindu College, University of Delhi, New Delhi, India

DOI:

https://doi.org/10.6000/1929-6029.2026.15.02

Keywords:

Chronic Kidney Disease (CKD), End-Stage Renal Disease (ESRD), Haemodialysis, Mortality Predictors, Risk Factors, Logistic Regression, Machine Learning, Survival Analysis, Cox Proportional Hazards Model, Clinical Outcomes, Risk Stratification, Patient Survival

Abstract

Mortality remains high among patients undergoing maintenance dialysis for end-stage renal disease (ESRD). Identification of key mortality predictors is paramount for improving prognosis and guiding care. Recent advances in machine learning (ML) offer potential to enhance risk stratification beyond traditional statistical models. This study compares feature selection methods—LASSO, Random Forest, and Gradient Boosting—in predicting mortality risk among dialysis patients, integrating logistic regression and Cox proportional hazards modelling. Retrospective data from 224 ESRD patients on maintenance haemodialysis were analysed. Thirty-three clinical and demographic variables were evaluated. Feature subsets were generated using ML algorithms and used for building predictive models. Model performance was assessed via discrimination (AUC), accuracy, sensitivity, specificity, and survival prediction concordance index (C-index). LASSO-selected features yielded an AUC of 0.82 and C-index of 0.81, demonstrating strong discriminatory ability. Random Forest showed highest AUC (0.85) but lower sensitivity. Gradient Boosting offered balanced sensitivity and specificity with an AUC of 0.81. The parsimonious common-feature model (dialysis session frequency, diabetes) achieved the best survival discrimination (C-index 0.83). Full models with all variables demonstrated moderate performance, highlighting potential overfitting. Key mortality predictors included dialysis adequacy, diabetes status, respiratory comorbidities, and hemodynamic parameters. Machine learning–aided feature selection enhances mortality risk prediction in dialysis patients. Parsimonious models focusing on consistent predictors may optimize clinical applicability. These findings support integrating ML and traditional regression approaches to refine prognostic tools and inform personalized care strategies in ESRD.

References

Hill NR, Fatoba ST, Oke JL, et al. Global prevalence of chronic kidney disease: A systematic review and meta-analysis. PLoS One 2016; 11(7): e0158765. DOI: https://doi.org/10.1371/journal.pone.0158765

Jha V, Garcia-Garcia G, Iseki K, et al. Chronic kidney disease: Global dimension and perspectives. The Lancet 2013; 382(9888): 260-272. DOI: https://doi.org/10.1016/S0140-6736(13)60687-X

Sarnak MJ, Jaber BL. Pulmonary infectious mortality among patients with end-stage renal disease. Chest 2001; 120(6): 1883-1887.

Foley RN, Gilbertson DT, Murray, et al. Long interdialytic interval and mortality among patients receiving hemodialysis. New England Journal of Medicine 2011; 365(12): 1099-1107.

Tsai WC, Wu HY, Peng YS, et al. Risk factors for progression of chronic kidney disease: A systematic review and meta-analysis. Medicine (Baltimore) 2016; 95(11): e3013. DOI: https://doi.org/10.1097/MD.0000000000003013

Sankarasubbaiyan S, Pollock CA, Anandh U, et al. Risk factors for mortality among patients on hemodialysis in India: A case-control study. Indian Journal of Nephrology 2025; 35(3): 390-396. DOI: https://doi.org/10.25259/ijn_563_23

Tangri N, Stevens LA, Griffith J, et al. A predictive model for progression of chronic kidney disease to kidney failure. JAMA 2011; 305(15): 1553-1559. DOI: https://doi.org/10.1001/jama.2011.451

Kalantar-Zadeh K, Lockwood MB, Rhee CM, et al. Patient-centred approaches for the management of dialysis patients in the era of precision medicine. Nature Reviews Nephrology 2020; 16(11): 681-700.

Takkavatakarn K, Nadkarni GN, Roytman M, et al. Machine learning models to predict end-stage kidney disease in chronic kidney disease stage 4. BMC Nephrology 2023; 24(1): 132. DOI: https://doi.org/10.1186/s12882-023-03424-7

Peng Z, Zhong S, Li X, et al. An artificial intelligence model to predict mortality among hemodialysis patients: A retrospective validated cohort study. Scientific Reports 2025; 15(1): 27699. DOI: https://doi.org/10.1038/s41598-025-06576-8

Sarnak MJ, Jaber BL. Pulmonary infectious mortality among patients with end-stage renal disease. Chest 2001; 120(6): 1883-1887. DOI: https://doi.org/10.1378/chest.120.6.1883

Riehl-Tonn VJ, MacRae JM, Dumanski SM, et al. Sex and gender differences in health-related quality of life in individuals initiating haemodialysis. Clinical Kidney Journal 2024; 17(10): sfae273. DOI: https://doi.org/10.1093/ckj/sfae273

Soleymanian T, Kokabeh Z, Ramaghi R, et al. Clinical outcomes and quality of life in hemodialysis diabetic patients versus non-diabetics. Journal of Nephropathology 2017; 6(2): 81-89. DOI: https://doi.org/10.15171/jnp.2017.14

Liu S-X, Wang Z-H, Zhang S, et al. The association between dose of hemodialysis and mortality in a prospective cohort study. Scientific Reports 2022; 12: 13708.

Flythe JE, Xue H, Lynch KE, et al. Association of mortality risk with various definitions of intradialytic hypotension. Journal of the American Society of Nephrology 2015; 26(3): 724-734. DOI: https://doi.org/10.1681/ASN.2014020222

Noh J, Park SY, Bae W, et al. Predicting early mortality in haemodialysis patients: A deep learning approach using a nationwide prospective cohort in South Korea. Scientific Reports 2024; 14: 29658. DOI: https://doi.org/10.1038/s41598-024-80900-6

Montemayor VC, Malo AM, Barbieri C. Predicting mortality in hemodialysis patients using machine learning analysis. Clinical Kidney Journal 2020; 14(5): 1388-1395. DOI: https://doi.org/10.1093/ckj/sfaa126

Liu S-X, Wang Z-H, Zhang S, et al. The association between dose of hemodialysis and mortality in a prospective cohort study. Scientific Reports 2022; 12: 13708. DOI: https://doi.org/10.1038/s41598-022-17943-0

Jia W, He W, Chen Z, et al. Determinants of dialysis adequacy in maintenance hemodialysis patients: A cross-sectional study on modifiable risk factors and clinical interventions. BMC Nephrology 2025; 26: 369. DOI: https://doi.org/10.1186/s12882-025-04278-x

Foley RN, Gilbertson DT, Murray et al. Long interdialytic interval and mortality among patients receiving hemodialysis. New England Journal of Medicine 2011; 365(12): 1099-1107. DOI: https://doi.org/10.1056/NEJMoa1103313

El Chamieh C, Liabeuf S, Massy Z. Uremic toxins and cardio- vascular risk in chronic kidney disease: What have we learned recently beyond the past findings? Toxins 2022; 14(4): 280. DOI: https://doi.org/10.3390/toxins14040280

Collins AJ, Foley RN, Herzog C, et al. United States Renal Data System Annual Data Report. Am J Kidney Dis 2019; 73(3): A7-A815.

Bhatti NK, Karimi Galougahi K, Paz, et al. Diagnosis and management of cardiovascular disease in advanced and end-stage renal disease. Journal of the American Heart Association 2016; 5(8): e003648. DOI: https://doi.org/10.1161/JAHA.116.003648

Thomas MC, Cooper ME, Zimmet P. Diabetic kidney disease. Nature Reviews Disease Primers 2015; 1: 15018. DOI: https://doi.org/10.1038/nrdp.2015.70

Babitt JL, Lin HY. Mechanisms of anemia in CKD. New England Journal of Medicine 2012; 367(19): 1901-1911.

Lin Y-C, Chang Y-H, Yang S-Y, et al. Update of pathophysiology and management of diabetic kidney disease. Journal of the Formosan Medical Association 2018; 117(8): 662-675. DOI: https://doi.org/10.1016/j.jfma.2018.02.007

Pisoni RL, Bragg-Gresham JL, Young EW, et al. Anemia management and outcomes in dialysis patients: Results from the Dialysis Outcomes and Practice Patterns Study (DOPPS). American Journal of Kidney Diseases 2018; 71(6): 812-822.

Shoji T, Tsubakihara Y, Fujii M, et al. Hemodialysis- associated hypotension as an independent risk factor for two-year mortality in patients on chronic hemodialysis. Kidney International 2019; 95(5): 1212-1220. DOI: https://doi.org/10.1111/j.1523-1755.2004.00812.x

Steyerberg EW, Eijkemans MJC, Harrell FE, et al. Prognostic modelling with logistic regression analysis: A comparison of selection and estimation methods. Statistics in Medicine 2000; 19(8): 1059-1079. DOI: https://doi.org/10.1002/(SICI)1097-0258(20000430)19:8<1059::AID-SIM412>3.0.CO;2-0

Harrell FE, Jr. Regression modeling strategies: With applications to linear models, logistic regression, and survival analysis (2nd ed.). Springer 2015. DOI: https://doi.org/10.1007/978-3-319-19425-7

Belloni A, Chernozhukov V, Hansen C. High-dimensional methods and inference on structural and treatment effects. Journal of Economic Perspectives 2014; 28(2): 29-50. DOI: https://doi.org/10.1257/jep.28.2.29

Tangri N, Grams ME, Levey AS, et al. Multinational assessment of accuracy of equations for predicting risk of kidney failure. JAMA 2016; 315(2): 164-174. DOI: https://doi.org/10.1001/jama.2015.18202

KDIGO CKD Work Group. KDIGO 2012 clinical practice guideline for the evaluation and management of chronic kidney disease. Kidney International Supplements 2013; 3(1): 1-150.

KDIGO Diabetes Work Group. KDIGO 2020 clinical practice guideline for diabetes management in chronic kidney disease. Kidney International 2020; 98(4S): S1-S115. DOI: https://doi.org/10.1016/j.kint.2020.06.019

National Kidney Foundation. KDOQI clinical practice guideline for hemodialysis adequacy: 2015 update. American Journal of Kidney Diseases 2015; 66(5): 884-930. DOI: https://doi.org/10.1053/j.ajkd.2015.07.015

Goldfarb-Rumyantzev AS, Massry SG. Heart-lung-kidney interactions in chronic kidney disease. Seminars in Nephrology 2019; 39(3): 263-275.

Vanholder R, Van Biesen W, Lameire N. What is the renal-cardio-pulmonary syndrome? Nature Reviews Nephrology 2020; 16(12): 707-723.

McIntyre CW, Odudu A, Eldehni MT, et al. Induced cardiac injury by hemodialysis: Time to turn down the dial. Journal of the American Society of Nephrology 2017; 28(10): 2839-2848.

Agarwal R, Flynn J, Pogue V, et al. Assessment and management of hypertension in patients on dialysis. Journal of the American Society of Nephrology 2018; 29(4): 893-905.

Rajkomar A, Dean J, Kohane I. Machine learning in medicine. New England Journal of Medicine 2019; 380(14): 1347-1358. DOI: https://doi.org/10.1056/NEJMra1814259

Deo RC. Machine learning in medicine. Circulation 2015; 132(20): 1920-1930. DOI: https://doi.org/10.1161/CIRCULATIONAHA.115.001593

Sood MM, Tangri N, Hiebert B, et al. Pulmonary disease and mortality in patients with chronic kidney disease: A population-based cohort study. Kidney International 2017; 92(2): 467-476.

Ravi V, Singh SK, Yadav CB. To Identify the Predictors of Mortality in Renal Patients Undergoing Dialysis. International Journal of Statistics in Medical Research 2025; 14: 755-764. DOI: https://doi.org/10.6000/1929-6029.2025.14.68

Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 1996; 58(1): 267-288. DOI: https://doi.org/10.1111/j.2517-6161.1996.tb02080.x

Breiman L. Random forests. Machine Learning 2001; 45(1): 5-32. DOI: https://doi.org/10.1023/A:1010933404324

Friedman JH. Greedy function approximation: A gradient boosting machine. The Annals of Statistics 2001; 29(5): 1189-1232. DOI: https://doi.org/10.1214/aos/1013203451

Downloads

Published

2026-01-30

How to Cite

Ravi, V. ., Singh, S. K. ., & Yadav, C. B. . (2026). Mortality Prediction and Survival Estimation in Dialysis Patients Using Logistic and Cox Regression with Machine Learning Feature Selection . International Journal of Statistics in Medical Research, 15, 17–27. https://doi.org/10.6000/1929-6029.2026.15.02

Issue

Section

General Articles