TY - JOUR
T1 - Development of a machine learning-based model for the prediction and progression of diabetic kidney disease
T2 - A single centred retrospective study
AU - Nayak, Sandhya
AU - Amin, Ashwini
AU - Reghunath, Swetha R.
AU - Thunga, Girish
AU - Acharya U, Dinesh
AU - Shivashankara, K. N.
AU - Prabhu Attur, Ravindra
AU - Acharya, Leelavathi D.
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2024/10
Y1 - 2024/10
N2 - Background: Diabetic kidney disease (DKD) is a diabetic microvascular complication often characterized by an unpredictable progression. Hence, early detection and recognition of patients vulnerable to progression is crucial. Objective: To develop a prediction model to identify the stages of DKD and the factors contributing to progression to each stage using machine learning. Methodology: A retrospective study was conducted in a South Indian tertiary care hospital and collected the details of patients diagnosed with DKD from January 2017 to January 2022. Bayesian optimization-based machine learning techniques such as classification and regression were employed. The model was developed with the help of an optimization framework that effectively balances classification, prediction accuracy, and explainability. Results: Of the 311 patients diagnosed with DKD, 227 were selected for the study. A system for predicting DKD has been created for a patient dataset utilizing a variety of machine-learning approaches. The eXtreme gradient (XG) Boost method excelled, achieving 88.75% accuracy, 88.57% precision, 91.4% sensitivity,100% specificity, and 89.49% F1-score. An interpretable data-driven method highlights significant features for early DKD diagnosis. The best explainable prediction model uses the XG Boost classifier, revealing serum uric acid, urea, phosphorous, red blood cells, calcium, and absolute eosinophil count as the major predictors influencing the progression of DKD. In the case of regression models, the gradient boost regressor performed the best, with an R2 score of 0.97. Conclusion: Machine learning algorithms can effectively predict the stages of DKD and thus help physicians in providing patients with personalized care at the right time.
AB - Background: Diabetic kidney disease (DKD) is a diabetic microvascular complication often characterized by an unpredictable progression. Hence, early detection and recognition of patients vulnerable to progression is crucial. Objective: To develop a prediction model to identify the stages of DKD and the factors contributing to progression to each stage using machine learning. Methodology: A retrospective study was conducted in a South Indian tertiary care hospital and collected the details of patients diagnosed with DKD from January 2017 to January 2022. Bayesian optimization-based machine learning techniques such as classification and regression were employed. The model was developed with the help of an optimization framework that effectively balances classification, prediction accuracy, and explainability. Results: Of the 311 patients diagnosed with DKD, 227 were selected for the study. A system for predicting DKD has been created for a patient dataset utilizing a variety of machine-learning approaches. The eXtreme gradient (XG) Boost method excelled, achieving 88.75% accuracy, 88.57% precision, 91.4% sensitivity,100% specificity, and 89.49% F1-score. An interpretable data-driven method highlights significant features for early DKD diagnosis. The best explainable prediction model uses the XG Boost classifier, revealing serum uric acid, urea, phosphorous, red blood cells, calcium, and absolute eosinophil count as the major predictors influencing the progression of DKD. In the case of regression models, the gradient boost regressor performed the best, with an R2 score of 0.97. Conclusion: Machine learning algorithms can effectively predict the stages of DKD and thus help physicians in providing patients with personalized care at the right time.
UR - http://www.scopus.com/inward/record.url?scp=85198315599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85198315599&partnerID=8YFLogxK
U2 - 10.1016/j.ijmedinf.2024.105546
DO - 10.1016/j.ijmedinf.2024.105546
M3 - Article
AN - SCOPUS:85198315599
SN - 1386-5056
VL - 190
JO - International Journal of Medical Informatics
JF - International Journal of Medical Informatics
M1 - 105546
ER -