Explainable Machine Learning-Based Prediction Model for Diabetic Nephropathy

IF 3.6 3区 医学 Q2 ENDOCRINOLOGY & METABOLISM Journal of Diabetes Research Pub Date : 2024-01-20 DOI:10.1155/2024/8857453
Jing-Mei Yin, Yang Li, Jun-Tang Xue, Guo-Wei Zong, Zhong-Ze Fang, Lang Zou
{"title":"Explainable Machine Learning-Based Prediction Model for Diabetic Nephropathy","authors":"Jing-Mei Yin, Yang Li, Jun-Tang Xue, Guo-Wei Zong, Zhong-Ze Fang, Lang Zou","doi":"10.1155/2024/8857453","DOIUrl":null,"url":null,"abstract":"The aim of this study is to analyze the effect of serum metabolites on diabetic nephropathy (DN) and predict the prevalence of DN through a machine learning approach. The dataset consists of 548 patients from April 2018 to April 2019 in the Second Affiliated Hospital of Dalian Medical University (SAHDMU). We select the optimal 38 features through a least absolute shrinkage and selection operator (LASSO) regression model and a 10-fold cross-validation. We compare four machine learning algorithms, including extreme gradient boosting (XGB), random forest, decision tree, and logistic regression, by AUC-ROC curves, decision curves, and calibration curves. We quantify feature importance and interaction effects in the optimal predictive model by Shapley additive explanation (SHAP) method. The XGB model has the best performance to screen for DN with the highest AUC value of 0.966. The XGB model also gains more clinical net benefits than others, and the fitting degree is better. In addition, there are significant interactions between serum metabolites and duration of diabetes. We develop a predictive model by XGB algorithm to screen for DN. C2, C5DC, Tyr, Ser, Met, C24, C4DC, and Cys have great contribution in the model and can possibly be biomarkers for DN.","PeriodicalId":15576,"journal":{"name":"Journal of Diabetes Research","volume":"14 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Diabetes Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1155/2024/8857453","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

The aim of this study is to analyze the effect of serum metabolites on diabetic nephropathy (DN) and predict the prevalence of DN through a machine learning approach. The dataset consists of 548 patients from April 2018 to April 2019 in the Second Affiliated Hospital of Dalian Medical University (SAHDMU). We select the optimal 38 features through a least absolute shrinkage and selection operator (LASSO) regression model and a 10-fold cross-validation. We compare four machine learning algorithms, including extreme gradient boosting (XGB), random forest, decision tree, and logistic regression, by AUC-ROC curves, decision curves, and calibration curves. We quantify feature importance and interaction effects in the optimal predictive model by Shapley additive explanation (SHAP) method. The XGB model has the best performance to screen for DN with the highest AUC value of 0.966. The XGB model also gains more clinical net benefits than others, and the fitting degree is better. In addition, there are significant interactions between serum metabolites and duration of diabetes. We develop a predictive model by XGB algorithm to screen for DN. C2, C5DC, Tyr, Ser, Met, C24, C4DC, and Cys have great contribution in the model and can possibly be biomarkers for DN.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的糖尿病肾病可解释预测模型
本研究旨在分析血清代谢物对糖尿病肾病(DN)的影响,并通过机器学习方法预测DN的患病率。数据集由大连医科大学附属第二医院(SAHDMU)2018年4月至2019年4月的548名患者组成。我们通过最小绝对收缩和选择算子(LASSO)回归模型和 10 倍交叉验证来选择最佳的 38 个特征。我们通过AUC-ROC曲线、决策曲线和校准曲线比较了四种机器学习算法,包括极梯度提升算法(XGB)、随机森林算法、决策树算法和逻辑回归算法。我们通过夏普利加法解释(SHAP)方法量化了最优预测模型中的特征重要性和交互效应。XGB 模型在筛查 DN 方面表现最佳,其 AUC 值最高,为 0.966。XGB 模型也比其他模型获得了更多的临床净效益,拟合度也更好。此外,血清代谢物与糖尿病病程之间存在明显的交互作用。我们通过 XGB 算法建立了一个预测模型来筛查 DN。在该模型中,C2、C5DC、Tyr、Ser、Met、C24、C4DC 和 Cys 的贡献较大,有可能成为 DN 的生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Diabetes Research
Journal of Diabetes Research ENDOCRINOLOGY & METABOLISM-MEDICINE, RESEARCH & EXPERIMENTAL
CiteScore
8.40
自引率
2.30%
发文量
152
审稿时长
14 weeks
期刊介绍: Journal of Diabetes Research is a peer-reviewed, Open Access journal that publishes research articles, review articles, and clinical studies related to type 1 and type 2 diabetes. The journal welcomes submissions focusing on the epidemiology, etiology, pathogenesis, management, and prevention of diabetes, as well as associated complications, such as diabetic retinopathy, neuropathy and nephropathy.
期刊最新文献
Understanding Experiences of Diabetes Distress: A Systematic Review and Thematic Synthesis. Prevalence of Hypertension and Its Clinical and Psychological Factors in Type 2 Diabetes Patients in Ghana: A Secondary Analysis of a Cross-Sectional Study. Total Water-Soluble Flavonoids From Lithocarpus litseifolius (Hance) Chun (Sweet Tea) Improve Glucose Homeostasis Through Multitarget Signalling in GDM Mice. Lipid Accumulation Product as a Predictor of Prediabetes and Diabetes: Insights From NHANES Data (1999-2018). Visceral Adiposity as an Independent Risk Factor for Diabetic Peripheral Neuropathy in Type 2 Diabetes Mellitus: A Retrospective Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1