Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-11-01 eCollection Date: 2024-01-01 DOI:10.7717/peerj.18436

Jialin Du, Jie Gao, Jie Guan, Bo Jin, Nan Duan, Lu Pang, Haiming Huang, Qian Ma, Chenwei Huang, Haixia Li

{"title":"Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study.","authors":"Jialin Du, Jie Gao, Jie Guan, Bo Jin, Nan Duan, Lu Pang, Haiming Huang, Qian Ma, Chenwei Huang, Haixia Li","doi":"10.7717/peerj.18436","DOIUrl":null,"url":null,"abstract":"Background and objective: Chronic kidney disease (CKD) is a major public health issue, and accurate prediction of the progression of kidney failure is critical for clinical decision-making and helps improve patient outcomes. As such, we aimed to develop and externally validate a machine-learned model to predict the progression of CKD using common laboratory variables, demographic characteristics, and an electronic health records database.Methods: We developed a predictive model using longitudinal clinical data from a single center for Chinese CKD patients. The cohort included 987 patients who were followed up for more than 24 months. Fifty-three laboratory features were considered for inclusion in the model. The primary outcome in our study was an estimated glomerular filtration rate ≤15 mL/min/1.73 m2 or kidney failure. Machine learning algorithms were applied to the modeling dataset (n = 296), and an external dataset (n = 71) was used for model validation. We assessed model discrimination via area under the curve (AUC) values, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.Results: Over a median follow-up period of 3.75 years, 148 patients experienced kidney failure. The optimal model was based on stacking different classifier algorithms with six laboratory features, including 24-h urine protein, potassium, glucose, urea, prealbumin and total protein. The model had considerable predictive power, with AUC values of 0.896 and 0.771 in the validation and external datasets, respectively. This model also accurately predicted the progression of renal function in patients over different follow-up periods after their initial assessment.Conclusions: A prediction model that leverages routinely collected laboratory features in the Chinese population can accurately identify patients with CKD at high risk of progressing to kidney failure. An online version of the model can be easily and quickly applied in clinical management and treatment.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11533905/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 0

Abstract

Background and objective: Chronic kidney disease (CKD) is a major public health issue, and accurate prediction of the progression of kidney failure is critical for clinical decision-making and helps improve patient outcomes. As such, we aimed to develop and externally validate a machine-learned model to predict the progression of CKD using common laboratory variables, demographic characteristics, and an electronic health records database.

Methods: We developed a predictive model using longitudinal clinical data from a single center for Chinese CKD patients. The cohort included 987 patients who were followed up for more than 24 months. Fifty-three laboratory features were considered for inclusion in the model. The primary outcome in our study was an estimated glomerular filtration rate ≤15 mL/min/1.73 m² or kidney failure. Machine learning algorithms were applied to the modeling dataset (n = 296), and an external dataset (n = 71) was used for model validation. We assessed model discrimination via area under the curve (AUC) values, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.

Results: Over a median follow-up period of 3.75 years, 148 patients experienced kidney failure. The optimal model was based on stacking different classifier algorithms with six laboratory features, including 24-h urine protein, potassium, glucose, urea, prealbumin and total protein. The model had considerable predictive power, with AUC values of 0.896 and 0.771 in the validation and external datasets, respectively. This model also accurately predicted the progression of renal function in patients over different follow-up periods after their initial assessment.

Conclusions: A prediction model that leverages routinely collected laboratory features in the Chinese population can accurately identify patients with CKD at high risk of progressing to kidney failure. An online version of the model can be easily and quickly applied in clinical management and treatment.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于实验室信息系统的堆叠集合法预测中国人群慢性肾病进展：一项回顾性研究。

背景和目的：慢性肾脏病（CKD）是一个重大的公共卫生问题，准确预测肾衰竭的进展对临床决策至关重要，有助于改善患者的预后。因此，我们旨在利用常见的实验室变量、人口统计学特征和电子健康记录数据库，开发并从外部验证一个机器学习模型来预测 CKD 的进展：方法：我们利用来自一个中心的纵向临床数据，为中国的 CKD 患者开发了一个预测模型。研究对象包括随访超过 24 个月的 987 名患者。模型中考虑了 53 项实验室特征。我们研究的主要结果是估计肾小球滤过率≤15 mL/min/1.73 m2或肾衰竭。建模数据集（n = 296）采用机器学习算法，模型验证采用外部数据集（n = 71）。我们通过曲线下面积（AUC）值、准确性、灵敏度、特异性、阳性预测值、阴性预测值和 F1 分数来评估模型的区分度：中位随访期为 3.75 年，148 名患者出现肾衰竭。最佳模型基于不同分类器算法与六种实验室特征的叠加，包括 24 小时尿蛋白、钾、葡萄糖、尿素、前白蛋白和总蛋白。该模型具有相当高的预测能力，在验证数据集和外部数据集中的AUC值分别为0.896和0.771。该模型还能准确预测患者在初次评估后不同随访期内肾功能的进展情况：结论：利用在中国人群中常规收集的实验室特征建立的预测模型可以准确地识别出有进展到肾衰竭高风险的慢性肾功能衰竭患者。该模型的在线版本可方便快捷地应用于临床管理和治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊