利用 NHANES 数据(1999-2018 年)的机器学习方法预测美国成人多病虚弱情况

{"title":"利用 NHANES 数据(1999-2018 年)的机器学习方法预测美国成人多病虚弱情况","authors":"","doi":"10.1016/j.cmpbup.2024.100164","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>The global increase in an aging population has led to more common age-related health challenges, particularly multimorbidity and frailty, but there is a significant gap.</p></div><div><h3>Methods</h3><p>This cross-sectional study utilized data from the National Health and Nutrition Examination Survey (1999–2018). The association between age and frailty was assessed using a restricted cubic spline (RCS) model, while weighted adjusted multivariable logistic regression evaluated the effect of diseases to frailty. And in machine learning process, feature selection for the frailty prediction model involved three algorithms. The model's performance was optimized using nested cross-validation and tested with various algorithms including decision tree, Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting (XGBoost). We used areas under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AU-PRC) to evaluate six algorithms, select the optimal model, and test the discrimination and consistency of the optimal model.</p></div><div><h3>Results</h3><p>The study included 46,187 participants, with 6,009 cases of frailty. RCS analysis showed a non-linear association between age and frailty, with a turning point at 49 years. Key impacting variables identified are Anemia, Arthritis, Diabetes Mellitus, Coronary Heart Disease, and Hypertension. In the machine learning process, we selected the optimal data set by feature selection, including 13 variables. Through nested cross-validation, a total of 31,900 models were built using 6 algorithms. And the XGBoost model showed the highest performance (AUC = 0.8828 and AU-PRC = 0.624), and clear proficiency in both discrimination and calibration.</p></div><div><h3>Conclusions</h3><p>We found 49 years maintain the balance of physiological reserve and external aggression. In addition, chronic diseases are trigger factor of frailty, while acute diseases are contributing factor that exacerbates the body's rapid decline. Last, the XGBoost frailty prediction model, with its simplicity, high performance and high clinical value holds potential for clinical application.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000314/pdfft?md5=b2ac2f1faea71ce864789e43929be852&pid=1-s2.0-S2666990024000314-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Machine learning approaches for predicting frailty base on multimorbidities in US adults using NHANES data (1999–2018)\",\"authors\":\"\",\"doi\":\"10.1016/j.cmpbup.2024.100164\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>The global increase in an aging population has led to more common age-related health challenges, particularly multimorbidity and frailty, but there is a significant gap.</p></div><div><h3>Methods</h3><p>This cross-sectional study utilized data from the National Health and Nutrition Examination Survey (1999–2018). The association between age and frailty was assessed using a restricted cubic spline (RCS) model, while weighted adjusted multivariable logistic regression evaluated the effect of diseases to frailty. And in machine learning process, feature selection for the frailty prediction model involved three algorithms. The model's performance was optimized using nested cross-validation and tested with various algorithms including decision tree, Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting (XGBoost). We used areas under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AU-PRC) to evaluate six algorithms, select the optimal model, and test the discrimination and consistency of the optimal model.</p></div><div><h3>Results</h3><p>The study included 46,187 participants, with 6,009 cases of frailty. RCS analysis showed a non-linear association between age and frailty, with a turning point at 49 years. Key impacting variables identified are Anemia, Arthritis, Diabetes Mellitus, Coronary Heart Disease, and Hypertension. In the machine learning process, we selected the optimal data set by feature selection, including 13 variables. Through nested cross-validation, a total of 31,900 models were built using 6 algorithms. And the XGBoost model showed the highest performance (AUC = 0.8828 and AU-PRC = 0.624), and clear proficiency in both discrimination and calibration.</p></div><div><h3>Conclusions</h3><p>We found 49 years maintain the balance of physiological reserve and external aggression. In addition, chronic diseases are trigger factor of frailty, while acute diseases are contributing factor that exacerbates the body's rapid decline. Last, the XGBoost frailty prediction model, with its simplicity, high performance and high clinical value holds potential for clinical application.</p></div>\",\"PeriodicalId\":72670,\"journal\":{\"name\":\"Computer methods and programs in biomedicine update\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666990024000314/pdfft?md5=b2ac2f1faea71ce864789e43929be852&pid=1-s2.0-S2666990024000314-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine update\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666990024000314\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666990024000314","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景全球老龄化人口的增加导致了更常见的与年龄相关的健康挑战,尤其是多病症和虚弱,但目前还存在很大差距。使用受限立方样条(RCS)模型评估了年龄与虚弱之间的关联,而加权调整多变量逻辑回归评估了疾病对虚弱的影响。在机器学习过程中,虚弱预测模型的特征选择涉及三种算法。我们使用嵌套交叉验证对模型的性能进行了优化,并使用多种算法进行了测试,包括决策树、逻辑回归、k-近邻、随机森林、递归分区和回归树以及极梯度提升(XGBoost)。我们使用接收者操作特征曲线下面积(AUC)和精确度-召回曲线下面积(AU-PRC)对六种算法进行了评估,选出了最优模型,并测试了最优模型的区分度和一致性。RCS 分析表明,年龄与虚弱之间存在非线性关系,49 岁时出现转折点。主要影响变量包括贫血、关节炎、糖尿病、冠心病和高血压。在机器学习过程中,我们通过特征选择选出了最佳数据集,其中包括 13 个变量。通过嵌套交叉验证,共使用 6 种算法建立了 31900 个模型。而 XGBoost 模型表现出了最高的性能(AUC = 0.8828 和 AU-PRC = 0.624),并且在判别和校准方面都有明显的优势。此外,慢性疾病是体弱的诱发因素,而急性疾病则是加剧身体快速衰退的诱因。最后,XGBoost 虚弱预测模型具有简单、高性能和高临床价值的特点,具有临床应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine learning approaches for predicting frailty base on multimorbidities in US adults using NHANES data (1999–2018)

Background

The global increase in an aging population has led to more common age-related health challenges, particularly multimorbidity and frailty, but there is a significant gap.

Methods

This cross-sectional study utilized data from the National Health and Nutrition Examination Survey (1999–2018). The association between age and frailty was assessed using a restricted cubic spline (RCS) model, while weighted adjusted multivariable logistic regression evaluated the effect of diseases to frailty. And in machine learning process, feature selection for the frailty prediction model involved three algorithms. The model's performance was optimized using nested cross-validation and tested with various algorithms including decision tree, Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting (XGBoost). We used areas under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AU-PRC) to evaluate six algorithms, select the optimal model, and test the discrimination and consistency of the optimal model.

Results

The study included 46,187 participants, with 6,009 cases of frailty. RCS analysis showed a non-linear association between age and frailty, with a turning point at 49 years. Key impacting variables identified are Anemia, Arthritis, Diabetes Mellitus, Coronary Heart Disease, and Hypertension. In the machine learning process, we selected the optimal data set by feature selection, including 13 variables. Through nested cross-validation, a total of 31,900 models were built using 6 algorithms. And the XGBoost model showed the highest performance (AUC = 0.8828 and AU-PRC = 0.624), and clear proficiency in both discrimination and calibration.

Conclusions

We found 49 years maintain the balance of physiological reserve and external aggression. In addition, chronic diseases are trigger factor of frailty, while acute diseases are contributing factor that exacerbates the body's rapid decline. Last, the XGBoost frailty prediction model, with its simplicity, high performance and high clinical value holds potential for clinical application.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.90
自引率
0.00%
发文量
0
审稿时长
10 weeks
期刊最新文献
Fostering digital health literacy to enhance trust and improve health outcomes Machine learning from real data: A mental health registry case study ResfEANet: ResNet-fused External Attention Network for Tuberculosis Diagnosis using Chest X-ray Images Role-playing recovery in social virtual worlds: Adult use of child avatars as PTSD therapy Comparative evaluation of low-cost 3D scanning devices for ear acquisition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1