Explainable machine learning model for pre-frailty risk assessment in community-dwelling older adults

Health Care Science Pub Date : 2024-12-10 DOI:10.1002/hcs2.120

Chenlin Du, Zeyu Zhang, Baoqin Liu, Zijian Cao, Nan Jiang, Zongjiu Zhang

{"title":"Explainable machine learning model for pre-frailty risk assessment in community-dwelling older adults","authors":"Chenlin Du, Zeyu Zhang, Baoqin Liu, Zijian Cao, Nan Jiang, Zongjiu Zhang","doi":"10.1002/hcs2.120","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Frailty in older adults is linked to increased risks and lower quality of life. Pre-frailty, a condition preceding frailty, is intervenable, but its determinants and assessment are challenging. This study aims to develop and validate an explainable machine learning model for pre-frailty risk assessment among community-dwelling older adults.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>The study included 3141 adults aged 60 or above from the China Health and Retirement Longitudinal Study. Pre-frailty was characterized by one or two criteria from the physical frailty phenotype scale. We extracted 80 distinct features across seven dimensions to evaluate pre-frailty risk. A model was constructed using recursive feature elimination and a stacking-CatBoost distillation module on 80% of the sample and validated on a separate 20% holdout data set.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The study used data from 2508 community-dwelling older adults (mean age, 67.24 years [range, 60–96]; 1215 [48.44%] females) to develop a pre-frailty risk assessment model. We selected 57 predictive features and built a distilled CatBoost model, which achieved the highest discrimination (AUROC: 0.7560 [95% CI: 0.7169, 0.7928]) on the 20% holdout data set. The living city, BMI, and peak expiratory flow (PEF) were the three most significant contributors to pre-frailty risk. Physical and environmental factors were the top 2 impactful feature dimensions.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>An accurate and interpretable pre-frailty risk assessment framework using state-of-the-art machine learning techniques and explanation methods has been developed. Our framework incorporates a wide range of features and determinants, allowing for a comprehensive and nuanced understanding of pre-frailty risk.</p>\n </section>\n </div>","PeriodicalId":100601,"journal":{"name":"Health Care Science","volume":"3 6","pages":"426-437"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11671213/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Care Science","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hcs2.120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Frailty in older adults is linked to increased risks and lower quality of life. Pre-frailty, a condition preceding frailty, is intervenable, but its determinants and assessment are challenging. This study aims to develop and validate an explainable machine learning model for pre-frailty risk assessment among community-dwelling older adults.

Methods

The study included 3141 adults aged 60 or above from the China Health and Retirement Longitudinal Study. Pre-frailty was characterized by one or two criteria from the physical frailty phenotype scale. We extracted 80 distinct features across seven dimensions to evaluate pre-frailty risk. A model was constructed using recursive feature elimination and a stacking-CatBoost distillation module on 80% of the sample and validated on a separate 20% holdout data set.

Results

The study used data from 2508 community-dwelling older adults (mean age, 67.24 years [range, 60–96]; 1215 [48.44%] females) to develop a pre-frailty risk assessment model. We selected 57 predictive features and built a distilled CatBoost model, which achieved the highest discrimination (AUROC: 0.7560 [95% CI: 0.7169, 0.7928]) on the 20% holdout data set. The living city, BMI, and peak expiratory flow (PEF) were the three most significant contributors to pre-frailty risk. Physical and environmental factors were the top 2 impactful feature dimensions.

Conclusions

An accurate and interpretable pre-frailty risk assessment framework using state-of-the-art machine learning techniques and explanation methods has been developed. Our framework incorporates a wide range of features and determinants, allowing for a comprehensive and nuanced understanding of pre-frailty risk.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可解释的机器学习模型在社区居住的老年人脆弱前风险评估。

背景：老年人的虚弱与风险增加和生活质量降低有关。脆弱前期是脆弱之前的一种状态，是可以干预的，但其决定因素和评估是具有挑战性的。本研究旨在开发和验证一个可解释的机器学习模型，用于社区居住老年人的脆弱性前风险评估。方法：研究对象为3141名来自中国健康与退休纵向研究的60岁及以上成年人。体质脆弱表型量表的一个或两个标准表征了体质脆弱前期。我们从7个维度中提取了80个不同的特征来评估脆弱性前的风险。在80%的样本上使用递归特征消除和stack - catboost蒸馏模块构建模型，并在单独的20%保留数据集上进行验证。结果：该研究使用了2508名社区居住老年人的数据(平均年龄67.24岁[范围60-96岁]；1215例（48.44%）女性)建立虚弱前风险评估模型。我们选择了57个预测特征，并建立了一个经过提炼的CatBoost模型，该模型在20%的holdout数据集上获得了最高的判别率（AUROC: 0.7560 [95% CI: 0.7169, 0.7928]）。生活城市、BMI和呼气流量峰值（PEF）是导致脆弱前风险的三个最重要因素。物理和环境因素是影响最大的两个特征维度。结论：利用最先进的机器学习技术和解释方法，开发了一个准确且可解释的脆弱性前风险评估框架。我们的框架包含了广泛的特征和决定因素，允许对脆弱前风险进行全面而细致的理解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Health Care Science

CiteScore

0.90

自引率

0.00%

发文量