Development and evaluation of interpretable machine learning regressors for predicting femoral neck bone mineral density in elderly men using NHANES data.

0 MEDICINE, RESEARCH & EXPERIMENTAL Biomolecules & biomedicine Pub Date : 2025-01-14 DOI:10.17305/bb.2024.10725
Wen He, Song Chen, Xianghong Fu, Licong Xu, Jun Xie, Jinxing Wan
{"title":"Development and evaluation of interpretable machine learning regressors for predicting femoral neck bone mineral density in elderly men using NHANES data.","authors":"Wen He, Song Chen, Xianghong Fu, Licong Xu, Jun Xie, Jinxing Wan","doi":"10.17305/bb.2024.10725","DOIUrl":null,"url":null,"abstract":"<p><p>Osteoporotic femoral neck fractures (OFNFs) pose a significant orthopedic challenge in the elderly population, accounting for up to 40% of all osteoporotic fractures and leading to considerable health deterioration and increased mortality. In addressing the critical need for early identification of osteoporosis through routine screening of femoral neck bone mineral density (FNBMD), this study developed a user-friendly prediction model aimed at men aged 50 years and older, a demographic often overlooked in osteoporosis screening. Utilizing data from the National Health and Nutrition Examination Survey (NHANES), the study involved outlier detection and handling, missing value imputation via the K nearest neighbor (KNN) algorithm, and data normalization and encoding. The dataset was split into training and test sets with a 7:3 ratio, followed by feature screening through the least absolute shrinkage and selection operator (LASSO) and the Boruta algorithm. Eight different machine learning algorithms were then employed to construct predictive models, with their performance evaluated through a comprehensive metric suite. The random forest regressor (RFR) emerged as the most effective model, characterized by key predictors such as age, body mass index (BMI), poverty income ratio (PIR), serum calcium, and race, achieving a coefficient of determination (R²) of 0.218 and maintaining robustness in sensitivity analyses. Notably, excluding race from the model resulted in sustained high performance, underscoring the model's adaptability. Interpretations using Shapley additive explanations (SHAP) highlighted the influence of each feature on FNBMD. These findings indicate that our predictive model effectively aids in the early detection of osteoporosis, potentially reducing the incidence of OFNFs in this high-risk population.</p>","PeriodicalId":72398,"journal":{"name":"Biomolecules & biomedicine","volume":" ","pages":"375-390"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11734819/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomolecules & biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17305/bb.2024.10725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Osteoporotic femoral neck fractures (OFNFs) pose a significant orthopedic challenge in the elderly population, accounting for up to 40% of all osteoporotic fractures and leading to considerable health deterioration and increased mortality. In addressing the critical need for early identification of osteoporosis through routine screening of femoral neck bone mineral density (FNBMD), this study developed a user-friendly prediction model aimed at men aged 50 years and older, a demographic often overlooked in osteoporosis screening. Utilizing data from the National Health and Nutrition Examination Survey (NHANES), the study involved outlier detection and handling, missing value imputation via the K nearest neighbor (KNN) algorithm, and data normalization and encoding. The dataset was split into training and test sets with a 7:3 ratio, followed by feature screening through the least absolute shrinkage and selection operator (LASSO) and the Boruta algorithm. Eight different machine learning algorithms were then employed to construct predictive models, with their performance evaluated through a comprehensive metric suite. The random forest regressor (RFR) emerged as the most effective model, characterized by key predictors such as age, body mass index (BMI), poverty income ratio (PIR), serum calcium, and race, achieving a coefficient of determination (R²) of 0.218 and maintaining robustness in sensitivity analyses. Notably, excluding race from the model resulted in sustained high performance, underscoring the model's adaptability. Interpretations using Shapley additive explanations (SHAP) highlighted the influence of each feature on FNBMD. These findings indicate that our predictive model effectively aids in the early detection of osteoporosis, potentially reducing the incidence of OFNFs in this high-risk population.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 NHANES 数据开发和评估用于预测老年男性股骨颈骨矿物质密度的可解释机器学习回归因子。
骨质疏松性股骨颈骨折(OFNFs)对老年人群的骨科治疗构成了巨大挑战,在所有骨质疏松性骨折中占比高达 40%,并导致严重的健康恶化和死亡率上升。为了满足通过股骨颈骨密度(FNBMD)常规筛查早期识别骨质疏松症的迫切需求,本研究针对骨质疏松症筛查中经常被忽视的 50 岁及以上男性人群开发了一个用户友好型预测模型。这项研究利用了美国国家健康与营养调查(NHANES)的数据,包括离群值的检测和处理、通过 K 近邻(KNN)算法对缺失值进行估算,以及数据归一化和编码。数据集以 7:3 的比例分成训练集和测试集,然后通过最小绝对收缩和选择算子(LASSO)和 Boruta 算法进行特征筛选。然后采用八种不同的机器学习算法构建预测模型,并通过一套综合指标对其性能进行评估。随机森林回归模型(RFR)是最有效的模型,它以年龄、体重指数(BMI)、贫困收入比(PIR)、血清钙和种族等关键预测因子为特征,决定系数(R²)达到 0.218,并在敏感性分析中保持稳健性。值得注意的是,将种族排除在模型之外也能保持较高的性能,这凸显了模型的适应性。使用夏普利加法解释(SHAP)进行的解释强调了每个特征对全骨畸形的影响。这些研究结果表明,我们的预测模型可以有效地帮助早期检测骨质疏松症,从而降低高危人群中 OFNF 的发病率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.10
自引率
0.00%
发文量
0
期刊最新文献
Clinical profile and risk factors for respiratory failure in children with Mycoplasma pneumoniae infection. Association between diabetes mellitus and tinnitus: A meta-analysis. Long-term smoking contributes to aging frailty and inflammatory response. Deep learning approach based on a patch residual for pediatric supracondylar subtle fracture detection. The molecular mechanisms of cuproptosis and its relevance to atherosclerosis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1