Development and validation of machine learning models for MASLD: based on multiple potential screening indicators.

IF 4.6 2区 医学 Q2 ENDOCRINOLOGY & METABOLISM Frontiers in Endocrinology Pub Date : 2025-01-21 eCollection Date: 2024-01-01 DOI:10.3389/fendo.2024.1449064
Hao Chen, Jingjing Zhang, Xueqin Chen, Ling Luo, Wenjiao Dong, Yongjie Wang, Jiyu Zhou, Canjin Chen, Wenhao Wang, Wenbin Zhang, Zhiyi Zhang, Yongguang Cai, Danli Kong, Yuanlin Ding
{"title":"Development and validation of machine learning models for MASLD: based on multiple potential screening indicators.","authors":"Hao Chen, Jingjing Zhang, Xueqin Chen, Ling Luo, Wenjiao Dong, Yongjie Wang, Jiyu Zhou, Canjin Chen, Wenhao Wang, Wenbin Zhang, Zhiyi Zhang, Yongguang Cai, Danli Kong, Yuanlin Ding","doi":"10.3389/fendo.2024.1449064","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Multifaceted factors play a crucial role in the prevention and treatment of metabolic dysfunction-associated steatotic liver disease (MASLD). This study aimed to utilize multifaceted indicators to construct MASLD risk prediction machine learning models and explore the core factors within these models.</p><p><strong>Methods: </strong>MASLD risk prediction models were constructed based on seven machine learning algorithms using all variables, insulin-related variables, demographic characteristics variables, and other indicators, respectively. Subsequently, the partial dependence plot(PDP) method and SHapley Additive exPlanations (SHAP) were utilized to explain the roles of important variables in the model to filter out the optimal indicators for constructing the MASLD risk model.</p><p><strong>Results: </strong>Ranking the feature importance of the Random Forest (RF) model and eXtreme Gradient Boosting (XGBoost) model constructed using all variables found that both homeostasis model assessment of insulin resistance (HOMA-IR) and triglyceride glucose-waist circumference (TyG-WC) were the first and second most important variables. The MASLD risk prediction model constructed using the variables with top 10 importance was superior to the previous model. The PDP and SHAP methods were further utilized to screen the best indicators (including HOMA-IR, TyG-WC, age, aspartate aminotransferase (AST), and ethnicity) for constructing the model, and the mean area under the curve value of the models was 0.960.</p><p><strong>Conclusions: </strong>HOMA-IR and TyG-WC are core factors in predicting MASLD risk. Ultimately, our study constructed the optimal MASLD risk prediction model using HOMA-IR, TyG-WC, age, AST, and ethnicity.</p>","PeriodicalId":12447,"journal":{"name":"Frontiers in Endocrinology","volume":"15 ","pages":"1449064"},"PeriodicalIF":4.6000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11790477/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Endocrinology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fendo.2024.1449064","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Multifaceted factors play a crucial role in the prevention and treatment of metabolic dysfunction-associated steatotic liver disease (MASLD). This study aimed to utilize multifaceted indicators to construct MASLD risk prediction machine learning models and explore the core factors within these models.

Methods: MASLD risk prediction models were constructed based on seven machine learning algorithms using all variables, insulin-related variables, demographic characteristics variables, and other indicators, respectively. Subsequently, the partial dependence plot(PDP) method and SHapley Additive exPlanations (SHAP) were utilized to explain the roles of important variables in the model to filter out the optimal indicators for constructing the MASLD risk model.

Results: Ranking the feature importance of the Random Forest (RF) model and eXtreme Gradient Boosting (XGBoost) model constructed using all variables found that both homeostasis model assessment of insulin resistance (HOMA-IR) and triglyceride glucose-waist circumference (TyG-WC) were the first and second most important variables. The MASLD risk prediction model constructed using the variables with top 10 importance was superior to the previous model. The PDP and SHAP methods were further utilized to screen the best indicators (including HOMA-IR, TyG-WC, age, aspartate aminotransferase (AST), and ethnicity) for constructing the model, and the mean area under the curve value of the models was 0.960.

Conclusions: HOMA-IR and TyG-WC are core factors in predicting MASLD risk. Ultimately, our study constructed the optimal MASLD risk prediction model using HOMA-IR, TyG-WC, age, AST, and ethnicity.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MASLD机器学习模型的开发和验证:基于多个潜在筛选指标。
背景:多种因素在代谢功能障碍相关脂肪变性肝病(MASLD)的预防和治疗中起关键作用。本研究旨在利用多方面指标构建MASLD风险预测机器学习模型,并探索这些模型中的核心因素。方法:分别使用所有变量、胰岛素相关变量、人口统计学特征变量和其他指标,基于7种机器学习算法构建MASLD风险预测模型。随后,利用偏相关图(PDP)法和SHapley加性解释(SHAP)法解释模型中重要变量的作用,筛选出构建MASLD风险模型的最优指标。结果:对随机森林(RF)模型和使用所有变量构建的极端梯度增强(XGBoost)模型的特征重要性排序发现,胰岛素抵抗的稳态模型评估(HOMA-IR)和甘油三酯葡萄糖-腰围(TyG-WC)是第一和第二重要的变量。利用重要性前10的变量构建的MASLD风险预测模型优于之前的模型。进一步利用PDP和SHAP方法筛选构建模型的最佳指标(HOMA-IR、TyG-WC、年龄、天冬氨酸转氨酶(AST)和种族),模型曲线下平均面积为0.960。结论:HOMA-IR和TyG-WC是预测MASLD风险的核心因素。最终,我们的研究利用HOMA-IR、TyG-WC、年龄、AST和种族构建了最优的MASLD风险预测模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Endocrinology
Frontiers in Endocrinology Medicine-Endocrinology, Diabetes and Metabolism
CiteScore
5.70
自引率
9.60%
发文量
3023
审稿时长
14 weeks
期刊介绍: Frontiers in Endocrinology is a field journal of the "Frontiers in" journal series. In today’s world, endocrinology is becoming increasingly important as it underlies many of the challenges societies face - from obesity and diabetes to reproduction, population control and aging. Endocrinology covers a broad field from basic molecular and cellular communication through to clinical care and some of the most crucial public health issues. The journal, thus, welcomes outstanding contributions in any domain of endocrinology. Frontiers in Endocrinology publishes articles on the most outstanding discoveries across a wide research spectrum of Endocrinology. The mission of Frontiers in Endocrinology is to bring all relevant Endocrinology areas together on a single platform.
期刊最新文献
Association of a novel index, the total cholesterol-high-density lipoprotein-glucose index, with diabetes mellitus in Chinese adults: a retrospective cohort study. Atherogenic index of plasma and cardiovascular high-risk status in the ChinaHEART luohe cohort: multivariable association modeling with nonlinear dose-response and effect heterogeneity. Comparative efficacy and safety of antiresorptive and anabolic therapies for male osteoporosis: an updated Bayesian network meta-analysis. Clinical and endocrine effects of pharmacological therapy in endometriosis: a systematic review and meta-analysis. Comparing the response of triple therapy and conventional treatment in male congenital hypogonadotropic hypogonadism: a randomized controlled trial.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1