Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches

IF 1 Q4 ENDOCRINOLOGY & METABOLISM Diabetes epidemiology and management Pub Date : 2024-01-01 DOI:10.1016/j.deman.2023.100191
Marenao Tanaka , Yukinori Akiyama , Kazuma Mori , Itaru Hosaka , Kenichi Kato , Keisuke Endo , Toshifumi Ogawa , Tatsuya Sato , Toru Suzuki , Toshiyuki Yano , Hirofumi Ohnishi , Nagisa Hanawa , Masato Furuhashi
{"title":"Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches","authors":"Marenao Tanaka ,&nbsp;Yukinori Akiyama ,&nbsp;Kazuma Mori ,&nbsp;Itaru Hosaka ,&nbsp;Kenichi Kato ,&nbsp;Keisuke Endo ,&nbsp;Toshifumi Ogawa ,&nbsp;Tatsuya Sato ,&nbsp;Toru Suzuki ,&nbsp;Toshiyuki Yano ,&nbsp;Hirofumi Ohnishi ,&nbsp;Nagisa Hanawa ,&nbsp;Masato Furuhashi","doi":"10.1016/j.deman.2023.100191","DOIUrl":null,"url":null,"abstract":"<div><h3>Aims</h3><p>Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus.</p></div><div><h3>Methods</h3><p>In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and γ-glutamyl transferase, were used.</p></div><div><h3>Results</h3><p>During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naïve Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models.</p></div><div><h3>Conclusions</h3><p>ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus.</p></div>","PeriodicalId":72796,"journal":{"name":"Diabetes epidemiology and management","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666970623000707/pdfft?md5=29183cb351f691865659fdb42480574b&pid=1-s2.0-S2666970623000707-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diabetes epidemiology and management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666970623000707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Aims

Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus.

Methods

In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and γ-glutamyl transferase, were used.

Results

During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naïve Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models.

Conclusions

ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用各种机器学习方法中的关键因素建立糖尿病发展预测模型
目的当需要从众多候选者中自动识别相关特征时,机器学习(ML)方法是非常有益的。方法 在接受年度健康检查的 10248 名受试者中,使用了包括脂肪肝指数(FLI)在内的 58 个候选指标,脂肪肝指数是通过腰围、体重指数以及甘油三酯和γ-谷氨酰转移酶水平计算得出的。结果在 10 年的随访期间(平均时间:6.9 年),培训组(70%,人数=7173)有 322 名受试者(6.5%)新发糖尿病,试验组(30%,人数=3075)有 127 名受试者(6.2%)新发糖尿病。通过随机森林特征选择和 10 倍交叉验证,血红蛋白 A1c、空腹血糖和 FLI 被确定为前 3 个预测因子。当使用血红蛋白 A1c 和 FLI 作为所选特征时,包括逻辑回归、奈夫贝叶斯、极端梯度提升和人工神经网络在内的多模型接收者工作特征曲线分析的 C 统计量分别为 0.874、0.869、0.856 和 0.869。结论 结合血红蛋白 A1c 和 FLI 的ML 模型为预测糖尿病的发展提供了一种准确而直接的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Diabetes epidemiology and management
Diabetes epidemiology and management Endocrinology, Diabetes and Metabolism, Public Health and Health Policy
CiteScore
1.10
自引率
0.00%
发文量
0
审稿时长
14 days
期刊最新文献
Assessment of vitamin D status in obese and non-obese patients: A case-control study The impact of the COVID-19 pandemic on diabetes-related mortality Diabetes in pregnancy: Anxious minds precarious times Primary care monitoring of HbA1c tests in type 2 diabetic patients in Northern France and impact of the successive COVID-19 pandemic containments Evaluation of a Support Tool for Diabetes Control in primary care – A Qualitative study with primary care staff
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1