人工智能识别未确诊的非酒精性脂肪性肝炎患者。

IF 2.3 Q2 ECONOMICS Journal of Health Economics and Outcomes Research Pub Date : 2024-09-25 eCollection Date: 2024-01-01 DOI:10.36469/001c.123645

Onur Baser, Gabriela Samayoa, Nehir Yapar, Erdem Baser

{"title":"人工智能识别未确诊的非酒精性脂肪性肝炎患者。","authors":"Onur Baser, Gabriela Samayoa, Nehir Yapar, Erdem Baser","doi":"10.36469/001c.123645","DOIUrl":null,"url":null,"abstract":"Background: Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. Objective: This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. Methods: From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. Results: In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. Conclusions: Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.","PeriodicalId":16012,"journal":{"name":"Journal of Health Economics and Outcomes Research","volume":"11 2","pages":"86-94"},"PeriodicalIF":2.3000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11441708/pdf/","citationCount":"0","resultStr":"{\"title\":\"Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis.\",\"authors\":\"Onur Baser, Gabriela Samayoa, Nehir Yapar, Erdem Baser\",\"doi\":\"10.36469/001c.123645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. Objective: This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. Methods: From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. Results: In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. Conclusions: Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.\",\"PeriodicalId\":16012,\"journal\":{\"name\":\"Journal of Health Economics and Outcomes Research\",\"volume\":\"11 2\",\"pages\":\"86-94\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11441708/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Health Economics and Outcomes Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36469/001c.123645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Health Economics and Outcomes Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36469/001c.123645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

背景：尽管非酒精性脂肪性肝炎（NASH）的发病率越来越高，但在临床实践中却常常得不到诊断。研究目的本研究利用机器学习算法在退伍军人事务（VA）医疗系统中识别出可能患有未确诊的非酒精性脂肪性肝炎的患者。方法从退伍军人事务部 2,500 万成年参保者的数据集中，将研究人群分为 NASH 阳性、非 NASH 和高危人群。我们使用机器学习算法对索赔数据进行了分析。为了建立模型，我们将研究人群随机分为 80% 的训练子集和 20% 的测试子集，并使用交叉验证技术进行测试和训练。除了基线模型外，我们还创建了梯度提升分类树、天真贝叶斯和随机森林模型，并使用接受者操作者特征、曲线下面积和准确率进行了比较。在全部 80% 的训练子集中对表现最好的模型进行再训练，并将其应用到 20% 的测试子集中以计算性能指标。结果：共有 4 223 443 例患者符合研究纳入标准，其中 4903 例为 NASH 阳性，35 528 例为非 NASH 患者。其余为高危患者，其中 514 997 名患者（12%）被确定为可能患有 NASH。年龄、肥胖和肝功能检测异常是确定 NASH 概率的首要因素。结论：利用机器学习预测 NASH 可以更广泛地识别、及时干预和有针对性的治疗，以改善或缓解疾病进展，可用作初步筛查工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis.

Background: Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. Objective: This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. Methods: From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. Results: In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. Conclusions: Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊