Accurate prediction of all-cause mortality in patients with metabolic dysfunction-associated steatotic liver disease using electronic health records

IF 3.7 3区医学 Q2 GASTROENTEROLOGY & HEPATOLOGY Annals of hepatology Pub Date : 2024-07-04 DOI:10.1016/j.aohep.2024.101528

{"title":"Accurate prediction of all-cause mortality in patients with metabolic dysfunction-associated steatotic liver disease using electronic health records","authors":"","doi":"10.1016/j.aohep.2024.101528","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction and Objectives</h3><p>Despite the huge clinical burden of MASLD, validated tools for early risk stratification are lacking, and heterogeneous disease expression and a highly variable rate of progression to clinical outcomes result in prognostic uncertainty. We aimed to investigate longitudinal electronic health record-based outcome prediction in MASLD using a state-of-the-art machine learning model.</p></div><div><h3>Patients and Methods</h3><p><em>n</em> = 940 patients with histologically-defined MASLD were used to develop a deep-learning model for all-cause mortality prediction. Patient timelines, spanning 12 years, were fully-annotated with demographic/clinical characteristics, ICD-9 and -10 codes, blood test results, prescribing data, and secondary care activity. A Transformer neural network (TNN) was trained to output concomitant probabilities of 12-, 24-, and 36-month all-cause mortality. In-sample performance was assessed using 5-fold cross-validation. Out-of-sample performance was assessed in an independent set of <em>n</em> = 528 MASLD patients.</p></div><div><h3>Results</h3><p>In-sample model performance achieved AUROC curve 0.74–0.90 (95 % CI: 0.72–0.94), sensitivity 64 %-82 %, specificity 75 %–92 % and Positive Predictive Value (PPV) 94 %-98 %. Out-of-sample model validation had AUROC 0.70–0.86 (95 % CI: 0.67–0.90), sensitivity 69 %–70 %, specificity 96 %–97 % and PPV 75 %–77 %. Key predictive factors, identified using coefficients of determination, were age, presence of type 2 diabetes, and history of hospital admissions with length of stay >14 days.</p></div><div><h3>Conclusions</h3><p>A TNN, applied to routinely-collected longitudinal electronic health records, achieved good performance in prediction of 12-, 24-, and 36-month all-cause mortality in patients with MASLD. Extrapolation of our technique to population-level data will enable scalable and accurate risk stratification to identify people most likely to benefit from anticipatory health care and personalized interventions.</p></div>","PeriodicalId":7979,"journal":{"name":"Annals of hepatology","volume":"29 5","pages":"Article 101528"},"PeriodicalIF":3.7000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1665268124003223/pdfft?md5=78eda29d27beb056fd99526cd9d223f2&pid=1-s2.0-S1665268124003223-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of hepatology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1665268124003223","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction and Objectives

Despite the huge clinical burden of MASLD, validated tools for early risk stratification are lacking, and heterogeneous disease expression and a highly variable rate of progression to clinical outcomes result in prognostic uncertainty. We aimed to investigate longitudinal electronic health record-based outcome prediction in MASLD using a state-of-the-art machine learning model.

Patients and Methods

n = 940 patients with histologically-defined MASLD were used to develop a deep-learning model for all-cause mortality prediction. Patient timelines, spanning 12 years, were fully-annotated with demographic/clinical characteristics, ICD-9 and -10 codes, blood test results, prescribing data, and secondary care activity. A Transformer neural network (TNN) was trained to output concomitant probabilities of 12-, 24-, and 36-month all-cause mortality. In-sample performance was assessed using 5-fold cross-validation. Out-of-sample performance was assessed in an independent set of n = 528 MASLD patients.

Results

In-sample model performance achieved AUROC curve 0.74–0.90 (95 % CI: 0.72–0.94), sensitivity 64 %-82 %, specificity 75 %–92 % and Positive Predictive Value (PPV) 94 %-98 %. Out-of-sample model validation had AUROC 0.70–0.86 (95 % CI: 0.67–0.90), sensitivity 69 %–70 %, specificity 96 %–97 % and PPV 75 %–77 %. Key predictive factors, identified using coefficients of determination, were age, presence of type 2 diabetes, and history of hospital admissions with length of stay >14 days.

Conclusions

A TNN, applied to routinely-collected longitudinal electronic health records, achieved good performance in prediction of 12-, 24-, and 36-month all-cause mortality in patients with MASLD. Extrapolation of our technique to population-level data will enable scalable and accurate risk stratification to identify people most likely to benefit from anticipatory health care and personalized interventions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用电子健康记录准确预测代谢功能障碍相关性脂肪肝患者的全因死亡率。

导言和目的：尽管MASLD造成了巨大的临床负担，但却缺乏用于早期风险分层的有效工具，而且异质性疾病表达和临床结果进展率的高度可变性导致了预后的不确定性。我们的目的是利用最先进的机器学习模型，研究基于电子健康记录的MASLD纵向预后预测。患者和方法：940名组织学定义的MASLD患者被用于开发一个深度学习模型，以预测全因死亡率。患者时间跨度长达12年，其人口学/临床特征、ICD-9和-10代码、血液检测结果、处方数据和二级护理活动均已完全标注。对变压器神经网络 (TNN) 进行了训练，以输出 12 个月、24 个月和 36 个月全因死亡率的伴随概率。采用 5 倍交叉验证评估样本内性能。对一组独立的 n=528 MASLD 患者进行了样本外性能评估：样本内模型性能达到 AUROC 曲线 0.74-0.90（95% CI：0.72-0.94），灵敏度 64%-82%，特异性 75%-92%，阳性预测值 (PPV) 94%-98%。样本外模型验证的 AUROC 为 0.70-0.86（95% CI：0.67-0.90），灵敏度为 69%-70%，特异度为 96%-97%，PPV 为 75%-77%。利用决定系数确定的主要预测因素是年龄、是否患有 2 型糖尿病以及住院时间超过 14 天的入院史：应用于常规收集的纵向电子健康记录的 TNN 在预测 MASLD 患者 12 个月、24 个月和 36 个月的全因死亡率方面表现良好。将我们的技术推广到人群数据中，将实现可扩展的、准确的风险分层，以确定最有可能从预见性医疗保健和个性化干预中获益的人群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Annals of hepatology 医学-胃肠肝病学

CiteScore

7.90

自引率

2.60%

发文量

183

审稿时长

4-8 weeks

期刊介绍： Annals of Hepatology publishes original research on the biology and diseases of the liver in both humans and experimental models. Contributions may be submitted as regular articles. The journal also publishes concise reviews of both basic and clinical topics.