Identification of heart failure subtypes using transformer-based deep learning modelling: a population-based study of 379,108 individuals.

IF 9.7 1区 医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL EBioMedicine Pub Date : 2025-03-19 DOI:10.1016/j.ebiom.2025.105657
Zhengxian Fan, Mohammad Mamouei, Yikuan Li, Shishir Rao, Kazem Rahimi
{"title":"Identification of heart failure subtypes using transformer-based deep learning modelling: a population-based study of 379,108 individuals.","authors":"Zhengxian Fan, Mohammad Mamouei, Yikuan Li, Shishir Rao, Kazem Rahimi","doi":"10.1016/j.ebiom.2025.105657","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Heart failure (HF) is a complex syndrome with varied presentations and progression patterns. Traditional classification systems based on left ventricular ejection fraction (LVEF) have limitations in capturing the heterogeneity of HF. We aimed to explore the application of deep learning, specifically a Transformer-based approach, to analyse electronic health records (EHR) for a refined subtyping of patients with HF.</p><p><strong>Methods: </strong>We utilised linked EHR from primary and secondary care, sourced from the Clinical Practice Research Datalink (CPRD) Aurum, which encompassed health data of over 30 million individuals in the UK. Individuals aged 35 and above with incident reports of HF between January 1, 2005, and January 1, 2018, were included. We proposed a Transformer-based approach to cluster patients based on all clinical diagnoses, procedures, and medication records in EHR. Statistical machine learning (ML) methods were used for comparative benchmarking. The models were trained on a derivation cohort and assessed for their ability to delineate distinct clusters and prognostic value by comparing one-year all-cause mortality and HF hospitalisation rates among the identified subgroups in a separate validation cohort. Association analyses were conducted to elucidate the clinical characteristics of the derived clusters.</p><p><strong>Findings: </strong>A total of 379,108 patients were included in the HF subtyping analysis. The Transformer-based approach outperformed alternative methods, delineating more distinct and prognostically valuable clusters. This approach identified seven unique HF patient clusters characterised by differing patterns of mortality, hospitalisation, and comorbidities. These clusters were labelled based on the dominant clinical features present at the initial diagnosis of HF: early-onset, hypertension, ischaemic heart disease, metabolic problems, chronic obstructive pulmonary disease (COPD), thyroid dysfunction, and late-onset clusters. The Transformer-based subtyping approach successfully captured the multifaceted nature of HF.</p><p><strong>Interpretation: </strong>This study identified seven distinct subtypes, including COPD-related and thyroid dysfunction-related subgroups, which are two high-risk subgroups not recognised in previous subtyping analyses. These insights lay the groundwork for further investigations into tailored and effective management strategies for HF.</p><p><strong>Funding: </strong>British Heart Foundation, European Union - Horizon Europe, and Novo Nordisk Research Centre Oxford.</p>","PeriodicalId":11494,"journal":{"name":"EBioMedicine","volume":"114 ","pages":"105657"},"PeriodicalIF":9.7000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EBioMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ebiom.2025.105657","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Heart failure (HF) is a complex syndrome with varied presentations and progression patterns. Traditional classification systems based on left ventricular ejection fraction (LVEF) have limitations in capturing the heterogeneity of HF. We aimed to explore the application of deep learning, specifically a Transformer-based approach, to analyse electronic health records (EHR) for a refined subtyping of patients with HF.

Methods: We utilised linked EHR from primary and secondary care, sourced from the Clinical Practice Research Datalink (CPRD) Aurum, which encompassed health data of over 30 million individuals in the UK. Individuals aged 35 and above with incident reports of HF between January 1, 2005, and January 1, 2018, were included. We proposed a Transformer-based approach to cluster patients based on all clinical diagnoses, procedures, and medication records in EHR. Statistical machine learning (ML) methods were used for comparative benchmarking. The models were trained on a derivation cohort and assessed for their ability to delineate distinct clusters and prognostic value by comparing one-year all-cause mortality and HF hospitalisation rates among the identified subgroups in a separate validation cohort. Association analyses were conducted to elucidate the clinical characteristics of the derived clusters.

Findings: A total of 379,108 patients were included in the HF subtyping analysis. The Transformer-based approach outperformed alternative methods, delineating more distinct and prognostically valuable clusters. This approach identified seven unique HF patient clusters characterised by differing patterns of mortality, hospitalisation, and comorbidities. These clusters were labelled based on the dominant clinical features present at the initial diagnosis of HF: early-onset, hypertension, ischaemic heart disease, metabolic problems, chronic obstructive pulmonary disease (COPD), thyroid dysfunction, and late-onset clusters. The Transformer-based subtyping approach successfully captured the multifaceted nature of HF.

Interpretation: This study identified seven distinct subtypes, including COPD-related and thyroid dysfunction-related subgroups, which are two high-risk subgroups not recognised in previous subtyping analyses. These insights lay the groundwork for further investigations into tailored and effective management strategies for HF.

Funding: British Heart Foundation, European Union - Horizon Europe, and Novo Nordisk Research Centre Oxford.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
背景:心力衰竭(HF)是一种复杂的综合征,表现和发展模式各不相同。基于左心室射血分数(LVEF)的传统分类系统在捕捉心衰的异质性方面存在局限性。我们旨在探索深度学习的应用,特别是基于Transformer的方法,以分析电子健康记录(EHR),从而对高血压患者进行细化分型:我们利用了来自初级和二级医疗机构的链接电子病历,这些病历来自临床实践研究数据链(CPRD)Aurum,其中包含英国 3000 多万人的健康数据。2005年1月1日至2018年1月1日期间,年龄在35岁及以上、有高血压事件报告的个人被纳入研究范围。我们提出了一种基于 Transformer 的方法,根据 EHR 中的所有临床诊断、手术和用药记录对患者进行聚类。统计机器学习(ML)方法用于比较基准。在衍生队列中对模型进行了训练,并通过在单独的验证队列中比较已识别亚群的一年全因死亡率和高血压住院率,评估了模型划分不同群组的能力和预后价值。研究还进行了关联分析,以阐明所得出分组的临床特征:共有 379 108 名患者被纳入高频亚型分析。基于转换器的方法优于其他方法,能划分出更多不同的、对预后更有价值的群组。这种方法确定了七个独特的高血压患者群,其死亡率、住院率和合并症的模式各不相同。这些群组是根据最初诊断为高血压时的主要临床特征划分的:早发群组、高血压群组、缺血性心脏病群组、代谢问题群组、慢性阻塞性肺病群组、甲状腺功能障碍群组以及晚发群组。基于转换器的亚型划分方法成功地捕捉到了高频的多面性:这项研究确定了七个不同的亚型,包括慢性阻塞性肺病相关亚型和甲状腺功能障碍相关亚型,这两个高风险亚型在以往的亚型分析中未被确认。这些见解为进一步研究量身定制的有效高频管理策略奠定了基础:资金来源:英国心脏基金会、欧盟-地平线欧洲和诺和诺德牛津研究中心。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
EBioMedicine
EBioMedicine Biochemistry, Genetics and Molecular Biology-General Biochemistry,Genetics and Molecular Biology
CiteScore
17.70
自引率
0.90%
发文量
579
审稿时长
5 weeks
期刊介绍: eBioMedicine is a comprehensive biomedical research journal that covers a wide range of studies that are relevant to human health. Our focus is on original research that explores the fundamental factors influencing human health and disease, including the discovery of new therapeutic targets and treatments, the identification of biomarkers and diagnostic tools, and the investigation and modification of disease pathways and mechanisms. We welcome studies from any biomedical discipline that contribute to our understanding of disease and aim to improve human health.
期刊最新文献
AI in radiological imaging of soft-tissue and bone tumours: a systematic review evaluating against CLAIM and FUTURE-AI guidelines. Chromatin landscape in paired human visceral and subcutaneous adipose tissue and its impact on clinical variables in obesity. Author response to "A step closer to optimal paroxetine dosing: what is next?" Identification of heart failure subtypes using transformer-based deep learning modelling: a population-based study of 379,108 individuals. Unlocking the potential of circular RNA vaccines: a bioinformatics and computational biology perspective.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1