Lead-fusion Barlow twins: A fused self-supervised learning method for multi-lead electrocardiograms

IF 14.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Information Fusion Pub Date : 2024-09-16 DOI:10.1016/j.inffus.2024.102698

Wenhan Liu , Shurong Pan , Zhoutong Li , Sheng Chang , Qijun Huang , Nan Jiang

{"title":"Lead-fusion Barlow twins: A fused self-supervised learning method for multi-lead electrocardiograms","authors":"Wenhan Liu , Shurong Pan , Zhoutong Li , Sheng Chang , Qijun Huang , Nan Jiang","doi":"10.1016/j.inffus.2024.102698","DOIUrl":null,"url":null,"abstract":"<div><p>Nowadays, deep learning depends on large-scale labeled datasets, which limits its broader application in electrocardiogram (ECG) analysis, as manual labeling of ECGs is consistently costly. To overcome this issue, this paper proposes a fused self-supervised learning (SSL) method for multi-lead ECGs: lead-fusion Barlow twins (LFBT). It utilizes unlabeled ECG datasets to pretrain an encoder group using a fused loss. This loss fuses intra-lead and inter-lead BT loss. By employing BT, LFBT avoids the need for additional techniques to prevent trivial solutions (collapse) in pretraining. Moreover, multi-branch concatenation (MBC) fuses information from all leads when transferring pretrained encoders to downstream tasks. According to the experiments, LFBT can extract prior knowledge from unlabeled ECG datasets, making a deep learning model yield comparable performances with its supervised counterpart (trained from scratch) using 3<span><math><mrow><mo>∼</mo><mn>5</mn><mo>×</mo></mrow></math></span> fewer labels. Furthermore, LFBT is robust when applied to uncurated ECGs from real-world hospitals, with no significant performance decline observed after pretraining. Model interpretation based on gradient-weighted class activation mapping (GradCAM) indicates that LFBT helps models focus on critical waveform changes when training data and labels are insufficient. Compared with previous methods, LFBT demonstrates advantages in performance and implementation. To summarize, LFBT shows considerable potential in reducing the need for manual labeling of ECGs, thereby advancing deep learning applications in real-world ECG-based diagnoses. Code is available at <span><span>https://github.com/Aiwiscal/ECG_SSL_LFBT</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"114 ","pages":"Article 102698"},"PeriodicalIF":14.7000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524004767","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Nowadays, deep learning depends on large-scale labeled datasets, which limits its broader application in electrocardiogram (ECG) analysis, as manual labeling of ECGs is consistently costly. To overcome this issue, this paper proposes a fused self-supervised learning (SSL) method for multi-lead ECGs: lead-fusion Barlow twins (LFBT). It utilizes unlabeled ECG datasets to pretrain an encoder group using a fused loss. This loss fuses intra-lead and inter-lead BT loss. By employing BT, LFBT avoids the need for additional techniques to prevent trivial solutions (collapse) in pretraining. Moreover, multi-branch concatenation (MBC) fuses information from all leads when transferring pretrained encoders to downstream tasks. According to the experiments, LFBT can extract prior knowledge from unlabeled ECG datasets, making a deep learning model yield comparable performances with its supervised counterpart (trained from scratch) using 3 $\sim 5 \times$ fewer labels. Furthermore, LFBT is robust when applied to uncurated ECGs from real-world hospitals, with no significant performance decline observed after pretraining. Model interpretation based on gradient-weighted class activation mapping (GradCAM) indicates that LFBT helps models focus on critical waveform changes when training data and labels are insufficient. Compared with previous methods, LFBT demonstrates advantages in performance and implementation. To summarize, LFBT shows considerable potential in reducing the need for manual labeling of ECGs, thereby advancing deep learning applications in real-world ECG-based diagnoses. Code is available at https://github.com/Aiwiscal/ECG_SSL_LFBT.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

导联融合巴洛双胞胎：多导联心电图的融合自监督学习方法

如今，深度学习依赖于大规模标记数据集，这限制了其在心电图（ECG）分析中的广泛应用，因为手动标记心电图的成本一直很高。为了克服这一问题，本文提出了一种针对多导联心电图的融合自监督学习（SSL）方法：导联融合巴洛双胞胎（LFBT）。它利用未标记的心电图数据集，使用融合损失预训练编码器组。这种损耗融合了导联内和导联间的 BT 损耗。通过使用 BT，LFBT 避免了在预训练中使用额外的技术来防止琐碎解（崩溃）。此外，在将预训练编码器转移到下游任务时，多分支连接（MBC）可融合所有线索的信息。实验结果表明，LFBT 可以从无标记的心电图数据集中提取先验知识，从而使深度学习模型在使用 3∼5 倍较少标记的情况下，获得与监督模型（从头开始训练）相当的性能。此外，当将 LFBT 应用于来自真实世界医院的未经标注的心电图时，它具有很强的鲁棒性，在预训练后没有观察到明显的性能下降。基于梯度加权类激活映射（GradCAM）的模型解释表明，当训练数据和标签不足时，LFBT 能帮助模型关注关键的波形变化。与之前的方法相比，LFBT 在性能和实施方面都具有优势。总之，LFBT 在减少对心电图手动标记的需求方面显示出了巨大的潜力，从而推动了深度学习在基于心电图的实际诊断中的应用。代码见 https://github.com/Aiwiscal/ECG_SSL_LFBT。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.