Wenhan Liu , Shurong Pan , Zhoutong Li , Sheng Chang , Qijun Huang , Nan Jiang
{"title":"Lead-fusion Barlow twins: A fused self-supervised learning method for multi-lead electrocardiograms","authors":"Wenhan Liu , Shurong Pan , Zhoutong Li , Sheng Chang , Qijun Huang , Nan Jiang","doi":"10.1016/j.inffus.2024.102698","DOIUrl":null,"url":null,"abstract":"<div><p>Nowadays, deep learning depends on large-scale labeled datasets, which limits its broader application in electrocardiogram (ECG) analysis, as manual labeling of ECGs is consistently costly. To overcome this issue, this paper proposes a fused self-supervised learning (SSL) method for multi-lead ECGs: lead-fusion Barlow twins (LFBT). It utilizes unlabeled ECG datasets to pretrain an encoder group using a fused loss. This loss fuses intra-lead and inter-lead BT loss. By employing BT, LFBT avoids the need for additional techniques to prevent trivial solutions (collapse) in pretraining. Moreover, multi-branch concatenation (MBC) fuses information from all leads when transferring pretrained encoders to downstream tasks. According to the experiments, LFBT can extract prior knowledge from unlabeled ECG datasets, making a deep learning model yield comparable performances with its supervised counterpart (trained from scratch) using 3<span><math><mrow><mo>∼</mo><mn>5</mn><mo>×</mo></mrow></math></span> fewer labels. Furthermore, LFBT is robust when applied to uncurated ECGs from real-world hospitals, with no significant performance decline observed after pretraining. Model interpretation based on gradient-weighted class activation mapping (GradCAM) indicates that LFBT helps models focus on critical waveform changes when training data and labels are insufficient. Compared with previous methods, LFBT demonstrates advantages in performance and implementation. To summarize, LFBT shows considerable potential in reducing the need for manual labeling of ECGs, thereby advancing deep learning applications in real-world ECG-based diagnoses. Code is available at <span><span>https://github.com/Aiwiscal/ECG_SSL_LFBT</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"114 ","pages":"Article 102698"},"PeriodicalIF":14.7000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524004767","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, deep learning depends on large-scale labeled datasets, which limits its broader application in electrocardiogram (ECG) analysis, as manual labeling of ECGs is consistently costly. To overcome this issue, this paper proposes a fused self-supervised learning (SSL) method for multi-lead ECGs: lead-fusion Barlow twins (LFBT). It utilizes unlabeled ECG datasets to pretrain an encoder group using a fused loss. This loss fuses intra-lead and inter-lead BT loss. By employing BT, LFBT avoids the need for additional techniques to prevent trivial solutions (collapse) in pretraining. Moreover, multi-branch concatenation (MBC) fuses information from all leads when transferring pretrained encoders to downstream tasks. According to the experiments, LFBT can extract prior knowledge from unlabeled ECG datasets, making a deep learning model yield comparable performances with its supervised counterpart (trained from scratch) using 3 fewer labels. Furthermore, LFBT is robust when applied to uncurated ECGs from real-world hospitals, with no significant performance decline observed after pretraining. Model interpretation based on gradient-weighted class activation mapping (GradCAM) indicates that LFBT helps models focus on critical waveform changes when training data and labels are insufficient. Compared with previous methods, LFBT demonstrates advantages in performance and implementation. To summarize, LFBT shows considerable potential in reducing the need for manual labeling of ECGs, thereby advancing deep learning applications in real-world ECG-based diagnoses. Code is available at https://github.com/Aiwiscal/ECG_SSL_LFBT.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.