José-María Lillo-Castellano , Inmaculada Mora-Jiménez , María Martín-Méndez , Laia Cerdá , Arcadi García-Alberola , José Luis Rojo-Álvarez , Devis Tuia
{"title":"Active learning and margin strategies for arrhythmia classification in implantable devices","authors":"José-María Lillo-Castellano , Inmaculada Mora-Jiménez , María Martín-Méndez , Laia Cerdá , Arcadi García-Alberola , José Luis Rojo-Álvarez , Devis Tuia","doi":"10.1016/j.compbiomed.2025.109747","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objectives:</h3><div>The massive storage of cardiac arrhythmic episodes from Implantable Cardioverter Defibrillators (ICD) and the advent of new artificial intelligence algorithms are opening up new opportunities for electrophysiological knowledge extraction. However, in this context, accurate and reliable episode labeling by expert cardiologists still remains a manual, costly, and time-consuming process.</div></div><div><h3>Methods:</h3><div>In this work, we propose using Active Learning (AL) to design classification models that streamline the manual labeling of cardiac arrhythmic episodes. When AL is used, relevant episodes for classification are selected and then presented to the human expert for labeling, thereby dramatically reducing the manual labeling burden.</div></div><div><h3>Results:</h3><div>We adapted four large-margin-based AL strategies to a previously proposed classification methodology. We benchmarked them on problems involving 3 and 8 arrhythmia types using 9908 episodes from a massive national ICD data repository. Specifically, the relevance of episode–patient diversity for classification was evaluated. Results showed that the gold standard performance, achieved using all episodes, was reached by using approximately 20% (50%) of episodes from 60% (85%) of patients in the 3-class (8-class) model design.</div></div><div><h3>Conclusions:</h3><div>We can conclude that AL techniques are advantageous for designing classification models and can streamline the human labeling process of large ICD datasets.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"188 ","pages":"Article 109747"},"PeriodicalIF":7.0000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525000976","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objectives:
The massive storage of cardiac arrhythmic episodes from Implantable Cardioverter Defibrillators (ICD) and the advent of new artificial intelligence algorithms are opening up new opportunities for electrophysiological knowledge extraction. However, in this context, accurate and reliable episode labeling by expert cardiologists still remains a manual, costly, and time-consuming process.
Methods:
In this work, we propose using Active Learning (AL) to design classification models that streamline the manual labeling of cardiac arrhythmic episodes. When AL is used, relevant episodes for classification are selected and then presented to the human expert for labeling, thereby dramatically reducing the manual labeling burden.
Results:
We adapted four large-margin-based AL strategies to a previously proposed classification methodology. We benchmarked them on problems involving 3 and 8 arrhythmia types using 9908 episodes from a massive national ICD data repository. Specifically, the relevance of episode–patient diversity for classification was evaluated. Results showed that the gold standard performance, achieved using all episodes, was reached by using approximately 20% (50%) of episodes from 60% (85%) of patients in the 3-class (8-class) model design.
Conclusions:
We can conclude that AL techniques are advantageous for designing classification models and can streamline the human labeling process of large ICD datasets.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.