Mohammed Guhdar , Abdulhakeem O. Mohammed , Ramadhan J. Mstafa
{"title":"Advanced deep learning framework for ECG arrhythmia classification using 1D-CNN with attention mechanism","authors":"Mohammed Guhdar , Abdulhakeem O. Mohammed , Ramadhan J. Mstafa","doi":"10.1016/j.knosys.2025.113301","DOIUrl":null,"url":null,"abstract":"<div><div>Cardiovascular diseases, particularly cardiac arrhythmias, remain a leading cause of global mortality, necessitating efficient and accurate diagnostic tools. Despite advances in deep learning for ECG analysis, current models face challenges in cross-population performance, signal noise robustness, limited training data efficiency, and clinical result interpretability. Additionally, most current approaches struggle to generalize across different ECG databases and require extensive computational resources for real-time analysis. This paper presents a novel hybrid deep learning framework for automated ECG analysis, combining one-dimensional convolutional neural networks (1D-CNN) with a specialized attention mechanism. The proposed architecture implements a four-stage CNN backbone enhanced with a squeeze-and-excitation attention block, enabling adaptive feature selection across multiple scales. The model incorporates advanced regularization techniques, including focal loss, L2 regularization, and an ensemble approach with mixed precision training. We conducted extensive experiments across multiple datasets to evaluate generalization capabilities. This study utilizes two standard databases: the MIT-BIH Arrhythmia Database (48 half-hour recordings sampled at 360 Hz) and the PTB Diagnostic ECG Database (549 records from 290 subjects sampled at 1000 Hz). Through rigorous validation including five-fold cross-validation and statistical significance testing, our model attained remarkable performance, achieving 99.48% accuracy on MIT-BIH, 99.83% accuracy on PTB, and 99.64% accuracy on the combined dataset, with corresponding F1-scores of 0.99, 1.00, and 1.00 respectively. The findings demonstrate robust generalization across varied ECG morphologies and recording conditions, with particular effectiveness in handling class imbalance without data augmentation. The model’s reliable performance across multiple datasets indicates significant potential for clinical applications in automated cardiac diagnostics.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"315 ","pages":"Article 113301"},"PeriodicalIF":7.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512500348X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Cardiovascular diseases, particularly cardiac arrhythmias, remain a leading cause of global mortality, necessitating efficient and accurate diagnostic tools. Despite advances in deep learning for ECG analysis, current models face challenges in cross-population performance, signal noise robustness, limited training data efficiency, and clinical result interpretability. Additionally, most current approaches struggle to generalize across different ECG databases and require extensive computational resources for real-time analysis. This paper presents a novel hybrid deep learning framework for automated ECG analysis, combining one-dimensional convolutional neural networks (1D-CNN) with a specialized attention mechanism. The proposed architecture implements a four-stage CNN backbone enhanced with a squeeze-and-excitation attention block, enabling adaptive feature selection across multiple scales. The model incorporates advanced regularization techniques, including focal loss, L2 regularization, and an ensemble approach with mixed precision training. We conducted extensive experiments across multiple datasets to evaluate generalization capabilities. This study utilizes two standard databases: the MIT-BIH Arrhythmia Database (48 half-hour recordings sampled at 360 Hz) and the PTB Diagnostic ECG Database (549 records from 290 subjects sampled at 1000 Hz). Through rigorous validation including five-fold cross-validation and statistical significance testing, our model attained remarkable performance, achieving 99.48% accuracy on MIT-BIH, 99.83% accuracy on PTB, and 99.64% accuracy on the combined dataset, with corresponding F1-scores of 0.99, 1.00, and 1.00 respectively. The findings demonstrate robust generalization across varied ECG morphologies and recording conditions, with particular effectiveness in handling class imbalance without data augmentation. The model’s reliable performance across multiple datasets indicates significant potential for clinical applications in automated cardiac diagnostics.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.