Kim-Ngoc T Le, Gyurin Byun, Syed M Raza, Duc-Tai Le, Hyunseung Choo
{"title":"Respiratory Anomaly and Disease Detection Using Multi-Level Temporal Convolutional Networks.","authors":"Kim-Ngoc T Le, Gyurin Byun, Syed M Raza, Duc-Tai Le, Hyunseung Choo","doi":"10.1109/JBHI.2025.3545156","DOIUrl":null,"url":null,"abstract":"<p><p>An automated analysis of respiratory sounds using Deep Learning (DL) plays a pivotal role in the early detection of lung diseases. However, current DL methods often examine the spatial and temporal characteristics of respiratory sounds in isolation, which inherently limit their potential. This study proposes a novel DL framework that captures spatial features through convolution operations and exploits the spatiotemporal correlations of these features using temporal convolution networks. The proposed framework incorporates Multi-Level Temporal Convolutional Networks (ML-TCN) to considerably enhance the model accuracy in detecting anomaly breathing cycles and respiratory recordings from lung sound audio. Moreover, a transfer learning technique is also employed to extract semantic features efficiently from limited and imbalanced data in this domain. Thorough experiments on the well-known ICBHI 2017 challenge dataset show that the proposed framework outperforms state-of-the-art methods in both binary and multi-class classification tasks for respiratory anomaly and disease detection. In particular, improvements of up to 2.29% and 2.27% in terms of the Score metric, average sensitivity and specificity, are demonstrated in binary and multi-class anomaly breathing cycle detection tasks, respectively. In respiratory recording classification tasks, the classification accuracy is improved by 2.69% for healthy-unhealthy binary classification and 1.47% for healthy, chronic, and non-chronic diagnosis. These results highlight the marked advantage of the ML-TCN over existing techniques, showcasing its potential to drive future innovations in respiratory healthcare technology.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3545156","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
An automated analysis of respiratory sounds using Deep Learning (DL) plays a pivotal role in the early detection of lung diseases. However, current DL methods often examine the spatial and temporal characteristics of respiratory sounds in isolation, which inherently limit their potential. This study proposes a novel DL framework that captures spatial features through convolution operations and exploits the spatiotemporal correlations of these features using temporal convolution networks. The proposed framework incorporates Multi-Level Temporal Convolutional Networks (ML-TCN) to considerably enhance the model accuracy in detecting anomaly breathing cycles and respiratory recordings from lung sound audio. Moreover, a transfer learning technique is also employed to extract semantic features efficiently from limited and imbalanced data in this domain. Thorough experiments on the well-known ICBHI 2017 challenge dataset show that the proposed framework outperforms state-of-the-art methods in both binary and multi-class classification tasks for respiratory anomaly and disease detection. In particular, improvements of up to 2.29% and 2.27% in terms of the Score metric, average sensitivity and specificity, are demonstrated in binary and multi-class anomaly breathing cycle detection tasks, respectively. In respiratory recording classification tasks, the classification accuracy is improved by 2.69% for healthy-unhealthy binary classification and 1.47% for healthy, chronic, and non-chronic diagnosis. These results highlight the marked advantage of the ML-TCN over existing techniques, showcasing its potential to drive future innovations in respiratory healthcare technology.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.