{"title":"Enhancing cross-domain robustness in phonocardiogram signal classification using domain-invariant preprocessing and transfer learning","authors":"Arnab Maity, Goutam Saha","doi":"10.1016/j.cmpb.2024.108462","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objective:</h3><div>Phonocardiogram (PCG) signal analysis is a non-invasive and cost-efficient approach for diagnosing cardiovascular diseases. Existing PCG-based approaches employ signal processing and machine learning (ML) for automatic disease detection. However, machine learning techniques are known to underperform in cross-corpora arrangements. A drastic effect on disease detection performance is observed when training and testing sets come from different PCG databases with varying data acquisition settings. This study investigates the impact of data acquisition parameter variations in the PCG data across different databases and develops methods to achieve robustness against these variations.</div></div><div><h3>Methods:</h3><div>To alleviate the effect of dataset-induced variations, it employs a combination of three strategies: domain-invariant preprocessing, transfer learning, and domain-balanced variable hop fragment selection (DBVHFS). The domain-invariant preprocessing normalizes the PCG to reduce the stethoscope and environment-induced variations. The transfer learning utilizes a pre-trained model trained on diverse audio data to reduce the impact of data variability by generalizing feature representations. DBVHFS facilitates unbiased fine-tuning of the pre-trained model by balancing the training fragments across all domains, ensuring equal distribution from each class.</div></div><div><h3>Results:</h3><div>The proposed method is evaluated on six independent PhysioNet/CinC Challenge <span><math><mrow><mn>2016</mn></mrow></math></span> PCG databases using leave-one-dataset-out cross-validation. Results indicate that our system outperforms the existing study with a relative improvement of <strong>5.92%</strong> in unweighted average recall and <strong>17.71%</strong> in sensitivity.</div></div><div><h3>Conclusions:</h3><div>The methods proposed in this study address variations in PCG data originating from different sources, potentially enhancing the implementation possibility of automated cardiac screening systems in real-life scenarios.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"257 ","pages":"Article 108462"},"PeriodicalIF":4.9000,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260724004553","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and objective:
Phonocardiogram (PCG) signal analysis is a non-invasive and cost-efficient approach for diagnosing cardiovascular diseases. Existing PCG-based approaches employ signal processing and machine learning (ML) for automatic disease detection. However, machine learning techniques are known to underperform in cross-corpora arrangements. A drastic effect on disease detection performance is observed when training and testing sets come from different PCG databases with varying data acquisition settings. This study investigates the impact of data acquisition parameter variations in the PCG data across different databases and develops methods to achieve robustness against these variations.
Methods:
To alleviate the effect of dataset-induced variations, it employs a combination of three strategies: domain-invariant preprocessing, transfer learning, and domain-balanced variable hop fragment selection (DBVHFS). The domain-invariant preprocessing normalizes the PCG to reduce the stethoscope and environment-induced variations. The transfer learning utilizes a pre-trained model trained on diverse audio data to reduce the impact of data variability by generalizing feature representations. DBVHFS facilitates unbiased fine-tuning of the pre-trained model by balancing the training fragments across all domains, ensuring equal distribution from each class.
Results:
The proposed method is evaluated on six independent PhysioNet/CinC Challenge PCG databases using leave-one-dataset-out cross-validation. Results indicate that our system outperforms the existing study with a relative improvement of 5.92% in unweighted average recall and 17.71% in sensitivity.
Conclusions:
The methods proposed in this study address variations in PCG data originating from different sources, potentially enhancing the implementation possibility of automated cardiac screening systems in real-life scenarios.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.