{"title":"CheXDouble:双重监督可解释疾病诊断模型","authors":"Zhiwei Tang , You Yang","doi":"10.1016/j.bspc.2024.107026","DOIUrl":null,"url":null,"abstract":"<div><div>Chest X-ray imaging, commonly used for diagnosing cardiopulmonary diseases, typically requires radiologists to devote considerable effort to reading and interpreting the images. Moreover, diagnostic outcomes can vary due to differences in radiologists’ experience. Deep learning for chest X-ray disease diagnosis holds great promise for enhancing diagnostic accuracy and reducing the workload of radiologists. However, traditional deep learning models for medical image classification are often difficult to interpret. To address this, we introduce the Global Attention Alignment Module, which utilizes cardiopulmonary mask for supervised training. This provides the model with spatial location priors during training, thereby enhancing the interpretability of the saliency maps and the disease classification performance. Additionally, most chest X-ray datasets suffer from severe imbalances between positive and negative samples for diseases, leading to classification imbalance issues when training models. Thus, we propose the Improved Focal Loss, which dynamically adjusts the weight of negative samples in the loss function based on sample statistics, effectively mitigating the imbalance issue in the dataset. Moreover, the training of deep learning models for medical image classification requires substantial data support. Therefore, we conducted a quantitative analysis to explore the impact of five different data augmentation methods on model classification performance across various input image sizes, identifying the most effective data augmentation strategy. Ultimately, through these proposed methods, we developed the dual-supervised medical imaging disease diagnosis model CheXDouble, which surpasses previous state-of-the-art models with its highly competitive disease classification performance.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107026"},"PeriodicalIF":4.9000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CheXDouble: Dual-Supervised interpretable disease diagnosis model\",\"authors\":\"Zhiwei Tang , You Yang\",\"doi\":\"10.1016/j.bspc.2024.107026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Chest X-ray imaging, commonly used for diagnosing cardiopulmonary diseases, typically requires radiologists to devote considerable effort to reading and interpreting the images. Moreover, diagnostic outcomes can vary due to differences in radiologists’ experience. Deep learning for chest X-ray disease diagnosis holds great promise for enhancing diagnostic accuracy and reducing the workload of radiologists. However, traditional deep learning models for medical image classification are often difficult to interpret. To address this, we introduce the Global Attention Alignment Module, which utilizes cardiopulmonary mask for supervised training. This provides the model with spatial location priors during training, thereby enhancing the interpretability of the saliency maps and the disease classification performance. Additionally, most chest X-ray datasets suffer from severe imbalances between positive and negative samples for diseases, leading to classification imbalance issues when training models. Thus, we propose the Improved Focal Loss, which dynamically adjusts the weight of negative samples in the loss function based on sample statistics, effectively mitigating the imbalance issue in the dataset. Moreover, the training of deep learning models for medical image classification requires substantial data support. Therefore, we conducted a quantitative analysis to explore the impact of five different data augmentation methods on model classification performance across various input image sizes, identifying the most effective data augmentation strategy. Ultimately, through these proposed methods, we developed the dual-supervised medical imaging disease diagnosis model CheXDouble, which surpasses previous state-of-the-art models with its highly competitive disease classification performance.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"100 \",\"pages\":\"Article 107026\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S174680942401084X\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S174680942401084X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
摘要
胸部 X 光成像常用于诊断心肺疾病,通常需要放射科医生花费大量精力来阅读和解读图像。此外,由于放射科医生的经验不同,诊断结果也会有所差异。用于胸部 X 射线疾病诊断的深度学习在提高诊断准确性和减少放射科医生工作量方面大有可为。然而,用于医学图像分类的传统深度学习模型往往难以解释。为解决这一问题,我们引入了全局注意力对齐模块,该模块利用心肺掩模进行监督训练。这就在训练过程中为模型提供了空间位置先验,从而提高了突出图的可解释性和疾病分类性能。此外,大多数胸部 X 光数据集都存在疾病正负样本严重不平衡的问题,导致模型训练时出现分类不平衡问题。因此,我们提出了 "改进的焦点损失"(Improved Focal Loss),根据样本统计数据动态调整损失函数中负样本的权重,有效缓解了数据集的不平衡问题。此外,用于医学图像分类的深度学习模型的训练需要大量的数据支持。因此,我们进行了定量分析,探讨了五种不同的数据增强方法对不同输入图像大小的模型分类性能的影响,找出了最有效的数据增强策略。最终,通过这些建议的方法,我们开发出了双监督医学影像疾病诊断模型 CheXDouble,它以极具竞争力的疾病分类性能超越了以往最先进的模型。
CheXDouble: Dual-Supervised interpretable disease diagnosis model
Chest X-ray imaging, commonly used for diagnosing cardiopulmonary diseases, typically requires radiologists to devote considerable effort to reading and interpreting the images. Moreover, diagnostic outcomes can vary due to differences in radiologists’ experience. Deep learning for chest X-ray disease diagnosis holds great promise for enhancing diagnostic accuracy and reducing the workload of radiologists. However, traditional deep learning models for medical image classification are often difficult to interpret. To address this, we introduce the Global Attention Alignment Module, which utilizes cardiopulmonary mask for supervised training. This provides the model with spatial location priors during training, thereby enhancing the interpretability of the saliency maps and the disease classification performance. Additionally, most chest X-ray datasets suffer from severe imbalances between positive and negative samples for diseases, leading to classification imbalance issues when training models. Thus, we propose the Improved Focal Loss, which dynamically adjusts the weight of negative samples in the loss function based on sample statistics, effectively mitigating the imbalance issue in the dataset. Moreover, the training of deep learning models for medical image classification requires substantial data support. Therefore, we conducted a quantitative analysis to explore the impact of five different data augmentation methods on model classification performance across various input image sizes, identifying the most effective data augmentation strategy. Ultimately, through these proposed methods, we developed the dual-supervised medical imaging disease diagnosis model CheXDouble, which surpasses previous state-of-the-art models with its highly competitive disease classification performance.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.