OMSF2: optimizing multi-scale feature fusion learning for pneumoconiosis staging diagnosis through data specificity augmentation

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Complex & Intelligent Systems Pub Date : 2024-12-30 DOI:10.1007/s40747-024-01729-0

Xueting Ren, Surong Chu, Guohua Ji, Zijuan Zhao, Juanjuan Zhao, Yan Qiang, Yangyang Wei, Yan Wang

{"title":"OMSF2: optimizing multi-scale feature fusion learning for pneumoconiosis staging diagnosis through data specificity augmentation","authors":"Xueting Ren, Surong Chu, Guohua Ji, Zijuan Zhao, Juanjuan Zhao, Yan Qiang, Yangyang Wei, Yan Wang","doi":"10.1007/s40747-024-01729-0","DOIUrl":null,"url":null,"abstract":"<p>Diagnosing pneumoconiosis is challenging because the lesions are not easily visible on chest X-rays, and the images often lack clear details. Existing deep detection models utilize Feature Pyramid Networks (FPNs) to identify objects at different scales. However, they struggle with insufficient perception of small targets and gradient inconsistency in medical image detection tasks, hindering the full utilization of multi-scale features. To address these issues, we propose an Optimized Multi-Scale Feature Fusion learning framework, OMSF2, which includes the following components: (1) Data specificity augmentation module is introduced to capture intrinsic data representations and introduce diversity by learning morphological variations and lesion locations. (2) Multi-scale feature learning module is utilized that refines micro-feature localization guided by heatmaps, enabling full extraction of multi-directional features of subtle diffuse targets. (3) Multi-scale feature fusion module is employed that facilitates the fusion of high-level and low-level features to better understand subtle differences between disease stages. Notably, this paper innovatively proposes a method for fine learning of low-resolution micro-features in pneumoconiosis, addressing the issue of maintaining cross-layer gradient consistency under multi-scale feature fusion. We established an enhanced pneumoconiosis X-ray dataset to optimize the lesion detection capability of the OMSF2 model. We also introduced an external dataset to evaluate other chest X-rays with complex lesions. On the AP-50 and R-50 evaluation metrics, OMSF2 improved by 3.25% and 3.31% on the internal dataset, and by 2.28% and 0.24% on the external dataset, respectively. Experimental results show that OMSF2 achieves significantly better performance than state-of-the-art baselines in medical image detection tasks.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"33 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-024-01729-0","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Diagnosing pneumoconiosis is challenging because the lesions are not easily visible on chest X-rays, and the images often lack clear details. Existing deep detection models utilize Feature Pyramid Networks (FPNs) to identify objects at different scales. However, they struggle with insufficient perception of small targets and gradient inconsistency in medical image detection tasks, hindering the full utilization of multi-scale features. To address these issues, we propose an Optimized Multi-Scale Feature Fusion learning framework, OMSF2, which includes the following components: (1) Data specificity augmentation module is introduced to capture intrinsic data representations and introduce diversity by learning morphological variations and lesion locations. (2) Multi-scale feature learning module is utilized that refines micro-feature localization guided by heatmaps, enabling full extraction of multi-directional features of subtle diffuse targets. (3) Multi-scale feature fusion module is employed that facilitates the fusion of high-level and low-level features to better understand subtle differences between disease stages. Notably, this paper innovatively proposes a method for fine learning of low-resolution micro-features in pneumoconiosis, addressing the issue of maintaining cross-layer gradient consistency under multi-scale feature fusion. We established an enhanced pneumoconiosis X-ray dataset to optimize the lesion detection capability of the OMSF2 model. We also introduced an external dataset to evaluate other chest X-rays with complex lesions. On the AP-50 and R-50 evaluation metrics, OMSF2 improved by 3.25% and 3.31% on the internal dataset, and by 2.28% and 0.24% on the external dataset, respectively. Experimental results show that OMSF2 achieves significantly better performance than state-of-the-art baselines in medical image detection tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

OMSF2：通过数据特异性增强优化多尺度特征融合学习用于尘肺分期诊断

诊断尘肺病具有挑战性，因为在胸部x光片上不容易看到病变，而且图像通常缺乏清晰的细节。现有的深度检测模型利用特征金字塔网络（Feature Pyramid Networks, FPNs）来识别不同尺度的目标。但在医学图像检测任务中存在小目标感知不足、梯度不一致等问题，阻碍了多尺度特征的充分利用。为了解决这些问题，我们提出了一个优化的多尺度特征融合学习框架OMSF2，该框架包括以下部分：(1)引入数据特异性增强模块，通过学习形态变化和病变位置来捕获内在数据表示并引入多样性。(2)利用多尺度特征学习模块，细化热图引导下的微特征定位，充分提取细微漫射目标的多向特征。(3)采用多尺度特征融合模块，实现高、低尺度特征融合，更好地了解疾病分期之间的细微差异。值得注意的是，本文创新性地提出了一种尘肺低分辨率微特征的精细学习方法，解决了多尺度特征融合下保持跨层梯度一致性的问题。我们建立了一个增强型尘肺x射线数据集，以优化OMSF2模型的病变检测能力。我们还引入了一个外部数据集来评估其他具有复杂病变的胸部x光片。在AP-50和R-50评价指标上，OMSF2在内部数据集上分别提高了3.25%和3.31%，在外部数据集上分别提高了2.28%和0.24%。实验结果表明，OMSF2在医学图像检测任务中的性能明显优于现有基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.