MCA-Unet：用于从CT图像分割新冠肺炎病变的多尺度上下文聚合U-Net

Computer methods and programs in biomedicine update Pub Date : 2023-01-01 Epub Date: 2023-08-25 DOI:10.1016/j.cmpbup.2023.100114

Alyaa Amer , Xujiong Ye

{"title":"MCA-Unet：用于从CT图像分割新冠肺炎病变的多尺度上下文聚合U-Net","authors":"Alyaa Amer , Xujiong Ye","doi":"10.1016/j.cmpbup.2023.100114","DOIUrl":null,"url":null,"abstract":"<div><p>The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":"4 ","pages":"Article 100114"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MCA-Unet: A multiscale context aggregation U-Net for the segmentation of COVID-19 lesions from CT images\",\"authors\":\"Alyaa Amer , Xujiong Ye\",\"doi\":\"10.1016/j.cmpbup.2023.100114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.</p></div>\",\"PeriodicalId\":72670,\"journal\":{\"name\":\"Computer methods and programs in biomedicine update\",\"volume\":\"4 \",\"pages\":\"Article 100114\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine update\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266699002300023X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/8/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266699002300023X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/25 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

冠状病毒病(COVID-19)大流行使世界面临生存健康危机。目前，从CT图像中分割COVID-19病变是评估疾病严重程度和肺部损伤程度的重要步骤。深度学习为医学图像分割带来了突破，其中U-Net是最突出的深度网络。然而，在本研究中，我们认为其架构在某些方面还存在不足。首先，在编码器和解码器特征之间的U-Net跳过连接中存在不兼容性，这对最终预测产生不利影响。其次，缺乏对多尺度上下文信息的捕获，忽略了所有语义信息在分割过程中的贡献。因此，我们提出了一种新的多尺度深度学习分割模型MCA-Unet，并对U-Net模型进行了一些改进。MCA-Unet集成了一个多尺度上下文聚合模块，该模块由两个块组成;上下文嵌入块(CEB)和级联扩展卷积块(CDCB)。CEB旨在减小沿U-Net跳过连接的连接特征之间的语义差距，它通过从后续高级特征继承的丰富语义来丰富低级编码器特征，从而减小低处理编码器特征与高处理解码器特征之间的语义差距，从而保证有效的连接。CDCB集成用于解决COVID-19病变形状和大小的可变性，它通过逐渐扩大接受野来捕获全局背景信息，然后反向操作，通过扩大接受野来捕获可能分散的小细节。为了验证我们模型的稳健性，我们在1705个不同类型COVID-19感染的轴向CT图像的公开数据集上对其进行了测试。实验结果表明，与基本U-Net及其变体相比，MCA-Unet的性能有了显著提高。通过使用不同的评价指标，该算法获得了88.6%的Dice相似系数、85.4%的Jaccard指数和93.5%的F-score度量，从而获得了较高的性能。这种优异的表现显示了巨大的潜力，以帮助医生在他们的检查和改善临床工作流程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

MCA-Unet: A multiscale context aggregation U-Net for the segmentation of COVID-19 lesions from CT images

The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊