MCA-Unet:用于从CT图像分割新冠肺炎病变的多尺度上下文聚合U-Net

Alyaa Amer , Xujiong Ye
{"title":"MCA-Unet:用于从CT图像分割新冠肺炎病变的多尺度上下文聚合U-Net","authors":"Alyaa Amer ,&nbsp;Xujiong Ye","doi":"10.1016/j.cmpbup.2023.100114","DOIUrl":null,"url":null,"abstract":"<div><p>The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":"4 ","pages":"Article 100114"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MCA-Unet: A multiscale context aggregation U-Net for the segmentation of COVID-19 lesions from CT images\",\"authors\":\"Alyaa Amer ,&nbsp;Xujiong Ye\",\"doi\":\"10.1016/j.cmpbup.2023.100114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.</p></div>\",\"PeriodicalId\":72670,\"journal\":{\"name\":\"Computer methods and programs in biomedicine update\",\"volume\":\"4 \",\"pages\":\"Article 100114\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine update\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266699002300023X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266699002300023X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

冠状病毒病(COVID-19)大流行使世界面临生存健康危机。目前,从CT图像中分割COVID-19病变是评估疾病严重程度和肺部损伤程度的重要步骤。深度学习为医学图像分割带来了突破,其中U-Net是最突出的深度网络。然而,在本研究中,我们认为其架构在某些方面还存在不足。首先,在编码器和解码器特征之间的U-Net跳过连接中存在不兼容性,这对最终预测产生不利影响。其次,缺乏对多尺度上下文信息的捕获,忽略了所有语义信息在分割过程中的贡献。因此,我们提出了一种新的多尺度深度学习分割模型MCA-Unet,并对U-Net模型进行了一些改进。MCA-Unet集成了一个多尺度上下文聚合模块,该模块由两个块组成;上下文嵌入块(CEB)和级联扩展卷积块(CDCB)。CEB旨在减小沿U-Net跳过连接的连接特征之间的语义差距,它通过从后续高级特征继承的丰富语义来丰富低级编码器特征,从而减小低处理编码器特征与高处理解码器特征之间的语义差距,从而保证有效的连接。CDCB集成用于解决COVID-19病变形状和大小的可变性,它通过逐渐扩大接受野来捕获全局背景信息,然后反向操作,通过扩大接受野来捕获可能分散的小细节。为了验证我们模型的稳健性,我们在1705个不同类型COVID-19感染的轴向CT图像的公开数据集上对其进行了测试。实验结果表明,与基本U-Net及其变体相比,MCA-Unet的性能有了显著提高。通过使用不同的评价指标,该算法获得了88.6%的Dice相似系数、85.4%的Jaccard指数和93.5%的F-score度量,从而获得了较高的性能。这种优异的表现显示了巨大的潜力,以帮助医生在他们的检查和改善临床工作流程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MCA-Unet: A multiscale context aggregation U-Net for the segmentation of COVID-19 lesions from CT images

The pandemic of coronavirus disease (COVID-19) caused the world to face an existential health crisis. COVID-19 lesions segmentation from CT images is nowadays an essential step to assess the severity of the disease and the amount of damage to the lungs. Deep learning has brought about a breakthrough in medical image segmentation where U-Net is the most prominent deep network. However, in this study, we argue that its architecture still lacks in certain aspects. First, there is an incompatibility in the U-Net skip connection between the encoder and decoder features which adversely affects the final prediction. Second, it lacks capturing multiscale context information and ignores the contribution of all semantic information through the segmentation process. Therefore, we propose a model named MCA-Unet, a novel multiscale deep learning segmentation model, which proposes some modifications to improve upon the U-Net model. MCA-Unet is integrated with a multiscale context aggregation module which is constituted of two blocks; a context embedding block (CEB) and a cascaded dilated convolution block (CDCB). The CEB aims at reducing the semantic gap between the concatenated features along the U-Net skip connections, it enriches the low-level encoder features with rich semantics inherited from the subsequent higher-level features, to reduce the semantic gap between the low-processed encoder features and the highly-processed decoder features, thus ensuring effectual concatenation. The CDCB is integrated to address the variability in shape and size of the COVID-19 lesions, it captures global context information by gradually expanding the receptive field, then operates reversely to capture the small fine details that might be scattered by enlarging the receptive field. To validate the robustness of our model, we tested it on a publicly available dataset of 1705 axial CT images with different types of COVID-19 infection. Experimental results show that MCA-Unet has attained a remarkable gain in performance in comparison with the basic U-Net and its variant. It achieved high performance using different evaluation metrics showing 88.6% Dice similarity coefficient, 85.4% Jaccard index, and 93.5% F-score measure. This outperformance shows great potential to help physicians during their examination and improve the clinical workflow.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.90
自引率
0.00%
发文量
0
审稿时长
10 weeks
期刊最新文献
Fostering digital health literacy to enhance trust and improve health outcomes Machine learning from real data: A mental health registry case study ResfEANet: ResNet-fused External Attention Network for Tuberculosis Diagnosis using Chest X-ray Images Role-playing recovery in social virtual worlds: Adult use of child avatars as PTSD therapy Comparative evaluation of low-cost 3D scanning devices for ear acquisition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1