Qian Tang;Qikui Zhu;Yuxuan Xiong;Yongchao Xu;Bo Du
{"title":"用于医学图像分割的边缘与掩膜集成驱动扩散模型","authors":"Qian Tang;Qikui Zhu;Yuxuan Xiong;Yongchao Xu;Bo Du","doi":"10.1109/LSP.2024.3466608","DOIUrl":null,"url":null,"abstract":"Denoising diffusion probabilistic models (DDPMs) exhibit significant potential in the realm of medical image segmentation. Nevertheless, current DDPM implementations rely on original image features as conditional information, thus lacking the ability to specifically emphasize edge information, a critical aspect in addressing the primary challenge of segmentation. Furthermore, the necessary semantic features for conditioning the diffusion process lack effective alignment with the noise embedding. To address the above issues, we propose a novel edge-and-mask integration-driven diffusion model (EMidDiff). Specifically, 1) an edge-and-mask condition strategy is proposed for the segmentation diffusion model to effectively leverage rich semantic features, particularly the edge feature. 2) A novel co-attention guidance block is designed to align the segmentation map and condition features. The experimental results on brain tumor segmentation and optic-cup segmentation underscore the effectiveness of our approach, surpassing the performance of some state-of-the-art segmentation diffusion models.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"31 ","pages":"2665-2669"},"PeriodicalIF":3.2000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Edge-and-Mask Integration-Driven Diffusion Models for Medical Image Segmentation\",\"authors\":\"Qian Tang;Qikui Zhu;Yuxuan Xiong;Yongchao Xu;Bo Du\",\"doi\":\"10.1109/LSP.2024.3466608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Denoising diffusion probabilistic models (DDPMs) exhibit significant potential in the realm of medical image segmentation. Nevertheless, current DDPM implementations rely on original image features as conditional information, thus lacking the ability to specifically emphasize edge information, a critical aspect in addressing the primary challenge of segmentation. Furthermore, the necessary semantic features for conditioning the diffusion process lack effective alignment with the noise embedding. To address the above issues, we propose a novel edge-and-mask integration-driven diffusion model (EMidDiff). Specifically, 1) an edge-and-mask condition strategy is proposed for the segmentation diffusion model to effectively leverage rich semantic features, particularly the edge feature. 2) A novel co-attention guidance block is designed to align the segmentation map and condition features. The experimental results on brain tumor segmentation and optic-cup segmentation underscore the effectiveness of our approach, surpassing the performance of some state-of-the-art segmentation diffusion models.\",\"PeriodicalId\":13154,\"journal\":{\"name\":\"IEEE Signal Processing Letters\",\"volume\":\"31 \",\"pages\":\"2665-2669\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Signal Processing Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10689371/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10689371/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Edge-and-Mask Integration-Driven Diffusion Models for Medical Image Segmentation
Denoising diffusion probabilistic models (DDPMs) exhibit significant potential in the realm of medical image segmentation. Nevertheless, current DDPM implementations rely on original image features as conditional information, thus lacking the ability to specifically emphasize edge information, a critical aspect in addressing the primary challenge of segmentation. Furthermore, the necessary semantic features for conditioning the diffusion process lack effective alignment with the noise embedding. To address the above issues, we propose a novel edge-and-mask integration-driven diffusion model (EMidDiff). Specifically, 1) an edge-and-mask condition strategy is proposed for the segmentation diffusion model to effectively leverage rich semantic features, particularly the edge feature. 2) A novel co-attention guidance block is designed to align the segmentation map and condition features. The experimental results on brain tumor segmentation and optic-cup segmentation underscore the effectiveness of our approach, surpassing the performance of some state-of-the-art segmentation diffusion models.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.