{"title":"利用去噪扩散概率模型改进多厚度 CT 分割","authors":"","doi":"10.1016/j.neucom.2024.128573","DOIUrl":null,"url":null,"abstract":"<div><p>Organs-at-risk (OARs) segmentation in computed tomography (CT) is a fundamental step in the radiotherapy workflow, which has been prone as a time-consuming and labor-intensive task. Deep neural networks (DNNs) have gained significant popularity in the field of OAR segmentation tasks, achieving remarkable progress in clinical practice. Typically, OARs are distributed throughout different areas of the body and require varying thicknesses of CT scans for better diagnosis and segmentation in clinical. Most DNN-based segmentation focuses on single-thickness CT scans, limiting their applicability to varying thicknesses due to a lack of diverse thickness-related feature learning. While pre-training with the denoising diffusion probabilistic model (DDPM) offers an effective solution for dense feature learning, current works are constrained in addressing feature diversity, as exemplified by scenarios such as multi-thickness CT. To address the above challenges, this paper introduces a novel pre-training approach called DiffMT. This approach leverages the DDPM to extract valuable features from multi-thickness CT images. By transferring the pre-trained DDPM to the downstream segmentation for fine-tuning, the model gains proficiency in learning diverse multi-thickness CT features, leading to precise segmentation across varied thicknesses. We explore DiffMT’s feature learning capacity through experiments involving pre-trained models of varying sizes and different denoising thicknesses. Subsequently, thorough experiments comparing DDPM-based segmentation with other state-of-the-art (SOTA) CT segmentation methods, along with assessments on diverse OARs and modalities, empirically demonstrate that the proposed DiffMT method outperforms the control methods. The codes are available at <span><span>https://github.com/ychengrong/DiffMT</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging denoising diffusion probabilistic model to improve the multi-thickness CT segmentation\",\"authors\":\"\",\"doi\":\"10.1016/j.neucom.2024.128573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Organs-at-risk (OARs) segmentation in computed tomography (CT) is a fundamental step in the radiotherapy workflow, which has been prone as a time-consuming and labor-intensive task. Deep neural networks (DNNs) have gained significant popularity in the field of OAR segmentation tasks, achieving remarkable progress in clinical practice. Typically, OARs are distributed throughout different areas of the body and require varying thicknesses of CT scans for better diagnosis and segmentation in clinical. Most DNN-based segmentation focuses on single-thickness CT scans, limiting their applicability to varying thicknesses due to a lack of diverse thickness-related feature learning. While pre-training with the denoising diffusion probabilistic model (DDPM) offers an effective solution for dense feature learning, current works are constrained in addressing feature diversity, as exemplified by scenarios such as multi-thickness CT. To address the above challenges, this paper introduces a novel pre-training approach called DiffMT. This approach leverages the DDPM to extract valuable features from multi-thickness CT images. By transferring the pre-trained DDPM to the downstream segmentation for fine-tuning, the model gains proficiency in learning diverse multi-thickness CT features, leading to precise segmentation across varied thicknesses. We explore DiffMT’s feature learning capacity through experiments involving pre-trained models of varying sizes and different denoising thicknesses. Subsequently, thorough experiments comparing DDPM-based segmentation with other state-of-the-art (SOTA) CT segmentation methods, along with assessments on diverse OARs and modalities, empirically demonstrate that the proposed DiffMT method outperforms the control methods. The codes are available at <span><span>https://github.com/ychengrong/DiffMT</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231224013444\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224013444","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Leveraging denoising diffusion probabilistic model to improve the multi-thickness CT segmentation
Organs-at-risk (OARs) segmentation in computed tomography (CT) is a fundamental step in the radiotherapy workflow, which has been prone as a time-consuming and labor-intensive task. Deep neural networks (DNNs) have gained significant popularity in the field of OAR segmentation tasks, achieving remarkable progress in clinical practice. Typically, OARs are distributed throughout different areas of the body and require varying thicknesses of CT scans for better diagnosis and segmentation in clinical. Most DNN-based segmentation focuses on single-thickness CT scans, limiting their applicability to varying thicknesses due to a lack of diverse thickness-related feature learning. While pre-training with the denoising diffusion probabilistic model (DDPM) offers an effective solution for dense feature learning, current works are constrained in addressing feature diversity, as exemplified by scenarios such as multi-thickness CT. To address the above challenges, this paper introduces a novel pre-training approach called DiffMT. This approach leverages the DDPM to extract valuable features from multi-thickness CT images. By transferring the pre-trained DDPM to the downstream segmentation for fine-tuning, the model gains proficiency in learning diverse multi-thickness CT features, leading to precise segmentation across varied thicknesses. We explore DiffMT’s feature learning capacity through experiments involving pre-trained models of varying sizes and different denoising thicknesses. Subsequently, thorough experiments comparing DDPM-based segmentation with other state-of-the-art (SOTA) CT segmentation methods, along with assessments on diverse OARs and modalities, empirically demonstrate that the proposed DiffMT method outperforms the control methods. The codes are available at https://github.com/ychengrong/DiffMT.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.