Enyuan Pan , Yuan Zhong , Ping Li , Yi Yang , Jin Zhou
{"title":"Hybrid-noise generative diffusion probabilistic model for cervical spine MRI image generation","authors":"Enyuan Pan , Yuan Zhong , Ping Li , Yi Yang , Jin Zhou","doi":"10.1016/j.cmpb.2025.108639","DOIUrl":null,"url":null,"abstract":"<div><div>Medical imaging is crucial for artificial intelligence-based clinician decision-making. However, learning anatomical features from limited samples poses a challenge. To address this issue, recent studies have employed diffusion models to generate medical imaging data, demonstrating the potential for high-quality medical image generation through deep learning. In this paper, we propose a high-quality cervical MRI image generation method called the Cervical Spine MRI Diffusion Probabilistic Model (CSM-DPM). Considering the complexity of neck MRI image data, our method uses a hybrid of standard Gaussian noise and point noise obtained by sampling within 2D Gaussian noise fields to approximate the true distribution of the image data. Meanwhile, the cosine noise schedule is used to make the morphology of the generated vertebral blocks and other focal areas more visually natural and clear. Furthermore, to enhance the noise prediction capabilities of UNet in DDPM, we devise the Asa-ResUNet module, which incorporates an asymmetric attention mechanism. This mechanism includes spatial attention on different ResUNet layers to improve feature extraction and incorporates high-level semantic information. We further enhance the stability and robustness of the Asa-ResUNet by using an exponential weighting strategy (EMA). Experiments demonstrate that our method produces cervical spine MRI images with FID values that are up to 15.79%, 52.61%, and 46.57% lower than those produced by DDPM, DDIM, and F-PNDM, respectively, indicating superior image quality. Segmentation experiments confirm that the generated images can enhance the overall performance of segmentation models when used for training.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"262 ","pages":"Article 108639"},"PeriodicalIF":4.9000,"publicationDate":"2025-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725000562","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Medical imaging is crucial for artificial intelligence-based clinician decision-making. However, learning anatomical features from limited samples poses a challenge. To address this issue, recent studies have employed diffusion models to generate medical imaging data, demonstrating the potential for high-quality medical image generation through deep learning. In this paper, we propose a high-quality cervical MRI image generation method called the Cervical Spine MRI Diffusion Probabilistic Model (CSM-DPM). Considering the complexity of neck MRI image data, our method uses a hybrid of standard Gaussian noise and point noise obtained by sampling within 2D Gaussian noise fields to approximate the true distribution of the image data. Meanwhile, the cosine noise schedule is used to make the morphology of the generated vertebral blocks and other focal areas more visually natural and clear. Furthermore, to enhance the noise prediction capabilities of UNet in DDPM, we devise the Asa-ResUNet module, which incorporates an asymmetric attention mechanism. This mechanism includes spatial attention on different ResUNet layers to improve feature extraction and incorporates high-level semantic information. We further enhance the stability and robustness of the Asa-ResUNet by using an exponential weighting strategy (EMA). Experiments demonstrate that our method produces cervical spine MRI images with FID values that are up to 15.79%, 52.61%, and 46.57% lower than those produced by DDPM, DDIM, and F-PNDM, respectively, indicating superior image quality. Segmentation experiments confirm that the generated images can enhance the overall performance of segmentation models when used for training.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.