CUNSB-RFIE:视网膜眼底图像增强中的情境感知非配对神经 Schr"{o}dinger 桥接器

Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang
{"title":"CUNSB-RFIE:视网膜眼底图像增强中的情境感知非配对神经 Schr\"{o}dinger 桥接器","authors":"Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang","doi":"arxiv-2409.10966","DOIUrl":null,"url":null,"abstract":"Retinal fundus photography is significant in diagnosing and monitoring\nretinal diseases. However, systemic imperfections and operator/patient-related\nfactors can hinder the acquisition of high-quality retinal images. Previous\nefforts in retinal image enhancement primarily relied on GANs, which are\nlimited by the trade-off between training stability and output diversity. In\ncontrast, the Schr\\\"{o}dinger Bridge (SB), offers a more stable solution by\nutilizing Optimal Transport (OT) theory to model a stochastic differential\nequation (SDE) between two arbitrary distributions. This allows SB to\neffectively transform low-quality retinal images into their high-quality\ncounterparts. In this work, we leverage the SB framework to propose an\nimage-to-image translation pipeline for retinal image enhancement.\nAdditionally, previous methods often fail to capture fine structural details,\nsuch as blood vessels. To address this, we enhance our pipeline by introducing\nDynamic Snake Convolution, whose tortuous receptive field can better preserve\ntubular structures. We name the resulting retinal fundus image enhancement\nframework the Context-aware Unpaired Neural Schr\\\"{o}dinger Bridge\n(CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use\nthe SB approach for retinal image enhancement. Experimental results on a\nlarge-scale dataset demonstrate the advantage of the proposed method compared\nto several state-of-the-art supervised and unsupervised methods in terms of\nimage quality and performance on downstream tasks.The code is available at\n\\url{https://github.com/Retinal-Research/CUNSB-RFIE}.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CUNSB-RFIE: Context-aware Unpaired Neural Schr\\\"{o}dinger Bridge in Retinal Fundus Image Enhancement\",\"authors\":\"Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang\",\"doi\":\"arxiv-2409.10966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retinal fundus photography is significant in diagnosing and monitoring\\nretinal diseases. However, systemic imperfections and operator/patient-related\\nfactors can hinder the acquisition of high-quality retinal images. Previous\\nefforts in retinal image enhancement primarily relied on GANs, which are\\nlimited by the trade-off between training stability and output diversity. In\\ncontrast, the Schr\\\\\\\"{o}dinger Bridge (SB), offers a more stable solution by\\nutilizing Optimal Transport (OT) theory to model a stochastic differential\\nequation (SDE) between two arbitrary distributions. This allows SB to\\neffectively transform low-quality retinal images into their high-quality\\ncounterparts. In this work, we leverage the SB framework to propose an\\nimage-to-image translation pipeline for retinal image enhancement.\\nAdditionally, previous methods often fail to capture fine structural details,\\nsuch as blood vessels. To address this, we enhance our pipeline by introducing\\nDynamic Snake Convolution, whose tortuous receptive field can better preserve\\ntubular structures. We name the resulting retinal fundus image enhancement\\nframework the Context-aware Unpaired Neural Schr\\\\\\\"{o}dinger Bridge\\n(CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use\\nthe SB approach for retinal image enhancement. Experimental results on a\\nlarge-scale dataset demonstrate the advantage of the proposed method compared\\nto several state-of-the-art supervised and unsupervised methods in terms of\\nimage quality and performance on downstream tasks.The code is available at\\n\\\\url{https://github.com/Retinal-Research/CUNSB-RFIE}.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.10966\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10966","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

视网膜眼底摄影在诊断和监测视网膜疾病方面具有重要意义。然而,系统缺陷和操作员/患者相关因素会阻碍高质量视网膜图像的获取。以前在视网膜图像增强方面的努力主要依赖于 GAN,而 GAN 受限于训练稳定性和输出多样性之间的权衡。相比之下,薛定谔桥(SB)利用最优传输(OT)理论对两个任意分布之间的随机微分方程(SDE)进行建模,从而提供了一种更稳定的解决方案。这使得 SB 能够有效地将低质量视网膜图像转换为高质量图像。在这项工作中,我们利用 SB 框架提出了用于视网膜图像增强的动画到图像转换管道。为了解决这个问题,我们通过引入动态蛇卷积(Dynamic Snake Convolution)来增强我们的管道,其迂回的感受野可以更好地保留管状结构。我们将由此产生的视网膜眼底图像增强框架命名为 "上下文感知非配对神经桥接(CUNSB-RFIE)"。据我们所知,这是首次将 SB 方法用于视网膜图像增强。在大规模数据集上的实验结果表明,与几种最先进的有监督和无监督方法相比,所提出的方法在图像质量和下游任务性能方面更具优势。代码可在(url{https://github.com/Retinal-Research/CUNSB-RFIE}.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CUNSB-RFIE: Context-aware Unpaired Neural Schr"{o}dinger Bridge in Retinal Fundus Image Enhancement
Retinal fundus photography is significant in diagnosing and monitoring retinal diseases. However, systemic imperfections and operator/patient-related factors can hinder the acquisition of high-quality retinal images. Previous efforts in retinal image enhancement primarily relied on GANs, which are limited by the trade-off between training stability and output diversity. In contrast, the Schr\"{o}dinger Bridge (SB), offers a more stable solution by utilizing Optimal Transport (OT) theory to model a stochastic differential equation (SDE) between two arbitrary distributions. This allows SB to effectively transform low-quality retinal images into their high-quality counterparts. In this work, we leverage the SB framework to propose an image-to-image translation pipeline for retinal image enhancement. Additionally, previous methods often fail to capture fine structural details, such as blood vessels. To address this, we enhance our pipeline by introducing Dynamic Snake Convolution, whose tortuous receptive field can better preserve tubular structures. We name the resulting retinal fundus image enhancement framework the Context-aware Unpaired Neural Schr\"{o}dinger Bridge (CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use the SB approach for retinal image enhancement. Experimental results on a large-scale dataset demonstrate the advantage of the proposed method compared to several state-of-the-art supervised and unsupervised methods in terms of image quality and performance on downstream tasks.The code is available at \url{https://github.com/Retinal-Research/CUNSB-RFIE}.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
multiPI-TransBTS: A Multi-Path Learning Framework for Brain Tumor Image Segmentation Based on Multi-Physical Information Autopet III challenge: Incorporating anatomical knowledge into nnUNet for lesion segmentation in PET/CT Denoising diffusion models for high-resolution microscopy image restoration Tumor aware recurrent inter-patient deformable image registration of computed tomography scans with lung cancer Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to Fine-tune Vision Foundation Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1