CUNSB-RFIE：视网膜眼底图像增强中的情境感知非配对神经 Schr"{o}dinger 桥接器

arXiv - EE - Image and Video Processing Pub Date : 2024-09-17 DOI:arxiv-2409.10966

Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang

{"title":"CUNSB-RFIE：视网膜眼底图像增强中的情境感知非配对神经 Schr\"{o}dinger 桥接器","authors":"Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang","doi":"arxiv-2409.10966","DOIUrl":null,"url":null,"abstract":"Retinal fundus photography is significant in diagnosing and monitoring\nretinal diseases. However, systemic imperfections and operator/patient-related\nfactors can hinder the acquisition of high-quality retinal images. Previous\nefforts in retinal image enhancement primarily relied on GANs, which are\nlimited by the trade-off between training stability and output diversity. In\ncontrast, the Schr\\\"{o}dinger Bridge (SB), offers a more stable solution by\nutilizing Optimal Transport (OT) theory to model a stochastic differential\nequation (SDE) between two arbitrary distributions. This allows SB to\neffectively transform low-quality retinal images into their high-quality\ncounterparts. In this work, we leverage the SB framework to propose an\nimage-to-image translation pipeline for retinal image enhancement.\nAdditionally, previous methods often fail to capture fine structural details,\nsuch as blood vessels. To address this, we enhance our pipeline by introducing\nDynamic Snake Convolution, whose tortuous receptive field can better preserve\ntubular structures. We name the resulting retinal fundus image enhancement\nframework the Context-aware Unpaired Neural Schr\\\"{o}dinger Bridge\n(CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use\nthe SB approach for retinal image enhancement. Experimental results on a\nlarge-scale dataset demonstrate the advantage of the proposed method compared\nto several state-of-the-art supervised and unsupervised methods in terms of\nimage quality and performance on downstream tasks.The code is available at\n\\url{https://github.com/Retinal-Research/CUNSB-RFIE}.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CUNSB-RFIE: Context-aware Unpaired Neural Schr\\\"{o}dinger Bridge in Retinal Fundus Image Enhancement\",\"authors\":\"Xuanzhao Dong, Vamsi Krishna Vasa, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Yi Su, Yujian Xiong, Zhangsihao Yang, Yanxi Chen, Yalin Wang\",\"doi\":\"arxiv-2409.10966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retinal fundus photography is significant in diagnosing and monitoring\\nretinal diseases. However, systemic imperfections and operator/patient-related\\nfactors can hinder the acquisition of high-quality retinal images. Previous\\nefforts in retinal image enhancement primarily relied on GANs, which are\\nlimited by the trade-off between training stability and output diversity. In\\ncontrast, the Schr\\\\\\\"{o}dinger Bridge (SB), offers a more stable solution by\\nutilizing Optimal Transport (OT) theory to model a stochastic differential\\nequation (SDE) between two arbitrary distributions. This allows SB to\\neffectively transform low-quality retinal images into their high-quality\\ncounterparts. In this work, we leverage the SB framework to propose an\\nimage-to-image translation pipeline for retinal image enhancement.\\nAdditionally, previous methods often fail to capture fine structural details,\\nsuch as blood vessels. To address this, we enhance our pipeline by introducing\\nDynamic Snake Convolution, whose tortuous receptive field can better preserve\\ntubular structures. We name the resulting retinal fundus image enhancement\\nframework the Context-aware Unpaired Neural Schr\\\\\\\"{o}dinger Bridge\\n(CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use\\nthe SB approach for retinal image enhancement. Experimental results on a\\nlarge-scale dataset demonstrate the advantage of the proposed method compared\\nto several state-of-the-art supervised and unsupervised methods in terms of\\nimage quality and performance on downstream tasks.The code is available at\\n\\\\url{https://github.com/Retinal-Research/CUNSB-RFIE}.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.10966\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10966","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

视网膜眼底摄影在诊断和监测视网膜疾病方面具有重要意义。然而，系统缺陷和操作员/患者相关因素会阻碍高质量视网膜图像的获取。以前在视网膜图像增强方面的努力主要依赖于 GAN，而 GAN 受限于训练稳定性和输出多样性之间的权衡。相比之下，薛定谔桥（SB）利用最优传输（OT）理论对两个任意分布之间的随机微分方程（SDE）进行建模，从而提供了一种更稳定的解决方案。这使得 SB 能够有效地将低质量视网膜图像转换为高质量图像。在这项工作中，我们利用 SB 框架提出了用于视网膜图像增强的动画到图像转换管道。为了解决这个问题，我们通过引入动态蛇卷积（Dynamic Snake Convolution）来增强我们的管道，其迂回的感受野可以更好地保留管状结构。我们将由此产生的视网膜眼底图像增强框架命名为 "上下文感知非配对神经桥接（CUNSB-RFIE）"。据我们所知，这是首次将 SB 方法用于视网膜图像增强。在大规模数据集上的实验结果表明，与几种最先进的有监督和无监督方法相比，所提出的方法在图像质量和下游任务性能方面更具优势。代码可在（url{https://github.com/Retinal-Research/CUNSB-RFIE}.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

CUNSB-RFIE: Context-aware Unpaired Neural Schr"{o}dinger Bridge in Retinal Fundus Image Enhancement

Retinal fundus photography is significant in diagnosing and monitoring retinal diseases. However, systemic imperfections and operator/patient-related factors can hinder the acquisition of high-quality retinal images. Previous efforts in retinal image enhancement primarily relied on GANs, which are limited by the trade-off between training stability and output diversity. In contrast, the Schr\"{o}dinger Bridge (SB), offers a more stable solution by utilizing Optimal Transport (OT) theory to model a stochastic differential equation (SDE) between two arbitrary distributions. This allows SB to effectively transform low-quality retinal images into their high-quality counterparts. In this work, we leverage the SB framework to propose an image-to-image translation pipeline for retinal image enhancement. Additionally, previous methods often fail to capture fine structural details, such as blood vessels. To address this, we enhance our pipeline by introducing Dynamic Snake Convolution, whose tortuous receptive field can better preserve tubular structures. We name the resulting retinal fundus image enhancement framework the Context-aware Unpaired Neural Schr\"{o}dinger Bridge (CUNSB-RFIE). To the best of our knowledge, this is the first endeavor to use the SB approach for retinal image enhancement. Experimental results on a large-scale dataset demonstrate the advantage of the proposed method compared to several state-of-the-art supervised and unsupervised methods in terms of image quality and performance on downstream tasks.The code is available at \url{https://github.com/Retinal-Research/CUNSB-RFIE}.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - EE - Image and Video Processing

自引率

0.00%

发文量