HyFormer:用于视网膜 OCT 图像分割的混合变换器-CNN 架构。

IF 2.9 2区 医学 Q2 BIOCHEMICAL RESEARCH METHODS Biomedical optics express Pub Date : 2024-10-02 eCollection Date: 2024-11-01 DOI:10.1364/BOE.538959
Qingxin Jiang, Ying Fan, Menghan Li, Sheng Fang, Weifang Zhu, Dehui Xiang, Tao Peng, Xinjian Chen, Xun Xu, Fei Shi
{"title":"HyFormer:用于视网膜 OCT 图像分割的混合变换器-CNN 架构。","authors":"Qingxin Jiang, Ying Fan, Menghan Li, Sheng Fang, Weifang Zhu, Dehui Xiang, Tao Peng, Xinjian Chen, Xun Xu, Fei Shi","doi":"10.1364/BOE.538959","DOIUrl":null,"url":null,"abstract":"<p><p>Optical coherence tomography (OCT) has become the leading imaging technique in diagnosing and treatment planning for retinal diseases. Retinal OCT image segmentation involves extracting lesions and/or tissue structures to aid in the decisions of ophthalmologists, and multi-class segmentation is commonly needed. As the target regions often spread widely inside the retina, and the intensities and locations of different categories can be close, good segmentation networks must possess both global modeling capabilities and the ability to capture fine details. To address the challenge in capturing both global and local features simultaneously, we propose HyFormer, an efficient, lightweight, and robust hybrid network architecture. The proposed architecture features parallel Transformer and convolutional encoders for independent feature capture. A multi-scale gated attention block and a group positional embedding block are introduced within the Transformer encoder to enhance feature extraction. Feature integration is achieved in the decoder composed of the proposed three-path fusion modules. A class activation map-based cross-entropy loss function is also proposed to improve segmentation results. Evaluations are performed on a private dataset with myopic traction maculopathy lesions and the public AROI dataset for retinal layer and lesion segmentation with age-related degeneration. The results demonstrate HyFormer's superior segmentation performance and robustness compared to existing methods, showing promise for accurate and efficient OCT image segmentation. .</p>","PeriodicalId":8969,"journal":{"name":"Biomedical optics express","volume":"15 11","pages":"6156-6170"},"PeriodicalIF":2.9000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11563338/pdf/","citationCount":"0","resultStr":"{\"title\":\"HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation.\",\"authors\":\"Qingxin Jiang, Ying Fan, Menghan Li, Sheng Fang, Weifang Zhu, Dehui Xiang, Tao Peng, Xinjian Chen, Xun Xu, Fei Shi\",\"doi\":\"10.1364/BOE.538959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Optical coherence tomography (OCT) has become the leading imaging technique in diagnosing and treatment planning for retinal diseases. Retinal OCT image segmentation involves extracting lesions and/or tissue structures to aid in the decisions of ophthalmologists, and multi-class segmentation is commonly needed. As the target regions often spread widely inside the retina, and the intensities and locations of different categories can be close, good segmentation networks must possess both global modeling capabilities and the ability to capture fine details. To address the challenge in capturing both global and local features simultaneously, we propose HyFormer, an efficient, lightweight, and robust hybrid network architecture. The proposed architecture features parallel Transformer and convolutional encoders for independent feature capture. A multi-scale gated attention block and a group positional embedding block are introduced within the Transformer encoder to enhance feature extraction. Feature integration is achieved in the decoder composed of the proposed three-path fusion modules. A class activation map-based cross-entropy loss function is also proposed to improve segmentation results. Evaluations are performed on a private dataset with myopic traction maculopathy lesions and the public AROI dataset for retinal layer and lesion segmentation with age-related degeneration. The results demonstrate HyFormer's superior segmentation performance and robustness compared to existing methods, showing promise for accurate and efficient OCT image segmentation. .</p>\",\"PeriodicalId\":8969,\"journal\":{\"name\":\"Biomedical optics express\",\"volume\":\"15 11\",\"pages\":\"6156-6170\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11563338/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical optics express\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1364/BOE.538959\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical optics express","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1364/BOE.538959","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

光学相干断层扫描(OCT)已成为诊断和治疗视网膜疾病的主要成像技术。视网膜 OCT 图像分割涉及提取病变和/或组织结构,以帮助眼科医生做出决定,通常需要进行多类分割。由于目标区域通常广泛分布在视网膜内部,而且不同类别的强度和位置可能很接近,因此好的分割网络必须同时具备全局建模能力和捕捉精细细节的能力。为了应对同时捕捉全局和局部特征的挑战,我们提出了一种高效、轻便、稳健的混合网络架构 HyFormer。该架构具有并行变换器和卷积编码器,可实现独立的特征捕捉。在变换器编码器中引入了多尺度门控注意力块和组位置嵌入块,以加强特征提取。特征整合在解码器中实现,解码器由建议的三路径融合模块组成。此外,还提出了一种基于类激活图的交叉熵损失函数,以改善分割结果。我们在一个包含近视牵引性黄斑病变的私人数据集和一个包含年龄相关变性的视网膜层和病变分割的公共 AROI 数据集上进行了评估。结果表明,与现有方法相比,HyFormer 的分割性能和鲁棒性更优越,有望实现准确、高效的 OCT 图像分割。.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation.

Optical coherence tomography (OCT) has become the leading imaging technique in diagnosing and treatment planning for retinal diseases. Retinal OCT image segmentation involves extracting lesions and/or tissue structures to aid in the decisions of ophthalmologists, and multi-class segmentation is commonly needed. As the target regions often spread widely inside the retina, and the intensities and locations of different categories can be close, good segmentation networks must possess both global modeling capabilities and the ability to capture fine details. To address the challenge in capturing both global and local features simultaneously, we propose HyFormer, an efficient, lightweight, and robust hybrid network architecture. The proposed architecture features parallel Transformer and convolutional encoders for independent feature capture. A multi-scale gated attention block and a group positional embedding block are introduced within the Transformer encoder to enhance feature extraction. Feature integration is achieved in the decoder composed of the proposed three-path fusion modules. A class activation map-based cross-entropy loss function is also proposed to improve segmentation results. Evaluations are performed on a private dataset with myopic traction maculopathy lesions and the public AROI dataset for retinal layer and lesion segmentation with age-related degeneration. The results demonstrate HyFormer's superior segmentation performance and robustness compared to existing methods, showing promise for accurate and efficient OCT image segmentation. .

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical optics express
Biomedical optics express BIOCHEMICAL RESEARCH METHODS-OPTICS
CiteScore
6.80
自引率
11.80%
发文量
633
审稿时长
1 months
期刊介绍: The journal''s scope encompasses fundamental research, technology development, biomedical studies and clinical applications. BOEx focuses on the leading edge topics in the field, including: Tissue optics and spectroscopy Novel microscopies Optical coherence tomography Diffuse and fluorescence tomography Photoacoustic and multimodal imaging Molecular imaging and therapies Nanophotonic biosensing Optical biophysics/photobiology Microfluidic optical devices Vision research.
期刊最新文献
Super resolution reconstruction of fluorescence microscopy images by a convolutional network with physical priors. Physics-guided deep learning-based real-time image reconstruction of Fourier-domain optical coherence tomography. On bench evaluation of intraocular lenses: performance of a commercial interferometer. Predictive coding compressive sensing optical coherence tomography hardware implementation. Development of silicone-based phantoms for biomedical optics from 400 to 1550 nm.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1