Position-aware representation learning with anatomical priors for enhanced pancreas tumor segmentation

IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neurocomputing Pub Date : 2024-11-16 DOI:10.1016/j.neucom.2024.128881
Kaiqi Dong , Peijun Hu , Yu Tian , Yan Zhu , Xiang Li , Tianshu Zhou , Xueli Bai , Tingbo Liang , Jingsong Li
{"title":"Position-aware representation learning with anatomical priors for enhanced pancreas tumor segmentation","authors":"Kaiqi Dong ,&nbsp;Peijun Hu ,&nbsp;Yu Tian ,&nbsp;Yan Zhu ,&nbsp;Xiang Li ,&nbsp;Tianshu Zhou ,&nbsp;Xueli Bai ,&nbsp;Tingbo Liang ,&nbsp;Jingsong Li","doi":"10.1016/j.neucom.2024.128881","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate pancreatic tumor segmentation in CT images is crucial but challenging due to the complex anatomy and varied tumor appearance. Previous methods predominantly adopt two-stage segmentation approaches to identify and localize tumors and rely heavily on CNN-extracted texture features. In this study, we propose a tumor position-aware branch to learn pancreatic anatomical priors and integrate them into a standard 3D U-Net segmentation network. The tumor position-aware branch consists of three innovative components. Firstly, the proposed method utilizes discrete information bottleneck theory to extract compact and informative segmentation features with pancreatic anatomical priors. Secondly, we propose a coordinate position encoding transformer that encodes the spatial coordinates of each patch within the CT volume. This encoding provides the model with a global positional context, allowing it to effectively model the spatial relationships between anatomical structures. Thirdly, a probability margin regularization loss is proposed to further eliminate the interference of background patches on the learning of pancreatic anatomical positions. Our model is trained and validated our model on the public Medical Segmentation Decathlon (MSD) dataset and a private clinical dataset. Experimental results demonstrate that our approach achieves competitive performance compared to state-of-the-art (SOTA) methods in both pancreas and tumor segmentation, with Dice scores of 82.11% for the pancreas and 55.56% for the tumor on the MSD dataset. The proposed framework offers an effective solution to leverage anatomical priors and enhance representation learning for improved pancreatic tumor segmentation.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"616 ","pages":"Article 128881"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224016527","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate pancreatic tumor segmentation in CT images is crucial but challenging due to the complex anatomy and varied tumor appearance. Previous methods predominantly adopt two-stage segmentation approaches to identify and localize tumors and rely heavily on CNN-extracted texture features. In this study, we propose a tumor position-aware branch to learn pancreatic anatomical priors and integrate them into a standard 3D U-Net segmentation network. The tumor position-aware branch consists of three innovative components. Firstly, the proposed method utilizes discrete information bottleneck theory to extract compact and informative segmentation features with pancreatic anatomical priors. Secondly, we propose a coordinate position encoding transformer that encodes the spatial coordinates of each patch within the CT volume. This encoding provides the model with a global positional context, allowing it to effectively model the spatial relationships between anatomical structures. Thirdly, a probability margin regularization loss is proposed to further eliminate the interference of background patches on the learning of pancreatic anatomical positions. Our model is trained and validated our model on the public Medical Segmentation Decathlon (MSD) dataset and a private clinical dataset. Experimental results demonstrate that our approach achieves competitive performance compared to state-of-the-art (SOTA) methods in both pancreas and tumor segmentation, with Dice scores of 82.11% for the pancreas and 55.56% for the tumor on the MSD dataset. The proposed framework offers an effective solution to leverage anatomical priors and enhance representation learning for improved pancreatic tumor segmentation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于解剖先验的位置感知表征学习增强胰腺肿瘤分割
由于复杂的解剖结构和不同的肿瘤外观,在CT图像中准确的胰腺肿瘤分割是至关重要的,但具有挑战性。以前的方法主要采用两阶段分割方法来识别和定位肿瘤,并且严重依赖于cnn提取的纹理特征。在这项研究中,我们提出了一个肿瘤位置感知分支来学习胰腺解剖先验,并将它们整合到标准的3D U-Net分割网络中。肿瘤位置感知分支由三个创新组件组成。该方法首先利用离散信息瓶颈理论提取具有胰腺解剖先验的紧凑、信息丰富的分割特征;其次,我们提出了一种坐标位置编码转换器,对CT体内每个patch的空间坐标进行编码。这种编码为模型提供了一个全局位置上下文,使其能够有效地模拟解剖结构之间的空间关系。再次,提出一种概率边缘正则化损失,进一步消除背景斑块对胰腺解剖位置学习的干扰。我们的模型在公共医疗分割十项全能(MSD)数据集和私人临床数据集上进行了训练和验证。实验结果表明,与最先进的(SOTA)方法相比,我们的方法在胰腺和肿瘤分割方面都取得了具有竞争力的性能,在MSD数据集上,胰腺的Dice得分为82.11%,肿瘤的Dice得分为55.56%。所提出的框架提供了有效的解决方案,利用解剖先验和增强表征学习来改进胰腺肿瘤分割。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
期刊最新文献
Agricultural surface water extraction in environmental remote sensing: A novel semantic segmentation model emphasizing contextual information enhancement and foreground detail attention Physics embedded neural network: Novel data-free approach towards scientific computing and applications in transfer learning View-Channel Mixer Network for Double Incomplete Multi-View Multi-Label learning Diffusion models for image super-resolution: State-of-the-art and future directions Investigating the effects of recursion in convolutional layers using analytical methods
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1