A pseudo-3D coarse-to-fine architecture for 3D medical landmark detection

IF 6.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neurocomputing Pub Date : 2024-10-28 DOI:10.1016/j.neucom.2024.128782
Li Cui , Boyan Liu , Guikun Xu , Jixiang Guo , Wei Tang , Tao He
{"title":"A pseudo-3D coarse-to-fine architecture for 3D medical landmark detection","authors":"Li Cui ,&nbsp;Boyan Liu ,&nbsp;Guikun Xu ,&nbsp;Jixiang Guo ,&nbsp;Wei Tang ,&nbsp;Tao He","doi":"10.1016/j.neucom.2024.128782","DOIUrl":null,"url":null,"abstract":"<div><div>The coarse-to-fine architecture is a benchmark method designed to enhance the accuracy of 3D medical landmark detection. However, incorporating 3D convolutional neural networks into the coarse-to-fine architecture leads to a significant increase in model parameters, making it costly for deployment in clinical applications. This paper introduces a novel lightweight pseudo-3D coarse-to-fine architecture, consisting of a Plane-wise Attention Pseudo-3D (PA-P3D) model and a Spatial Separation Pseudo-3D (SS-P3D) model. The PA-P3D inherits the lightweight structure of the general pseudo-3D and enhances cross-plane feature interaction in 3D medical images. On the other hand, the SS-P3D replaces the 3D model with three spatially separated 2D models to simultaneously detect 2D landmarks on axial, sagittal, and coronal planes. In comparison to the conventional coarse-to-fine architecture, the proposed method requires only approximately a quarter of the model parameters (60% reduced by PA-P3D and 40% reduced by SS-P3D) while simultaneously improving landmark detection performance. Experimental results demonstrate the effectiveness of the proposed method, achieving state-of-the-art performance on both a public dataset for mandibular molar landmark detection and a private dataset for cephalometric landmark detection. Overall, this paper highlights the potential of the coarse-to-fine method for cost-effective model deployment, thanks to its lightweight model structure.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"614 ","pages":"Article 128782"},"PeriodicalIF":6.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224015534","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The coarse-to-fine architecture is a benchmark method designed to enhance the accuracy of 3D medical landmark detection. However, incorporating 3D convolutional neural networks into the coarse-to-fine architecture leads to a significant increase in model parameters, making it costly for deployment in clinical applications. This paper introduces a novel lightweight pseudo-3D coarse-to-fine architecture, consisting of a Plane-wise Attention Pseudo-3D (PA-P3D) model and a Spatial Separation Pseudo-3D (SS-P3D) model. The PA-P3D inherits the lightweight structure of the general pseudo-3D and enhances cross-plane feature interaction in 3D medical images. On the other hand, the SS-P3D replaces the 3D model with three spatially separated 2D models to simultaneously detect 2D landmarks on axial, sagittal, and coronal planes. In comparison to the conventional coarse-to-fine architecture, the proposed method requires only approximately a quarter of the model parameters (60% reduced by PA-P3D and 40% reduced by SS-P3D) while simultaneously improving landmark detection performance. Experimental results demonstrate the effectiveness of the proposed method, achieving state-of-the-art performance on both a public dataset for mandibular molar landmark detection and a private dataset for cephalometric landmark detection. Overall, this paper highlights the potential of the coarse-to-fine method for cost-effective model deployment, thanks to its lightweight model structure.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于三维医学地标检测的伪三维粗到细架构
从粗到细架构是一种基准方法,旨在提高三维医学地标检测的准确性。然而,将三维卷积神经网络纳入粗到细架构会导致模型参数大幅增加,使其在临床应用中部署成本高昂。本文介绍了一种新型轻量级伪三维粗到细架构,由平面注意力伪三维(PA-P3D)模型和空间分离伪三维(SS-P3D)模型组成。PA-P3D 继承了一般伪三维模型的轻量级结构,增强了三维医学图像中的跨平面特征交互。另一方面,SS-P3D 用三个空间分离的二维模型取代了三维模型,可同时检测轴向、矢状和冠状面上的二维地标。与传统的从粗到细结构相比,所提出的方法只需要大约四分之一的模型参数(PA-P3D 减少了 60%,SS-P3D 减少了 40%),同时还提高了地标检测性能。实验结果证明了所提方法的有效性,在下颌臼齿地标检测的公共数据集和头面部地标检测的私有数据集上都达到了最先进的性能。总之,本文强调了从粗到细方法因其轻量级模型结构而在经济高效地部署模型方面所具有的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
期刊最新文献
Editorial Board Dismantling strategies for cost networks based on multi-view deep learning Contrastive coarse-to-fine medical segmentation with prototype guidance and dual-granularity fusion LECMARL: A cooperative multi-agent reinforcement learning method based on lazy mechanisms and efficient exploration Offset-corrected query generation strategies for cross-modality misalignment in 3D object detection: aligning LiDAR and camera
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1