{"title":"Neighborhood transformer for sparse-view X-ray 3D foot reconstruction","authors":"Wei Wang , Li An , Mingquan Zhou , Gengyin Han","doi":"10.1016/j.bspc.2024.107082","DOIUrl":null,"url":null,"abstract":"<div><div>In medical imaging, Sparse-View X-ray 3D reconstruction is crucial for analyzing and diagnosing foot bone structures. However, existing methods face limitations when handling sparse view data and complex bone structures. To enhance reconstruction accuracy and detail preservation, this paper proposes an innovative Sparse-View X-ray 3D foot reconstruction technique based on Neighborhood Transformer. A new Neighborhood Position Encoding strategy is introduced, which divides X-ray images into local regions using a window mechanism and precisely selects these regions through nearest neighbor methods, thereby capturing detailed features in the images. Building upon existing NeRF (Neural Radiance Fields) technology, the paper introduces the Neighborhood Transformer module. This module significantly improves the expression capability for complex foot bone structures through depthwise separable convolutions and a dual-branch local–global Transformer network. Additionally, an adaptive weight learning strategy is applied within the Transformer module, enabling the model to better capture long-distance dependencies, thereby improving its ability to handle sparse view data.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107082"},"PeriodicalIF":4.9000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011406","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In medical imaging, Sparse-View X-ray 3D reconstruction is crucial for analyzing and diagnosing foot bone structures. However, existing methods face limitations when handling sparse view data and complex bone structures. To enhance reconstruction accuracy and detail preservation, this paper proposes an innovative Sparse-View X-ray 3D foot reconstruction technique based on Neighborhood Transformer. A new Neighborhood Position Encoding strategy is introduced, which divides X-ray images into local regions using a window mechanism and precisely selects these regions through nearest neighbor methods, thereby capturing detailed features in the images. Building upon existing NeRF (Neural Radiance Fields) technology, the paper introduces the Neighborhood Transformer module. This module significantly improves the expression capability for complex foot bone structures through depthwise separable convolutions and a dual-branch local–global Transformer network. Additionally, an adaptive weight learning strategy is applied within the Transformer module, enabling the model to better capture long-distance dependencies, thereby improving its ability to handle sparse view data.
在医学成像中,稀疏视图 X 射线三维重建对于分析和诊断足部骨骼结构至关重要。然而,现有方法在处理稀疏视图数据和复杂骨骼结构时面临局限。为了提高重建精度并保留细节,本文提出了一种基于邻域变换器的创新稀疏视图 X 射线三维足部重建技术。本文引入了一种新的邻域位置编码策略,利用窗口机制将 X 射线图像划分为局部区域,并通过近邻方法精确选择这些区域,从而捕捉图像中的细节特征。在现有 NeRF(神经辐射场)技术的基础上,本文引入了 Neighborhood Transformer 模块。该模块通过深度可分离卷积和局部-全局双分支变换器网络,大大提高了复杂足骨结构的表达能力。此外,变换器模块还采用了自适应权重学习策略,使模型能够更好地捕捉长距离依赖关系,从而提高处理稀疏视图数据的能力。
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.