CephTransXnet: An attention enhanced feature fusion network leveraging neighborhood rough set approach for cephalometric landmark prediction

IF 7 2区医学 Q1 BIOLOGY Computers in biology and medicine Pub Date : 2025-02-25 DOI:10.1016/j.compbiomed.2025.109891

R. Neeraja, L. Jani Anbarasi

{"title":"CephTransXnet: An attention enhanced feature fusion network leveraging neighborhood rough set approach for cephalometric landmark prediction","authors":"R. Neeraja, L. Jani Anbarasi","doi":"10.1016/j.compbiomed.2025.109891","DOIUrl":null,"url":null,"abstract":"<div><div>The convergence of medical imaging, computer vision, and orthodontics has made automatic cephalometric landmark detection a pivotal area of research. Accurate cephalometric analysis is crucial in orthodontics, orthognathic and maxillofacial surgery for diagnosis, treatment planning, and monitoring craniofacial growth. In this research study, a multi-branch fused feature extraction network titled <math><mrow><msub><mrow><mi>C</mi><mi>e</mi><mi>p</mi><mi>h</mi><mi>T</mi><mi>r</mi><mi>a</mi><mi>n</mi><mi>s</mi><mi>X</mi></mrow><mrow><mi>n</mi><mi>e</mi><mi>t</mi></mrow></msub></mrow></math> is proposed to automatically predict landmark coordinates from cephalometric radiographs. The initial sequential branch enhances discriminative local feature learning and feature extraction through parallel feature fusion by integrating Convolved Pooled Normalized (<math><mrow><msub><mrow><mi>C</mi><mi>P</mi><mi>N</mi></mrow><mi>B</mi></msub></mrow></math>) and Gradient Optimized Multi-Path Bottleneck (<math><mrow><msub><mrow><mi>G</mi><mi>M</mi><mi>B</mi></mrow><mi>B</mi></msub></mrow></math>) blocks with Channel and Spatial Attention (<math><mrow><msub><mrow><mi>C</mi><mi>S</mi><mi>A</mi><mi>T</mi></mrow><mi>M</mi></msub></mrow></math>) module. The Swin Transformer (<math><mrow><msub><mrow><mi>S</mi><mi>T</mi></mrow><mi>B</mi></msub><mo>)</mo></mrow></math> branch efficiently handles long-range dependencies and extracts global features in cephalometric radiographs. The multi-branch fused features along with features from skip connections of <math><mrow><msub><mrow><mi>C</mi><mi>P</mi><mi>N</mi></mrow><mi>B</mi></msub></mrow></math> and <math><mrow><msub><mrow><mi>G</mi><mi>M</mi><mi>B</mi></mrow><mi>B</mi></msub></mrow></math> blocks are concatenated using a Coordinate Attention module <math><mrow><mo>(</mo><msub><mrow><mi>C</mi><mi>o</mi><mi>A</mi><mi>T</mi></mrow><mi>M</mi></msub><mo>)</mo></mrow></math> to captures the positional relationships between various landmark features. A Landmark Discriminative Deviation Factor <math><mrow><mo>(</mo><mrow><mi>L</mi><mi>D</mi><mi>D</mi><mi>F</mi></mrow><mo>)</mo></mrow></math> is determined by applying the Neighborhood Rough Set <math><mrow><mo>(</mo><mrow><mi>N</mi><mi>R</mi><mi>S</mi></mrow><mo>)</mo></mrow></math> approach to analyse the surrounding features of each landmark by considering spatial relationships or similarity measures between the landmarks and neighboring regions. The Spatial Pyramid Pooling (<math><mrow><msub><mrow><mi>S</mi><mi>P</mi><mi>P</mi></mrow><mi>L</mi></msub></mrow></math>) layer incorporated in the final phase of <math><mrow><msub><mrow><mi>C</mi><mi>e</mi><mi>p</mi><mi>h</mi><mi>T</mi><mi>r</mi><mi>a</mi><mi>n</mi><mi>s</mi><mi>X</mi></mrow><mrow><mi>n</mi><mi>e</mi><mi>t</mi></mrow></msub></mrow></math> model extracts multi-scale features by pooling over sub-regions of varying sizes, enabling the network to capture both local and global context for precise cephalometric landmark identification. The <math><mrow><msub><mrow><mi>C</mi><mi>e</mi><mi>p</mi><mi>h</mi><mi>T</mi><mi>r</mi><mi>a</mi><mi>n</mi><mi>s</mi><mi>X</mi></mrow><mrow><mi>n</mi><mi>e</mi><mi>t</mi></mrow></msub></mrow></math> framework achieved an average Successful Detection Rates <math><mrow><mo>(</mo><msub><mrow><mi>S</mi><mi>D</mi><mi>R</mi></mrow><mi>s</mi></msub><mo>)</mo></mrow></math> of 88.71 % and 79.05 % in 2 mm using the 2015 International Symposium on Biomedical Imaging (ISBI) grand challenge dental X-ray analysis dataset. The effectiveness of the <math><mrow><msub><mrow><mi>C</mi><mi>e</mi><mi>p</mi><mi>h</mi><mi>T</mi><mi>r</mi><mi>a</mi><mi>n</mi><mi>s</mi><mi>X</mi></mrow><mrow><mi>n</mi><mi>e</mi><mi>t</mi></mrow></msub></mrow></math> model is evaluated using a private clinical dataset obtained from Solanki Dental Care Clinic in Sharjah, UAE, and attained an average <math><mrow><msub><mrow><mi>S</mi><mi>D</mi><mi>R</mi></mrow><mi>s</mi></msub></mrow></math> of 74.38 % in 2 mm precision range.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"188 ","pages":"Article 109891"},"PeriodicalIF":7.0000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525002422","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

The convergence of medical imaging, computer vision, and orthodontics has made automatic cephalometric landmark detection a pivotal area of research. Accurate cephalometric analysis is crucial in orthodontics, orthognathic and maxillofacial surgery for diagnosis, treatment planning, and monitoring craniofacial growth. In this research study, a multi-branch fused feature extraction network titled

{C e p h T r a n s X}_{n e t}

is proposed to automatically predict landmark coordinates from cephalometric radiographs. The initial sequential branch enhances discriminative local feature learning and feature extraction through parallel feature fusion by integrating Convolved Pooled Normalized (

{C P N}_{B}

) and Gradient Optimized Multi-Path Bottleneck (

{G M B}_{B}

) blocks with Channel and Spatial Attention (

{C S A T}_{M}

) module. The Swin Transformer (

{S T}_{B})

branch efficiently handles long-range dependencies and extracts global features in cephalometric radiographs. The multi-branch fused features along with features from skip connections of

{C P N}_{B}

and

{G M B}_{B}

blocks are concatenated using a Coordinate Attention module

({C o A T}_{M})

to captures the positional relationships between various landmark features. A Landmark Discriminative Deviation Factor

(L D D F)

is determined by applying the Neighborhood Rough Set

(N R S)

approach to analyse the surrounding features of each landmark by considering spatial relationships or similarity measures between the landmarks and neighboring regions. The Spatial Pyramid Pooling (

{S P P}_{L}

) layer incorporated in the final phase of

{C e p h T r a n s X}_{n e t}

model extracts multi-scale features by pooling over sub-regions of varying sizes, enabling the network to capture both local and global context for precise cephalometric landmark identification. The

{C e p h T r a n s X}_{n e t}

framework achieved an average Successful Detection Rates

({S D R}_{s})

of 88.71 % and 79.05 % in 2 mm using the 2015 International Symposium on Biomedical Imaging (ISBI) grand challenge dental X-ray analysis dataset. The effectiveness of the

{C e p h T r a n s X}_{n e t}

model is evaluated using a private clinical dataset obtained from Solanki Dental Care Clinic in Sharjah, UAE, and attained an average

{S D R}_{s}

of 74.38 % in 2 mm precision range.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.