{"title":"MiFDeU: Multi-information fusion network based on dual-encoder for pelvic bones segmentation","authors":"Fujiao Ju , Yichu Wu , Mingjie Dong , Jingxin Zhao","doi":"10.1016/j.engappai.2025.110230","DOIUrl":null,"url":null,"abstract":"<div><div>The segmentation of bone fragments is crucial for preoperative planning and intraoperative navigation in reduction surgery. Recent advances in medical segmentation have predominantly focused on U-shaped frameworks that employ convolutional neural networks or transformer variants as the backbone. However, these frameworks, which rely on a single encoder, often struggle with integrating information from diverse features and processing irregular shapes in visual objects. Such limitations can reduce segmentation accuracy and impair generalization performance across different datasets. To address these issues, we introduce a multi-information fusion network based on dual-encoder for pelvic bones segmentation. In order to capture global contextual information and local features simultaneously, our model takes a light resnet and a graph neural network with swin-pool module as dual-encoder for effectively representing the global and local topologies. We construct a high-low multi-dimensional paired attention in the bottleneck for fusing spatial and channel information from different dimensions. Instead of using the traditional dice loss in the unet-like architecture, our model employs both topological loss and boundary loss to enhance the goal optimization. In the experiments, our model achieves a substantially lower dice similarity coefficient and comparable 95 % Hausdorff distance compared to other state-of-the-art. The experiments on across datasets verify the superiority and generalization of the proposed model.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"147 ","pages":"Article 110230"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625002301","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The segmentation of bone fragments is crucial for preoperative planning and intraoperative navigation in reduction surgery. Recent advances in medical segmentation have predominantly focused on U-shaped frameworks that employ convolutional neural networks or transformer variants as the backbone. However, these frameworks, which rely on a single encoder, often struggle with integrating information from diverse features and processing irregular shapes in visual objects. Such limitations can reduce segmentation accuracy and impair generalization performance across different datasets. To address these issues, we introduce a multi-information fusion network based on dual-encoder for pelvic bones segmentation. In order to capture global contextual information and local features simultaneously, our model takes a light resnet and a graph neural network with swin-pool module as dual-encoder for effectively representing the global and local topologies. We construct a high-low multi-dimensional paired attention in the bottleneck for fusing spatial and channel information from different dimensions. Instead of using the traditional dice loss in the unet-like architecture, our model employs both topological loss and boundary loss to enhance the goal optimization. In the experiments, our model achieves a substantially lower dice similarity coefficient and comparable 95 % Hausdorff distance compared to other state-of-the-art. The experiments on across datasets verify the superiority and generalization of the proposed model.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.