{"title":"UTSN-net: medical image semantic segmentation model based on skip non-local attention module","authors":"Li Zhang, BinBing Zhu, Chunpeng Ma","doi":"10.1117/12.2682365","DOIUrl":null,"url":null,"abstract":"The semantic segmentation task of medical image is to segment the focus, organ or substructure of human body in medical image. It plays an important role in locating and identifying the diseased area and making medical plan. In various medical image segmentation tasks, the U-shaped architecture has achieved great success. Transunet introduces Transformer with global attention mechanism into the U-shaped architecture, which overcomes the inherent limitations of convolution, but because it still continues the original skip connections structure, it will bring the strong noise from features in the shallow network into the high semantic features of the deep network, thus affecting the segmentation accuracy. UTSN-net model based on the combination of Transformer and nonlocal attention mechanism is proposed. UTSN-net uses Transformer to overcome the inherent limitations of convolution, and introduces the skip connections module based on nonlocal attention mechanism into the U-shaped network, which can comprehensively consider the deep features with global context information and the shallow features with accurate high-resolution positioning information to improve the accuracy of segmentation results. Experiments on synapse multi-organ abdominal CT dataset verify that UTSN-net has better semantic segmentation performance.","PeriodicalId":440430,"journal":{"name":"International Conference on Electronic Technology and Information Science","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Technology and Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The semantic segmentation task of medical image is to segment the focus, organ or substructure of human body in medical image. It plays an important role in locating and identifying the diseased area and making medical plan. In various medical image segmentation tasks, the U-shaped architecture has achieved great success. Transunet introduces Transformer with global attention mechanism into the U-shaped architecture, which overcomes the inherent limitations of convolution, but because it still continues the original skip connections structure, it will bring the strong noise from features in the shallow network into the high semantic features of the deep network, thus affecting the segmentation accuracy. UTSN-net model based on the combination of Transformer and nonlocal attention mechanism is proposed. UTSN-net uses Transformer to overcome the inherent limitations of convolution, and introduces the skip connections module based on nonlocal attention mechanism into the U-shaped network, which can comprehensively consider the deep features with global context information and the shallow features with accurate high-resolution positioning information to improve the accuracy of segmentation results. Experiments on synapse multi-organ abdominal CT dataset verify that UTSN-net has better semantic segmentation performance.