{"title":"基于混合建模的前视声纳图像语义分割","authors":"Yike Wang;Zhi Liu;Gongyang Li;Xiaofeng Lu;Xuefeng Liu;Hongwei Zhang","doi":"10.1109/JOE.2024.3467309","DOIUrl":null,"url":null,"abstract":"Semantic segmentation of forward-looking sonar (FLS) images plays a key role in the perception and interaction of autonomous underwater vehicles with the surrounding environment. Due to the strong noise and blurred object edges in sonar images, there is a high demand for the model's feature extraction and anti-interference ability. Currently, most methods are based on convolutional neural networks (CNNs), which are sensitive to local noise, and have a heavy computational burden, making them difficult to meet real-time requirements. This article re-examines CNNs and vision transformers, proposing a hybrid modeling-based network called HMSeg that combines both convolution modeling and attention modeling approaches for sonar image segmentation. In addition, a dynamic attention gate module is proposed to dynamically enhance feature maps with high-level features and eliminate interference. Furthermore, we propose a composite loss function to guide the model in extracting pure features and accurate semantic information. We present a new FLS image data set and conducted a series of experiments on a marine debris data set and a UATD-Seg data set. The results demonstrate that our proposed HMSeg achieves the best performance, proving its robustness and efficiency in different environments.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 1","pages":"380-393"},"PeriodicalIF":3.8000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Modeling Based Semantic Segmentation of Forward-Looking Sonar Images\",\"authors\":\"Yike Wang;Zhi Liu;Gongyang Li;Xiaofeng Lu;Xuefeng Liu;Hongwei Zhang\",\"doi\":\"10.1109/JOE.2024.3467309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic segmentation of forward-looking sonar (FLS) images plays a key role in the perception and interaction of autonomous underwater vehicles with the surrounding environment. Due to the strong noise and blurred object edges in sonar images, there is a high demand for the model's feature extraction and anti-interference ability. Currently, most methods are based on convolutional neural networks (CNNs), which are sensitive to local noise, and have a heavy computational burden, making them difficult to meet real-time requirements. This article re-examines CNNs and vision transformers, proposing a hybrid modeling-based network called HMSeg that combines both convolution modeling and attention modeling approaches for sonar image segmentation. In addition, a dynamic attention gate module is proposed to dynamically enhance feature maps with high-level features and eliminate interference. Furthermore, we propose a composite loss function to guide the model in extracting pure features and accurate semantic information. We present a new FLS image data set and conducted a series of experiments on a marine debris data set and a UATD-Seg data set. The results demonstrate that our proposed HMSeg achieves the best performance, proving its robustness and efficiency in different environments.\",\"PeriodicalId\":13191,\"journal\":{\"name\":\"IEEE Journal of Oceanic Engineering\",\"volume\":\"50 1\",\"pages\":\"380-393\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Oceanic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10753270/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10753270/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Hybrid Modeling Based Semantic Segmentation of Forward-Looking Sonar Images
Semantic segmentation of forward-looking sonar (FLS) images plays a key role in the perception and interaction of autonomous underwater vehicles with the surrounding environment. Due to the strong noise and blurred object edges in sonar images, there is a high demand for the model's feature extraction and anti-interference ability. Currently, most methods are based on convolutional neural networks (CNNs), which are sensitive to local noise, and have a heavy computational burden, making them difficult to meet real-time requirements. This article re-examines CNNs and vision transformers, proposing a hybrid modeling-based network called HMSeg that combines both convolution modeling and attention modeling approaches for sonar image segmentation. In addition, a dynamic attention gate module is proposed to dynamically enhance feature maps with high-level features and eliminate interference. Furthermore, we propose a composite loss function to guide the model in extracting pure features and accurate semantic information. We present a new FLS image data set and conducted a series of experiments on a marine debris data set and a UATD-Seg data set. The results demonstrate that our proposed HMSeg achieves the best performance, proving its robustness and efficiency in different environments.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.