{"title":"An Effective Strategy of Object Instance Segmentation in Sonar Images","authors":"Pengfei Shi, Huanru Sun, Qi He, Hanren Wang, Xinnan Fan, Yuanxue Xin","doi":"10.1049/2024/1357293","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Instance segmentation is a task that involves pixel-level classification and segmentation of each object instance in images. Various CNN-based methods have achieved promising results in natural image instance segmentation. However, the noise interference, low resolution, and blurred edges bring more significant challenges for sonar image instance segmentation. To solve these problems, we propose the Effective Strategy for Sonar Images Instance Segmentation (ESSIIS). We introduce ASception, a new network combining Atrous Spatial Pyramid Pooling (ASPP) and Extreme Inception (Xception). By integrating this with ResNet and transforming traditional convolutions into deformable convolutions, we further improve the ability of the network to extract features from sonar images. Additionally, we incorporate a bidirectional feature fusion module to enhance information fusion. Finally, we evaluate the detection accuracy and segmentation accuracy of the proposed method on the public sonar image dataset and the self-constructed dataset. ESSIIS attains a detection accuracy of 0.981 and a segmentation accuracy of 0.951 on SCTD, further impressively achieving 0.986 in both metrics when appraised on our dataset. The evaluation results demonstrate that the proposed method is more accurate, robust, and considerable for sonar image detection and segmentation.</p>\n </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"2024 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/1357293","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/2024/1357293","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Instance segmentation is a task that involves pixel-level classification and segmentation of each object instance in images. Various CNN-based methods have achieved promising results in natural image instance segmentation. However, the noise interference, low resolution, and blurred edges bring more significant challenges for sonar image instance segmentation. To solve these problems, we propose the Effective Strategy for Sonar Images Instance Segmentation (ESSIIS). We introduce ASception, a new network combining Atrous Spatial Pyramid Pooling (ASPP) and Extreme Inception (Xception). By integrating this with ResNet and transforming traditional convolutions into deformable convolutions, we further improve the ability of the network to extract features from sonar images. Additionally, we incorporate a bidirectional feature fusion module to enhance information fusion. Finally, we evaluate the detection accuracy and segmentation accuracy of the proposed method on the public sonar image dataset and the self-constructed dataset. ESSIIS attains a detection accuracy of 0.981 and a segmentation accuracy of 0.951 on SCTD, further impressively achieving 0.986 in both metrics when appraised on our dataset. The evaluation results demonstrate that the proposed method is more accurate, robust, and considerable for sonar image detection and segmentation.
期刊介绍:
IET Signal Processing publishes research on a diverse range of signal processing and machine learning topics, covering a variety of applications, disciplines, modalities, and techniques in detection, estimation, inference, and classification problems. The research published includes advances in algorithm design for the analysis of single and high-multi-dimensional data, sparsity, linear and non-linear systems, recursive and non-recursive digital filters and multi-rate filter banks, as well a range of topics that span from sensor array processing, deep convolutional neural network based approaches to the application of chaos theory, and far more.
Topics covered by scope include, but are not limited to:
advances in single and multi-dimensional filter design and implementation
linear and nonlinear, fixed and adaptive digital filters and multirate filter banks
statistical signal processing techniques and analysis
classical, parametric and higher order spectral analysis
signal transformation and compression techniques, including time-frequency analysis
system modelling and adaptive identification techniques
machine learning based approaches to signal processing
Bayesian methods for signal processing, including Monte-Carlo Markov-chain and particle filtering techniques
theory and application of blind and semi-blind signal separation techniques
signal processing techniques for analysis, enhancement, coding, synthesis and recognition of speech signals
direction-finding and beamforming techniques for audio and electromagnetic signals
analysis techniques for biomedical signals
baseband signal processing techniques for transmission and reception of communication signals
signal processing techniques for data hiding and audio watermarking
sparse signal processing and compressive sensing
Special Issue Call for Papers:
Intelligent Deep Fuzzy Model for Signal Processing - https://digital-library.theiet.org/files/IET_SPR_CFP_IDFMSP.pdf