Semantic labelling of remote sensing images is crucial for various remote sensing applications. However, the dense distribution of man-made and natural objects with similar colours and geographic proximity poses challenges for achieving consistent and accurate labelling results. To address this issue, a novel deep learning model incorporating an octave convolutional neural networks (CNNs) within an end-to-end U-shaped architecture is presented. The approach differs from conventional CNNs in that it employs octave convolutions instead of standard convolutions. This strategy serves to minimize low-frequency information redundancy while maintaining segmentation accuracy. Furthermore, coordination attention is introduced in the encoder module to enhance the network's ability to extract useful features, focusing on spatial and channel dependencies within the feature maps. This attention mechanism enables the network to better capture channel, direction, and location information. In conclusion, the U-shaped network is engineered with a completely symmetric structure that employs skip connections to merge low-resolution information, used for object class recognition, with high-resolution information to enable precise localization. This configuration ultimately improves segmentation accuracy. Experimental results on two public datasets demonstrate that our U-ONet achieves state-of-the-art performance, making it a compelling choice for remote sensing image semantic labelling applications.
遥感图像的语义标注对各种遥感应用至关重要。然而,具有相似颜色和地理邻近性的人造和自然物体的密集分布给实现一致、准确的标注结果带来了挑战。为解决这一问题,本文提出了一种新型深度学习模型,在端到端 U 型架构中集成了八度卷积神经网络(CNN)。这种方法与传统的 CNN 不同,它采用了倍频卷积而不是标准卷积。这种策略可在保持分割准确性的同时,最大限度地减少低频信息冗余。此外,在编码器模块中引入了协调注意力,以增强网络提取有用特征的能力,重点关注特征图中的空间和通道依赖关系。这种注意力机制使网络能够更好地捕捉信道、方向和位置信息。总之,U 型网络采用完全对称的结构,利用跳转连接将用于物体类别识别的低分辨率信息与高分辨率信息合并,从而实现精确定位。这种配置最终提高了分割精度。在两个公共数据集上的实验结果表明,我们的 U-ONet 实现了最先进的性能,使其成为遥感图像语义标注应用的一个令人信服的选择。
{"title":"U-ONet: Remote sensing image semantic labelling based on octave convolution and coordination attention in U-shape deep neural network","authors":"Qiongqiong Hu, Feiting Wang, Yuechao Wu, Ying Li","doi":"10.1049/ell2.70014","DOIUrl":"https://doi.org/10.1049/ell2.70014","url":null,"abstract":"<p>Semantic labelling of remote sensing images is crucial for various remote sensing applications. However, the dense distribution of man-made and natural objects with similar colours and geographic proximity poses challenges for achieving consistent and accurate labelling results. To address this issue, a novel deep learning model incorporating an octave convolutional neural networks (CNNs) within an end-to-end U-shaped architecture is presented. The approach differs from conventional CNNs in that it employs octave convolutions instead of standard convolutions. This strategy serves to minimize low-frequency information redundancy while maintaining segmentation accuracy. Furthermore, coordination attention is introduced in the encoder module to enhance the network's ability to extract useful features, focusing on spatial and channel dependencies within the feature maps. This attention mechanism enables the network to better capture channel, direction, and location information. In conclusion, the U-shaped network is engineered with a completely symmetric structure that employs skip connections to merge low-resolution information, used for object class recognition, with high-resolution information to enable precise localization. This configuration ultimately improves segmentation accuracy. Experimental results on two public datasets demonstrate that our U-ONet achieves state-of-the-art performance, making it a compelling choice for remote sensing image semantic labelling applications.</p>","PeriodicalId":11556,"journal":{"name":"Electronics Letters","volume":null,"pages":null},"PeriodicalIF":0.7,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ell2.70014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142447642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}