遥感语义分割的高效特征生成与空间聚合研究

Algorithms Pub Date : 2024-04-04 DOI:10.3390/a17040151

Ruoyang Li, Shuping Xiong, Yinchao Che, Lei Shi, Xinming Ma, Lei Xi

{"title":"遥感语义分割的高效特征生成与空间聚合研究","authors":"Ruoyang Li, Shuping Xiong, Yinchao Che, Lei Shi, Xinming Ma, Lei Xi","doi":"10.3390/a17040151","DOIUrl":null,"url":null,"abstract":"Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and slow execution. To address these issues, we introduce a semantic segmentation network model emphasizing the rapid generation of redundant features and multi-level spatial aggregation. This model applies cost-efficient linear transformations instead of standard convolution operations during feature map generation, effectively managing memory usage and reducing computational complexity. To enhance the feature maps’ representation ability post-linear transformation, a specifically designed dual-attention mechanism is implemented, enhancing the model’s capacity for semantic understanding of both local and global image information. Moreover, the model integrates sparse self-attention with multi-scale contextual strategies, effectively combining features across different scales and spatial extents. This approach optimizes computational efficiency and retains crucial information, enabling precise and quick image segmentation. To assess the model’s segmentation performance, we conducted experiments in Changge City, Henan Province, using datasets such as LoveDA, PASCAL VOC, LandCoverNet, and DroneDeploy. These experiments demonstrated the model’s outstanding performance on public remote sensing datasets, significantly reducing the parameter count and computational complexity while maintaining high accuracy in segmentation tasks. This advancement offers substantial technical benefits for applications in agriculture and forestry, including land cover classification and crop health monitoring, thereby underscoring the model’s potential to support these critical sectors effectively.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"62 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Efficient Feature Generation and Spatial Aggregation for Remote Sensing Semantic Segmentation\",\"authors\":\"Ruoyang Li, Shuping Xiong, Yinchao Che, Lei Shi, Xinming Ma, Lei Xi\",\"doi\":\"10.3390/a17040151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and slow execution. To address these issues, we introduce a semantic segmentation network model emphasizing the rapid generation of redundant features and multi-level spatial aggregation. This model applies cost-efficient linear transformations instead of standard convolution operations during feature map generation, effectively managing memory usage and reducing computational complexity. To enhance the feature maps’ representation ability post-linear transformation, a specifically designed dual-attention mechanism is implemented, enhancing the model’s capacity for semantic understanding of both local and global image information. Moreover, the model integrates sparse self-attention with multi-scale contextual strategies, effectively combining features across different scales and spatial extents. This approach optimizes computational efficiency and retains crucial information, enabling precise and quick image segmentation. To assess the model’s segmentation performance, we conducted experiments in Changge City, Henan Province, using datasets such as LoveDA, PASCAL VOC, LandCoverNet, and DroneDeploy. These experiments demonstrated the model’s outstanding performance on public remote sensing datasets, significantly reducing the parameter count and computational complexity while maintaining high accuracy in segmentation tasks. This advancement offers substantial technical benefits for applications in agriculture and forestry, including land cover classification and crop health monitoring, thereby underscoring the model’s potential to support these critical sectors effectively.\",\"PeriodicalId\":502609,\"journal\":{\"name\":\"Algorithms\",\"volume\":\"62 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Algorithms\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/a17040151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/a17040151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

利用深度卷积神经网络的语义分割算法往往会遇到参数多、计算复杂度高、执行速度慢等挑战。为了解决这些问题，我们引入了一种语义分割网络模型，强调快速生成冗余特征和多级空间聚合。该模型在特征图生成过程中采用了具有成本效益的线性变换，而不是标准的卷积操作，从而有效地管理了内存使用，降低了计算复杂度。为了提高线性变换后特征图的表示能力，该模型采用了专门设计的双重关注机制，增强了模型对局部和全局图像信息的语义理解能力。此外，该模型还将稀疏自我注意与多尺度上下文策略相结合，有效地结合了不同尺度和空间范围的特征。这种方法既能优化计算效率，又能保留关键信息，从而实现精确、快速的图像分割。为了评估该模型的分割性能，我们在河南省长葛市使用 LoveDA、PASCAL VOC、LandCoverNet 和 DroneDeploy 等数据集进行了实验。这些实验证明了该模型在公共遥感数据集上的出色表现，在保持高精度分割任务的同时，大大减少了参数数量和计算复杂度。这一进步为农业和林业应用（包括土地覆被分类和作物健康监测）带来了巨大的技术优势，从而凸显了该模型有效支持这些关键领域的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Research on Efficient Feature Generation and Spatial Aggregation for Remote Sensing Semantic Segmentation

Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and slow execution. To address these issues, we introduce a semantic segmentation network model emphasizing the rapid generation of redundant features and multi-level spatial aggregation. This model applies cost-efficient linear transformations instead of standard convolution operations during feature map generation, effectively managing memory usage and reducing computational complexity. To enhance the feature maps’ representation ability post-linear transformation, a specifically designed dual-attention mechanism is implemented, enhancing the model’s capacity for semantic understanding of both local and global image information. Moreover, the model integrates sparse self-attention with multi-scale contextual strategies, effectively combining features across different scales and spatial extents. This approach optimizes computational efficiency and retains crucial information, enabling precise and quick image segmentation. To assess the model’s segmentation performance, we conducted experiments in Changge City, Henan Province, using datasets such as LoveDA, PASCAL VOC, LandCoverNet, and DroneDeploy. These experiments demonstrated the model’s outstanding performance on public remote sensing datasets, significantly reducing the parameter count and computational complexity while maintaining high accuracy in segmentation tasks. This advancement offers substantial technical benefits for applications in agriculture and forestry, including land cover classification and crop health monitoring, thereby underscoring the model’s potential to support these critical sectors effectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助