{"title":"利用残留注意力多尺度聚合全卷积网络从遥感图像中提取建筑物足迹","authors":"Nima Ahmadian, Amin Sedaghat, Nazila Mohammadi","doi":"10.1007/s12524-024-01961-8","DOIUrl":null,"url":null,"abstract":"<p>Building footprint extraction is crucial for various applications, including disaster management, change detection, and 3D modeling. Satellite and aerial images, when combined with deep learning techniques, offer an effective means for this task. The Multi-scale Aggregation Fully Convolutional Network (MA-FCN) is an encoder-decoder model that emphasizes scale information, producing the final segmentation map by concatenating four feature maps from different stages of the decoder. To enhance segmentation accuracy, we propose two novel deep learning models: Attention MA-FCN and Residual Attention MA-FCN. Attention MA-FCN incorporates attention gates in the skip connections to emphasize relevant features, directing the model’s focus to essential areas. Residual Attention MA-FCN further integrates residual blocks into the architecture, using both attention mechanisms and residual blocks to improve stability against gradient vanishing and overfitting, thereby enabling deeper training. These models were evaluated on the WHU, Massachusetts, and Jinghai District datasets, showing superior performance compared to the original MA-FCN. Specifically, Residual Attention MA-FCN outperformed MA-FCN and Attention MA-FCN by 3.6% and 0.92% on the WHU dataset, and by 5.51% and 0.91% on the Massachusetts dataset in terms of the Intersection Over Union (IOU) metric. Additionally, Residual Attention MA-FCN surpassed MA-FCN, Attention MA-FCN, Mask-RCNN, and U-Net models on the Jinghai District dataset. Due to the significance of building footprint extraction in various applications, the results of this study indicates that the proposed methods are more accurate than the MA-FCN model with better performances in IOU and F1-score metrics.</p>","PeriodicalId":17510,"journal":{"name":"Journal of the Indian Society of Remote Sensing","volume":"214 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Building Footprint Extraction from Remote Sensing Images with Residual Attention Multi-Scale Aggregation Fully Convolutional Network\",\"authors\":\"Nima Ahmadian, Amin Sedaghat, Nazila Mohammadi\",\"doi\":\"10.1007/s12524-024-01961-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Building footprint extraction is crucial for various applications, including disaster management, change detection, and 3D modeling. Satellite and aerial images, when combined with deep learning techniques, offer an effective means for this task. The Multi-scale Aggregation Fully Convolutional Network (MA-FCN) is an encoder-decoder model that emphasizes scale information, producing the final segmentation map by concatenating four feature maps from different stages of the decoder. To enhance segmentation accuracy, we propose two novel deep learning models: Attention MA-FCN and Residual Attention MA-FCN. Attention MA-FCN incorporates attention gates in the skip connections to emphasize relevant features, directing the model’s focus to essential areas. Residual Attention MA-FCN further integrates residual blocks into the architecture, using both attention mechanisms and residual blocks to improve stability against gradient vanishing and overfitting, thereby enabling deeper training. These models were evaluated on the WHU, Massachusetts, and Jinghai District datasets, showing superior performance compared to the original MA-FCN. Specifically, Residual Attention MA-FCN outperformed MA-FCN and Attention MA-FCN by 3.6% and 0.92% on the WHU dataset, and by 5.51% and 0.91% on the Massachusetts dataset in terms of the Intersection Over Union (IOU) metric. Additionally, Residual Attention MA-FCN surpassed MA-FCN, Attention MA-FCN, Mask-RCNN, and U-Net models on the Jinghai District dataset. Due to the significance of building footprint extraction in various applications, the results of this study indicates that the proposed methods are more accurate than the MA-FCN model with better performances in IOU and F1-score metrics.</p>\",\"PeriodicalId\":17510,\"journal\":{\"name\":\"Journal of the Indian Society of Remote Sensing\",\"volume\":\"214 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Indian Society of Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s12524-024-01961-8\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Indian Society of Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12524-024-01961-8","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Building Footprint Extraction from Remote Sensing Images with Residual Attention Multi-Scale Aggregation Fully Convolutional Network
Building footprint extraction is crucial for various applications, including disaster management, change detection, and 3D modeling. Satellite and aerial images, when combined with deep learning techniques, offer an effective means for this task. The Multi-scale Aggregation Fully Convolutional Network (MA-FCN) is an encoder-decoder model that emphasizes scale information, producing the final segmentation map by concatenating four feature maps from different stages of the decoder. To enhance segmentation accuracy, we propose two novel deep learning models: Attention MA-FCN and Residual Attention MA-FCN. Attention MA-FCN incorporates attention gates in the skip connections to emphasize relevant features, directing the model’s focus to essential areas. Residual Attention MA-FCN further integrates residual blocks into the architecture, using both attention mechanisms and residual blocks to improve stability against gradient vanishing and overfitting, thereby enabling deeper training. These models were evaluated on the WHU, Massachusetts, and Jinghai District datasets, showing superior performance compared to the original MA-FCN. Specifically, Residual Attention MA-FCN outperformed MA-FCN and Attention MA-FCN by 3.6% and 0.92% on the WHU dataset, and by 5.51% and 0.91% on the Massachusetts dataset in terms of the Intersection Over Union (IOU) metric. Additionally, Residual Attention MA-FCN surpassed MA-FCN, Attention MA-FCN, Mask-RCNN, and U-Net models on the Jinghai District dataset. Due to the significance of building footprint extraction in various applications, the results of this study indicates that the proposed methods are more accurate than the MA-FCN model with better performances in IOU and F1-score metrics.
期刊介绍:
The aims and scope of the Journal of the Indian Society of Remote Sensing are to help towards advancement, dissemination and application of the knowledge of Remote Sensing technology, which is deemed to include photo interpretation, photogrammetry, aerial photography, image processing, and other related technologies in the field of survey, planning and management of natural resources and other areas of application where the technology is considered to be appropriate, to promote interaction among all persons, bodies, institutions (private and/or state-owned) and industries interested in achieving advancement, dissemination and application of the technology, to encourage and undertake research in remote sensing and related technologies and to undertake and execute all acts which shall promote all or any of the aims and objectives of the Indian Society of Remote Sensing.