{"title":"建筑物检测使用密集的关注网络从激光雷达和图像数据","authors":"N. Ghasemian, Jinfei Wang, Mohammad Reza Najafi","doi":"10.1139/geomat-2021-0013","DOIUrl":null,"url":null,"abstract":"Accurate building mapping using remote sensing data is challenging because of the complexity of building structures, particularly in populated cities. LiDAR data are widely used for building extraction because they provide height information, which can help distinguish buildings from other tall objects. However, tall trees and bridges in the vicinity of buildings can limit the application of LiDAR data, particularly in urban areas. Combining LiDAR and orthoimages can help in such situations, because orthoimages can provide information on the physical properties of objects, such as reflectance characteristics. One efficient way to combine these two data sources is to use convolutional neural networks (CNN). This study proposes a CNN architecture based on dense attention blocks for building detection in southern Toronto and Massachusetts. The stacking of information from multiple previous layers was inspired by dense attention networks (DANs). DAN blocks consist of batch normalization, convolution, dropout, and average pooling layers to extract high- and low-level features. Compared with two other widely used deep learning techniques, VGG16 and Resnet50, the proposed method has a simpler architecture and converges faster with higher accuracy. In addition, a comparison with the two other state-of-the-art deep learning methods, including U-net and ResUnet, showed that our proposed technique could achieve a higher F1-score, of 0.71, compared with 0.42 for U-net and 0.49 for ResUnet.","PeriodicalId":35938,"journal":{"name":"Geomatica","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Building detection using a dense attention network from LiDAR and image data\",\"authors\":\"N. Ghasemian, Jinfei Wang, Mohammad Reza Najafi\",\"doi\":\"10.1139/geomat-2021-0013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate building mapping using remote sensing data is challenging because of the complexity of building structures, particularly in populated cities. LiDAR data are widely used for building extraction because they provide height information, which can help distinguish buildings from other tall objects. However, tall trees and bridges in the vicinity of buildings can limit the application of LiDAR data, particularly in urban areas. Combining LiDAR and orthoimages can help in such situations, because orthoimages can provide information on the physical properties of objects, such as reflectance characteristics. One efficient way to combine these two data sources is to use convolutional neural networks (CNN). This study proposes a CNN architecture based on dense attention blocks for building detection in southern Toronto and Massachusetts. The stacking of information from multiple previous layers was inspired by dense attention networks (DANs). DAN blocks consist of batch normalization, convolution, dropout, and average pooling layers to extract high- and low-level features. Compared with two other widely used deep learning techniques, VGG16 and Resnet50, the proposed method has a simpler architecture and converges faster with higher accuracy. In addition, a comparison with the two other state-of-the-art deep learning methods, including U-net and ResUnet, showed that our proposed technique could achieve a higher F1-score, of 0.71, compared with 0.42 for U-net and 0.49 for ResUnet.\",\"PeriodicalId\":35938,\"journal\":{\"name\":\"Geomatica\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geomatica\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1139/geomat-2021-0013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geomatica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1139/geomat-2021-0013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
Building detection using a dense attention network from LiDAR and image data
Accurate building mapping using remote sensing data is challenging because of the complexity of building structures, particularly in populated cities. LiDAR data are widely used for building extraction because they provide height information, which can help distinguish buildings from other tall objects. However, tall trees and bridges in the vicinity of buildings can limit the application of LiDAR data, particularly in urban areas. Combining LiDAR and orthoimages can help in such situations, because orthoimages can provide information on the physical properties of objects, such as reflectance characteristics. One efficient way to combine these two data sources is to use convolutional neural networks (CNN). This study proposes a CNN architecture based on dense attention blocks for building detection in southern Toronto and Massachusetts. The stacking of information from multiple previous layers was inspired by dense attention networks (DANs). DAN blocks consist of batch normalization, convolution, dropout, and average pooling layers to extract high- and low-level features. Compared with two other widely used deep learning techniques, VGG16 and Resnet50, the proposed method has a simpler architecture and converges faster with higher accuracy. In addition, a comparison with the two other state-of-the-art deep learning methods, including U-net and ResUnet, showed that our proposed technique could achieve a higher F1-score, of 0.71, compared with 0.42 for U-net and 0.49 for ResUnet.
GeomaticaSocial Sciences-Geography, Planning and Development
CiteScore
1.50
自引率
0.00%
发文量
7
期刊介绍:
Geomatica (formerly CISM Journal ACSGC), is the official quarterly publication of the Canadian Institute of Geomatics. It is the oldest surveying and mapping publication in Canada and was first published in 1922 as the Journal of the Dominion Land Surveyors’ Association. Geomatica is dedicated to the dissemination of information on technical advances in the geomatics sciences. The internationally respected publication contains special features, notices of conferences, calendar of event, articles on personalities, review of current books, industry news and new products, all of which keep the publication lively and informative.