{"title":"Classification of buildings from VHR satellite images using ensemble of U-Net and ResNet","authors":"S. Vasavi, Hema Sri Somagani, Yarlagadda Sai","doi":"10.1016/j.ejrs.2023.11.008","DOIUrl":null,"url":null,"abstract":"<div><p>The urbanization rate of India is 35.9 % by 2022 reports. In this 45.23 % of urbanization is happening in Maharashtra and it is the third most urbanized state of India after Tamil Nadu and Kerala. In metropolitan areas, the classification of land cover from satellite images has been the focus of remote sensing over the years. Due to complex architecture and a lack of labeled data, classifying buildings in metropolitan areas from very high resolution (VHR) satellite imagery is challenging. Traditional approaches for building classification include hand-crafted features and transfer learning methods. These methods often struggle with the variability in building shapes, orientation, and viewpoint, leading to low accuracy in densely populated urban areas and limited performance when dealing with high- resolution satellite images. A deep-learning based approach for semantic segmentation using U-Net with a backbone of ResNet-34 is proposed for building classification. Urban area Dataset with Images of 0.5 m resolution is prepared from SASPlanet. One hot Encoding is applied for classifying buildings. U-Net is trained with encoded data. The proposed model is evaluated on the Indian dataset, specifically, the urban areas of Nashik, Maharashtra state and the accuracy obtained for the classification dataset is 60 % and the accuracy of the building detection is about 85 %. Change detection is calculated from bi-temporal images. The GIS maps are updated to detect changes in buildings, represented by different colors to distinguish newly constructed buildings, existing structures and demolished ones.</p></div>","PeriodicalId":48539,"journal":{"name":"Egyptian Journal of Remote Sensing and Space Sciences","volume":"26 4","pages":"Pages 937-953"},"PeriodicalIF":3.7000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110982323000960/pdfft?md5=a76e7cd6bd6e8d8ffed83cfbb5f8197e&pid=1-s2.0-S1110982323000960-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Journal of Remote Sensing and Space Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110982323000960","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The urbanization rate of India is 35.9 % by 2022 reports. In this 45.23 % of urbanization is happening in Maharashtra and it is the third most urbanized state of India after Tamil Nadu and Kerala. In metropolitan areas, the classification of land cover from satellite images has been the focus of remote sensing over the years. Due to complex architecture and a lack of labeled data, classifying buildings in metropolitan areas from very high resolution (VHR) satellite imagery is challenging. Traditional approaches for building classification include hand-crafted features and transfer learning methods. These methods often struggle with the variability in building shapes, orientation, and viewpoint, leading to low accuracy in densely populated urban areas and limited performance when dealing with high- resolution satellite images. A deep-learning based approach for semantic segmentation using U-Net with a backbone of ResNet-34 is proposed for building classification. Urban area Dataset with Images of 0.5 m resolution is prepared from SASPlanet. One hot Encoding is applied for classifying buildings. U-Net is trained with encoded data. The proposed model is evaluated on the Indian dataset, specifically, the urban areas of Nashik, Maharashtra state and the accuracy obtained for the classification dataset is 60 % and the accuracy of the building detection is about 85 %. Change detection is calculated from bi-temporal images. The GIS maps are updated to detect changes in buildings, represented by different colors to distinguish newly constructed buildings, existing structures and demolished ones.
期刊介绍:
The Egyptian Journal of Remote Sensing and Space Sciences (EJRS) encompasses a comprehensive range of topics within Remote Sensing, Geographic Information Systems (GIS), planetary geology, and space technology development, including theories, applications, and modeling. EJRS aims to disseminate high-quality, peer-reviewed research focusing on the advancement of remote sensing and GIS technologies and their practical applications for effective planning, sustainable development, and environmental resource conservation. The journal particularly welcomes innovative papers with broad scientific appeal.