Combining readily available population and land cover maps to generate non-residential built-up labels to train Sentinel-2 image segmentation models

Diogo Duarte , Cidália C. Fonte
{"title":"Combining readily available population and land cover maps to generate non-residential built-up labels to train Sentinel-2 image segmentation models","authors":"Diogo Duarte ,&nbsp;Cidália C. Fonte","doi":"10.1016/j.jag.2024.104272","DOIUrl":null,"url":null,"abstract":"<div><div>The localization of non-residential buildings over wide geographical areas is used as input within several contexts such as disaster management, regional and national planning, policy making and evaluation, among others. While the built-up environment has been continuously and globally mapped, given the efforts on producing synoptic land cover information; little attention has been given to the land use component of such built-up. This is due to, for example, difficulties in distinguishing built-up land use in non-commercial satellite imagery (e.g., Sentinel-2, with spatial resolution of up to 10 m), difficulties in collecting training data for supervised classification approaches, and the fact that variations in features of the built-up environment not always translate to a specific land use. This is even more critical when considering nadir viewing satellite or aerial imagery. However, map producers have been addressing this issue. For example, the Copernicus program (European Commission), through their pan-European CORINE Land Cover (CLC), and Urban Atlas restricted to several European metropolitan areas, have been making available land use information of the built-up cover, with 6-year intervals. The Global Human Settlement Layer (Copernicus program) has been providing built-up land use information by distinguishing residential from non-residential built-up since 2023 (GHSL_NRES). Currently these are also provided with a time interval of 5 years. National map producers often provide this information but usually with an interval between editions of several years. In this paper we combine readily available population counts and land cover maps to generate non-residential training labels that can be used to train a Sentinel-2 image segmentation model capable of distinguishing non-residential built-up from the remaining built-up. Leveraging two publicly available datasets, population counts (WorldPop) and built-up land cover (ESA WorldCover), allowed to produce training data from which an image segmentation model was able to learn relevant features to distinguish non-residential areas from other built-up in Sentinel-2 images. The results within a study area of 4 Sentinel-2 tiles shown that it improves the detection of non-residential built-up areas when comparing with CLC and GHSL_NRES (F1-score of 32 %, 25 % and 29 %, respectively), which are the products providing pan-European information regarding the built-up land use. These results indicate that the combination of publicly available geospatial datasets may be used to produce higher quality geospatial information.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"135 ","pages":"Article 104272"},"PeriodicalIF":7.6000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224006289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0

Abstract

The localization of non-residential buildings over wide geographical areas is used as input within several contexts such as disaster management, regional and national planning, policy making and evaluation, among others. While the built-up environment has been continuously and globally mapped, given the efforts on producing synoptic land cover information; little attention has been given to the land use component of such built-up. This is due to, for example, difficulties in distinguishing built-up land use in non-commercial satellite imagery (e.g., Sentinel-2, with spatial resolution of up to 10 m), difficulties in collecting training data for supervised classification approaches, and the fact that variations in features of the built-up environment not always translate to a specific land use. This is even more critical when considering nadir viewing satellite or aerial imagery. However, map producers have been addressing this issue. For example, the Copernicus program (European Commission), through their pan-European CORINE Land Cover (CLC), and Urban Atlas restricted to several European metropolitan areas, have been making available land use information of the built-up cover, with 6-year intervals. The Global Human Settlement Layer (Copernicus program) has been providing built-up land use information by distinguishing residential from non-residential built-up since 2023 (GHSL_NRES). Currently these are also provided with a time interval of 5 years. National map producers often provide this information but usually with an interval between editions of several years. In this paper we combine readily available population counts and land cover maps to generate non-residential training labels that can be used to train a Sentinel-2 image segmentation model capable of distinguishing non-residential built-up from the remaining built-up. Leveraging two publicly available datasets, population counts (WorldPop) and built-up land cover (ESA WorldCover), allowed to produce training data from which an image segmentation model was able to learn relevant features to distinguish non-residential areas from other built-up in Sentinel-2 images. The results within a study area of 4 Sentinel-2 tiles shown that it improves the detection of non-residential built-up areas when comparing with CLC and GHSL_NRES (F1-score of 32 %, 25 % and 29 %, respectively), which are the products providing pan-European information regarding the built-up land use. These results indicate that the combination of publicly available geospatial datasets may be used to produce higher quality geospatial information.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
结合现成的人口和土地覆盖图生成非住宅建筑标签,以训练哨兵-2 图像分割模型
在灾害管理、区域和国家规划、政策制定和评估等多个方面,对广阔地理区域内的非住宅建筑进行定位都是一种投入。虽然在制作综合土地覆被信息方面做出了努力,对建成区环境进行了持续的全球测绘,但对建成区的土地利用部分却关注甚少。其原因包括:在非商业卫星图像(如空间分辨率高达 10 米的哨兵-2 号)中难以区分建成区的土地利用;难以收集用于监督分类方法的训练数据;以及建成区环境特征的变化并不总能转化为特定的土地利用。当考虑从天顶观测卫星或航空图像时,这一点甚至更为关键。不过,地图制作者一直在解决这个问题。例如,哥白尼计划(欧洲委员会)通过其泛欧洲 CORINE 土地覆盖(CLC)和仅限于欧洲几个大都市地区的城市地图,以 6 年的间隔提供了建筑覆盖的土地利用信息。自 2023 年以来,全球人类住区图层(哥白尼计划)一直在通过区分住宅和非住宅建筑(GHSL_NRES)提供建筑用地信息。目前,这些信息的时间间隔也是 5 年。国家地图编制者通常会提供此类信息,但其版本之间的时间间隔通常为数年。在本文中,我们将现成的人口数量和土地覆被图结合起来,生成非住宅区训练标签,用于训练能够区分非住宅区建筑群和其余建筑群的哨兵-2 图像分割模型。利用两个公开数据集--人口数量(WorldPop)和建成区土地覆盖(ESA WorldCover)--可以生成训练数据,图像分割模型能够从中学习相关特征,以区分哨兵-2 图像中的非居民区和其他建成区。在 4 个哨兵-2 瓦片研究区域内的结果表明,与 CLC 和 GHSL_NRES 相比(F1 分数分别为 32%、25% 和 29%),它提高了对非住宅建筑区的检测能力,而 CLC 和 GHSL_NRES 是提供泛欧建筑用地信息的产品。这些结果表明,将可公开获取的地理空间数据集结合起来,可以生成更高质量的地理空间信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International journal of applied earth observation and geoinformation : ITC journal
International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences
CiteScore
12.00
自引率
0.00%
发文量
0
审稿时长
77 days
期刊介绍: The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.
期刊最新文献
Combining readily available population and land cover maps to generate non-residential built-up labels to train Sentinel-2 image segmentation models An intercomparison of national and global land use and land cover products for Fiji The illusion of success: Test set disproportion causes inflated accuracy in remote sensing mapping research Multispectral imaging and terrestrial laser scanning for the detection of drought-induced paraheliotropic leaf movement in soybean DeLA: An extremely faster network with decoupled local aggregation for large scale point cloud learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1