Spatially explicit accuracy assessment of deep learning-based, fine-resolution built-up land data in the United States.

IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences International Journal of Applied Earth Observation and Geoinformation Pub Date : 2023-09-01 Epub Date: 2023-08-28 DOI:10.1016/j.jag.2023.103469
Johannes H Uhl, Stefan Leyk
{"title":"Spatially explicit accuracy assessment of deep learning-based, fine-resolution built-up land data in the United States.","authors":"Johannes H Uhl, Stefan Leyk","doi":"10.1016/j.jag.2023.103469","DOIUrl":null,"url":null,"abstract":"<p><p>Geospatial datasets derived from remote sensing data by means of machine learning methods are often based on probabilistic outputs of abstract nature, which are difficult to translate into interpretable measures. For example, the Global Human Settlement Layer GHS-BUILT-S2 product reports the probability of the presence of built-up areas in 2018 in a global 10 m × 10 m grid. However, practitioners typically require interpretable measures such as binary surfaces indicating the presence or absence of built-up areas or estimates of sub-pixel built-up surface fractions. Herein, we assess the relationship between the built-up probability in GHS-BUILT-S2 and reference built-up surface fractions derived from a highly reliable reference database for several regions in the United States. Furthermore, we identify a binarization threshold using an agreement maximization method that creates binary built-up land data from these built-up probabilities. These binary surfaces are input to a spatially explicit, scale-sensitive accuracy assessment which includes the use of a novel, visual-analytical tool which we call focal precision-recall signature plots. Our analysis reveals that a threshold of 0.5 applied to GHS-BUILT-S2 maximizes the agreement with binarized built-up land data derived from the reference built-up area fraction. We find high levels of accuracy (i.e., county-level F-1 scores of almost 0.8 on average) in the derived built-up areas, and consistently high accuracy along the rural-urban gradient in our study area. These results reveal considerable accuracy improvements in human settlement models based on Sentinel-2 data and deep learning, as compared to earlier, Landsat-based versions of the Global Human Settlement Layer.</p>","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"123 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10653213/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Earth Observation and Geoinformation","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1016/j.jag.2023.103469","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 1

Abstract

Geospatial datasets derived from remote sensing data by means of machine learning methods are often based on probabilistic outputs of abstract nature, which are difficult to translate into interpretable measures. For example, the Global Human Settlement Layer GHS-BUILT-S2 product reports the probability of the presence of built-up areas in 2018 in a global 10 m × 10 m grid. However, practitioners typically require interpretable measures such as binary surfaces indicating the presence or absence of built-up areas or estimates of sub-pixel built-up surface fractions. Herein, we assess the relationship between the built-up probability in GHS-BUILT-S2 and reference built-up surface fractions derived from a highly reliable reference database for several regions in the United States. Furthermore, we identify a binarization threshold using an agreement maximization method that creates binary built-up land data from these built-up probabilities. These binary surfaces are input to a spatially explicit, scale-sensitive accuracy assessment which includes the use of a novel, visual-analytical tool which we call focal precision-recall signature plots. Our analysis reveals that a threshold of 0.5 applied to GHS-BUILT-S2 maximizes the agreement with binarized built-up land data derived from the reference built-up area fraction. We find high levels of accuracy (i.e., county-level F-1 scores of almost 0.8 on average) in the derived built-up areas, and consistently high accuracy along the rural-urban gradient in our study area. These results reveal considerable accuracy improvements in human settlement models based on Sentinel-2 data and deep learning, as compared to earlier, Landsat-based versions of the Global Human Settlement Layer.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习的美国精细分辨率建筑用地数据的空间显式精度评估。
利用机器学习方法从遥感数据中获得的地理空间数据集往往基于抽象性质的概率输出,难以转化为可解释的度量。例如,全球人类住区层GHS-BUILT-S2产品报告了2018年全球10米× 10米网格中建成区存在的概率。然而,从业者通常需要可解释的测量,如二元表面,表明建成区的存在或不存在,或亚像素建成区表面分数的估计。在此,我们评估了GHS-BUILT-S2中堆积概率与参考堆积地表分数之间的关系,这些参考地表分数来自于美国几个地区的一个高度可靠的参考数据库。此外,我们使用协议最大化方法确定二值化阈值,该方法从这些累积概率中创建二元累积土地数据。这些二元曲面被输入到一个空间显式的、尺度敏感的精度评估中,其中包括使用一种新颖的视觉分析工具,我们称之为焦点精度召回签名图。我们的分析表明,GHS-BUILT-S2的阈值为0.5,与参考建成区分数得到的二值化建成区数据的一致性最大。我们发现,在衍生的建成区中,准确率很高(即,县级F-1得分平均接近0.8),并且在我们的研究区域中,沿着城乡梯度,准确率始终很高。这些结果表明,与早期基于landsat的全球人类住区层版本相比,基于Sentinel-2数据和深度学习的人类住区模型的准确性有很大提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
10.20
自引率
8.00%
发文量
49
审稿时长
7.2 months
期刊介绍: The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.
期刊最新文献
Automatic seismic source modeling of InSAR displacements HCPNet: Learning discriminative prototypes for few-shot remote sensing image scene classification 3D building similarity for a random single-view-image pair based on a local 3D shape A polarization-spectrum fusion framework based on multiscale transform and generative adversarial network for improving water and different vegetation distinguishability Spatially explicit accuracy assessment of deep learning-based, fine-resolution built-up land data in the United States.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1