首页 > 最新文献

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science最新文献

英文 中文
Weighted Multiple Point Cloud Fusion 加权多点云融合
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-09-12 DOI: 10.1007/s41064-024-00310-1
Kwasi Nyarko Poku-Agyemang, Alexander Reiterer

Multiple viewpoint 3D reconstruction has been used in recent years to create accurate complete scenes and objects used for various applications. This is to overcome limitations of single viewpoint 3D digital imaging such as occlusion within the scene during the reconstruction process. In this paper, we propose a weighted point cloud fusion process using both local and global spatial information of the point clouds to fuse them together. The process aims to minimize duplication and remove noise while maintaining a consistent level of details using spatial information from point clouds to compute a weight to fuse them. The algorithm improves the overall accuracy of the fused point cloud while maintaining a similar degree of coverage comparable with state-of-the-art point cloud fusion algorithms.

近年来,多视点三维重建技术已被用于创建精确完整的场景和对象,并被广泛应用于各种领域。这是为了克服单视点三维数字成像的局限性,如重建过程中场景内的遮挡。在本文中,我们提出了一种加权点云融合流程,利用点云的局部和全局空间信息将它们融合在一起。该流程旨在利用点云的空间信息来计算融合点云的权重,从而最大限度地减少重复和去除噪音,同时保持细节的一致性。该算法提高了融合点云的整体精度,同时保持了与最先进的点云融合算法类似的覆盖程度。
{"title":"Weighted Multiple Point Cloud Fusion","authors":"Kwasi Nyarko Poku-Agyemang, Alexander Reiterer","doi":"10.1007/s41064-024-00310-1","DOIUrl":"https://doi.org/10.1007/s41064-024-00310-1","url":null,"abstract":"<p>Multiple viewpoint 3D reconstruction has been used in recent years to create accurate complete scenes and objects used for various applications. This is to overcome limitations of single viewpoint 3D digital imaging such as occlusion within the scene during the reconstruction process. In this paper, we propose a weighted point cloud fusion process using both local and global spatial information of the point clouds to fuse them together. The process aims to minimize duplication and remove noise while maintaining a consistent level of details using spatial information from point clouds to compute a weight to fuse them. The algorithm improves the overall accuracy of the fused point cloud while maintaining a similar degree of coverage comparable with state-of-the-art point cloud fusion algorithms.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142200717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stripe Error Correction for Landsat-7 Using Deep Learning 利用深度学习对 Landsat-7 进行条纹误差校正
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-29 DOI: 10.1007/s41064-024-00306-x
Hilal Adıyaman, Yunus Emre Varul, Tolga Bakırman, Bülent Bayram

Long-term time series satellite imagery became highly essential for analyzing earth cycles such as global warming, climate change, and urbanization. Landsat‑7 satellite imagery plays a key role in this domain since it provides open-access data with expansive coverage and consistent temporal resolution for more than two decades. This paper addresses the challenge of stripe errors induced by Scan Line Corrector sensor malfunction in Landsat‑7 ETM+ satellite imagery, resulting in data loss and degradation. To overcome this problem, we propose a Generative Adversarial Networks approach to fill the gaps in the Landsat‑7 ETM+ panchromatic images. First, we introduce the YTU_STRIPE dataset, comprising Landsat‑8 OLI panchromatic images with synthetically induced stripe errors, for model training and testing. Our results indicate sufficient performance of the Pix2Pix GAN for this purpose. We demonstrate the efficiency of our approach through systematic experimentation and evaluation using various accuracy metrics, including Peak Signal-to-Noise Ratio, Structural Similarity Index Measurement, Universal Image Quality Index, Correlation Coefficient, and Root Mean Square Error which were calculated as 38.5570, 0.9206, 0.7670, 0.7753 and 3.8212, respectively. Our findings suggest promising prospects for utilizing synthetic imagery from Landsat‑8 OLI to mitigate stripe errors in Landsat‑7 ETM+ SLC-off imagery, thereby enhancing image reconstruction efforts. The datasets and model weights generated in this study are publicly available for further research and development: https://github.com/ynsemrevrl/eliminating-stripe-errors.

长期的时间序列卫星图像对于分析全球变暖、气候变化和城市化等地球周期非常重要。Landsat-7 卫星图像在这一领域发挥着关键作用,因为它提供了二十多年来覆盖范围广、时间分辨率一致的开放式数据。本文探讨了 Landsat-7 ETM+ 卫星图像中因扫描线校正器传感器故障而引起的条纹错误,从而导致数据丢失和质量下降的难题。为了克服这一问题,我们提出了一种生成对抗网络方法来填补 Landsat-7 ETM+ 全色图像中的空白。首先,我们引入了 YTU_STRIPE 数据集,该数据集由具有合成条纹误差的 Landsat-8 OLI 全色图像组成,用于模型训练和测试。我们的结果表明,Pix2Pix GAN 在这方面具有足够的性能。我们通过系统实验和使用各种精度指标(包括峰值信噪比、结构相似性指数测量、通用图像质量指数、相关系数和均方根误差)进行评估,证明了我们方法的效率,计算结果分别为 38.5570、0.9206、0.7670、0.7753 和 3.8212。我们的研究结果表明,利用来自 Landsat-8 OLI 的合成图像来减少 Landsat-7 ETM+ SLC-off 图像中的条纹误差,从而提高图像重建工作的效率,前景十分广阔。本研究生成的数据集和模型权重可公开用于进一步的研究和开发:https://github.com/ynsemrevrl/eliminating-stripe-errors。
{"title":"Stripe Error Correction for Landsat-7 Using Deep Learning","authors":"Hilal Adıyaman, Yunus Emre Varul, Tolga Bakırman, Bülent Bayram","doi":"10.1007/s41064-024-00306-x","DOIUrl":"https://doi.org/10.1007/s41064-024-00306-x","url":null,"abstract":"<p>Long-term time series satellite imagery became highly essential for analyzing earth cycles such as global warming, climate change, and urbanization. Landsat‑7 satellite imagery plays a key role in this domain since it provides open-access data with expansive coverage and consistent temporal resolution for more than two decades. This paper addresses the challenge of stripe errors induced by Scan Line Corrector sensor malfunction in Landsat‑7 ETM+ satellite imagery, resulting in data loss and degradation. To overcome this problem, we propose a Generative Adversarial Networks approach to fill the gaps in the Landsat‑7 ETM+ panchromatic images. First, we introduce the YTU_STRIPE dataset, comprising Landsat‑8 OLI panchromatic images with synthetically induced stripe errors, for model training and testing. Our results indicate sufficient performance of the Pix2Pix GAN for this purpose. We demonstrate the efficiency of our approach through systematic experimentation and evaluation using various accuracy metrics, including Peak Signal-to-Noise Ratio, Structural Similarity Index Measurement, Universal Image Quality Index, Correlation Coefficient, and Root Mean Square Error which were calculated as 38.5570, 0.9206, 0.7670, 0.7753 and 3.8212, respectively. Our findings suggest promising prospects for utilizing synthetic imagery from Landsat‑8 OLI to mitigate stripe errors in Landsat‑7 ETM+ SLC-off imagery, thereby enhancing image reconstruction efforts. The datasets and model weights generated in this study are publicly available for further research and development: https://github.com/ynsemrevrl/eliminating-stripe-errors.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142200722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EnhancedNet, an End-to-End Network for Dense Disparity Estimation and its Application to Aerial Images 增强型网络(EnhancedNet)--用于密集差异估计的端到端网络及其在航空图像中的应用
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-28 DOI: 10.1007/s41064-024-00307-w
Junhua Kang, Lin Chen, Christian Heipke

Recent developments in deep learning technology have boosted the performance of dense stereo reconstruction. However, the state-of-the-art deep learning-based stereo matching methods are mainly trained using close-range synthetic images. Consequently, the application of these methods in aerial photogrammetry and remote sensing is currently far from straightforward. In this paper, we propose a new disparity estimation network for stereo matching and investigate its generalization abilities in regard to aerial images. First, we propose an end-to-end deep learning network for stereo matching, regularized by disparity gradients, which includes a residual cost volume and a reconstruction error volume in a refinement module, and multiple losses. In order to investigate the influence of the multiple losses, a comprehensive analysis is presented. Second, based on this network trained with synthetic close-range data, we propose a new pipeline for matching high-resolution aerial imagery. The experimental results show that the proposed network improves the disparity accuracy by up to 40% in terms of errors larger than 1 px compared to results when not including the refinement network, especially in areas containing detailed small objects. In addition, in qualitative and quantitative experiments, we are able to show that our model, pre-trained on a synthetic stereo dataset, achieves very competitive sub-pixel geometric accuracy on aerial images. These results confirm that the domain gap between synthetic close-range and real aerial images can be satisfactorily bridged using the proposed new deep learning method for dense image matching.

深度学习技术的最新发展提升了密集立体重建的性能。然而,基于深度学习的最新立体匹配方法主要是利用近距离合成图像进行训练的。因此,这些方法目前在航空摄影测量和遥感中的应用还不够直接。在本文中,我们提出了一种新的用于立体匹配的差异估计网络,并研究了其在航空图像方面的泛化能力。首先,我们提出了一种用于立体匹配的端到端深度学习网络,该网络由差距梯度正则化,包括细化模块中的残差成本卷和重建误差卷,以及多重损失。为了研究多重损失的影响,本文进行了综合分析。其次,基于这个用合成近距离数据训练的网络,我们提出了一种新的高分辨率航空图像匹配管道。实验结果表明,与不包含细化网络的结果相比,在误差大于 1 px 的情况下,所提出的网络可将差异精度提高 40%,尤其是在包含细节小物体的区域。此外,在定性和定量实验中,我们还证明了我们在合成立体数据集上预先训练的模型在航空图像上实现了极具竞争力的亚像素几何精度。这些结果证实,利用所提出的新深度学习方法进行密集图像匹配,可以令人满意地缩小合成近距离图像与真实航空图像之间的领域差距。
{"title":"EnhancedNet, an End-to-End Network for Dense Disparity Estimation and its Application to Aerial Images","authors":"Junhua Kang, Lin Chen, Christian Heipke","doi":"10.1007/s41064-024-00307-w","DOIUrl":"https://doi.org/10.1007/s41064-024-00307-w","url":null,"abstract":"<p>Recent developments in deep learning technology have boosted the performance of dense stereo reconstruction. However, the state-of-the-art deep learning-based stereo matching methods are mainly trained using close-range synthetic images. Consequently, the application of these methods in aerial photogrammetry and remote sensing is currently far from straightforward. In this paper, we propose a new disparity estimation network for stereo matching and investigate its generalization abilities in regard to aerial images. First, we propose an end-to-end deep learning network for stereo matching, regularized by disparity gradients, which includes a residual cost volume and a reconstruction error volume in a refinement module, and multiple losses. In order to investigate the influence of the multiple losses, a comprehensive analysis is presented. Second, based on this network trained with synthetic close-range data, we propose a new pipeline for matching high-resolution aerial imagery. The experimental results show that the proposed network improves the disparity accuracy by up to 40% in terms of errors larger than 1 px compared to results when not including the refinement network, especially in areas containing detailed small objects. In addition, in qualitative and quantitative experiments, we are able to show that our model, pre-trained on a synthetic stereo dataset, achieves very competitive sub-pixel geometric accuracy on aerial images. These results confirm that the domain gap between synthetic close-range and real aerial images can be satisfactorily bridged using the proposed new deep learning method for dense image matching.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142200718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fresh Concrete Properties from Stereoscopic Image Sequences 从立体图像序列中获取新鲜混凝土特性
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-26 DOI: 10.1007/s41064-024-00303-0
Max Meyer, Amadeus Langer, Max Mehltretter, Dries Beyer, Max Coenen, Tobias Schack, Michael Haist, Christian Heipke

Increasing the degree of digitization and automation in concrete production can make a decisive contribution to reducing the associated (text{CO}_{2}) emissions. This paper presents a method which predicts the properties of fresh concrete during the mixing process on the basis of stereoscopic image sequences of the moving concrete and mix design information or a variation of these. A Convolutional Neural Network (CNN) is used for the prediction, which receives the images supported by information about the mix design as input. In addition, the network receives temporal information in the form of the time difference between image acquisition and the point in time for which the concrete properties are to be predicted. During training, the times at which the reference values were captured are used for the latter. With this temporal information, the network implicitly learns the time-dependent behavior of the concrete properties. The network predicts the slump flow diameter, the yield stress and the plastic viscosity. The time-dependent prediction opens up the possibility of forecasting the temporal development of the fresh concrete properties during mixing. This is a significant advantage for the concrete industry, as countermeasures can then be taken in a timely manner, if the properties deviate from the desired ones. In various experiments it is shown that both the stereoscopic observations and the mix design information contain valuable information for the time-dependent prediction of the fresh concrete properties.

提高混凝土生产的数字化和自动化程度可以为减少相关排放做出决定性贡献。本文介绍了一种方法,该方法可根据移动混凝土的立体图像序列和混合设计信息或这些信息的变体,预测搅拌过程中新拌混凝土的特性。预测使用了卷积神经网络 (CNN),该网络接收由混合设计信息支持的图像作为输入。此外,该网络还能接收时间信息,即图像采集与预测混凝土特性的时间点之间的时间差。在训练过程中,参考值的采集时间被用于后者。有了这些时间信息,网络就能隐式地学习随时间变化的混凝土特性行为。该网络可预测坍落度流动直径、屈服应力和塑性粘度。随时间变化的预测为预报搅拌过程中新拌混凝土性能的随时间变化提供了可能。这对混凝土行业来说是一个重大优势,因为如果性能偏离预期,就可以及时采取应对措施。各种实验表明,立体观测和混合设计信息都包含了对新拌混凝土性能随时间变化进行预测的宝贵信息。
{"title":"Fresh Concrete Properties from Stereoscopic Image Sequences","authors":"Max Meyer, Amadeus Langer, Max Mehltretter, Dries Beyer, Max Coenen, Tobias Schack, Michael Haist, Christian Heipke","doi":"10.1007/s41064-024-00303-0","DOIUrl":"https://doi.org/10.1007/s41064-024-00303-0","url":null,"abstract":"<p>Increasing the degree of digitization and automation in concrete production can make a decisive contribution to reducing the associated <span>(text{CO}_{2})</span> emissions. This paper presents a method which predicts the properties of fresh concrete during the mixing process on the basis of stereoscopic image sequences of the moving concrete and mix design information or a variation of these. A Convolutional Neural Network (CNN) is used for the prediction, which receives the images supported by information about the mix design as input. In addition, the network receives temporal information in the form of the time difference between image acquisition and the point in time for which the concrete properties are to be predicted. During training, the times at which the reference values were captured are used for the latter. With this temporal information, the network implicitly learns the time-dependent behavior of the concrete properties. The network predicts the slump flow diameter, the yield stress and the plastic viscosity. The time-dependent prediction opens up the possibility of forecasting the temporal development of the fresh concrete properties during mixing. This is a significant advantage for the concrete industry, as countermeasures can then be taken in a timely manner, if the properties deviate from the desired ones. In various experiments it is shown that both the stereoscopic observations and the mix design information contain valuable information for the time-dependent prediction of the fresh concrete properties.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142200719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Patterns and Trends in Urbanization and Land Use Efficiency Across the Philippines: A Comprehensive Analysis Using Global Earth Observation Data and SDG 11.3.1 Indicators 评估菲律宾各地城市化和土地使用效率的模式和趋势:利用全球地球观测数据和可持续发展目标 11.3.1 指标进行综合分析
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-13 DOI: 10.1007/s41064-024-00305-y
Jojene R. Santillan, Christian Heipke

Urbanization, a global phenomenon with profound implications for sustainable development, is a focal point of Sustainable Development Goal 11 (SDG 11). Aimed at fostering inclusive, resilient, and sustainable urbanization by 2030, SDG 11 emphasizes the importance of monitoring land use efficiency (LUE) through indicator 11.3.1. In the Philippines, urbanization has surged over recent decades. Despite its importance, research on urbanization and LUE has predominantly focused on the country’s national capital region (Metro Manila), while little to no attention is given to comprehensive investigations across different regions, provinces, cities, and municipalities of the country. Additionally, challenges in acquiring consistent spatial data, especially due to the Philippines’ archipelagic nature, have hindered comprehensive analysis. To address these gaps, this study conducts a thorough examination of urbanization patterns and LUE dynamics in the Philippines from 1975 to 2020, leveraging Global Human Settlement Layers (GHSL) data and secondary indicators associated with SDG 11.3.1. Our study examines spatial patterns and temporal trends in built-up area expansion, population growth, and LUE characteristics at both city and municipal levels. Among the major findings are the substantial growth in built-up areas and population across the country. We also found a shift in urban growth dynamics, with Metro Manila showing limited expansion in recent years while new urban growth emerges in other regions of the country. Our analysis of the spatiotemporal patterns of Land Consumption Rate (LCR) revealed three distinct evolutional phases: a growth phase between 1975–1990, followed by a decline phase between 1990–2005, and a resurgence phase from 2005–2020. Generally declining trends in LCR and Population Growth Rate (PGR) were evident, demonstrating the country’s direction towards efficient built-up land utilization. However, this efficiency coincides with overcrowding issues as revealed by additional indicators such as the Abstract Achieved Population Density in Expansion Areas (AAPDEA) and Marginal Land Consumption per New Inhabitant (MLCNI). We also analyzed the spatial patterns and temporal trends of LUE across the country and found distinct clusters of transitioning urban centers, densely inhabited metropolises, expanding metropolitan regions, and rapidly growing urban hubs. The study’s findings suggest the need for policy interventions that promote compact and sustainable urban development, equitable regional development, and measures to address overcrowding in urban areas. By aligning policies with the observed spatial and temporal trends, decision-makers can work towards achieving SDG 11, fostering inclusive, resilient, and sustainable urbanization in the Philippines.

城市化是一个对可持续发展具有深远影响的全球现象,是可持续发展目标 11(SDG 11)的一个焦点。可持续发展目标 11 旨在到 2030 年促进包容、有韧性和可持续的城市化,通过指标 11.3.1 强调了监测土地使用效率(LUE)的重要性。近几十年来,菲律宾的城市化进程迅猛发展。尽管城市化和土地使用效率非常重要,但有关城市化和土地使用效率的研究却主要集中在国家首都地区(大马尼拉),而对全国不同地区、省、市和直辖市的全面调查却几乎没有给予关注。此外,在获取一致的空间数据方面存在的挑战,尤其是菲律宾的群岛性质,也阻碍了综合分析的进行。为了弥补这些不足,本研究利用全球人类住区图层(GHSL)数据和与可持续发展目标 11.3.1 相关的二级指标,对 1975 年至 2020 年菲律宾的城市化模式和土地使用效率动态进行了全面研究。我们的研究考察了城市和市镇层面建成区扩张、人口增长和土地利用效率特征的空间模式和时间趋势。主要发现包括全国范围内建成区和人口的大幅增长。我们还发现城市增长动态发生了变化,近年来大马尼拉市的扩张有限,而全国其他地区则出现了新的城市增长。我们对土地消耗率(LCR)时空模式的分析表明了三个不同的演变阶段:1975-1990 年为增长阶段,1990-2005 年为下降阶段,2005-2020 年为恢复阶段。土地消耗率(LCR)和人口增长率(PGR)总体呈下降趋势,这表明该国正朝着高效利用建筑用地的方向发展。然而,这种效率与过度拥挤问题同时存在,扩展区人口密度(AAPDEA)和每新增居民边际土地消耗(MLCNI)等其他指标也揭示了这一问题。我们还分析了全国土地利用效率的空间模式和时间趋势,发现了明显的城市中心转型群、人口稠密的大都市、扩张中的大都市地区和快速发展的城市中心。研究结果表明,有必要采取政策干预措施,促进紧凑和可持续的城市发展、公平的区域发展以及解决城市地区过度拥挤问题的措施。通过使政策与观察到的空间和时间趋势相一致,决策者可以努力实现可持续发展目标 11,促进菲律宾的包容性、弹性和可持续城市化。
{"title":"Assessing Patterns and Trends in Urbanization and Land Use Efficiency Across the Philippines: A Comprehensive Analysis Using Global Earth Observation Data and SDG 11.3.1 Indicators","authors":"Jojene R. Santillan, Christian Heipke","doi":"10.1007/s41064-024-00305-y","DOIUrl":"https://doi.org/10.1007/s41064-024-00305-y","url":null,"abstract":"<p>Urbanization, a global phenomenon with profound implications for sustainable development, is a focal point of Sustainable Development Goal 11 (SDG 11). Aimed at fostering inclusive, resilient, and sustainable urbanization by 2030, SDG 11 emphasizes the importance of monitoring land use efficiency (LUE) through indicator 11.3.1. In the Philippines, urbanization has surged over recent decades. Despite its importance, research on urbanization and LUE has predominantly focused on the country’s national capital region (Metro Manila), while little to no attention is given to comprehensive investigations across different regions, provinces, cities, and municipalities of the country. Additionally, challenges in acquiring consistent spatial data, especially due to the Philippines’ archipelagic nature, have hindered comprehensive analysis. To address these gaps, this study conducts a thorough examination of urbanization patterns and LUE dynamics in the Philippines from 1975 to 2020, leveraging Global Human Settlement Layers (GHSL) data and secondary indicators associated with SDG 11.3.1. Our study examines spatial patterns and temporal trends in built-up area expansion, population growth, and LUE characteristics at both city and municipal levels. Among the major findings are the substantial growth in built-up areas and population across the country. We also found a shift in urban growth dynamics, with Metro Manila showing limited expansion in recent years while new urban growth emerges in other regions of the country. Our analysis of the spatiotemporal patterns of Land Consumption Rate (LCR) revealed three distinct evolutional phases: a growth phase between 1975–1990, followed by a decline phase between 1990–2005, and a resurgence phase from 2005–2020. Generally declining trends in LCR and Population Growth Rate (PGR) were evident, demonstrating the country’s direction towards efficient built-up land utilization. However, this efficiency coincides with overcrowding issues as revealed by additional indicators such as the Abstract Achieved Population Density in Expansion Areas (AAPDEA) and Marginal Land Consumption per New Inhabitant (MLCNI). We also analyzed the spatial patterns and temporal trends of LUE across the country and found distinct clusters of transitioning urban centers, densely inhabited metropolises, expanding metropolitan regions, and rapidly growing urban hubs. The study’s findings suggest the need for policy interventions that promote compact and sustainable urban development, equitable regional development, and measures to address overcrowding in urban areas. By aligning policies with the observed spatial and temporal trends, decision-makers can work towards achieving SDG 11, fostering inclusive, resilient, and sustainable urbanization in the Philippines.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142200720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer models for Land Cover Classification with Satellite Image Time Series 利用卫星图像时间序列进行土地覆盖分类的变换器模型
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-06 DOI: 10.1007/s41064-024-00299-7
Mirjana Voelsen, Franz Rottensteiner, Christian Heipke

In this paper we address the task of pixel-wise land cover (LC) classification using satellite image time series (SITS). For that purpose, we use a supervised deep learning model and focus on combining spatial and temporal features. Our method is based on the Swin Transformer and captures global temporal features by using self-attention and local spatial features by convolutions. We extend the architecture to receive multi-temporal input to generate one output label map for every input image. In our experiments we focus on the application of pixel-wise LC classification from Sentinel‑2 SITS over the whole area of Lower Saxony (Germany). The experiments with our new model show that by using convolutions for spatial feature extraction or a temporal weighting module in the skip connections the performance improves and is more stable. The combined usage of both adaptations results in the overall best performance although this improvement is only minimal. Compared to a fully convolutional neural network without any self-attention layers our model improves the results by 2.1% in the mean F1-Score on a corrected test dataset. Additionally, we investigate different types of temporal position encoding, which do not have a significant impact on the performance.

在本文中,我们利用卫星图像时间序列(SITS)解决了像素级土地覆盖(LC)分类任务。为此,我们使用了一个有监督的深度学习模型,并侧重于结合空间和时间特征。我们的方法以 Swin 变换器为基础,通过自我关注捕捉全局时间特征,并通过卷积捕捉局部空间特征。我们对架构进行了扩展,以接收多时态输入,为每张输入图像生成一个输出标签图。在实验中,我们重点应用了下萨克森州(德国)整个地区哨兵-2 SITS 的像素级 LC 分类。使用我们的新模型进行的实验表明,通过使用卷积进行空间特征提取或在跳转连接中使用时间加权模块,可以提高性能并使其更加稳定。综合使用这两种适配方法可获得最佳的整体性能,尽管这种改进微乎其微。与没有任何自我注意层的完全卷积神经网络相比,我们的模型在校正测试数据集上的平均 F1 分数提高了 2.1%。此外,我们还研究了不同类型的时间位置编码,但这些编码对性能并无显著影响。
{"title":"Transformer models for Land Cover Classification with Satellite Image Time Series","authors":"Mirjana Voelsen, Franz Rottensteiner, Christian Heipke","doi":"10.1007/s41064-024-00299-7","DOIUrl":"https://doi.org/10.1007/s41064-024-00299-7","url":null,"abstract":"<p>In this paper we address the task of pixel-wise land cover (LC) classification using satellite image time series (SITS). For that purpose, we use a supervised deep learning model and focus on combining spatial and temporal features. Our method is based on the Swin Transformer and captures global temporal features by using self-attention and local spatial features by convolutions. We extend the architecture to receive multi-temporal input to generate one output label map for every input image. In our experiments we focus on the application of pixel-wise LC classification from Sentinel‑2 SITS over the whole area of Lower Saxony (Germany). The experiments with our new model show that by using convolutions for spatial feature extraction or a temporal weighting module in the skip connections the performance improves and is more stable. The combined usage of both adaptations results in the overall best performance although this improvement is only minimal. Compared to a fully convolutional neural network without any self-attention layers our model improves the results by 2.1% in the mean F1-Score on a corrected test dataset. Additionally, we investigate different types of temporal position encoding, which do not have a significant impact on the performance.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enabling Spatial Digital Twins: Technologies, Challenges, and Future Research Directions 实现空间数字孪生:技术、挑战和未来研究方向
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-06 DOI: 10.1007/s41064-024-00301-2
Mohammed Eunus Ali, Muhammad Aamir Cheema, Tanzima Hashem, Anwaar Ulhaq, Muhammad Ali Babar

A Digital Twin (DT) is a virtual replica of a physical object or system, created to monitor, analyze, and optimize its behavior and characteristics. A Spatial Digital Twin (SDT) is a specific type of digital twin that emphasizes the geospatial aspects of the physical entity, incorporating precise location and dimensional attributes for a comprehensive understanding of its spatial environment. With the recent advancement in spatial technologies and breakthroughs in other computing technologies such as Artificial Intelligence (AI) and Machine Learning (ML), the SDTs market is expected to rise to 25 billion, covering a wide range of applications. The majority of existing research focuses on DTs and often fails to address the necessary spatial technologies essential for constructing SDTs. The current body of research on SDTs primarily concentrates on analyzing their potential impact and opportunities within various application domains. As building an SDT is a complex process and requires a variety of spatial computing technologies, it is not straightforward for practitioners and researchers of this multi-disciplinary domain to grasp the underlying details of enabling technologies of the SDT. In this paper, we are the first to systematically analyze different spatial technologies relevant to building an SDT in a layered approach (starting from data acquisition to visualization). More specifically, we present the tech stack of SDTs into five distinct layers of technologies: (i) data acquisition and processing; (ii) data integration, cataloging, and metadata management; (iii) data modeling, database management & big data analytics systems; (iv) Geographic Information System (GIS) software, maps, & APIs; and (v) key functional components such as visualizing, querying, mining, simulation, and prediction. Moreover, we discuss how modern technologies such as AI/ML, blockchains, and cloud computing can be effectively utilized in enabling and enhancing SDTs. Finally, we identify a number of research challenges and opportunities in SDTs. This work serves as an important resource for SDT researchers and practitioners as it explicitly distinguishes SDTs from traditional DTs, identifies unique applications, outlines the essential technological components of SDTs, and presents a vision for their future development along with the challenges that lie ahead.

数字孪生(DT)是物理对象或系统的虚拟复制品,用于监控、分析和优化其行为和特征。空间数字孪生(SDT)是数字孪生的一种特殊类型,它强调物理实体的地理空间方面,包含精确的位置和尺寸属性,以全面了解其空间环境。随着近年来空间技术的进步以及人工智能(AI)和机器学习(ML)等其他计算技术的突破,SDTs 的市场规模预计将上升到 250 亿美元,涵盖广泛的应用领域。现有研究大多集中在 DT 上,往往未能涉及构建 SDT 所必需的空间技术。目前对 SDT 的研究主要集中在分析其在不同应用领域的潜在影响和机遇。由于构建 SDT 是一个复杂的过程,需要多种空间计算技术,因此对于这一跨学科领域的从业人员和研究人员来说,要掌握 SDT 使能技术的基本细节并非易事。在本文中,我们首次以分层方法(从数据采集到可视化)系统分析了与构建 SDT 相关的各种空间技术。更具体地说,我们将 SDT 的技术堆栈分为五个不同的技术层:(i) 数据采集和处理;(ii) 数据集成、编目和元数据管理;(iii) 数据建模、数据库管理及大数据分析系统;(iv) 地理信息系统(GIS)软件、地图及应用程序接口;以及 (v) 可视化、查询、挖掘、模拟和预测等关键功能组件。此外,我们还讨论了如何有效利用人工智能/ML、区块链和云计算等现代技术来支持和增强 SDT。最后,我们确定了 SDTs 的一系列研究挑战和机遇。本著作明确区分了 SDT 与传统 DT,确定了 SDT 的独特应用,概述了 SDT 的基本技术组件,并提出了 SDT 的未来发展愿景和面临的挑战,是 SDT 研究人员和从业人员的重要资源。
{"title":"Enabling Spatial Digital Twins: Technologies, Challenges, and Future Research Directions","authors":"Mohammed Eunus Ali, Muhammad Aamir Cheema, Tanzima Hashem, Anwaar Ulhaq, Muhammad Ali Babar","doi":"10.1007/s41064-024-00301-2","DOIUrl":"https://doi.org/10.1007/s41064-024-00301-2","url":null,"abstract":"<p>A <i>Digital Twin (DT)</i> is a virtual replica of a physical object or system, created to monitor, analyze, and optimize its behavior and characteristics. A <i>Spatial Digital Twin (SDT)</i> is a specific type of digital twin that emphasizes the geospatial aspects of the physical entity, incorporating precise location and dimensional attributes for a comprehensive understanding of its spatial environment. With the recent advancement in spatial technologies and breakthroughs in other computing technologies such as Artificial Intelligence (AI) and Machine Learning (ML), the SDTs market is expected to rise to 25 billion, covering a wide range of applications. The majority of existing research focuses on DTs and often fails to address the necessary spatial technologies essential for constructing SDTs. The current body of research on SDTs primarily concentrates on analyzing their potential impact and opportunities within various application domains. As building an SDT is a complex process and requires a variety of spatial computing technologies, it is not straightforward for practitioners and researchers of this multi-disciplinary domain to grasp the underlying details of enabling technologies of the SDT. In this paper, we are the first to systematically analyze different spatial technologies relevant to building an SDT in a layered approach (starting from data acquisition to visualization). More specifically, we present the <i>tech stack of SDTs</i> into five distinct layers of technologies: (i) data acquisition and processing; (ii) data integration, cataloging, and metadata management; (iii) data modeling, database management &amp; big data analytics systems; (iv) Geographic Information System (GIS) software, maps, &amp; APIs; and (v) key functional components such as visualizing, querying, mining, simulation, and prediction. Moreover, we discuss how modern technologies such as AI/ML, blockchains, and cloud computing can be effectively utilized in enabling and enhancing SDTs. Finally, we identify a number of research challenges and opportunities in SDTs. This work serves as an important resource for SDT researchers and practitioners as it explicitly distinguishes SDTs from traditional DTs, identifies unique applications, outlines the essential technological components of SDTs, and presents a vision for their future development along with the challenges that lie ahead.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Treating Tropospheric Phase Delay in Large-scale Sentinel-1 Stacks to Analyze Land Subsidence 处理大规模哨兵-1 叠加数据中的对流层相位延迟以分析地面沉降
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-08-02 DOI: 10.1007/s41064-024-00304-z
Mahmud Haghshenas Haghighi, Mahdi Motagh

Variations in the tropospheric phase delay pose a primary challenge to achieving precise displacement measurements in Interferometric Synthetic Aperture Radar (InSAR) analysis. This study presents a cluster-based empirical tropospheric phase correction approach to analyze land subsidence rates from large-scale Sentinel‑1 data stacks. Our method identifies the optimum number of clusters in individual interferograms for K‑means clustering, and segments extensive interferograms into areas with consistent tropospheric phase delay behaviors. It then performs tropospheric phase correction based on empirical topography-phase correlation, addressing stratified and broad-scale tropospheric phase delays. Applied to a six-year data stack along a 1000-km track in Iran, we demonstrate that this approach enhances interferogram quality by reducing the standard deviation by 50% and lowering the semivariance of the interferograms to 20 cm2 at distances up to 800 km in 97% of the interferograms. Additionally, the corrected time series of deformation shows a 40% reduction in the root mean square of residuals at the most severely deformed points. By analyzing the corrected interferograms, we show that our method improves the efficiency of country-scale InSAR surveys to detect and quantify present-day land subsidence in Iran, which is essential for groundwater management and sustainable water resource planning.

对流层相位延迟的变化是干涉合成孔径雷达(InSAR)分析中实现精确位移测量的主要挑战。本研究提出了一种基于集群的对流层经验相位校正方法,用于分析来自大规模哨兵-1 数据集的土地沉降率。我们的方法在单个干涉图中确定 K-means 聚类的最佳簇数,并将大范围干涉图分割成具有一致对流层相位延迟行为的区域。然后根据经验地形-相位相关性进行对流层相位校正,解决分层和大尺度对流层相位延迟问题。我们将这种方法应用于伊朗 1000 公里轨道上的六年数据堆栈,结果表明,这种方法提高了干涉图的质量,将标准偏差降低了 50%,在距离达 800 公里的干涉图中,97%的干涉图的半方差降低到 20 平方厘米。此外,校正后的变形时间序列显示,变形最严重点的残差均方根降低了 40%。通过分析校正后的干涉图,我们发现我们的方法提高了国家尺度 InSAR 勘测的效率,可用于探测和量化伊朗现今的土地沉降,这对地下水管理和可持续水资源规划至关重要。
{"title":"Treating Tropospheric Phase Delay in Large-scale Sentinel-1 Stacks to Analyze Land Subsidence","authors":"Mahmud Haghshenas Haghighi, Mahdi Motagh","doi":"10.1007/s41064-024-00304-z","DOIUrl":"https://doi.org/10.1007/s41064-024-00304-z","url":null,"abstract":"<p>Variations in the tropospheric phase delay pose a primary challenge to achieving precise displacement measurements in Interferometric Synthetic Aperture Radar (InSAR) analysis. This study presents a cluster-based empirical tropospheric phase correction approach to analyze land subsidence rates from large-scale Sentinel‑1 data stacks. Our method identifies the optimum number of clusters in individual interferograms for K‑means clustering, and segments extensive interferograms into areas with consistent tropospheric phase delay behaviors. It then performs tropospheric phase correction based on empirical topography-phase correlation, addressing stratified and broad-scale tropospheric phase delays. Applied to a six-year data stack along a 1000-km track in Iran, we demonstrate that this approach enhances interferogram quality by reducing the standard deviation by 50% and lowering the semivariance of the interferograms to 20 cm<sup>2</sup> at distances up to 800 km in 97% of the interferograms. Additionally, the corrected time series of deformation shows a 40% reduction in the root mean square of residuals at the most severely deformed points. By analyzing the corrected interferograms, we show that our method improves the efficiency of country-scale InSAR surveys to detect and quantify present-day land subsidence in Iran, which is essential for groundwater management and sustainable water resource planning.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cooperative Image Orientation with Dynamic Objects 与动态物体合作确定图像方向
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-07-26 DOI: 10.1007/s41064-024-00296-w
Philipp Trusheim, Max Mehltretter, Franz Rottensteiner, Christian Heipke

Using images to supplement classical navigation solutions purely based on global navigation satellite systems (GNSSs) has the potential to overcome problems in densely built-up areas. These approaches usually assume a static environment; however, this assumption is not necessarily valid in urban areas. Therefore, many approaches delete information stemming from moving objects in a first processing step, but this results in information being lost. In this paper, we present an approach that detects and models so-called dynamic objects based on image sequences and includes these object models into a bundle adjustment. We distinguish dynamic objects that provide information about their position to others (cooperating objects) and those that do not (non-cooperating objects). Dynamic objects that observe the environment with the help of sensors in order to determine their position are called observing objects. In the experiments discussed here, the observing object is equipped with a stereo camera and a GNSS receiver. We show that cooperating objects can have a positive effect on the exterior orientation of the observing object after the bundle adjustment, both in terms of precision and accuracy. However, we found that introducing non-cooperating objects did not result in further improvements, probably because in our case the photogrammetric block was already stable without them due to the large number and good distribution of static tie points.

利用图像来补充纯粹基于全球导航卫星系统(GNSS)的传统导航解决方案,有可能解决建筑密集地区的问题。这些方法通常假设环境是静态的,但这一假设在城市地区并不一定成立。因此,许多方法都会在第一步处理过程中删除来自移动物体的信息,但这会导致信息丢失。在本文中,我们提出了一种基于图像序列检测所谓动态物体并建立模型的方法,并将这些物体模型纳入捆绑调整。我们区分了向他人提供自身位置信息的动态物体(合作物体)和不提供位置信息的动态物体(非合作物体)。借助传感器观察环境以确定自身位置的动态物体被称为观察物体。在本文讨论的实验中,观测物体配备了一个立体摄像机和一个全球导航卫星系统接收器。我们的研究表明,合作对象在进行捆绑调整后,无论在精度还是准确度方面,都能对观测对象的外部方位产生积极影响。然而,我们发现,引入非合作对象并不会带来进一步的改进,这可能是因为在我们的案例中,由于静态连接点数量多且分布均匀,因此在没有这些对象的情况下,摄影测量块已经很稳定了。
{"title":"Cooperative Image Orientation with Dynamic Objects","authors":"Philipp Trusheim, Max Mehltretter, Franz Rottensteiner, Christian Heipke","doi":"10.1007/s41064-024-00296-w","DOIUrl":"https://doi.org/10.1007/s41064-024-00296-w","url":null,"abstract":"<p>Using images to supplement classical navigation solutions purely based on global navigation satellite systems (GNSSs) has the potential to overcome problems in densely built-up areas. These approaches usually assume a static environment; however, this assumption is not necessarily valid in urban areas. Therefore, many approaches delete information stemming from moving objects in a first processing step, but this results in information being lost. In this paper, we present an approach that detects and models so-called dynamic objects based on image sequences and includes these object models into a bundle adjustment. We distinguish dynamic objects that provide information about their position to others (cooperating objects) and those that do not (non-cooperating objects). Dynamic objects that observe the environment with the help of sensors in order to determine their position are called observing objects. In the experiments discussed here, the observing object is equipped with a stereo camera and a GNSS receiver. We show that cooperating objects can have a positive effect on the exterior orientation of the observing object after the bundle adjustment, both in terms of precision and accuracy. However, we found that introducing non-cooperating objects did not result in further improvements, probably because in our case the photogrammetric block was already stable without them due to the large number and good distribution of static tie points.</p>","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local Evaluation of Large-scale Remote Sensing Machine Learning-generated Building and Road Dataset: The Case of Rwanda 大规模遥感机器学习生成的建筑物和道路数据集的地方评估:卢旺达案例
IF 4.1 4区 地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2024-07-24 DOI: 10.1007/s41064-024-00297-9
Emmanuel Nyandwi, Markus Gerke, Pedro Achanccaray

Accurate and up-to-date building and road data are crucial for informed spatial planning. In developing regions in particular, major challenges arise due to the limited availability of these data, primarily as a result of the inherent inefficiency of traditional field-based surveys and manual data generation methods. Importantly, this limitation has prompted the exploration of alternative solutions, including the use of remote sensing machine learning-generated (RSML) datasets. Within the field of RSML datasets, a plethora of models have been proposed. However, these methods, evaluated in a research setting, may not translate perfectly to massive real-world applications, attributable to potential inaccuracies in unknown geographic spaces. The scepticism surrounding the usefulness of datasets generated by global models, owing to unguaranteed local accuracy, appears to be particularly concerning. As a consequence, rigorous evaluations of these datasets in local scenarios are essential for gaining insights into their usability. To address this concern, this study investigates the local accuracy of large RSML datasets. For this evaluation, we employed a dataset generated using models pre-trained on a variety of samples drawn from across the world and accessible from public repositories of open benchmark datasets. Subsequently, these models were fine-tuned with a limited set of local samples specific to Rwanda. In addition, the evaluation included Microsoft’s and Google’s global datasets. Using ResNet and Mask R‑CNN, we explored the performance variations of different building detection approaches: bottom-up, end-to-end, and their combination. For road extraction, we explored the approach of training multiple models on subsets representing different road types. Our testing dataset was carefully designed to be diverse, incorporating both easy and challenging scenes. It includes areas purposefully chosen for their high level of clutter, making it difficult to detect structures like buildings. This inclusion of complex scenarios alongside simpler ones allows us to thoroughly assess the robustness of DL-based detection models for handling diverse real-world conditions. In addition, buildings were evaluated using a polygon-wise comparison, while roads were assessed using network length-derived metrics.

Our results showed a precision (P) of around 75% and a recall (R) of around 60% for the locally fine-tuned building model. This performance was achieved in three out of six testing sites and is considered the lowest limit needed for practical utility of RSML datasets, according to the literature. In contrast, comparable results were obtained in only one out of six sites for the Google and Microsoft datasets. Our locally fine-tuned road model achieved moderate success, meeting the minimum usability threshold in four out of six sites. In contrast, the Microsoft dataset performed well on all sites. In summary, our findings suggest improved performance

准确、最新的建筑和道路数据对于知情的空间规划至关重要。特别是在发展中地区,由于这些数据的可用性有限,主要是由于传统的实地调查和人工数据生成方法固有的低效率造成的,因此面临着重大挑战。重要的是,这种局限性促使人们探索其他解决方案,包括使用遥感机器学习生成(RSML)数据集。在 RSML 数据集领域,已经提出了大量模型。然而,这些在研究环境中进行评估的方法可能无法完美地应用于大规模的现实世界,原因是在未知的地理空间中可能存在误差。由于无法保证局部准确性,人们对全球模型生成的数据集的实用性持怀疑态度,这似乎尤其令人担忧。因此,在本地场景中对这些数据集进行严格评估对于深入了解其可用性至关重要。为了解决这个问题,本研究调查了大型 RSML 数据集的局部准确性。在评估过程中,我们使用了一个数据集,该数据集是使用在来自世界各地的各种样本上预先训练的模型生成的,这些样本可从开放基准数据集的公共存储库中获取。随后,我们使用卢旺达本地的有限样本集对这些模型进行了微调。此外,评估还包括微软和谷歌的全球数据集。利用 ResNet 和 Mask R-CNN,我们探索了不同建筑物检测方法的性能差异:自下而上、端到端以及它们的组合。在道路提取方面,我们探索了在代表不同道路类型的子集上训练多个模型的方法。我们的测试数据集经过精心设计,既包括简单场景,也包括具有挑战性的场景。测试数据集特意选择了杂乱程度较高的区域,这样就很难检测到建筑物等结构。将复杂场景与简单场景结合在一起,使我们能够全面评估基于 DL 的检测模型在处理现实世界各种条件时的鲁棒性。此外,我们还使用多边形比较法对建筑物进行了评估,并使用源自网络长度的指标对道路进行了评估。我们的结果显示,局部微调的建筑物模型的精确度(P)约为 75%,召回率(R)约为 60%。我们的结果表明,局部微调建筑模型的精确度(P)约为 75%,召回率(R)约为 60%,在六个测试点中的三个测试点都达到了这一性能,根据文献,这被认为是 RSML 数据集实用性所需的最低限度。相比之下,谷歌和微软数据集在六个测试点中只有一个测试点取得了类似的结果。我们的局部微调道路模型取得了中等程度的成功,在六个站点中的四个站点达到了最低可用性要求。相比之下,微软数据集在所有网站上的表现都很好。总之,我们的研究结果表明,相对于建筑物提取任务,道路提取的性能有所提高。此外,我们还发现,与开放的全局数据集相比,依靠自下而上和自上而下相结合的分割方法,同时利用开放的全局基准标注数据集和少量样本进行微调,可以提供更准确的 RSML 数据集。我们的研究结果表明,仅仅依赖综合准确度指标可能会产生误导。根据我们的评估,即使是城市级别的衍生指标也可能无法捕捉到城市内部性能的显著差异,例如特定街区的准确度较低。探索其他方法,包括整合激光雷达数据、无人机图像、航空图像或使用其他网络架构,可能有利于克服复杂地区的挑战。
{"title":"Local Evaluation of Large-scale Remote Sensing Machine Learning-generated Building and Road Dataset: The Case of Rwanda","authors":"Emmanuel Nyandwi, Markus Gerke, Pedro Achanccaray","doi":"10.1007/s41064-024-00297-9","DOIUrl":"https://doi.org/10.1007/s41064-024-00297-9","url":null,"abstract":"<p>Accurate and up-to-date building and road data are crucial for informed spatial planning. In developing regions in particular, major challenges arise due to the limited availability of these data, primarily as a result of the inherent inefficiency of traditional field-based surveys and manual data generation methods. Importantly, this limitation has prompted the exploration of alternative solutions, including the use of remote sensing machine learning-generated (RSML) datasets. Within the field of RSML datasets, a plethora of models have been proposed. However, these methods, evaluated in a research setting, may not translate perfectly to massive real-world applications, attributable to potential inaccuracies in unknown geographic spaces. The scepticism surrounding the usefulness of datasets generated by global models, owing to unguaranteed local accuracy, appears to be particularly concerning. As a consequence, rigorous evaluations of these datasets in local scenarios are essential for gaining insights into their usability. To address this concern, this study investigates the local accuracy of large RSML datasets. For this evaluation, we employed a dataset generated using models pre-trained on a variety of samples drawn from across the world and accessible from public repositories of open benchmark datasets. Subsequently, these models were fine-tuned with a limited set of local samples specific to Rwanda. In addition, the evaluation included Microsoft’s and Google’s global datasets. Using ResNet and Mask R‑CNN, we explored the performance variations of different building detection approaches: bottom-up, end-to-end, and their combination. For road extraction, we explored the approach of training multiple models on subsets representing different road types. Our testing dataset was carefully designed to be diverse, incorporating both easy and challenging scenes. It includes areas purposefully chosen for their high level of clutter, making it difficult to detect structures like buildings. This inclusion of complex scenarios alongside simpler ones allows us to thoroughly assess the robustness of DL-based detection models for handling diverse real-world conditions. In addition, buildings were evaluated using a polygon-wise comparison, while roads were assessed using network length-derived metrics.</p><p>Our results showed a precision (P) of around 75% and a recall (R) of around 60% for the locally fine-tuned building model. This performance was achieved in three out of six testing sites and is considered the lowest limit needed for practical utility of RSML datasets, according to the literature. In contrast, comparable results were obtained in only one out of six sites for the Google and Microsoft datasets. Our locally fine-tuned road model achieved moderate success, meeting the minimum usability threshold in four out of six sites. In contrast, the Microsoft dataset performed well on all sites. In summary, our findings suggest improved performance ","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":null,"pages":null},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1