PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science最新文献_第2页

Transformer models for Land Cover Classification with Satellite Image Time Series 利用卫星图像时间序列进行土地覆盖分类的变换器模型

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-08-06 DOI: 10.1007/s41064-024-00299-7

Mirjana Voelsen, Franz Rottensteiner, Christian Heipke

In this paper we address the task of pixel-wise land cover (LC) classification using satellite image time series (SITS). For that purpose, we use a supervised deep learning model and focus on combining spatial and temporal features. Our method is based on the Swin Transformer and captures global temporal features by using self-attention and local spatial features by convolutions. We extend the architecture to receive multi-temporal input to generate one output label map for every input image. In our experiments we focus on the application of pixel-wise LC classification from Sentinel‑2 SITS over the whole area of Lower Saxony (Germany). The experiments with our new model show that by using convolutions for spatial feature extraction or a temporal weighting module in the skip connections the performance improves and is more stable. The combined usage of both adaptations results in the overall best performance although this improvement is only minimal. Compared to a fully convolutional neural network without any self-attention layers our model improves the results by 2.1% in the mean F1-Score on a corrected test dataset. Additionally, we investigate different types of temporal position encoding, which do not have a significant impact on the performance.

在本文中，我们利用卫星图像时间序列（SITS）解决了像素级土地覆盖（LC）分类任务。为此，我们使用了一个有监督的深度学习模型，并侧重于结合空间和时间特征。我们的方法以 Swin 变换器为基础，通过自我关注捕捉全局时间特征，并通过卷积捕捉局部空间特征。我们对架构进行了扩展，以接收多时态输入，为每张输入图像生成一个输出标签图。在实验中，我们重点应用了下萨克森州（德国）整个地区哨兵-2 SITS 的像素级 LC 分类。使用我们的新模型进行的实验表明，通过使用卷积进行空间特征提取或在跳转连接中使用时间加权模块，可以提高性能并使其更加稳定。综合使用这两种适配方法可获得最佳的整体性能，尽管这种改进微乎其微。与没有任何自我注意层的完全卷积神经网络相比，我们的模型在校正测试数据集上的平均 F1 分数提高了 2.1%。此外，我们还研究了不同类型的时间位置编码，但这些编码对性能并无显著影响。

{"title":"Transformer models for Land Cover Classification with Satellite Image Time Series","authors":"Mirjana Voelsen, Franz Rottensteiner, Christian Heipke","doi":"10.1007/s41064-024-00299-7","DOIUrl":"https://doi.org/10.1007/s41064-024-00299-7","url":null,"abstract":"In this paper we address the task of pixel-wise land cover (LC) classification using satellite image time series (SITS). For that purpose, we use a supervised deep learning model and focus on combining spatial and temporal features. Our method is based on the Swin Transformer and captures global temporal features by using self-attention and local spatial features by convolutions. We extend the architecture to receive multi-temporal input to generate one output label map for every input image. In our experiments we focus on the application of pixel-wise LC classification from Sentinel‑2 SITS over the whole area of Lower Saxony (Germany). The experiments with our new model show that by using convolutions for spatial feature extraction or a temporal weighting module in the skip connections the performance improves and is more stable. The combined usage of both adaptations results in the overall best performance although this improvement is only minimal. Compared to a fully convolutional neural network without any self-attention layers our model improves the results by 2.1% in the mean F1-Score on a corrected test dataset. Additionally, we investigate different types of temporal position encoding, which do not have a significant impact on the performance.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"23 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enabling Spatial Digital Twins: Technologies, Challenges, and Future Research Directions 实现空间数字孪生：技术、挑战和未来研究方向

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-08-06 DOI: 10.1007/s41064-024-00301-2

Mohammed Eunus Ali, Muhammad Aamir Cheema, Tanzima Hashem, Anwaar Ulhaq, Muhammad Ali Babar

A Digital Twin (DT) is a virtual replica of a physical object or system, created to monitor, analyze, and optimize its behavior and characteristics. A Spatial Digital Twin (SDT) is a specific type of digital twin that emphasizes the geospatial aspects of the physical entity, incorporating precise location and dimensional attributes for a comprehensive understanding of its spatial environment. With the recent advancement in spatial technologies and breakthroughs in other computing technologies such as Artificial Intelligence (AI) and Machine Learning (ML), the SDTs market is expected to rise to 25 billion, covering a wide range of applications. The majority of existing research focuses on DTs and often fails to address the necessary spatial technologies essential for constructing SDTs. The current body of research on SDTs primarily concentrates on analyzing their potential impact and opportunities within various application domains. As building an SDT is a complex process and requires a variety of spatial computing technologies, it is not straightforward for practitioners and researchers of this multi-disciplinary domain to grasp the underlying details of enabling technologies of the SDT. In this paper, we are the first to systematically analyze different spatial technologies relevant to building an SDT in a layered approach (starting from data acquisition to visualization). More specifically, we present the tech stack of SDTs into five distinct layers of technologies: (i) data acquisition and processing; (ii) data integration, cataloging, and metadata management; (iii) data modeling, database management & big data analytics systems; (iv) Geographic Information System (GIS) software, maps, & APIs; and (v) key functional components such as visualizing, querying, mining, simulation, and prediction. Moreover, we discuss how modern technologies such as AI/ML, blockchains, and cloud computing can be effectively utilized in enabling and enhancing SDTs. Finally, we identify a number of research challenges and opportunities in SDTs. This work serves as an important resource for SDT researchers and practitioners as it explicitly distinguishes SDTs from traditional DTs, identifies unique applications, outlines the essential technological components of SDTs, and presents a vision for their future development along with the challenges that lie ahead.

数字孪生（DT）是物理对象或系统的虚拟复制品，用于监控、分析和优化其行为和特征。空间数字孪生（SDT）是数字孪生的一种特殊类型，它强调物理实体的地理空间方面，包含精确的位置和尺寸属性，以全面了解其空间环境。随着近年来空间技术的进步以及人工智能（AI）和机器学习（ML）等其他计算技术的突破，SDTs 的市场规模预计将上升到 250 亿美元，涵盖广泛的应用领域。现有研究大多集中在 DT 上，往往未能涉及构建 SDT 所必需的空间技术。目前对 SDT 的研究主要集中在分析其在不同应用领域的潜在影响和机遇。由于构建 SDT 是一个复杂的过程，需要多种空间计算技术，因此对于这一跨学科领域的从业人员和研究人员来说，要掌握 SDT 使能技术的基本细节并非易事。在本文中，我们首次以分层方法（从数据采集到可视化）系统分析了与构建 SDT 相关的各种空间技术。更具体地说，我们将 SDT 的技术堆栈分为五个不同的技术层：(i) 数据采集和处理；(ii) 数据集成、编目和元数据管理；(iii) 数据建模、数据库管理及大数据分析系统；(iv) 地理信息系统（GIS）软件、地图及应用程序接口；以及 (v) 可视化、查询、挖掘、模拟和预测等关键功能组件。此外，我们还讨论了如何有效利用人工智能/ML、区块链和云计算等现代技术来支持和增强 SDT。最后，我们确定了 SDTs 的一系列研究挑战和机遇。本著作明确区分了 SDT 与传统 DT，确定了 SDT 的独特应用，概述了 SDT 的基本技术组件，并提出了 SDT 的未来发展愿景和面临的挑战，是 SDT 研究人员和从业人员的重要资源。

{"title":"Enabling Spatial Digital Twins: Technologies, Challenges, and Future Research Directions","authors":"Mohammed Eunus Ali, Muhammad Aamir Cheema, Tanzima Hashem, Anwaar Ulhaq, Muhammad Ali Babar","doi":"10.1007/s41064-024-00301-2","DOIUrl":"https://doi.org/10.1007/s41064-024-00301-2","url":null,"abstract":"A Digital Twin (DT) is a virtual replica of a physical object or system, created to monitor, analyze, and optimize its behavior and characteristics. A Spatial Digital Twin (SDT) is a specific type of digital twin that emphasizes the geospatial aspects of the physical entity, incorporating precise location and dimensional attributes for a comprehensive understanding of its spatial environment. With the recent advancement in spatial technologies and breakthroughs in other computing technologies such as Artificial Intelligence (AI) and Machine Learning (ML), the SDTs market is expected to rise to 25 billion, covering a wide range of applications. The majority of existing research focuses on DTs and often fails to address the necessary spatial technologies essential for constructing SDTs. The current body of research on SDTs primarily concentrates on analyzing their potential impact and opportunities within various application domains. As building an SDT is a complex process and requires a variety of spatial computing technologies, it is not straightforward for practitioners and researchers of this multi-disciplinary domain to grasp the underlying details of enabling technologies of the SDT. In this paper, we are the first to systematically analyze different spatial technologies relevant to building an SDT in a layered approach (starting from data acquisition to visualization). More specifically, we present the tech stack of SDTs into five distinct layers of technologies: (i) data acquisition and processing; (ii) data integration, cataloging, and metadata management; (iii) data modeling, database management & big data analytics systems; (iv) Geographic Information System (GIS) software, maps, & APIs; and (v) key functional components such as visualizing, querying, mining, simulation, and prediction. Moreover, we discuss how modern technologies such as AI/ML, blockchains, and cloud computing can be effectively utilized in enabling and enhancing SDTs. Finally, we identify a number of research challenges and opportunities in SDTs. This work serves as an important resource for SDT researchers and practitioners as it explicitly distinguishes SDTs from traditional DTs, identifies unique applications, outlines the essential technological components of SDTs, and presents a vision for their future development along with the challenges that lie ahead.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"30 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Treating Tropospheric Phase Delay in Large-scale Sentinel-1 Stacks to Analyze Land Subsidence 处理大规模哨兵-1 叠加数据中的对流层相位延迟以分析地面沉降

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-08-02 DOI: 10.1007/s41064-024-00304-z

Mahmud Haghshenas Haghighi, Mahdi Motagh

Variations in the tropospheric phase delay pose a primary challenge to achieving precise displacement measurements in Interferometric Synthetic Aperture Radar (InSAR) analysis. This study presents a cluster-based empirical tropospheric phase correction approach to analyze land subsidence rates from large-scale Sentinel‑1 data stacks. Our method identifies the optimum number of clusters in individual interferograms for K‑means clustering, and segments extensive interferograms into areas with consistent tropospheric phase delay behaviors. It then performs tropospheric phase correction based on empirical topography-phase correlation, addressing stratified and broad-scale tropospheric phase delays. Applied to a six-year data stack along a 1000-km track in Iran, we demonstrate that this approach enhances interferogram quality by reducing the standard deviation by 50% and lowering the semivariance of the interferograms to 20 cm² at distances up to 800 km in 97% of the interferograms. Additionally, the corrected time series of deformation shows a 40% reduction in the root mean square of residuals at the most severely deformed points. By analyzing the corrected interferograms, we show that our method improves the efficiency of country-scale InSAR surveys to detect and quantify present-day land subsidence in Iran, which is essential for groundwater management and sustainable water resource planning.

对流层相位延迟的变化是干涉合成孔径雷达（InSAR）分析中实现精确位移测量的主要挑战。本研究提出了一种基于集群的对流层经验相位校正方法，用于分析来自大规模哨兵-1 数据集的土地沉降率。我们的方法在单个干涉图中确定 K-means 聚类的最佳簇数，并将大范围干涉图分割成具有一致对流层相位延迟行为的区域。然后根据经验地形-相位相关性进行对流层相位校正，解决分层和大尺度对流层相位延迟问题。我们将这种方法应用于伊朗 1000 公里轨道上的六年数据堆栈，结果表明，这种方法提高了干涉图的质量，将标准偏差降低了 50%，在距离达 800 公里的干涉图中，97%的干涉图的半方差降低到 20 平方厘米。此外，校正后的变形时间序列显示，变形最严重点的残差均方根降低了 40%。通过分析校正后的干涉图，我们发现我们的方法提高了国家尺度 InSAR 勘测的效率，可用于探测和量化伊朗现今的土地沉降，这对地下水管理和可持续水资源规划至关重要。

{"title":"Treating Tropospheric Phase Delay in Large-scale Sentinel-1 Stacks to Analyze Land Subsidence","authors":"Mahmud Haghshenas Haghighi, Mahdi Motagh","doi":"10.1007/s41064-024-00304-z","DOIUrl":"https://doi.org/10.1007/s41064-024-00304-z","url":null,"abstract":"Variations in the tropospheric phase delay pose a primary challenge to achieving precise displacement measurements in Interferometric Synthetic Aperture Radar (InSAR) analysis. This study presents a cluster-based empirical tropospheric phase correction approach to analyze land subsidence rates from large-scale Sentinel‑1 data stacks. Our method identifies the optimum number of clusters in individual interferograms for K‑means clustering, and segments extensive interferograms into areas with consistent tropospheric phase delay behaviors. It then performs tropospheric phase correction based on empirical topography-phase correlation, addressing stratified and broad-scale tropospheric phase delays. Applied to a six-year data stack along a 1000-km track in Iran, we demonstrate that this approach enhances interferogram quality by reducing the standard deviation by 50% and lowering the semivariance of the interferograms to 20 cm2 at distances up to 800 km in 97% of the interferograms. Additionally, the corrected time series of deformation shows a 40% reduction in the root mean square of residuals at the most severely deformed points. By analyzing the corrected interferograms, we show that our method improves the efficiency of country-scale InSAR surveys to detect and quantify present-day land subsidence in Iran, which is essential for groundwater management and sustainable water resource planning.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"55 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cooperative Image Orientation with Dynamic Objects 与动态物体合作确定图像方向

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-07-26 DOI: 10.1007/s41064-024-00296-w

Philipp Trusheim, Max Mehltretter, Franz Rottensteiner, Christian Heipke

Using images to supplement classical navigation solutions purely based on global navigation satellite systems (GNSSs) has the potential to overcome problems in densely built-up areas. These approaches usually assume a static environment; however, this assumption is not necessarily valid in urban areas. Therefore, many approaches delete information stemming from moving objects in a first processing step, but this results in information being lost. In this paper, we present an approach that detects and models so-called dynamic objects based on image sequences and includes these object models into a bundle adjustment. We distinguish dynamic objects that provide information about their position to others (cooperating objects) and those that do not (non-cooperating objects). Dynamic objects that observe the environment with the help of sensors in order to determine their position are called observing objects. In the experiments discussed here, the observing object is equipped with a stereo camera and a GNSS receiver. We show that cooperating objects can have a positive effect on the exterior orientation of the observing object after the bundle adjustment, both in terms of precision and accuracy. However, we found that introducing non-cooperating objects did not result in further improvements, probably because in our case the photogrammetric block was already stable without them due to the large number and good distribution of static tie points.

利用图像来补充纯粹基于全球导航卫星系统（GNSS）的传统导航解决方案，有可能解决建筑密集地区的问题。这些方法通常假设环境是静态的，但这一假设在城市地区并不一定成立。因此，许多方法都会在第一步处理过程中删除来自移动物体的信息，但这会导致信息丢失。在本文中，我们提出了一种基于图像序列检测所谓动态物体并建立模型的方法，并将这些物体模型纳入捆绑调整。我们区分了向他人提供自身位置信息的动态物体（合作物体）和不提供位置信息的动态物体（非合作物体）。借助传感器观察环境以确定自身位置的动态物体被称为观察物体。在本文讨论的实验中，观测物体配备了一个立体摄像机和一个全球导航卫星系统接收器。我们的研究表明，合作对象在进行捆绑调整后，无论在精度还是准确度方面，都能对观测对象的外部方位产生积极影响。然而，我们发现，引入非合作对象并不会带来进一步的改进，这可能是因为在我们的案例中，由于静态连接点数量多且分布均匀，因此在没有这些对象的情况下，摄影测量块已经很稳定了。

{"title":"Cooperative Image Orientation with Dynamic Objects","authors":"Philipp Trusheim, Max Mehltretter, Franz Rottensteiner, Christian Heipke","doi":"10.1007/s41064-024-00296-w","DOIUrl":"https://doi.org/10.1007/s41064-024-00296-w","url":null,"abstract":"Using images to supplement classical navigation solutions purely based on global navigation satellite systems (GNSSs) has the potential to overcome problems in densely built-up areas. These approaches usually assume a static environment; however, this assumption is not necessarily valid in urban areas. Therefore, many approaches delete information stemming from moving objects in a first processing step, but this results in information being lost. In this paper, we present an approach that detects and models so-called dynamic objects based on image sequences and includes these object models into a bundle adjustment. We distinguish dynamic objects that provide information about their position to others (cooperating objects) and those that do not (non-cooperating objects). Dynamic objects that observe the environment with the help of sensors in order to determine their position are called observing objects. In the experiments discussed here, the observing object is equipped with a stereo camera and a GNSS receiver. We show that cooperating objects can have a positive effect on the exterior orientation of the observing object after the bundle adjustment, both in terms of precision and accuracy. However, we found that introducing non-cooperating objects did not result in further improvements, probably because in our case the photogrammetric block was already stable without them due to the large number and good distribution of static tie points.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"14 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Local Evaluation of Large-scale Remote Sensing Machine Learning-generated Building and Road Dataset: The Case of Rwanda 大规模遥感机器学习生成的建筑物和道路数据集的地方评估：卢旺达案例

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-07-24 DOI: 10.1007/s41064-024-00297-9

Emmanuel Nyandwi, Markus Gerke, Pedro Achanccaray

Accurate and up-to-date building and road data are crucial for informed spatial planning. In developing regions in particular, major challenges arise due to the limited availability of these data, primarily as a result of the inherent inefficiency of traditional field-based surveys and manual data generation methods. Importantly, this limitation has prompted the exploration of alternative solutions, including the use of remote sensing machine learning-generated (RSML) datasets. Within the field of RSML datasets, a plethora of models have been proposed. However, these methods, evaluated in a research setting, may not translate perfectly to massive real-world applications, attributable to potential inaccuracies in unknown geographic spaces. The scepticism surrounding the usefulness of datasets generated by global models, owing to unguaranteed local accuracy, appears to be particularly concerning. As a consequence, rigorous evaluations of these datasets in local scenarios are essential for gaining insights into their usability. To address this concern, this study investigates the local accuracy of large RSML datasets. For this evaluation, we employed a dataset generated using models pre-trained on a variety of samples drawn from across the world and accessible from public repositories of open benchmark datasets. Subsequently, these models were fine-tuned with a limited set of local samples specific to Rwanda. In addition, the evaluation included Microsoft’s and Google’s global datasets. Using ResNet and Mask R‑CNN, we explored the performance variations of different building detection approaches: bottom-up, end-to-end, and their combination. For road extraction, we explored the approach of training multiple models on subsets representing different road types. Our testing dataset was carefully designed to be diverse, incorporating both easy and challenging scenes. It includes areas purposefully chosen for their high level of clutter, making it difficult to detect structures like buildings. This inclusion of complex scenarios alongside simpler ones allows us to thoroughly assess the robustness of DL-based detection models for handling diverse real-world conditions. In addition, buildings were evaluated using a polygon-wise comparison, while roads were assessed using network length-derived metrics.Our results showed a precision (P) of around 75% and a recall (R) of around 60% for the locally fine-tuned building model. This performance was achieved in three out of six testing sites and is considered the lowest limit needed for practical utility of RSML datasets, according to the literature. In contrast, comparable results were obtained in only one out of six sites for the Google and Microsoft datasets. Our locally fine-tuned road model achieved moderate success, meeting the minimum usability threshold in four out of six sites. In contrast, the Microsoft dataset performed well on all sites. In summary, our findings suggest improved performance

准确、最新的建筑和道路数据对于知情的空间规划至关重要。特别是在发展中地区，由于这些数据的可用性有限，主要是由于传统的实地调查和人工数据生成方法固有的低效率造成的，因此面临着重大挑战。重要的是，这种局限性促使人们探索其他解决方案，包括使用遥感机器学习生成（RSML）数据集。在 RSML 数据集领域，已经提出了大量模型。然而，这些在研究环境中进行评估的方法可能无法完美地应用于大规模的现实世界，原因是在未知的地理空间中可能存在误差。由于无法保证局部准确性，人们对全球模型生成的数据集的实用性持怀疑态度，这似乎尤其令人担忧。因此，在本地场景中对这些数据集进行严格评估对于深入了解其可用性至关重要。为了解决这个问题，本研究调查了大型 RSML 数据集的局部准确性。在评估过程中，我们使用了一个数据集，该数据集是使用在来自世界各地的各种样本上预先训练的模型生成的，这些样本可从开放基准数据集的公共存储库中获取。随后，我们使用卢旺达本地的有限样本集对这些模型进行了微调。此外，评估还包括微软和谷歌的全球数据集。利用 ResNet 和 Mask R-CNN，我们探索了不同建筑物检测方法的性能差异：自下而上、端到端以及它们的组合。在道路提取方面，我们探索了在代表不同道路类型的子集上训练多个模型的方法。我们的测试数据集经过精心设计，既包括简单场景，也包括具有挑战性的场景。测试数据集特意选择了杂乱程度较高的区域，这样就很难检测到建筑物等结构。将复杂场景与简单场景结合在一起，使我们能够全面评估基于 DL 的检测模型在处理现实世界各种条件时的鲁棒性。此外，我们还使用多边形比较法对建筑物进行了评估，并使用源自网络长度的指标对道路进行了评估。我们的结果显示，局部微调的建筑物模型的精确度（P）约为 75%，召回率（R）约为 60%。我们的结果表明，局部微调建筑模型的精确度（P）约为 75%，召回率（R）约为 60%，在六个测试点中的三个测试点都达到了这一性能，根据文献，这被认为是 RSML 数据集实用性所需的最低限度。相比之下，谷歌和微软数据集在六个测试点中只有一个测试点取得了类似的结果。我们的局部微调道路模型取得了中等程度的成功，在六个站点中的四个站点达到了最低可用性要求。相比之下，微软数据集在所有网站上的表现都很好。总之，我们的研究结果表明，相对于建筑物提取任务，道路提取的性能有所提高。此外，我们还发现，与开放的全局数据集相比，依靠自下而上和自上而下相结合的分割方法，同时利用开放的全局基准标注数据集和少量样本进行微调，可以提供更准确的 RSML 数据集。我们的研究结果表明，仅仅依赖综合准确度指标可能会产生误导。根据我们的评估，即使是城市级别的衍生指标也可能无法捕捉到城市内部性能的显著差异，例如特定街区的准确度较低。探索其他方法，包括整合激光雷达数据、无人机图像、航空图像或使用其他网络架构，可能有利于克服复杂地区的挑战。

{"title":"Local Evaluation of Large-scale Remote Sensing Machine Learning-generated Building and Road Dataset: The Case of Rwanda","authors":"Emmanuel Nyandwi, Markus Gerke, Pedro Achanccaray","doi":"10.1007/s41064-024-00297-9","DOIUrl":"https://doi.org/10.1007/s41064-024-00297-9","url":null,"abstract":"Accurate and up-to-date building and road data are crucial for informed spatial planning. In developing regions in particular, major challenges arise due to the limited availability of these data, primarily as a result of the inherent inefficiency of traditional field-based surveys and manual data generation methods. Importantly, this limitation has prompted the exploration of alternative solutions, including the use of remote sensing machine learning-generated (RSML) datasets. Within the field of RSML datasets, a plethora of models have been proposed. However, these methods, evaluated in a research setting, may not translate perfectly to massive real-world applications, attributable to potential inaccuracies in unknown geographic spaces. The scepticism surrounding the usefulness of datasets generated by global models, owing to unguaranteed local accuracy, appears to be particularly concerning. As a consequence, rigorous evaluations of these datasets in local scenarios are essential for gaining insights into their usability. To address this concern, this study investigates the local accuracy of large RSML datasets. For this evaluation, we employed a dataset generated using models pre-trained on a variety of samples drawn from across the world and accessible from public repositories of open benchmark datasets. Subsequently, these models were fine-tuned with a limited set of local samples specific to Rwanda. In addition, the evaluation included Microsoft’s and Google’s global datasets. Using ResNet and Mask R‑CNN, we explored the performance variations of different building detection approaches: bottom-up, end-to-end, and their combination. For road extraction, we explored the approach of training multiple models on subsets representing different road types. Our testing dataset was carefully designed to be diverse, incorporating both easy and challenging scenes. It includes areas purposefully chosen for their high level of clutter, making it difficult to detect structures like buildings. This inclusion of complex scenarios alongside simpler ones allows us to thoroughly assess the robustness of DL-based detection models for handling diverse real-world conditions. In addition, buildings were evaluated using a polygon-wise comparison, while roads were assessed using network length-derived metrics.Our results showed a precision (P) of around 75% and a recall (R) of around 60% for the locally fine-tuned building model. This performance was achieved in three out of six testing sites and is considered the lowest limit needed for practical utility of RSML datasets, according to the literature. In contrast, comparable results were obtained in only one out of six sites for the Google and Microsoft datasets. Our locally fine-tuned road model achieved moderate success, meeting the minimum usability threshold in four out of six sites. In contrast, the Microsoft dataset performed well on all sites. In summary, our findings suggest improved performance ","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"9 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Interactive Mixed Reality Methods for Visualization of Underground Utilities 地下公用设施可视化的交互式混合现实方法

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-07-03 DOI: 10.1007/s41064-024-00295-x

Mohamed Zahlan Abdul Muthalif, Davood Shojaei, Kourosh Khoshelham

This research aims to overcome the difficulties associated with visualizing underground utilities by proposing six interactive visualization methods that utilize Mixed Reality (MR) technology. By leveraging MR technology, which enables the seamless integration of virtual and real-world content, a more immersive and authentic experience is possible. The study evaluates the proposed visualization methods based on scene complexity, parallax effect, real-world occlusion, depth perception, and overall effectiveness, aiming to identify the most effective methods for addressing visual perceptual challenges in the context of underground utilities. The findings suggest that certain MR visualization methods are more effective than others in mitigating the challenges of visualizing underground utilities. The research highlights the potential of these methods, and feedback from industry professionals suggests that each method can be valuable in specific contexts.

这项研究旨在利用混合现实（MR）技术提出六种交互式可视化方法，从而克服地下公用设施可视化方面的困难。混合现实技术能将虚拟内容与现实内容无缝结合，通过利用混合现实技术，可以获得更加身临其境的真实体验。本研究根据场景复杂度、视差效应、真实世界遮挡、深度感知和整体效果对所提出的可视化方法进行了评估，旨在找出最有效的方法，以解决地下公用事业背景下的视觉感知挑战。研究结果表明，某些磁共振可视化方法在减轻地下公用设施可视化挑战方面比其他方法更有效。研究强调了这些方法的潜力，业内专业人士的反馈表明，每种方法在特定情况下都很有价值。

引用次数: 0

Satellite-based Bathymetry Supported by Extracted Coastlines 以提取的海岸线为支持的卫星测深法

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-07-02 DOI: 10.1007/s41064-024-00298-8

Hakan Uzakara, Nusret Demir, Serkan Karakış

Bathymetry is the measurement of ocean depths using a variety of techniques. Available techniques include sonar systems, light detection and ranging (LIDAR), and remote sensing systems. Acoustic systems, also known as LIDAR, are inefficient in terms of both time and money. This study applied remote sensing techniques to reduce both time and cost. The objective of this study is to use freely accessible Sentinel‑2 multispectral images to extract the depth information. Temporal variation was minimized by comparing the histograms of satellite images obtained over four consecutive months. The sea topography is determined using regression analysis, utilizing samples from reference data. The reference data is adjusted with the changes in shorelines, as the alteration of shorelines serves as a parameter for these modifications. Using the regression coefficients, analyses were conducted in regions with undetermined depths. The bathymetry maps were evaluated against a reference dataset and improved by incorporating shorelines. The analyses were carried out individually over four months, and the derived bathymetric data showed significant monthly average and monthly shoreline changes. The employed methodology offers an alternative approach for bathymetry studies that require temporal resolution when the available reference bathymetric data is insufficient.

测深是利用各种技术测量海洋深度。现有技术包括声纳系统、光探测和测距（激光雷达）以及遥感系统。声学系统（也称为激光雷达）在时间和资金方面都效率低下。本研究采用遥感技术来减少时间和成本。本研究的目的是利用可免费获取的哨兵-2 多光谱图像来提取深度信息。通过比较连续四个月获得的卫星图像直方图，将时间变化降至最低。利用参考数据的样本，通过回归分析确定海洋地形。参考数据根据海岸线的变化进行调整，因为海岸线的变化是这些变化的参数。利用回归系数，对深度未定的区域进行分析。根据参考数据集对水深测量图进行了评估，并通过纳入海岸线进行了改进。分析分别在四个月内进行，得出的水深数据显示出显著的月平均值和月海岸线变化。在现有参考测深数据不足的情况下，所采用的方法为需要时间分辨率的测深研究提供了另一种方法。

{"title":"Satellite-based Bathymetry Supported by Extracted Coastlines","authors":"Hakan Uzakara, Nusret Demir, Serkan Karakış","doi":"10.1007/s41064-024-00298-8","DOIUrl":"https://doi.org/10.1007/s41064-024-00298-8","url":null,"abstract":"Bathymetry is the measurement of ocean depths using a variety of techniques. Available techniques include sonar systems, light detection and ranging (LIDAR), and remote sensing systems. Acoustic systems, also known as LIDAR, are inefficient in terms of both time and money. This study applied remote sensing techniques to reduce both time and cost. The objective of this study is to use freely accessible Sentinel‑2 multispectral images to extract the depth information. Temporal variation was minimized by comparing the histograms of satellite images obtained over four consecutive months. The sea topography is determined using regression analysis, utilizing samples from reference data. The reference data is adjusted with the changes in shorelines, as the alteration of shorelines serves as a parameter for these modifications. Using the regression coefficients, analyses were conducted in regions with undetermined depths. The bathymetry maps were evaluated against a reference dataset and improved by incorporating shorelines. The analyses were carried out individually over four months, and the derived bathymetric data showed significant monthly average and monthly shoreline changes. The employed methodology offers an alternative approach for bathymetry studies that require temporal resolution when the available reference bathymetric data is insufficient.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141527337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Coastal Shoreline Change in Eastern Indian Metropolises 印度东部大都市的海岸线变化

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-04-29 DOI: 10.1007/s41064-024-00286-y

Vijay K. Kannaujiya, Abhishek K. Rai, Sukanta Malakar

The coastal regions of India have a high population density and are ecologically productive. However, they are also susceptible to both human activity and natural calamities, which can lead to erosion and accretion. As part of the sustainable management of coastal zones, these threats have taken precedence in evaluating shoreline dynamicity. This study demonstrated the effectiveness of integrating remote sensing and geographic information systems for comprehensive long-term coastal change analyses. The analysis reveals that the mean erosion rate along the Chennai coast ranges from −0.2 to −2.5 m/year. Accretion is also recorded along certain parts of the Chennai coast, with rates ranging from 1 to 4.6 m/year. The Vishakhapatnam shoreline has a consistent pattern of both erosion and accretion, with erosion rates ranging from −0.1 to −6.8 m/year and accretion from 0.2 to 5 m/year. However, most of the Puri coast exhibits an accretion pattern, with values ranging from approximately 0.1 to 3.22 m/year. The fluctuations in shorelines of these three metropolises are a matter of great concern, given that these coastal cities play a substantial part in India’s economic and cultural endeavors. The ongoing occurrence of climate change and global warming has led to an elevation in the worldwide sea level, along with a heightened intensity and frequency of extreme occurrences like tropical cyclones in the Bay of Bengal, where these three coasts are situated. The coastlines of these urban areas may experience alterations due to natural phenomena like rising sea levels and tropical cyclones, as well as a diverse array of human activity. This study may help to facilitate the formulation of suitable management strategies and regulations for the coastal areas of Vishakhapatnam, Puri, Chennai, and other Indian coastal places that have similar physical attributes.

印度沿海地区人口密度高，生态资源丰富。然而，它们也容易受到人类活动和自然灾害的影响，从而导致侵蚀和增生。作为沿海地区可持续管理的一部分，这些威胁已成为评估海岸线动态的优先考虑因素。这项研究表明，将遥感和地理信息系统结合起来进行长期海岸变化综合分析是有效的。分析表明，钦奈海岸的平均侵蚀速率为-0.2 至-2.5 米/年。钦奈海岸的某些地段也有增生现象，增生速率为 1 至 4.6 米/年。维沙卡帕特南海岸线的侵蚀和增生模式一致，侵蚀率为每年-0.1 至-6.8 米，增生率为每年 0.2 至 5 米。不过，普里海岸的大部分地区呈现出增生模式，增生值约为 0.1 至 3.22 米/年。由于这三个沿海城市在印度的经济和文化事业中发挥着重要作用，因此它们的海岸线波动引起了人们的极大关注。持续的气候变化和全球变暖导致全球海平面上升，这三个海岸所在的孟加拉湾热带气旋等极端事件的强度和频率也随之增加。由于海平面上升和热带气旋等自然现象以及各种人类活动，这些城市地区的海岸线可能会发生变化。这项研究可能有助于为维沙卡帕特南、普里、钦奈和其他具有类似物理属性的印度沿海地区制定合适的管理战略和法规。

{"title":"Coastal Shoreline Change in Eastern Indian Metropolises","authors":"Vijay K. Kannaujiya, Abhishek K. Rai, Sukanta Malakar","doi":"10.1007/s41064-024-00286-y","DOIUrl":"https://doi.org/10.1007/s41064-024-00286-y","url":null,"abstract":"The coastal regions of India have a high population density and are ecologically productive. However, they are also susceptible to both human activity and natural calamities, which can lead to erosion and accretion. As part of the sustainable management of coastal zones, these threats have taken precedence in evaluating shoreline dynamicity. This study demonstrated the effectiveness of integrating remote sensing and geographic information systems for comprehensive long-term coastal change analyses. The analysis reveals that the mean erosion rate along the Chennai coast ranges from −0.2 to −2.5 m/year. Accretion is also recorded along certain parts of the Chennai coast, with rates ranging from 1 to 4.6 m/year. The Vishakhapatnam shoreline has a consistent pattern of both erosion and accretion, with erosion rates ranging from −0.1 to −6.8 m/year and accretion from 0.2 to 5 m/year. However, most of the Puri coast exhibits an accretion pattern, with values ranging from approximately 0.1 to 3.22 m/year. The fluctuations in shorelines of these three metropolises are a matter of great concern, given that these coastal cities play a substantial part in India’s economic and cultural endeavors. The ongoing occurrence of climate change and global warming has led to an elevation in the worldwide sea level, along with a heightened intensity and frequency of extreme occurrences like tropical cyclones in the Bay of Bengal, where these three coasts are situated. The coastlines of these urban areas may experience alterations due to natural phenomena like rising sea levels and tropical cyclones, as well as a diverse array of human activity. This study may help to facilitate the formulation of suitable management strategies and regulations for the coastal areas of Vishakhapatnam, Puri, Chennai, and other Indian coastal places that have similar physical attributes.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"6 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140812972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating Sea Level Rise Impacts on the Southeastern Türkiye Coastline: a Coastal Vulnerability Perspective 评估海平面上升对土耳其东南部海岸线的影响：沿海脆弱性视角

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-04-19 DOI: 10.1007/s41064-024-00284-0

Fahri Aykut, Devrim Tezcan

Coastal areas are inherently sensitive and dynamic, susceptible to natural forces like waves, winds, currents, and tides. Human activities further accelerate coastal changes, while climate change and global sea level rise add to the challenges. Recognizing and safeguarding these coasts, vital for both socioeconomic and environmental reasons, becomes imperative. The objective of this study is to categorize the coasts of the Mersin and İskenderun bays along the southeastern coast of Türkiye based on their vulnerability to natural forces and human-induced factors using the coastal vulnerability index (CVI) method. The study area encompasses approximately 520 km of coastline. The coastal vulnerability analysis reveals that the coastal zone comprises various levels of vulnerability along the total coastline: 24.7% (128 km) is categorized as very high vulnerability, 27.4% (142 km) as high vulnerability, 23.7% (123 km) as moderate vulnerability, and 24.3% (126 km) as low vulnerability. Key parameters influencing vulnerability include coastal slope, land use, and population density. High and very high vulnerability are particularly prominent in coastal plains characterized by gentle slopes, weak geological and geomorphological features, and significant socioeconomic value.

沿海地区本身具有敏感性和动态性，易受海浪、风、海流和潮汐等自然力量的影响。人类活动进一步加速了沿海地区的变化，而气候变化和全球海平面上升则加剧了这些挑战。认识和保护这些对社会经济和环境都至关重要的海岸已成为当务之急。本研究的目的是利用海岸脆弱性指数 (CVI) 方法，根据这些海岸在自然力和人为因素面前的脆弱性，对土耳其东南沿海的梅尔辛海湾和伊斯肯德伦海湾的海岸进行分类。研究区域的海岸线长约 520 公里。海岸脆弱性分析表明，沿海地区的海岸线总长度存在不同程度的脆弱性：24.7%（128 公里）被归类为极高脆弱性，27.4%（142 公里）被归类为高脆弱性，23.7%（123 公里）被归类为中等脆弱性，24.3%（126 公里）被归类为低脆弱性。影响脆弱性的主要参数包括海岸坡度、土地利用和人口密度。高脆弱性和极高脆弱性在沿海平原尤为突出，这些平原的特点是坡度较缓，地质和地貌特征较弱，具有重要的社会经济价值。

{"title":"Evaluating Sea Level Rise Impacts on the Southeastern Türkiye Coastline: a Coastal Vulnerability Perspective","authors":"Fahri Aykut, Devrim Tezcan","doi":"10.1007/s41064-024-00284-0","DOIUrl":"https://doi.org/10.1007/s41064-024-00284-0","url":null,"abstract":"Coastal areas are inherently sensitive and dynamic, susceptible to natural forces like waves, winds, currents, and tides. Human activities further accelerate coastal changes, while climate change and global sea level rise add to the challenges. Recognizing and safeguarding these coasts, vital for both socioeconomic and environmental reasons, becomes imperative. The objective of this study is to categorize the coasts of the Mersin and İskenderun bays along the southeastern coast of Türkiye based on their vulnerability to natural forces and human-induced factors using the coastal vulnerability index (CVI) method. The study area encompasses approximately 520 km of coastline. The coastal vulnerability analysis reveals that the coastal zone comprises various levels of vulnerability along the total coastline: 24.7% (128 km) is categorized as very high vulnerability, 27.4% (142 km) as high vulnerability, 23.7% (123 km) as moderate vulnerability, and 24.3% (126 km) as low vulnerability. Key parameters influencing vulnerability include coastal slope, land use, and population density. High and very high vulnerability are particularly prominent in coastal plains characterized by gentle slopes, weak geological and geomorphological features, and significant socioeconomic value.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"51 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140627940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Building a Fully-Automatized Active Learning Framework for the Semantic Segmentation of Geospatial 3D Point Clouds 为地理空间三维点云的语义分割构建完全自动化的主动学习框架

IF 4.1 4区地球科学 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science

Pub Date : 2024-04-03 DOI: 10.1007/s41064-024-00281-3

Michael Kölle, Volker Walter, Uwe Sörgel

In recent years, significant progress has been made in developing supervised Machine Learning (ML) systems like Convolutional Neural Networks. However, it’s crucial to recognize that the performance of these systems heavily relies on the quality of labeled training data. To address this, we propose a shift in focus towards developing sustainable methods of acquiring such data instead of solely building new classifiers in the ever-evolving ML field. Specifically, in the geospatial domain, the process of generating training data for ML systems has been largely neglected in research. Traditionally, experts have been burdened with the laborious task of labeling, which is not only time-consuming but also inefficient. In our system for the semantic interpretation of Airborne Laser Scanning point clouds, we break with this convention and completely remove labeling obligations from domain experts who have completed special training in geosciences and instead adopt a hybrid intelligence approach. This involves active and iterative collaboration between the ML model and humans through Active Learning, which identifies the most critical samples justifying manual inspection. Only these samples (typically (ll 1{%}) of Passive Learning training points) are subject to human annotation. To carry out this annotation, we choose to outsource the task to a large group of non-specialists, referred to as the crowd, which comes with the inherent challenge of guiding those inexperienced annotators (i.e., “short-term employees”) to still produce labels of sufficient quality. However, we acknowledge that attracting enough volunteers for crowdsourcing campaigns can be challenging due to the tedious nature of labeling tasks. To address this, we propose employing paid crowdsourcing and providing monetary incentives to crowdworkers. This approach ensures access to a vast pool of prospective workers through respective platforms, ensuring timely completion of jobs. Effectively, crowdworkers become human processing units in our hybrid intelligence system mirroring the functionality of electronic processing units.

近年来，卷积神经网络等有监督机器学习（ML）系统的开发取得了重大进展。然而，我们必须认识到，这些系统的性能在很大程度上依赖于标注训练数据的质量。为了解决这个问题，我们建议将重点转移到开发获取此类数据的可持续方法上，而不是仅仅在不断发展的 ML 领域构建新的分类器。具体来说，在地理空间领域，为 ML 系统生成训练数据的过程在很大程度上被研究人员所忽视。传统上，专家们一直承担着费力的标注任务，这不仅耗时，而且效率低下。在我们的机载激光扫描点云语义解释系统中，我们打破了这一传统，完全免除了已完成地理科学专门培训的领域专家的标注义务，转而采用混合智能方法。这包括通过主动学习（Active Learning）在人工智能模型和人类之间进行积极的迭代协作，从而识别出最关键的样本，证明人工检查是合理的。只有这些样本（通常是被动学习训练点的）才需要人工标注。为了进行注释，我们选择将这项任务外包给一大批非专业人员，也就是我们所说的 "群众"，这就带来了一个固有的挑战，那就是如何指导这些缺乏经验的注释者（即 "短期雇员"），使他们仍然能够生成质量足够高的标签。不过，我们也承认，由于标注任务的乏味性，吸引足够的志愿者参与众包活动可能具有挑战性。为了解决这个问题，我们建议采用有偿众包，并为众包者提供金钱奖励。这种方法可以确保通过各自的平台接触到大量的潜在工作者，从而确保及时完成工作。实际上，在我们的混合智能系统中，众包工成为了人类处理单元，与电子处理单元的功能如出一辙。

{"title":"Building a Fully-Automatized Active Learning Framework for the Semantic Segmentation of Geospatial 3D Point Clouds","authors":"Michael Kölle, Volker Walter, Uwe Sörgel","doi":"10.1007/s41064-024-00281-3","DOIUrl":"https://doi.org/10.1007/s41064-024-00281-3","url":null,"abstract":"In recent years, significant progress has been made in developing supervised Machine Learning (ML) systems like Convolutional Neural Networks. However, it’s crucial to recognize that the performance of these systems heavily relies on the quality of labeled training data. To address this, we propose a shift in focus towards developing sustainable methods of acquiring such data instead of solely building new classifiers in the ever-evolving ML field. Specifically, in the geospatial domain, the process of generating training data for ML systems has been largely neglected in research. Traditionally, experts have been burdened with the laborious task of labeling, which is not only time-consuming but also inefficient. In our system for the semantic interpretation of Airborne Laser Scanning point clouds, we break with this convention and completely remove labeling obligations from domain experts who have completed special training in geosciences and instead adopt a hybrid intelligence approach. This involves active and iterative collaboration between the ML model and humans through Active Learning, which identifies the most critical samples justifying manual inspection. Only these samples (typically (ll 1{%}) of Passive Learning training points) are subject to human annotation. To carry out this annotation, we choose to outsource the task to a large group of non-specialists, referred to as the crowd, which comes with the inherent challenge of guiding those inexperienced annotators (i.e., “short-term employees”) to still produce labels of sufficient quality. However, we acknowledge that attracting enough volunteers for crowdsourcing campaigns can be challenging due to the tedious nature of labeling tasks. To address this, we propose employing paid crowdsourcing and providing monetary incentives to crowdworkers. This approach ensures access to a vast pool of prospective workers through respective platforms, ensuring timely completion of jobs. Effectively, crowdworkers become human processing units in our hybrid intelligence system mirroring the functionality of electronic processing units.","PeriodicalId":56035,"journal":{"name":"PFG-Journal of Photogrammetry Remote Sensing and Geoinformation Science","volume":"18 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140580122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0