Deep learning with simulated laser scanning data for 3D point cloud classification

IF 10.6 1区地球科学 Q1 GEOGRAPHY, PHYSICAL ISPRS Journal of Photogrammetry and Remote Sensing Pub Date : 2024-07-13 DOI:10.1016/j.isprsjprs.2024.06.018

Alberto M. Esmorís , Hannah Weiser , Lukas Winiwarter , Jose C. Cabaleiro , Bernhard Höfle

{"title":"Deep learning with simulated laser scanning data for 3D point cloud classification","authors":"Alberto M. Esmorís , Hannah Weiser , Lukas Winiwarter , Jose C. Cabaleiro , Bernhard Höfle","doi":"10.1016/j.isprsjprs.2024.06.018","DOIUrl":null,"url":null,"abstract":"<div><p>Laser scanning is an active remote sensing technique applied in many disciplines to acquire state-of-the-art spatial measurements. Semantic labeling is often necessary to extract information from the raw point cloud. Deep learning methods constitute a data-hungry solution for the semantic segmentation of point clouds. In this work, we investigate the use of simulated laser scanning for training deep learning models, which are applied to real data subsequently. We show that training a deep learning model purely on virtual laser scanning data can produce results comparable to models trained on real data when evaluated on real data. For leaf-wood segmentation of trees, using the KPConv model trained with virtual data achieves 93.7% overall accuracy, while the model trained on real data reaches 94.7% overall accuracy. In urban contexts, a KPConv model trained on virtual data achieves 74.1% overall accuracy on real validation data, while the model trained on real data achieves 82.4%. Our models outperform the state-of-the-art model FSCT in terms of generalization to unseen real data as well as a baseline model trained on points randomly sampled from the tree mesh surface. From our results, we conclude that the combination of laser scanning simulation and deep learning is a cost-effective alternative to real data acquisition and manual labeling in the domain of geospatial point cloud analysis. The strengths of this approach are that (a) a large amount of diverse laser scanning training data can be generated quickly and without the need for expensive equipment, (b) the simulation configurations can be adapted so that the virtual training data have similar characteristics to the targeted real data, and (c) the whole workflow can be automated through procedural scene generation.</p></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":null,"pages":null},"PeriodicalIF":10.6000,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0924271624002569/pdfft?md5=85e93537aadaaaa88cbcdae2d9791d9d&pid=1-s2.0-S0924271624002569-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271624002569","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Laser scanning is an active remote sensing technique applied in many disciplines to acquire state-of-the-art spatial measurements. Semantic labeling is often necessary to extract information from the raw point cloud. Deep learning methods constitute a data-hungry solution for the semantic segmentation of point clouds. In this work, we investigate the use of simulated laser scanning for training deep learning models, which are applied to real data subsequently. We show that training a deep learning model purely on virtual laser scanning data can produce results comparable to models trained on real data when evaluated on real data. For leaf-wood segmentation of trees, using the KPConv model trained with virtual data achieves 93.7% overall accuracy, while the model trained on real data reaches 94.7% overall accuracy. In urban contexts, a KPConv model trained on virtual data achieves 74.1% overall accuracy on real validation data, while the model trained on real data achieves 82.4%. Our models outperform the state-of-the-art model FSCT in terms of generalization to unseen real data as well as a baseline model trained on points randomly sampled from the tree mesh surface. From our results, we conclude that the combination of laser scanning simulation and deep learning is a cost-effective alternative to real data acquisition and manual labeling in the domain of geospatial point cloud analysis. The strengths of this approach are that (a) a large amount of diverse laser scanning training data can be generated quickly and without the need for expensive equipment, (b) the simulation configurations can be adapted so that the virtual training data have similar characteristics to the targeted real data, and (c) the whole workflow can be automated through procedural scene generation.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用模拟激光扫描数据进行三维点云分类的深度学习

激光扫描是一种主动遥感技术，应用于许多学科，以获取最先进的空间测量数据。要从原始点云中提取信息，通常需要进行语义标注。深度学习方法是一种对数据要求极高的点云语义分割解决方案。在这项工作中，我们研究了如何利用模拟激光扫描来训练深度学习模型，并随后将其应用于真实数据。我们的研究表明，在真实数据上进行评估时，纯粹在虚拟激光扫描数据上训练深度学习模型所产生的结果可与在真实数据上训练的模型相媲美。在树木的叶木分割方面，使用虚拟数据训练的 KPConv 模型达到了 93.7% 的总体准确率，而使用真实数据训练的模型则达到了 94.7% 的总体准确率。在城市环境中，使用虚拟数据训练的 KPConv 模型在真实验证数据上的总体准确率为 74.1%，而使用真实数据训练的模型则达到了 82.4%。在对未见真实数据的泛化方面，我们的模型优于最先进的模型 FSCT，也优于根据从树网格表面随机取样的点训练的基线模型。根据我们的研究结果，我们得出结论：在地理空间点云分析领域，激光扫描模拟与深度学习的结合是一种经济有效的方法，可以替代真实数据采集和人工标注。这种方法的优势在于：(a) 可以快速生成大量不同的激光扫描训练数据，无需昂贵的设备；(b) 可以调整模拟配置，使虚拟训练数据具有与目标真实数据相似的特征；(c) 可以通过程序化场景生成实现整个工作流程的自动化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术

CiteScore

21.00

自引率

6.30%

发文量

273

审稿时长

40 days

期刊介绍： The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.