基于迁移学习和性能增强的建筑遗产点云深度语义分割技术

IF 1.6 0 ARCHAEOLOGY Virtual Archaeology Review Pub Date : 2021-05-05 DOI:10.4995/VAR.2021.15318

F. Matrone, M. Martini

{"title":"基于迁移学习和性能增强的建筑遗产点云深度语义分割技术","authors":"F. Matrone, M. Martini","doi":"10.4995/VAR.2021.15318","DOIUrl":null,"url":null,"abstract":"The growing availability of three-dimensional (3D) data, such as point clouds, coming from Light Detection and Ranging (LiDAR), Mobile Mapping Systems (MMSs) or Unmanned Aerial Vehicles (UAVs), provides the opportunity to rapidly generate 3D models to support the restoration, conservation, and safeguarding activities of cultural heritage (CH). The so-called scan-to-BIM process can, in fact, benefit from such data, and they can themselves be a source for further analyses or activities on the archaeological and built heritage. There are several ways to exploit this type of data, such as Historic Building Information Modelling (HBIM), mesh creation, rasterisation, classification, and semantic segmentation. The latter, referring to point clouds, is a trending topic not only in the CH domain but also in other fields like autonomous navigation, medicine or retail. Precisely in these sectors, the task of semantic segmentation has been mainly exploited and developed with artificial intelligence techniques. In particular, machine learning (ML) algorithms, and their deep learning (DL) subset, are increasingly applied and have established a solid state-of-the-art in the last half-decade. However, applications of DL techniques on heritage point clouds are still scarce; therefore, we propose to tackle this framework within the built heritage field. Starting from some previous tests with the Dynamic Graph Convolutional Neural Network (DGCNN), in this contribution close attention is paid to: i) the investigation of fine-tuned models, used as a transfer learning technique, ii) the combination of external classifiers, such as Random Forest (RF), with the artificial neural network, and iii) the evaluation of the data augmentation results for the domain-specific ArCH dataset. Finally, after taking into account the main advantages and criticalities, considerations are made on the possibility to profit by this methodology also for non-programming or domain experts.Highlights:Semantic segmentation of built heritage point clouds through deep neural networks can provide performances comparable to those of more consolidated state-of-the-art ML classifiers.Transfer learning approaches, as fine-tuning, can considerably reduce computational time also for CH domain-specific datasets, as well as improve metrics for some challenging categories (i.e. windows or mouldings).Data augmentation techniques do not significantly improve overall performances.","PeriodicalId":44206,"journal":{"name":"Virtual Archaeology Review","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2021-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Transfer learning and performance enhancement techniques for deep semantic segmentation of built heritage point clouds\",\"authors\":\"F. Matrone, M. Martini\",\"doi\":\"10.4995/VAR.2021.15318\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The growing availability of three-dimensional (3D) data, such as point clouds, coming from Light Detection and Ranging (LiDAR), Mobile Mapping Systems (MMSs) or Unmanned Aerial Vehicles (UAVs), provides the opportunity to rapidly generate 3D models to support the restoration, conservation, and safeguarding activities of cultural heritage (CH). The so-called scan-to-BIM process can, in fact, benefit from such data, and they can themselves be a source for further analyses or activities on the archaeological and built heritage. There are several ways to exploit this type of data, such as Historic Building Information Modelling (HBIM), mesh creation, rasterisation, classification, and semantic segmentation. The latter, referring to point clouds, is a trending topic not only in the CH domain but also in other fields like autonomous navigation, medicine or retail. Precisely in these sectors, the task of semantic segmentation has been mainly exploited and developed with artificial intelligence techniques. In particular, machine learning (ML) algorithms, and their deep learning (DL) subset, are increasingly applied and have established a solid state-of-the-art in the last half-decade. However, applications of DL techniques on heritage point clouds are still scarce; therefore, we propose to tackle this framework within the built heritage field. Starting from some previous tests with the Dynamic Graph Convolutional Neural Network (DGCNN), in this contribution close attention is paid to: i) the investigation of fine-tuned models, used as a transfer learning technique, ii) the combination of external classifiers, such as Random Forest (RF), with the artificial neural network, and iii) the evaluation of the data augmentation results for the domain-specific ArCH dataset. Finally, after taking into account the main advantages and criticalities, considerations are made on the possibility to profit by this methodology also for non-programming or domain experts.Highlights:Semantic segmentation of built heritage point clouds through deep neural networks can provide performances comparable to those of more consolidated state-of-the-art ML classifiers.Transfer learning approaches, as fine-tuning, can considerably reduce computational time also for CH domain-specific datasets, as well as improve metrics for some challenging categories (i.e. windows or mouldings).Data augmentation techniques do not significantly improve overall performances.\",\"PeriodicalId\":44206,\"journal\":{\"name\":\"Virtual Archaeology Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2021-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virtual Archaeology Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4995/VAR.2021.15318\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ARCHAEOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Archaeology Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4995/VAR.2021.15318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}

引用次数: 3

摘要

来自光探测和测距(LiDAR)、移动测绘系统(mss)或无人机(uav)的三维(3D)数据(如点云)的可用性日益增加，为快速生成3D模型提供了机会，以支持文化遗产(CH)的修复、保护和保护活动。事实上，所谓的扫描到bim过程可以从这些数据中受益，并且它们本身可以成为进一步分析或考古和建筑遗产活动的来源。有几种方法可以利用这种类型的数据，如历史建筑信息建模(HBIM)、网格创建、光栅化、分类和语义分割。后者指的是点云，不仅在智能智能领域，而且在自主导航、医疗或零售等其他领域都是一个热门话题。正是在这些领域，语义分割的任务主要是利用人工智能技术来开发和发展的。特别是，机器学习(ML)算法及其深度学习(DL)子集越来越多地得到应用，并在过去五年中建立了坚实的先进技术。然而，深度学习技术在遗产点云上的应用仍然很少;因此，我们建议在建筑遗产领域内解决这个框架。从之前使用动态图卷积神经网络(DGCNN)进行的一些测试开始，在此贡献中密切关注:i)作为迁移学习技术的微调模型的研究，ii)外部分类器(如随机森林(RF))与人工神经网络的组合，以及iii)评估特定领域ArCH数据集的数据增强结果。最后，在考虑了主要的优点和缺点之后，对非编程或领域专家也可以通过这种方法获利的可能性进行了考虑。亮点:通过深度神经网络对构建的遗产点云进行语义分割，可以提供与更整合的最先进的ML分类器相当的性能。迁移学习方法，作为微调，也可以大大减少CH领域特定数据集的计算时间，以及改进一些具有挑战性的类别(即窗口或造型)的指标。数据增强技术并不能显著提高整体性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Transfer learning and performance enhancement techniques for deep semantic segmentation of built heritage point clouds

The growing availability of three-dimensional (3D) data, such as point clouds, coming from Light Detection and Ranging (LiDAR), Mobile Mapping Systems (MMSs) or Unmanned Aerial Vehicles (UAVs), provides the opportunity to rapidly generate 3D models to support the restoration, conservation, and safeguarding activities of cultural heritage (CH). The so-called scan-to-BIM process can, in fact, benefit from such data, and they can themselves be a source for further analyses or activities on the archaeological and built heritage. There are several ways to exploit this type of data, such as Historic Building Information Modelling (HBIM), mesh creation, rasterisation, classification, and semantic segmentation. The latter, referring to point clouds, is a trending topic not only in the CH domain but also in other fields like autonomous navigation, medicine or retail. Precisely in these sectors, the task of semantic segmentation has been mainly exploited and developed with artificial intelligence techniques. In particular, machine learning (ML) algorithms, and their deep learning (DL) subset, are increasingly applied and have established a solid state-of-the-art in the last half-decade. However, applications of DL techniques on heritage point clouds are still scarce; therefore, we propose to tackle this framework within the built heritage field. Starting from some previous tests with the Dynamic Graph Convolutional Neural Network (DGCNN), in this contribution close attention is paid to: i) the investigation of fine-tuned models, used as a transfer learning technique, ii) the combination of external classifiers, such as Random Forest (RF), with the artificial neural network, and iii) the evaluation of the data augmentation results for the domain-specific ArCH dataset. Finally, after taking into account the main advantages and criticalities, considerations are made on the possibility to profit by this methodology also for non-programming or domain experts.Highlights:Semantic segmentation of built heritage point clouds through deep neural networks can provide performances comparable to those of more consolidated state-of-the-art ML classifiers.Transfer learning approaches, as fine-tuning, can considerably reduce computational time also for CH domain-specific datasets, as well as improve metrics for some challenging categories (i.e. windows or mouldings).Data augmentation techniques do not significantly improve overall performances.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Virtual Archaeology Review ARCHAEOLOGY-

CiteScore

5.20

自引率

21.70%

发文量

审稿时长

20 weeks

期刊介绍： Virtual Archaeology Review (VAR) aims the publication of original papers, interdisciplinary reviews and essays on the new discipline of virtual archaeology, which is continuously evolving and currently on its way to achieve scientific consolidation. In fact, Virtual Archaeology deals with the digital representation of historical heritage objects, buildings and landscapes through 3D acquisition, digital recording and interactive and immersive tools for analysis, interpretation, dissemination and communication purposes by means of multidimensional geometric properties and visual computational modelling. VAR will publish full-length original papers which reflect both current research and practice throughout the world, in order to contribute to the advancement of the new field of virtual archaeology, ranging from new ways of digital recording and documentation, advanced reconstruction and 3D modelling up to cyber-archaeology, virtual exhibitions and serious gaming. Thus acceptable material may emerge from interesting applications as well as from original developments or research. OBJECTIVES: - OFFER researchers working in the field of virtual archaeology and cultural heritage an appropriate editorial frame to publish state-of-the-art research works, as well as theoretical and methodological contributions. - GATHER virtual archaeology progresses achieved as a new international scientific discipline. - ENCOURAGE the publication of the latest, state-of-the-art, significant research and meaningful applications in the field of virtual archaeology. - ENHANCE international connections in the field of virtual archaeology and cultural heritage.