Learning point cloud context information based on 3D transformer for more accurate and efficient classification

The Photogrammetric Record Pub Date : 2023-12-10 DOI:10.1111/phor.12469

Yiping Chen, Shuai Zhang, Weisheng Lin, Shuhang Zhang, Wuming Zhang

{"title":"Learning point cloud context information based on 3D transformer for more accurate and efficient classification","authors":"Yiping Chen, Shuai Zhang, Weisheng Lin, Shuhang Zhang, Wuming Zhang","doi":"10.1111/phor.12469","DOIUrl":null,"url":null,"abstract":"The point cloud semantic understanding task has made remarkable progress along with the development of 3D deep learning. However, aggregating spatial information to improve the local feature learning capability of the network remains a major challenge. Many methods have been used for improving local information learning, such as constructing a multi-area structure for capturing different area information. However, it will lose some local information due to the independent learning point feature. To solve this problem, a new network is proposed that considers the importance of the differences between points in the neighbourhood. Capturing local feature information can be enhanced by highlighting the different feature importance of the point cloud in the neighbourhood. First, T-Net is constructed to learn the point cloud transformation matrix for point cloud disorder. Second, transformer is used to improve the problem of local information loss due to the independence of each point in the neighbourhood. The experimental results show that 92.2% accuracy overall was achieved on the ModelNet40 dataset and 93.8% accuracy overall was achieved on the ModelNet10 dataset.","PeriodicalId":22881,"journal":{"name":"The Photogrammetric Record","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Photogrammetric Record","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/phor.12469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The point cloud semantic understanding task has made remarkable progress along with the development of 3D deep learning. However, aggregating spatial information to improve the local feature learning capability of the network remains a major challenge. Many methods have been used for improving local information learning, such as constructing a multi-area structure for capturing different area information. However, it will lose some local information due to the independent learning point feature. To solve this problem, a new network is proposed that considers the importance of the differences between points in the neighbourhood. Capturing local feature information can be enhanced by highlighting the different feature importance of the point cloud in the neighbourhood. First, T-Net is constructed to learn the point cloud transformation matrix for point cloud disorder. Second, transformer is used to improve the problem of local information loss due to the independence of each point in the neighbourhood. The experimental results show that 92.2% accuracy overall was achieved on the ModelNet40 dataset and 93.8% accuracy overall was achieved on the ModelNet10 dataset.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于 3D 变换器学习点云上下文信息，实现更准确、更高效的分类

随着三维深度学习的发展，点云语义理解任务取得了显著进展。然而，聚合空间信息以提高网络的局部特征学习能力仍是一大挑战。为了提高局部信息的学习能力，人们采用了很多方法，例如构建多区域结构来捕捉不同的区域信息。但是，由于学习点特征的独立性，它会丢失一些局部信息。为了解决这个问题，我们提出了一种新的网络，它考虑了邻域中点之间差异的重要性。通过突出邻域中点云的不同特征重要性，可以增强对局部特征信息的捕捉。首先，构建 T-Net 来学习点云变换矩阵，以解决点云紊乱问题。其次，利用变换器来改善由于邻域中各点的独立性而导致的局部信息丢失问题。实验结果表明，ModelNet40 数据集的总体准确率为 92.2%，ModelNet10 数据集的总体准确率为 93.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

The Photogrammetric Record

自引率

0.00%

发文量