DSANet: Dynamic and Structure-Aware GCN for Sparse and Incomplete Point Cloud Learning.

IF 10.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE transactions on neural networks and learning systems Pub Date : 2024-08-27 DOI:10.1109/TNNLS.2024.3439706

Yushi Li, George Baciu, Rong Chen, Chenhui Li, Hao Wang, Yushan Pan, Weiping Ding

{"title":"DSANet: Dynamic and Structure-Aware GCN for Sparse and Incomplete Point Cloud Learning.","authors":"Yushi Li, George Baciu, Rong Chen, Chenhui Li, Hao Wang, Yushan Pan, Weiping Ding","doi":"10.1109/TNNLS.2024.3439706","DOIUrl":null,"url":null,"abstract":"<p><p>Learning 3-D structures from incomplete point clouds with extreme sparsity and random distributions is a challenge since it is difficult to infer topological connectivity and structural details from fragmentary representations. Missing large portions of informative structures further aggravates this problem. To overcome this, a novel graph convolutional network (GCN) called dynamic and structure-aware NETwork (DSANet) is presented in this article. This framework is formulated based on a pyramidic auto-encoder (AE) architecture to address accurate structure reconstruction on the sparse and incomplete point clouds. A PointNet-like neural network is applied as the encoder to efficiently aggregate the global representations of coarse point clouds. On the decoder side, we design a dynamic graph learning module with a structure-aware attention (SAA) to take advantage of the topology relationships maintained in the dynamic latent graph. Relying on gradually unfolding the extracted representation into a sequence of graphs, DSANet is able to reconstruct complicated point clouds with rich and descriptive details. To associate analogous structure awareness with semantic estimation, we further propose a mechanism, called structure similarity assessment (SSA). This method allows our model to surmise semantic homogeneity in an unsupervised manner. Finally, we optimize the proposed model by minimizing a new distortion-aware objective end-to-end. Extensive qualitative and quantitative experiments demonstrate the impressive performance of our model in reconstructing unbroken 3-D shapes from deficient point clouds and preserving semantic relationships among different regional structures.</p>","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":""},"PeriodicalIF":10.2000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TNNLS.2024.3439706","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Learning 3-D structures from incomplete point clouds with extreme sparsity and random distributions is a challenge since it is difficult to infer topological connectivity and structural details from fragmentary representations. Missing large portions of informative structures further aggravates this problem. To overcome this, a novel graph convolutional network (GCN) called dynamic and structure-aware NETwork (DSANet) is presented in this article. This framework is formulated based on a pyramidic auto-encoder (AE) architecture to address accurate structure reconstruction on the sparse and incomplete point clouds. A PointNet-like neural network is applied as the encoder to efficiently aggregate the global representations of coarse point clouds. On the decoder side, we design a dynamic graph learning module with a structure-aware attention (SAA) to take advantage of the topology relationships maintained in the dynamic latent graph. Relying on gradually unfolding the extracted representation into a sequence of graphs, DSANet is able to reconstruct complicated point clouds with rich and descriptive details. To associate analogous structure awareness with semantic estimation, we further propose a mechanism, called structure similarity assessment (SSA). This method allows our model to surmise semantic homogeneity in an unsupervised manner. Finally, we optimize the proposed model by minimizing a new distortion-aware objective end-to-end. Extensive qualitative and quantitative experiments demonstrate the impressive performance of our model in reconstructing unbroken 3-D shapes from deficient point clouds and preserving semantic relationships among different regional structures.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DSANet：用于稀疏和不完整点云学习的动态和结构感知 GCN。

从具有极端稀疏性和随机分布的不完整点云中学习三维结构是一项挑战，因为很难从零散的表征中推断拓扑连接性和结构细节。而大量信息结构的缺失又进一步加剧了这一问题。为了克服这一问题，本文提出了一种名为动态结构感知网络（DSANet）的新型图卷积网络（GCN）。该框架基于金字塔式自动编码器（AE）架构，可解决稀疏和不完整点云的精确结构重建问题。编码器采用类似于 PointNet 的神经网络，以有效聚合粗糙点云的全局表示。在解码器方面，我们设计了一个具有结构感知注意力（SAA）的动态图学习模块，以利用动态潜在图中保持的拓扑关系。依靠将提取的表征逐步展开为一系列图形，DSANet 能够重建具有丰富描述细节的复杂点云。为了将类似的结构意识与语义估计联系起来，我们进一步提出了一种机制，称为结构相似性评估（SSA）。该方法允许我们的模型以无监督的方式推测语义同质性。最后，我们通过端到端最小化新的失真感知目标来优化所提出的模型。广泛的定性和定量实验证明，我们的模型在从缺陷点云中重建完整的三维形状以及保留不同区域结构之间的语义关系方面表现出色。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.