PointeNet：有效、高效的点云分析轻量级框架

IF 1.3 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Computer Aided Geometric Design Pub Date : 2024-05-01 DOI:10.1016/j.cagd.2024.102311

Lipeng Gu , Xuefeng Yan , Liangliang Nan , Dingkun Zhu , Honghua Chen , Weiming Wang , Mingqiang Wei

{"title":"PointeNet：有效、高效的点云分析轻量级框架","authors":"Lipeng Gu , Xuefeng Yan , Liangliang Nan , Dingkun Zhu , Honghua Chen , Weiming Wang , Mingqiang Wei","doi":"10.1016/j.cagd.2024.102311","DOIUrl":null,"url":null,"abstract":"<div><p>The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce <strong>PointeNet</strong>, an <strong>e</strong>fficient <strong>net</strong>work designed specifically for <strong>point</strong> cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an <em>optional</em> Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in <span><math><mn>3</mn><mi>D</mi><mspace></mspace><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mi>R</mi><mn>40</mn></mrow></msub></math></span> for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at <span>https://github.com/lipeng-gu/PointeNet</span><svg><path></path></svg>.</p></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"110 ","pages":"Article 102311"},"PeriodicalIF":1.3000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PointeNet: A lightweight framework for effective and efficient point cloud analysis\",\"authors\":\"Lipeng Gu , Xuefeng Yan , Liangliang Nan , Dingkun Zhu , Honghua Chen , Weiming Wang , Mingqiang Wei\",\"doi\":\"10.1016/j.cagd.2024.102311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce <strong>PointeNet</strong>, an <strong>e</strong>fficient <strong>net</strong>work designed specifically for <strong>point</strong> cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an <em>optional</em> Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in <span><math><mn>3</mn><mi>D</mi><mspace></mspace><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mi>R</mi><mn>40</mn></mrow></msub></math></span> for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at <span>https://github.com/lipeng-gu/PointeNet</span><svg><path></path></svg>.</p></div>\",\"PeriodicalId\":55226,\"journal\":{\"name\":\"Computer Aided Geometric Design\",\"volume\":\"110 \",\"pages\":\"Article 102311\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Aided Geometric Design\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167839624000451\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839624000451","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

点云分析的传统智慧主要是探索三维几何图形。这通常是通过在编码器中引入复杂的可学习几何提取器或通过重复块深化网络来实现的。然而，这些方法包含大量可学习参数，导致大量计算成本，并对 CPU/GPU 造成内存负担。此外，这些方法主要针对对象级点云分类和分割任务，对关键场景级应用（如自动驾驶）的扩展有限。为此，我们引入了专门为点云分析设计的高效网络 PointeNet。PointeNet 以其轻量级架构、低训练成本和即插即用功能而与众不同，同时还能有效捕捉具有代表性的特征。该网络由一个多变量几何编码（MGE）模块和一个可选的距离感知语义增强（DSE）模块组成。MGE 采用采样、分组、汇集和多变量几何聚合等操作，以轻量级捕获和自适应聚合多变量几何特征，提供全面的三维几何描述。DSE 专为真实世界的自动驾驶场景而设计，可增强点云的语义感知，尤其是对远距离点的感知。我们的方法具有灵活性，可与分类/分割头无缝集成，或嵌入现成的三维物体检测网络，以最小的成本实现显著的性能提升。在对象级数据集（包括 ModelNet40、ScanObjectNN、ShapeNetPart 和场景级数据集 KITTI）上进行的大量实验表明，PointeNet 在点云分析方面的性能优于最先进的方法。值得注意的是，在 ModelNet40、ScanObjectNN 和 ShapeNetPart 上，PointeNet 用更少的参数就超越了 PointMLP；在 KITTI 上，PointRCNN 在 3DAPR40 上取得了超过 2% 的大幅提升，而参数成本仅为 140 万。代码可在 https://github.com/lipeng-gu/PointeNet 公开获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

PointeNet: A lightweight framework for effective and efficient point cloud analysis

The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce PointeNet, an efficient network designed specifically for point cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an optional Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in $3 D A P_{R 40}$ for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at https://github.com/lipeng-gu/PointeNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Aided Geometric Design 工程技术-计算机：软件工程

CiteScore

3.50

自引率

13.30%

发文量

审稿时长

60 days

期刊介绍： The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following: -Mathematical and Geometric Foundations- Curve, Surface, and Volume generation- CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision- Industrial, medical, and scientific applications. The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.

期刊最新文献

Approximation properties over self-similar meshes of curved finite elements and applications to subdivision based isogeometric analysis Editorial Board A novel heterogeneous deformable surface model based on elasticity Optimal one-sided approximants of circular arc Editorial Board