Compressed point cloud classification with point-based edge sampling

IF 2.4 4区计算机科学 Eurasip Journal on Image and Video Processing Pub Date : 2024-08-07 DOI:10.1186/s13640-024-00637-0

Zhe Luo, Wenjing Jia, Stuart Perry

{"title":"Compressed point cloud classification with point-based edge sampling","authors":"Zhe Luo, Wenjing Jia, Stuart Perry","doi":"10.1186/s13640-024-00637-0","DOIUrl":null,"url":null,"abstract":"<p>3D point cloud data, as an immersive detailed data source, has been increasingly used in numerous applications. To deal with the computational and storage challenges of this data, it needs to be compressed before transmission, storage, and processing, especially in real-time systems. Instead of decoding the compressed data stream and subsequently conducting downstream tasks on the decompressed data, analyzing point clouds directly in their compressed domain has attracted great interest. In this paper, we dive into the realm of compressed point cloud classification (CPCC), aiming to achieve high point cloud classification accuracy in a bitrate-saving way by ensuring the bit stream contains a high degree of representative information of the point cloud. Edge information is one of the most important and representative attributes of the point cloud because it can display the outlines or main shapes. However, extracting edge points or information from point cloud models is challenging due to their irregularity and sparsity. To address this challenge, we adopt an advanced edge-sampling method that enhances existing state-of-the-art (SOTA) point cloud edge-sampling techniques based on attention mechanisms and consequently develop a novel CPCC method “CPCC-PES” that focuses on point cloud’s edge information. The result obtained on the benchmark ModelNet40 dataset shows that our model has superior rate-accuracy trade-off performance than SOTA works. Specifically, our method achieves over 90% Top-1 Accuracy with a mere 0.08 bits-per-point (bpp), marking a remarkable over 96% reduction in BD-bitrate compared with specialized codecs. This means that our method only consumes 20% of the bitrate of other SOTA works while maintaining comparable accuracy. Furthermore, we propose a new evaluation metric named BD-Top-1 Accuracy to evaluate the trade-off performance between bitrate and Top-1 Accuracy for future CPCC research.</p>","PeriodicalId":49322,"journal":{"name":"Eurasip Journal on Image and Video Processing","volume":"28 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eurasip Journal on Image and Video Processing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1186/s13640-024-00637-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

3D point cloud data, as an immersive detailed data source, has been increasingly used in numerous applications. To deal with the computational and storage challenges of this data, it needs to be compressed before transmission, storage, and processing, especially in real-time systems. Instead of decoding the compressed data stream and subsequently conducting downstream tasks on the decompressed data, analyzing point clouds directly in their compressed domain has attracted great interest. In this paper, we dive into the realm of compressed point cloud classification (CPCC), aiming to achieve high point cloud classification accuracy in a bitrate-saving way by ensuring the bit stream contains a high degree of representative information of the point cloud. Edge information is one of the most important and representative attributes of the point cloud because it can display the outlines or main shapes. However, extracting edge points or information from point cloud models is challenging due to their irregularity and sparsity. To address this challenge, we adopt an advanced edge-sampling method that enhances existing state-of-the-art (SOTA) point cloud edge-sampling techniques based on attention mechanisms and consequently develop a novel CPCC method “CPCC-PES” that focuses on point cloud’s edge information. The result obtained on the benchmark ModelNet40 dataset shows that our model has superior rate-accuracy trade-off performance than SOTA works. Specifically, our method achieves over 90% Top-1 Accuracy with a mere 0.08 bits-per-point (bpp), marking a remarkable over 96% reduction in BD-bitrate compared with specialized codecs. This means that our method only consumes 20% of the bitrate of other SOTA works while maintaining comparable accuracy. Furthermore, we propose a new evaluation metric named BD-Top-1 Accuracy to evaluate the trade-off performance between bitrate and Top-1 Accuracy for future CPCC research.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用基于点的边缘采样进行压缩点云分类

三维点云数据作为一种身临其境的详细数据源，已越来越多地应用于众多领域。为了应对这些数据在计算和存储方面的挑战，需要在传输、存储和处理之前对其进行压缩，尤其是在实时系统中。与解码压缩数据流并随后在解压缩数据上执行下游任务相比，直接在压缩域中分析点云引起了人们的极大兴趣。本文深入探讨了压缩点云分类（CPCC）领域，旨在通过确保比特流包含点云的高度代表性信息，以节省比特率的方式实现高点云分类精度。边缘信息是点云最重要、最具代表性的属性之一，因为它可以显示轮廓或主要形状。然而，由于点云模型的不规则性和稀疏性，从点云模型中提取边缘点或信息具有挑战性。为了应对这一挑战，我们采用了一种先进的边缘采样方法，该方法基于注意力机制增强了现有的最先进（SOTA）点云边缘采样技术，并由此开发出一种新型的 CPCC 方法 "CPCC-PES"，该方法重点关注点云的边缘信息。在基准 ModelNet40 数据集上获得的结果表明，与 SOTA 方法相比，我们的模型具有更优越的速率-精度权衡性能。具体来说，我们的方法仅用 0.08 比特/点 (bpp) 就达到了 90% 以上的 Top-1 准确率，与专用编解码器相比，BD 比特率显著降低了 96% 以上。这意味着我们的方法只需消耗其他 SOTA 方法 20% 的比特率，同时还能保持相当的准确率。此外，我们还提出了一个名为 BD-Top-1 Accuracy 的新评估指标，以评估比特率和 Top-1 Accuracy 之间的权衡性能，为未来的 CPCC 研究提供参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Eurasip Journal on Image and Video Processing Engineering-Electrical and Electronic Engineering

CiteScore

7.10

自引率

0.00%

发文量

审稿时长

6.8 months

期刊介绍： EURASIP Journal on Image and Video Processing is intended for researchers from both academia and industry, who are active in the multidisciplinary field of image and video processing. The scope of the journal covers all theoretical and practical aspects of the domain, from basic research to development of application.