SOA：用于点云室内 3D 物体检测的种子点偏移注意事项

IF 2.5 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Computers & Graphics-Uk Pub Date : 2024-07-06 DOI:10.1016/j.cag.2024.103992

Jun Shu , Shiqi Yu , Xinyi Shu , Jiewen Hu

{"title":"SOA：用于点云室内 3D 物体检测的种子点偏移注意事项","authors":"Jun Shu , Shiqi Yu , Xinyi Shu , Jiewen Hu","doi":"10.1016/j.cag.2024.103992","DOIUrl":null,"url":null,"abstract":"<div><p>Three-dimensional object detection plays a pivotal role in scene understanding and holds significant importance in various indoor perception applications. Traditional methods based on Hough voting are susceptible to interference from background points or neighboring objects when casting votes for the target’s center from each seed point. Moreover, fixed-size set abstraction modules may result in the loss of structural information for large objects. To address these challenges, this paper proposes a three-dimensional object detection model based on seed point offset attention. The objective of this model is to enhance the model’s resilience to voting noise interference and alleviate feature loss for large-scale objects. Specifically, a seed point offset tensor is first defined, and then the offset tensor self-attention network is employed to learn the weights between votes, thereby establishing a correlation between the voting semantic features and the object structural information. Furthermore, an object surface perception module is introduced, which incorporates detailed features of local object surfaces into global feature representations through vote backtracking and surface mapping. Experimental results indicate that the model achieved excellent performance on the ScanNet-V2 ([email protected], 60.3%) and SUN RGB-D ([email protected], 64.0%) datasets, respectively improving by 2.6% ([email protected]) and 5.4% ([email protected]) compared to VoteNet.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 103992"},"PeriodicalIF":2.5000,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SOA: Seed point offset attention for indoor 3D object detection in point clouds\",\"authors\":\"Jun Shu , Shiqi Yu , Xinyi Shu , Jiewen Hu\",\"doi\":\"10.1016/j.cag.2024.103992\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Three-dimensional object detection plays a pivotal role in scene understanding and holds significant importance in various indoor perception applications. Traditional methods based on Hough voting are susceptible to interference from background points or neighboring objects when casting votes for the target’s center from each seed point. Moreover, fixed-size set abstraction modules may result in the loss of structural information for large objects. To address these challenges, this paper proposes a three-dimensional object detection model based on seed point offset attention. The objective of this model is to enhance the model’s resilience to voting noise interference and alleviate feature loss for large-scale objects. Specifically, a seed point offset tensor is first defined, and then the offset tensor self-attention network is employed to learn the weights between votes, thereby establishing a correlation between the voting semantic features and the object structural information. Furthermore, an object surface perception module is introduced, which incorporates detailed features of local object surfaces into global feature representations through vote backtracking and surface mapping. Experimental results indicate that the model achieved excellent performance on the ScanNet-V2 ([email protected], 60.3%) and SUN RGB-D ([email protected], 64.0%) datasets, respectively improving by 2.6% ([email protected]) and 5.4% ([email protected]) compared to VoteNet.</p></div>\",\"PeriodicalId\":50628,\"journal\":{\"name\":\"Computers & Graphics-Uk\",\"volume\":\"123 \",\"pages\":\"Article 103992\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Graphics-Uk\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0097849324001274\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Graphics-Uk","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0097849324001274","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

三维物体检测在场景理解中起着关键作用，在各种室内感知应用中具有重要意义。传统的基于 Hough 投票的方法在对每个种子点的目标中心进行投票时，容易受到背景点或邻近物体的干扰。此外，固定大小的集合抽象模块可能会导致大型物体结构信息的丢失。为了应对这些挑战，本文提出了一种基于种子点偏移注意力的三维物体检测模型。该模型的目标是增强模型对投票噪声干扰的抗干扰能力，并减少大型物体的特征丢失。具体来说，首先定义种子点偏移张量，然后利用偏移张量自注意力网络学习投票之间的权重，从而建立投票语义特征与物体结构信息之间的相关性。此外，还引入了物体表面感知模块，通过投票回溯和表面映射，将局部物体表面的细节特征纳入全局特征表征。实验结果表明，该模型在 ScanNet-V2 数据集（[email protected]，60.3%）和 SUN RGB-D 数据集（[email protected]，64.0%）上取得了优异的性能，与 VoteNet 相比，分别提高了 2.6% （[email protected]）和 5.4% （[email protected]）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

SOA: Seed point offset attention for indoor 3D object detection in point clouds

Three-dimensional object detection plays a pivotal role in scene understanding and holds significant importance in various indoor perception applications. Traditional methods based on Hough voting are susceptible to interference from background points or neighboring objects when casting votes for the target’s center from each seed point. Moreover, fixed-size set abstraction modules may result in the loss of structural information for large objects. To address these challenges, this paper proposes a three-dimensional object detection model based on seed point offset attention. The objective of this model is to enhance the model’s resilience to voting noise interference and alleviate feature loss for large-scale objects. Specifically, a seed point offset tensor is first defined, and then the offset tensor self-attention network is employed to learn the weights between votes, thereby establishing a correlation between the voting semantic features and the object structural information. Furthermore, an object surface perception module is introduced, which incorporates detailed features of local object surfaces into global feature representations through vote backtracking and surface mapping. Experimental results indicate that the model achieved excellent performance on the ScanNet-V2 ([email protected], 60.3%) and SUN RGB-D ([email protected], 64.0%) datasets, respectively improving by 2.6% ([email protected]) and 5.4% ([email protected]) compared to VoteNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers & Graphics-Uk 工程技术-计算机：软件工程

CiteScore

5.30

自引率

12.00%

发文量

173

审稿时长

38 days

期刊介绍： Computers & Graphics is dedicated to disseminate information on research and applications of computer graphics (CG) techniques. The journal encourages articles on: 1. Research and applications of interactive computer graphics. We are particularly interested in novel interaction techniques and applications of CG to problem domains. 2. State-of-the-art papers on late-breaking, cutting-edge research on CG. 3. Information on innovative uses of graphics principles and technologies. 4. Tutorial papers on both teaching CG principles and innovative uses of CG in education.