利用点云在非结构化场景中生成 7-DoF 无模型抓手的并行图网络

IF 11.4 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Robotics and Computer-integrated Manufacturing Pub Date : 2025-04-01 Epub Date: 2024-09-17 DOI:10.1016/j.rcim.2024.102879

Chungang Zhuang, Haowen Wang, Wanhao Niu, Han Ding

{"title":"利用点云在非结构化场景中生成 7-DoF 无模型抓手的并行图网络","authors":"Chungang Zhuang, Haowen Wang, Wanhao Niu, Han Ding","doi":"10.1016/j.rcim.2024.102879","DOIUrl":null,"url":null,"abstract":"<div><p>Generating model-free grasps in complex scattered scenes remains a challenging task. Most current methods adopt PointNet++ as the backbone to extract structural features, while the relative associations of geometry are underexplored, leading to non-optimal grasp prediction results. In this work, a parallelized graph-based pipeline is developed to solve the 7-DoF grasp pose generation problem with point cloud as input. Using the non-textured information of the grasping scene, the proposed pipeline simultaneously performs feature embedding and grasping location focusing in two branches, avoiding the mutual influence of the two learning processes. In the feature learning branch, the geometric features of the whole scene will be fully learned. In the location focusing branch, the high-value grasping locations on the surface of objects will be strategically selected. Using the learned graph features at these locations, the pipeline will eventually output refined grasping directions and widths in conjunction with local spatial features. To strengthen the positional features in the grasping problem, a graph convolution operator based on the positional attention mechanism is designed, and a graph residual network based on this operator is applied in two branches. The above pipeline abstracts the grasping location selection task from the main process of grasp generation, which lowers the learning difficulty while avoiding the performance degradation problem of deep graph networks. The established pipeline is evaluated on the GraspNet-1Billion dataset, demonstrating much better performance and stronger generalization capabilities than the benchmark approach. In robotic bin-picking experiments, the proposed method can effectively understand scattered grasping scenarios and grasp multiple types of unknown objects with a high success rate.</p></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"92 ","pages":"Article 102879"},"PeriodicalIF":11.4000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A parallel graph network for generating 7-DoF model-free grasps in unstructured scenes using point cloud\",\"authors\":\"Chungang Zhuang, Haowen Wang, Wanhao Niu, Han Ding\",\"doi\":\"10.1016/j.rcim.2024.102879\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Generating model-free grasps in complex scattered scenes remains a challenging task. Most current methods adopt PointNet++ as the backbone to extract structural features, while the relative associations of geometry are underexplored, leading to non-optimal grasp prediction results. In this work, a parallelized graph-based pipeline is developed to solve the 7-DoF grasp pose generation problem with point cloud as input. Using the non-textured information of the grasping scene, the proposed pipeline simultaneously performs feature embedding and grasping location focusing in two branches, avoiding the mutual influence of the two learning processes. In the feature learning branch, the geometric features of the whole scene will be fully learned. In the location focusing branch, the high-value grasping locations on the surface of objects will be strategically selected. Using the learned graph features at these locations, the pipeline will eventually output refined grasping directions and widths in conjunction with local spatial features. To strengthen the positional features in the grasping problem, a graph convolution operator based on the positional attention mechanism is designed, and a graph residual network based on this operator is applied in two branches. The above pipeline abstracts the grasping location selection task from the main process of grasp generation, which lowers the learning difficulty while avoiding the performance degradation problem of deep graph networks. The established pipeline is evaluated on the GraspNet-1Billion dataset, demonstrating much better performance and stronger generalization capabilities than the benchmark approach. In robotic bin-picking experiments, the proposed method can effectively understand scattered grasping scenarios and grasp multiple types of unknown objects with a high success rate.</p></div>\",\"PeriodicalId\":21452,\"journal\":{\"name\":\"Robotics and Computer-integrated Manufacturing\",\"volume\":\"92 \",\"pages\":\"Article 102879\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Computer-integrated Manufacturing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0736584524001662\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/9/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584524001662","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

在复杂的零散场景中生成无模型抓取仍然是一项具有挑战性的任务。目前的大多数方法都采用 PointNet++ 作为提取结构特征的骨干，而对几何图形的相对关联探索不足，导致抓取预测结果不理想。在这项工作中，开发了一种基于图的并行化流水线，用于解决以点云为输入的 7-DoF 抓姿生成问题。利用抓取场景的非纹理信息，所提出的流水线在两个分支中同时执行特征嵌入和抓取位置聚焦，避免了两个学习过程的相互影响。在特征学习分支中，整个场景的几何特征将被完全学习。在位置聚焦分支中，将战略性地选择物体表面的高价值抓取位置。利用在这些位置学习到的图形特征，管道将结合局部空间特征，最终输出细化的抓取方向和宽度。为了强化抓取问题中的位置特征，我们设计了基于位置注意力机制的图卷积算子，并在两个分支中应用了基于该算子的图残差网络。上述管道将抓取位置选择任务从抓取生成的主要过程中抽象出来，降低了学习难度，同时避免了深度图网络的性能下降问题。在 GraspNet-1Billion 数据集上对所建立的管道进行了评估，结果表明其性能和泛化能力远远优于基准方法。在机器人分拣实验中，所提出的方法能有效地理解分散的抓取场景，并以较高的成功率抓取多种类型的未知物体。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A parallel graph network for generating 7-DoF model-free grasps in unstructured scenes using point cloud

Generating model-free grasps in complex scattered scenes remains a challenging task. Most current methods adopt PointNet++ as the backbone to extract structural features, while the relative associations of geometry are underexplored, leading to non-optimal grasp prediction results. In this work, a parallelized graph-based pipeline is developed to solve the 7-DoF grasp pose generation problem with point cloud as input. Using the non-textured information of the grasping scene, the proposed pipeline simultaneously performs feature embedding and grasping location focusing in two branches, avoiding the mutual influence of the two learning processes. In the feature learning branch, the geometric features of the whole scene will be fully learned. In the location focusing branch, the high-value grasping locations on the surface of objects will be strategically selected. Using the learned graph features at these locations, the pipeline will eventually output refined grasping directions and widths in conjunction with local spatial features. To strengthen the positional features in the grasping problem, a graph convolution operator based on the positional attention mechanism is designed, and a graph residual network based on this operator is applied in two branches. The above pipeline abstracts the grasping location selection task from the main process of grasp generation, which lowers the learning difficulty while avoiding the performance degradation problem of deep graph networks. The established pipeline is evaluated on the GraspNet-1Billion dataset, demonstrating much better performance and stronger generalization capabilities than the benchmark approach. In robotic bin-picking experiments, the proposed method can effectively understand scattered grasping scenarios and grasp multiple types of unknown objects with a high success rate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Robotics and Computer-integrated Manufacturing 工程技术-工程：制造

CiteScore

24.10

自引率

13.50%

发文量

160

审稿时长

50 days

期刊介绍： The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.