Dexterous Pre-Grasp Manipulation for Human-Like Functional Categorical Grasping: Deep Reinforcement Learning and Grasp Representations

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2025-02-13 DOI:10.1109/TASE.2025.3541768
Dmytro Pavlichenko;Sven Behnke
{"title":"Dexterous Pre-Grasp Manipulation for Human-Like Functional Categorical Grasping: Deep Reinforcement Learning and Grasp Representations","authors":"Dmytro Pavlichenko;Sven Behnke","doi":"10.1109/TASE.2025.3541768","DOIUrl":null,"url":null,"abstract":"Many objects, such as tools and household items, can be used only if grasped in a very specific way—grasped functionally. Often, a direct functional grasp is not possible, though. We propose a method for learning a dexterous pre-grasp manipulation policy to achieve human-like functional grasps using deep reinforcement learning. We introduce a dense multi-component reward function that enables learning a single policy, capable of dexterous pre-grasp manipulation of novel instances of several known object categories with an anthropomorphic hand. The policy is learned purely by means of reinforcement learning from scratch, without any expert demonstrations. It implicitly learns to reposition and reorient objects of complex shapes to achieve given functional grasps. In addition, we explore two different ways to represent a desired grasp: explicit and more abstract, constraint-based. We show that our method consistently learns to successfully manipulate and achieve desired grasps on previously unseen object instances of known categories using both grasp representations. Training is completed on a single GPU in under three hours. Note to Practitioners—This work was motivated by the increasing popularity of robots equipped with dexterous human-like hands. Operating in environments designed for humans necessitates the ability to use human tools. That requires grasping these tools in specific ways for effective use. We propose a learning-based method to train such behaviors in highly parallelized simulation. We explore two possible ways to represent a target functional grasp: an explicit and a more abstract, constraint-based, each with its own advantages and disadvantages. Our method learns to achieve human-like behaviors in under three hours on a single computer. It successfully manipulates previously unseen object instances with both target grasp representations. Such policies could be useful for robots with human-like hands in a broad range of scenarios: household, factory or search-and-rescue, whenever there is a necessity to grasp objects in a very specific way. The main limitation of this work is that the learned behaviors were not tested in the real world. Thus, closing the sim-to-real gap is a viable direction for future work.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"23 ","pages":"2231-2244"},"PeriodicalIF":6.4000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10884934/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Many objects, such as tools and household items, can be used only if grasped in a very specific way—grasped functionally. Often, a direct functional grasp is not possible, though. We propose a method for learning a dexterous pre-grasp manipulation policy to achieve human-like functional grasps using deep reinforcement learning. We introduce a dense multi-component reward function that enables learning a single policy, capable of dexterous pre-grasp manipulation of novel instances of several known object categories with an anthropomorphic hand. The policy is learned purely by means of reinforcement learning from scratch, without any expert demonstrations. It implicitly learns to reposition and reorient objects of complex shapes to achieve given functional grasps. In addition, we explore two different ways to represent a desired grasp: explicit and more abstract, constraint-based. We show that our method consistently learns to successfully manipulate and achieve desired grasps on previously unseen object instances of known categories using both grasp representations. Training is completed on a single GPU in under three hours. Note to Practitioners—This work was motivated by the increasing popularity of robots equipped with dexterous human-like hands. Operating in environments designed for humans necessitates the ability to use human tools. That requires grasping these tools in specific ways for effective use. We propose a learning-based method to train such behaviors in highly parallelized simulation. We explore two possible ways to represent a target functional grasp: an explicit and a more abstract, constraint-based, each with its own advantages and disadvantages. Our method learns to achieve human-like behaviors in under three hours on a single computer. It successfully manipulates previously unseen object instances with both target grasp representations. Such policies could be useful for robots with human-like hands in a broad range of scenarios: household, factory or search-and-rescue, whenever there is a necessity to grasp objects in a very specific way. The main limitation of this work is that the learned behaviors were not tested in the real world. Thus, closing the sim-to-real gap is a viable direction for future work.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
灵巧的预抓取操作的类人功能分类抓取:深度强化学习和抓取表征
许多物品,如工具和家居用品,只有以一种非常特定的方式——功能性地抓住——才能使用。通常,直接掌握功能是不可能的。我们提出了一种方法来学习灵巧的预抓取操作策略,以实现类似人类的功能抓取使用深度强化学习。我们引入了一个密集的多组件奖励函数,它可以学习单个策略,能够用拟人化的手灵巧地预先掌握几个已知对象类别的新实例。该策略完全是通过从头开始的强化学习来学习的,没有任何专家的演示。它隐式地学习重新定位和重新定向复杂形状的对象,以实现给定的功能掌握。此外,我们探索了两种不同的方式来表示期望的把握:明确的和更抽象的,基于约束的。我们表明,我们的方法始终如一地学习成功地操纵和实现对已知类别的先前未见过的对象实例的所需抓取,同时使用两种抓取表示。训练是在一个单一的GPU在三个小时内完成。从业人员注意事项——这项工作的动机是由于配备了灵巧的人手的机器人越来越受欢迎。在为人类设计的环境中操作,需要使用人类工具的能力。这需要以特定的方式掌握这些工具,以便有效地使用。我们提出了一种基于学习的方法来训练这种高度并行模拟的行为。我们探索了两种可能的方式来表示目标功能把握:一种是明确的,另一种是更抽象的,基于约束的,每种都有自己的优点和缺点。我们的方法在一台计算机上学习在三小时内实现类似人类的行为。它成功地用两个目标抓取表示操作了以前看不见的对象实例。这样的策略对于拥有类似人类双手的机器人在各种场景中都很有用:家庭、工厂或搜救,只要有必要以一种非常特定的方式抓住物体。这项工作的主要限制是学习行为没有在现实世界中进行测试。因此,缩小模拟与现实之间的差距是未来工作的一个可行方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Adaptive Neural Boundary Control of a Rotating Flexible Beam with Input Dead-Zone and Output Constraint TacMan-Turbo: Proactive Tactile Control for Robust and Efficient Articulated Object Manipulation Reduced Maximum Curvature-Based Linear Path Smoothing Method with Enhanced Kinematic Performance for 6R Robot Manipulators Multi-View Stereo with Holistic Guidance of Collaborative Geometric Priors Sensorless Robotic External Force Estimation in Uncertain Interactive Environments: A Hybrid Adaptive-Robust Kalman Filtering Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1