T-TD3: A Reinforcement Learning Framework for Stable Grasping of Deformable Objects Using Tactile Prior

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-08-14 DOI:10.1109/TASE.2024.3440047
Yanmin Zhou;Yiyang Jin;Ping Lu;Shuo Jiang;Zhipeng Wang;Bin He
{"title":"T-TD3: A Reinforcement Learning Framework for Stable Grasping of Deformable Objects Using Tactile Prior","authors":"Yanmin Zhou;Yiyang Jin;Ping Lu;Shuo Jiang;Zhipeng Wang;Bin He","doi":"10.1109/TASE.2024.3440047","DOIUrl":null,"url":null,"abstract":"Human tactile perception enables rapid assessment of deformable objects and the application of appropriate force to prevent slip or excessive deformation. However, this task remains challenging for robots. To address this issue, we propose the T-TD3 algorithm, which utilizes a multi-scale fusion neural network (MSF-Net) for the fused perception of multi-scale features, including the tactile prior information obtained through preprocessing. Our approach decomposes the robot task of grasping deformable objects into three subtasks: slip detection, stable grasping evaluation, and minimum grasping force tracking. We develop a simulation environment called CR5GraspStable-Env using PyBullet and TACTO for the network training. Our work reports a success rate of 94.81% in the robot task of grasping deformable objects in real, demonstrating an excellent sim-to-real capability. Moreover, the proposed approach has the potential to be extended to other stable grasping tasks that utilize tactile perception. Note to Practitioners—Traditional grasping strategies in stable grasping tasks typically apply significant grasping force to prevent slip. However, excessive grasping force for deformable objects may lead to excessive deformation and damage. While humans can flexibly control grasping force through tactile perception, it poses a significant challenge for robots. To overcome this challenge, we propose a novel method for the robot to learn an improved stable grasping strategy by incorporating tactile priors. Our method establishes a unified tactile prior representation for the visual-based tactile sensors mounted on the robot grippers, which enables them to sense the contact state. Additionally, it utilizes sensor distortion correction based on spatial symmetry to ensure the applicability of the tactile prior representation to any kind of visual-based tactile sensor. Furthermore, this method integrates multi-scale tactile priors and robot states, and utilizes reinforcement learning to autonomously make real-time decisions, aiming to minimize the grasping force while maintaining stable grasps on deformable objects. The primary research objective of this article is to address the challenge of achieving stable grasps on variable objects, while also being applicable to grasping rigid objects. We validated the practicality of this method by successfully achieving a 94.81% success rate in stably grasping deformable objects using the robotic arm CR5 in both simulated and real-world environments. Furthermore, this method can be readily applied to other robot systems. In the future, we plan to extend the application of our proposed method to more dexterous manipulators and perform more complex manipulation tasks. Additionally, we aim to introduce new fusion algorithms and decision-making strategies to further enhance the applicability of this method.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"6208-6222"},"PeriodicalIF":6.4000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10636851/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Human tactile perception enables rapid assessment of deformable objects and the application of appropriate force to prevent slip or excessive deformation. However, this task remains challenging for robots. To address this issue, we propose the T-TD3 algorithm, which utilizes a multi-scale fusion neural network (MSF-Net) for the fused perception of multi-scale features, including the tactile prior information obtained through preprocessing. Our approach decomposes the robot task of grasping deformable objects into three subtasks: slip detection, stable grasping evaluation, and minimum grasping force tracking. We develop a simulation environment called CR5GraspStable-Env using PyBullet and TACTO for the network training. Our work reports a success rate of 94.81% in the robot task of grasping deformable objects in real, demonstrating an excellent sim-to-real capability. Moreover, the proposed approach has the potential to be extended to other stable grasping tasks that utilize tactile perception. Note to Practitioners—Traditional grasping strategies in stable grasping tasks typically apply significant grasping force to prevent slip. However, excessive grasping force for deformable objects may lead to excessive deformation and damage. While humans can flexibly control grasping force through tactile perception, it poses a significant challenge for robots. To overcome this challenge, we propose a novel method for the robot to learn an improved stable grasping strategy by incorporating tactile priors. Our method establishes a unified tactile prior representation for the visual-based tactile sensors mounted on the robot grippers, which enables them to sense the contact state. Additionally, it utilizes sensor distortion correction based on spatial symmetry to ensure the applicability of the tactile prior representation to any kind of visual-based tactile sensor. Furthermore, this method integrates multi-scale tactile priors and robot states, and utilizes reinforcement learning to autonomously make real-time decisions, aiming to minimize the grasping force while maintaining stable grasps on deformable objects. The primary research objective of this article is to address the challenge of achieving stable grasps on variable objects, while also being applicable to grasping rigid objects. We validated the practicality of this method by successfully achieving a 94.81% success rate in stably grasping deformable objects using the robotic arm CR5 in both simulated and real-world environments. Furthermore, this method can be readily applied to other robot systems. In the future, we plan to extend the application of our proposed method to more dexterous manipulators and perform more complex manipulation tasks. Additionally, we aim to introduce new fusion algorithms and decision-making strategies to further enhance the applicability of this method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
T-TD3:利用触觉先验稳定抓取可变形物体的强化学习框架
人类的触觉感知能够快速评估可变形的物体,并应用适当的力来防止滑动或过度变形。然而,这项任务对机器人来说仍然具有挑战性。为了解决这一问题,我们提出了T-TD3算法,该算法利用多尺度融合神经网络(MSF-Net)对多尺度特征进行融合感知,包括通过预处理获得的触觉先验信息。该方法将机器人抓取可变形物体的任务分解为三个子任务:滑移检测、稳定抓取评估和最小抓取力跟踪。我们使用PyBullet和TACTO开发了一个名为cr5grasstable - env的仿真环境,用于网络训练。我们的工作报告了机器人在现实中抓取可变形物体任务的成功率为94.81%,展示了出色的模拟到真实的能力。此外,所提出的方法有可能扩展到其他利用触觉感知的稳定抓取任务。从业人员注意:在稳定的抓取任务中,传统的抓取策略通常应用显著的抓取力来防止滑动。然而,对于可变形的物体,过大的抓取力可能会导致过度变形和损坏。虽然人类可以通过触觉感知灵活地控制抓取力,但这对机器人提出了重大挑战。为了克服这一挑战,我们提出了一种新的方法,让机器人通过结合触觉先验来学习一种改进的稳定抓取策略。我们的方法为安装在机器人抓手上的基于视觉的触觉传感器建立了统一的触觉先验表示,使它们能够感知接触状态。此外,它利用基于空间对称性的传感器畸变校正,以确保触觉先验表征对任何一种基于视觉的触觉传感器的适用性。此外,该方法集成了多尺度触觉先验和机器人状态,并利用强化学习进行自主实时决策,旨在最小化抓取力的同时保持对可变形物体的稳定抓取。本文的主要研究目标是解决在抓取可变物体时实现稳定抓取的挑战,同时也适用于抓取刚性物体。在模拟和现实环境中,CR5机械臂稳定抓取可变形物体的成功率均达到94.81%,验证了该方法的实用性。此外,该方法可以很容易地应用于其他机器人系统。在未来,我们计划将我们的方法扩展到更灵巧的机械手和执行更复杂的操作任务。此外,我们的目标是引入新的融合算法和决策策略,以进一步提高该方法的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Optimal Non-anticipative All-scenario-feasible Scheduling Strategy for Community Microgrid with Vehicle-to-grid Energy Storages Incremental Q-Learning for Data-Driven Adaptive Critic Control of Wastewater Treatment Processes With State Constraints Automated Instance Segmentation Network for Overlapping Cells in Cleavage-Stage Embryos Fixed-Time Performance Fault-Tolerant Control for Cluster Synchronization of Spatiotemporal Networks with Sign-Based Coupling Distributed Coverage Control for Air-Ground Robot Systems with Heterogeneous Sensing Capabilities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1