首页 > 最新文献

2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)最新文献

英文 中文
Simplified Modeling of Hybrid Soft Robots with Constant Stiffness Assumption 具有恒定刚度假设的混合软机器人简化模型
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10355009
Umer Huzaifa, Dimuthu D. K. Arachchige, Muhammad Aneeq uz Zaman, Usman Syed
Soft robots have shown their value as alternatives or supplements to rigid robots in applications like search and rescue missions and complex precise tasks. Their ability to take on various shapes and apply adaptable force gives them an advantage over stiff robots. However, sometimes their soft structure doesn’t offer enough force for the task. Hybrid soft robots (HSRs) combine a soft body with a stronger backbone to handle tasks needing more strength. This rigid part lets us use rigid body dynamics to estimate HSR behavior. Here, we introduce a simplified N-link rigid body dynamic model with constant stiffness to mimic HSR behavior. While soft robots’ stiffness varies, the backbone in HSRs makes it similar to having constant stiffness. Comparing experiments supports the effectiveness of our N-link model for HSR modeling.
在搜救任务和复杂精密任务等应用中,软体机器人已显示出其作为刚性机器人替代品或补充的价值。与刚性机器人相比,软体机器人能够呈现出各种形状,并能施加适应性强的力,这使它们更具优势。然而,有时它们的软结构并不能为任务提供足够的力。混合软体机器人(HSR)结合了软体和更坚固的骨架,以处理需要更多力量的任务。我们可以利用刚体动力学来估计混合软体机器人的行为。在这里,我们引入了一个简化的具有恒定刚度的 N 连杆刚体动力学模型来模拟 HSR 的行为。软体机器人的刚度是变化的,而 HSR 中的骨架使其类似于具有恒定刚度。对比实验证明了我们的 N-连杆模型在 HSR 建模中的有效性。
{"title":"Simplified Modeling of Hybrid Soft Robots with Constant Stiffness Assumption","authors":"Umer Huzaifa, Dimuthu D. K. Arachchige, Muhammad Aneeq uz Zaman, Usman Syed","doi":"10.1109/ROBIO58561.2023.10355009","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10355009","url":null,"abstract":"Soft robots have shown their value as alternatives or supplements to rigid robots in applications like search and rescue missions and complex precise tasks. Their ability to take on various shapes and apply adaptable force gives them an advantage over stiff robots. However, sometimes their soft structure doesn’t offer enough force for the task. Hybrid soft robots (HSRs) combine a soft body with a stronger backbone to handle tasks needing more strength. This rigid part lets us use rigid body dynamics to estimate HSR behavior. Here, we introduce a simplified N-link rigid body dynamic model with constant stiffness to mimic HSR behavior. While soft robots’ stiffness varies, the backbone in HSRs makes it similar to having constant stiffness. Comparing experiments supports the effectiveness of our N-link model for HSR modeling.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"57 11","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved ORB-GMS image feature extraction and matching algorithm* 改进的 ORB-GMS 图像特征提取和匹配算法*
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10355043
Zhiying Tan, Wenbo Fan, Weifeng Kong, Xu Tao, Linsen Xu, Xiaobin Xu
Feature point extraction and matching is the key technology in object detection and simultaneous localization and mapping (SLAM). Aiming at the problems such as easy redundancy of feature points extracted by traditional ORB algorithm, low matching accuracy of mainstream robust estimation algorithms and low real-time performance, an improved ORB-GMS image feature extraction and matching algorithm is proposed. Firstly, the algorithm uses the gray value of the image to calculate the adaptive extraction threshold of the feature points. Then the image pyramid is constructed according to the image size. The set number of total feature points to be extracted is evenly distributed to each layer image according to the area ratio; Extract feature points from each layer of the image pyramid, and count the extracted feature points from each layer. If the number of feature points extracted from each layer meets the set number of images from each layer, the extraction ends. Then the quadtree algorithm is used to homogenize the feature points. Finally, the network scoring model is optimized from 8 neighborhood to 4 neighborhood, which reduces the computing time. Experimental results show that the matching accuracy of the proposed algorithm is 14% higher than that of the original algorithm, and the running time is 12% lower.
特征点提取与匹配是物体检测和同步定位与映射(SLAM)的关键技术。针对传统 ORB 算法提取的特征点易冗余、主流鲁棒估计算法匹配精度低、实时性差等问题,提出了一种改进的 ORB-GMS 图像特征提取与匹配算法。首先,该算法利用图像的灰度值计算特征点的自适应提取阈值。然后根据图像大小构建图像金字塔。根据面积比将设定的总特征点数平均分配到各层图像中;从图像金字塔的各层提取特征点,并统计各层提取的特征点数。如果每层提取的特征点数量达到设定的每层图像数量,则提取结束。然后使用四叉树算法对特征点进行均匀化处理。最后,网络评分模型从 8 个邻域优化为 4 个邻域,从而减少了计算时间。实验结果表明,建议算法的匹配准确率比原始算法高 14%,运行时间减少 12%。
{"title":"An improved ORB-GMS image feature extraction and matching algorithm*","authors":"Zhiying Tan, Wenbo Fan, Weifeng Kong, Xu Tao, Linsen Xu, Xiaobin Xu","doi":"10.1109/ROBIO58561.2023.10355043","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10355043","url":null,"abstract":"Feature point extraction and matching is the key technology in object detection and simultaneous localization and mapping (SLAM). Aiming at the problems such as easy redundancy of feature points extracted by traditional ORB algorithm, low matching accuracy of mainstream robust estimation algorithms and low real-time performance, an improved ORB-GMS image feature extraction and matching algorithm is proposed. Firstly, the algorithm uses the gray value of the image to calculate the adaptive extraction threshold of the feature points. Then the image pyramid is constructed according to the image size. The set number of total feature points to be extracted is evenly distributed to each layer image according to the area ratio; Extract feature points from each layer of the image pyramid, and count the extracted feature points from each layer. If the number of feature points extracted from each layer meets the set number of images from each layer, the extraction ends. Then the quadtree algorithm is used to homogenize the feature points. Finally, the network scoring model is optimized from 8 neighborhood to 4 neighborhood, which reduces the computing time. Experimental results show that the matching accuracy of the proposed algorithm is 14% higher than that of the original algorithm, and the running time is 12% lower.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"78 4","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Path-planning for the Human-arm-like Collaborative Robot with the Capability of Infinite Rotation 具有无限旋转能力的仿人协作机器人的路径规划
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354764
Yusheng Yang, Jiajia Liu, Qiaoni Yang, Hang Shi, Yangmin Xie
With the advantages of high safety and scalability, collaborative robots are widely used in the fields of Human-Robot Collaboration and Interaction. However, the joint limits of the robot restrict its flexibility and workspace, especially in a cluttered environment. Inspired by the motion of the human arm, whose elbow joint and shoulder joint can rotate infinitely, a collaborative robot with the capability of infinite rotation of its first and fourth joints is constructed in this paper and named the IR-Robot. With the breakthrough of the joint limit, the corresponding dimension in the robot’s configuration space changes from a bounded dimension to an unbounded dimension. The high-dimensional torus configuration space (HTCS) is presented to describe the bounded-unbounded dimensions hybrid property of the IR-Robot’s configuration space. Additionally, an IR-RRT* algorithm is proposed to conduct path-planning in HTCS. The experimental results in simulation and the real world demonstrate the feasibility and superiority of the IR-Robot in path-following and path-planning tasks.
协作机器人具有安全性高、可扩展性强等优点,被广泛应用于人机协作与交互领域。然而,机器人关节的局限性限制了其灵活性和工作空间,尤其是在杂乱的环境中。受人类手臂肘关节和肩关节可以无限旋转的运动启发,本文构建了一种第一关节和第四关节可以无限旋转的协作机器人,并将其命名为 IR-Robot。随着关节极限的突破,机器人配置空间的相应维度也从有界维度变为无界维度。本文提出了高维环形配置空间(HTCS)来描述 IR-Robot 配置空间的有界-无界维度混合特性。此外,还提出了在 HTCS 中进行路径规划的 IR-RRT* 算法。仿真和真实世界的实验结果证明了红外机器人在路径跟踪和路径规划任务中的可行性和优越性。
{"title":"Path-planning for the Human-arm-like Collaborative Robot with the Capability of Infinite Rotation","authors":"Yusheng Yang, Jiajia Liu, Qiaoni Yang, Hang Shi, Yangmin Xie","doi":"10.1109/ROBIO58561.2023.10354764","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354764","url":null,"abstract":"With the advantages of high safety and scalability, collaborative robots are widely used in the fields of Human-Robot Collaboration and Interaction. However, the joint limits of the robot restrict its flexibility and workspace, especially in a cluttered environment. Inspired by the motion of the human arm, whose elbow joint and shoulder joint can rotate infinitely, a collaborative robot with the capability of infinite rotation of its first and fourth joints is constructed in this paper and named the IR-Robot. With the breakthrough of the joint limit, the corresponding dimension in the robot’s configuration space changes from a bounded dimension to an unbounded dimension. The high-dimensional torus configuration space (HTCS) is presented to describe the bounded-unbounded dimensions hybrid property of the IR-Robot’s configuration space. Additionally, an IR-RRT* algorithm is proposed to conduct path-planning in HTCS. The experimental results in simulation and the real world demonstrate the feasibility and superiority of the IR-Robot in path-following and path-planning tasks.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"55 10","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Sample Repetitive Manipulation Mechanism (SRMM) for Lunar Regolith In-Situ Analysis: Design and Validation 用于月球岩石原位分析的样品重复操纵装置(SRMM):设计与验证
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354957
Yi Liu, Junyue Tang, Yafang Liu, Gongbo Ma, Feng Sun, Ye Li, Shengyuan Jiang
To exactly detect the water ice of the South Pole of the moon, a lunar regolith in-situ analysis payload deploying a mass spectrometer is proposed for China future lunar exploration missions. In order to receive the lunar regolith sample from a robotic arm with a soil sampler and transfer it into a furnace for further analysis, a sample manipulation mechanism is required during the above work flow. To solve the problems of adapting the sampler’s docking accuracy, receiving and transferring two different types of lunar soil sample under times of in-situ analysis, etc., a sample repetitive manipulation mechanism (SRMM) is proposed in this paper. By using a floating adjustable docking components and a flexible hopper, two types of encapsulated regolith sample and bulk material sample can be received with minimal sample loss, respectively. In order to receive and transfer two types of samples multiple times, two sample receiving methods have been designed that can be repeatedly transferred. A worm and worm wheel combined with a ball screw is designed in SRMM. To verify the above mechanism design, validation experiments were conducted. It indicates that this novel SRMM can be deployed in the future mission after further environmental tests.
为了准确探测月球南极的水冰,建议为中国未来的月球探测任务配备一个装有质谱仪的月球残积岩原位分析有效载荷。为了从带有土壤采样器的机械臂上接收月球岩石样品并将其转移到熔炉中进行进一步分析,在上述工作流程中需要一个样品操纵机构。为了解决采样器对接精度的调整、原位分析时两种不同类型月壤样品的接收和转移等问题,本文提出了一种样品重复操作机构(SRMM)。通过使用浮动可调对接组件和柔性料斗,可分别接收两种类型的封装碎屑样品和散装材料样品,且样品损失最小。为了多次接收和转移两种类型的样品,设计了两种可重复转移的样品接收方法。在 SRMM 中设计了一个蜗杆、蜗轮与滚珠丝杠相结合的机构。为了验证上述机构设计,我们进行了验证实验。这表明,经过进一步的环境测试后,这种新型 SRMM 可以部署在未来的任务中。
{"title":"A Sample Repetitive Manipulation Mechanism (SRMM) for Lunar Regolith In-Situ Analysis: Design and Validation","authors":"Yi Liu, Junyue Tang, Yafang Liu, Gongbo Ma, Feng Sun, Ye Li, Shengyuan Jiang","doi":"10.1109/ROBIO58561.2023.10354957","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354957","url":null,"abstract":"To exactly detect the water ice of the South Pole of the moon, a lunar regolith in-situ analysis payload deploying a mass spectrometer is proposed for China future lunar exploration missions. In order to receive the lunar regolith sample from a robotic arm with a soil sampler and transfer it into a furnace for further analysis, a sample manipulation mechanism is required during the above work flow. To solve the problems of adapting the sampler’s docking accuracy, receiving and transferring two different types of lunar soil sample under times of in-situ analysis, etc., a sample repetitive manipulation mechanism (SRMM) is proposed in this paper. By using a floating adjustable docking components and a flexible hopper, two types of encapsulated regolith sample and bulk material sample can be received with minimal sample loss, respectively. In order to receive and transfer two types of samples multiple times, two sample receiving methods have been designed that can be repeatedly transferred. A worm and worm wheel combined with a ball screw is designed in SRMM. To verify the above mechanism design, validation experiments were conducted. It indicates that this novel SRMM can be deployed in the future mission after further environmental tests.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"41 5","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stroke Synchronization of Underwater Modular Robot through Physical Interaction 通过物理交互实现水下模块机器人的行程同步
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354986
Kohei Nishikawa, Y. Origane, Daisuke Kurabayashi
Modular robots are expected to be used in extreme environments owing to their adaptability, and various modular robots have been developed. Most studies have focused on the expandability of capabilities or the integration of modules, whereas only a few studies have investigated autonomous decentralized control, in which each module harmonizes its own movements for overall functionality. We developed an underwater modular robot that synchronizes its paddle strokes; the robot is based on the motif of Gonium, a multicellular alga. We built a reduced system model of modules to represent the state of an oscillator by using a phase with attractive interactions with others. Because the model is similar to the Kuramoto model, we applied analysis methods. Real robotic modules were built, and experiments were conducted using a colony of the modules. The experimental results confirmed that the colony exhibited stroke synchronization ability by compensating for individual differences. The stroke synchronization is expected to stabilize the movements of robot colonies and improve their overall propulsion.
模块化机器人因其适应性强,有望在极端环境中使用,目前已开发出各种模块化机器人。大多数研究都集中在功能的可扩展性或模块的集成上,只有少数研究调查了自主分散控制,即每个模块协调自己的动作以实现整体功能。我们开发了一种水下模块化机器人,可以同步桨叶的划动;该机器人以多细胞藻类 Gonium 为原型。我们建立了一个简化的模块系统模型,通过使用与其他模块有吸引力相互作用的相位来表示振荡器的状态。由于该模型类似于仓本模型,因此我们采用了分析方法。我们制作了真实的机器人模块,并使用模块群进行了实验。实验结果证实,聚落通过补偿个体差异表现出了划水同步能力。冲程同步有望稳定机器人群落的运动,并提高其整体推进力。
{"title":"Stroke Synchronization of Underwater Modular Robot through Physical Interaction","authors":"Kohei Nishikawa, Y. Origane, Daisuke Kurabayashi","doi":"10.1109/ROBIO58561.2023.10354986","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354986","url":null,"abstract":"Modular robots are expected to be used in extreme environments owing to their adaptability, and various modular robots have been developed. Most studies have focused on the expandability of capabilities or the integration of modules, whereas only a few studies have investigated autonomous decentralized control, in which each module harmonizes its own movements for overall functionality. We developed an underwater modular robot that synchronizes its paddle strokes; the robot is based on the motif of Gonium, a multicellular alga. We built a reduced system model of modules to represent the state of an oscillator by using a phase with attractive interactions with others. Because the model is similar to the Kuramoto model, we applied analysis methods. Real robotic modules were built, and experiments were conducted using a colony of the modules. The experimental results confirmed that the colony exhibited stroke synchronization ability by compensating for individual differences. The stroke synchronization is expected to stabilize the movements of robot colonies and improve their overall propulsion.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"51 8","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A soft hydraulic endorectal actuator for prostate radiotherapy 用于前列腺放射治疗的软液压肛门直肠内推杆
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10355037
Aryan Niknam Maleki, Alexander Thompson, M. Runciman, Julia Murray, G. Mylonas
Despite advances in radiotherapy, motion error remains a challenge in prostate radiotherapy. Rectal obturators and endorectal balloons may reduce motion error and improve outcomes but have limitations. We aimed to create a deployable rectal obturator with precise angle control to personalise to a patient’s rectal anatomy, by using an antagonistic pair of "muscle" actuators to flex and extend the device. Results on deployability, angle control, and radial stiffness are presented here. The device can be compressed down to 16 x 3 x 91 mm, and be deployed to maximum dimensions of 24 x 25.5 x 77 mm. The device provides radial stiffness that may be sufficient to stabilise the rectum during radiotherapy. Angle control can be achieved with an average change of 7.5°/ml inflation in the extensor actuator.
尽管放疗技术不断进步,但运动误差仍是前列腺放疗的一大难题。直肠闭塞器和肛门直肠内球囊可减少运动误差并改善治疗效果,但也有局限性。我们的目标是通过使用一对拮抗的 "肌肉 "致动器来弯曲和伸展装置,创造出一种具有精确角度控制功能的可展开直肠闭锁器,以便根据患者的直肠解剖结构进行个性化设计。本文介绍了该装置的可展开性、角度控制和径向刚度结果。该装置可压缩至 16 x 3 x 91 毫米,展开后的最大尺寸为 24 x 25.5 x 77 毫米。该装置提供的径向硬度足以在放疗期间稳定直肠。角度控制可通过伸展致动器 7.5°/ml 的平均充气变化来实现。
{"title":"A soft hydraulic endorectal actuator for prostate radiotherapy","authors":"Aryan Niknam Maleki, Alexander Thompson, M. Runciman, Julia Murray, G. Mylonas","doi":"10.1109/ROBIO58561.2023.10355037","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10355037","url":null,"abstract":"Despite advances in radiotherapy, motion error remains a challenge in prostate radiotherapy. Rectal obturators and endorectal balloons may reduce motion error and improve outcomes but have limitations. We aimed to create a deployable rectal obturator with precise angle control to personalise to a patient’s rectal anatomy, by using an antagonistic pair of \"muscle\" actuators to flex and extend the device. Results on deployability, angle control, and radial stiffness are presented here. The device can be compressed down to 16 x 3 x 91 mm, and be deployed to maximum dimensions of 24 x 25.5 x 77 mm. The device provides radial stiffness that may be sufficient to stabilise the rectum during radiotherapy. Angle control can be achieved with an average change of 7.5°/ml inflation in the extensor actuator.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"49 3","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty in Bayesian Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards 针对奖励稀疏的机器人操纵任务的贝叶斯强化学习中的不确定性
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354785
Li Zheng, Yanghong Li, Yahao Wang, Guangrui Bai, Haiyang He, Erbao Dong
This paper aims to explore the application of Bayesian deep reinforcement learning (BDRL) in robot manipulation tasks with sparse rewards, focusing on addressing the uncertainty in complex and sparsely rewarded environments. Conventional deep reinforcement learning (DRL) algorithms still face significant challenges in the context of robot manipulation tasks. To address this issue, this paper proposes a general algorithm framework called BDRL that combines reinforcement learning algorithms with Bayesian networks to quantify the model uncertainty, aleatoric uncertainty in neural networks, and uncertainty in the reward function. The effectiveness and generality of the proposed algorithm are validated through simulation experiments on multiple sets of different sparsely rewarded tasks, employing various advanced DRL algorithms. The research results demonstrate that the DRL algorithm based on the Bayesian network mechanism significantly improves the convergence speed of the algorithms in sparse reward tasks by accurately estimating the model uncertainty.
本文旨在探索贝叶斯深度强化学习(BDRL)在奖励稀疏的机器人操纵任务中的应用,重点是解决复杂和奖励稀疏环境中的不确定性问题。传统的深度强化学习(DRL)算法在机器人操纵任务中仍面临巨大挑战。为解决这一问题,本文提出了一种名为 BDRL 的通用算法框架,该框架将强化学习算法与贝叶斯网络相结合,以量化模型的不确定性、神经网络的不确定性以及奖励函数的不确定性。本文采用各种先进的 DRL 算法,通过对多组不同的稀疏奖励任务进行模拟实验,验证了所提算法的有效性和通用性。研究结果表明,基于贝叶斯网络机制的 DRL 算法通过准确估计模型的不确定性,显著提高了稀疏奖励任务中算法的收敛速度。
{"title":"Uncertainty in Bayesian Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards","authors":"Li Zheng, Yanghong Li, Yahao Wang, Guangrui Bai, Haiyang He, Erbao Dong","doi":"10.1109/ROBIO58561.2023.10354785","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354785","url":null,"abstract":"This paper aims to explore the application of Bayesian deep reinforcement learning (BDRL) in robot manipulation tasks with sparse rewards, focusing on addressing the uncertainty in complex and sparsely rewarded environments. Conventional deep reinforcement learning (DRL) algorithms still face significant challenges in the context of robot manipulation tasks. To address this issue, this paper proposes a general algorithm framework called BDRL that combines reinforcement learning algorithms with Bayesian networks to quantify the model uncertainty, aleatoric uncertainty in neural networks, and uncertainty in the reward function. The effectiveness and generality of the proposed algorithm are validated through simulation experiments on multiple sets of different sparsely rewarded tasks, employing various advanced DRL algorithms. The research results demonstrate that the DRL algorithm based on the Bayesian network mechanism significantly improves the convergence speed of the algorithms in sparse reward tasks by accurately estimating the model uncertainty.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"48 6","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DynGraspVS: Servoing Aided Grasping for Dynamic Environments DynGraspVS:为动态环境提供伺服辅助抓取功能
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354813
Gunjan Gupta, Vedansh Mittal, K. M. Krishna
Visual servoing has been gaining popularity in various real-world vision-centric robotic applications. Autonomous robotic grasping often deals with unseen and unstructured environments, and in this task, Visual Servoing has been able to generate improved end-effector control by providing visual feedback. However, existing Servoing-aided grasping methods tend to fail at the task of grasping in dynamic environments i.e. - moving objects.In this paper, we introduce DynGraspVS, a novel Image-based Visual Servoing-aided Grasping approach that models the motion of moving objects in its interaction matrix. Leveraging a single-step rollout strategy, our approach achieves a remarkable increase in success rate, while converging faster and achieving a smoother trajectory, while maintaining precise alignments in six degrees of freedom. By integrating the velocity information into the interaction matrix, our method is able to successfully complete the challenging task of robotic grasping in the case of dynamic objects, while outperforming existing deep Model Predictive Control (MPC) based methods in the PyBullet simulation environment. We test it with a range of objects in the YCB dataset with varying range of shapes, sizes, and material properties. We report various evaluation metrics such as photometric error, success rate, time taken, and trajectory length.
视觉伺服在现实世界中各种以视觉为中心的机器人应用中越来越受欢迎。自主机器人抓取通常要面对看不见的非结构化环境,而在这项任务中,视觉伺服技术能够通过提供视觉反馈来改进末端执行器的控制。然而,现有的伺服辅助抓取方法往往无法在动态环境(即移动物体)中完成抓取任务。在本文中,我们介绍了基于图像的新型视觉伺服辅助抓取方法 DynGraspVS,它在交互矩阵中对移动物体的运动进行建模。利用单步推出策略,我们的方法显著提高了成功率,同时收敛速度更快,轨迹更平滑,并保持六个自由度的精确对准。通过将速度信息整合到交互矩阵中,我们的方法能够在动态物体的情况下成功完成机器人抓取这一具有挑战性的任务,同时在 PyBullet 仿真环境中优于现有的基于深度模型预测控制(MPC)的方法。我们用 YCB 数据集中一系列形状、大小和材料属性各不相同的物体对其进行了测试。我们报告了各种评估指标,如光度误差、成功率、耗时和轨迹长度。
{"title":"DynGraspVS: Servoing Aided Grasping for Dynamic Environments","authors":"Gunjan Gupta, Vedansh Mittal, K. M. Krishna","doi":"10.1109/ROBIO58561.2023.10354813","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354813","url":null,"abstract":"Visual servoing has been gaining popularity in various real-world vision-centric robotic applications. Autonomous robotic grasping often deals with unseen and unstructured environments, and in this task, Visual Servoing has been able to generate improved end-effector control by providing visual feedback. However, existing Servoing-aided grasping methods tend to fail at the task of grasping in dynamic environments i.e. - moving objects.In this paper, we introduce DynGraspVS, a novel Image-based Visual Servoing-aided Grasping approach that models the motion of moving objects in its interaction matrix. Leveraging a single-step rollout strategy, our approach achieves a remarkable increase in success rate, while converging faster and achieving a smoother trajectory, while maintaining precise alignments in six degrees of freedom. By integrating the velocity information into the interaction matrix, our method is able to successfully complete the challenging task of robotic grasping in the case of dynamic objects, while outperforming existing deep Model Predictive Control (MPC) based methods in the PyBullet simulation environment. We test it with a range of objects in the YCB dataset with varying range of shapes, sizes, and material properties. We report various evaluation metrics such as photometric error, success rate, time taken, and trajectory length.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"37 2","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Path Planning for Robotic Arm Based on Reinforcement Learning under the Train 基于列车下强化学习的机械臂路径规划
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354783
Guanhao Xie, Duo Zhao, Qichao Tang, Muhua Zhang, Wenjie Zhao, Yewen Wang
Due to the widespread use of robotic arms, path planning for them has always been a hot research topic. However, traditional path planning algorithms struggle to ensure low disparity in each path, making them unsuitable for operation scenarios with high safety requirements, such as the undercarriage environment of train. A Reinforcement Learning (RL) framework is proposed in this article to address this challenge. The Proximal Policy Optimization (PPO) algorithm has been enhanced, resulting in a variant referred to as Randomized PPO (RPPO), which demonstrates slightly accelerated convergence speed. Additionally, a reward model is proposed to assist the agent in escaping local optima. For modeling application environment, lidar is employed for obtaining obstacle point cloud information, which is then transformed into an octree grid map for maneuvering the robotic arm to avoid obstacles. According to the experimental results, the paths planned by our system are superior to those of RRT* in terms of both average length and standard deviation, and RPPO exhibits better convergence speed and path standard deviation compared to PPO.
由于机械臂的广泛应用,其路径规划一直是热门研究课题。然而,传统的路径规划算法难以确保每条路径的低差异,因此不适合安全要求较高的操作场景,如列车底盘环境。本文提出了一种强化学习(RL)框架来应对这一挑战。本文对近端策略优化(PPO)算法进行了改进,形成了一种称为随机 PPO(RPPO)的变体,其收敛速度略有加快。此外,还提出了一个奖励模型,以帮助代理摆脱局部最优状态。在模拟应用环境时,采用激光雷达获取障碍物点云信息,然后将其转换成八叉网格图,用于操纵机械臂避开障碍物。实验结果表明,我们的系统规划的路径在平均长度和标准偏差方面都优于 RRT*,与 PPO 相比,RPPO 表现出更好的收敛速度和路径标准偏差。
{"title":"Path Planning for Robotic Arm Based on Reinforcement Learning under the Train","authors":"Guanhao Xie, Duo Zhao, Qichao Tang, Muhua Zhang, Wenjie Zhao, Yewen Wang","doi":"10.1109/ROBIO58561.2023.10354783","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354783","url":null,"abstract":"Due to the widespread use of robotic arms, path planning for them has always been a hot research topic. However, traditional path planning algorithms struggle to ensure low disparity in each path, making them unsuitable for operation scenarios with high safety requirements, such as the undercarriage environment of train. A Reinforcement Learning (RL) framework is proposed in this article to address this challenge. The Proximal Policy Optimization (PPO) algorithm has been enhanced, resulting in a variant referred to as Randomized PPO (RPPO), which demonstrates slightly accelerated convergence speed. Additionally, a reward model is proposed to assist the agent in escaping local optima. For modeling application environment, lidar is employed for obtaining obstacle point cloud information, which is then transformed into an octree grid map for maneuvering the robotic arm to avoid obstacles. According to the experimental results, the paths planned by our system are superior to those of RRT* in terms of both average length and standard deviation, and RPPO exhibits better convergence speed and path standard deviation compared to PPO.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"34 12","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Reinforcement Learning for a Humanoid Robot Basketball Player 仿人机器人篮球运动员的深度强化学习
Pub Date : 2023-12-04 DOI: 10.1109/ROBIO58561.2023.10354565
Shuaiqi Zhang, Guodong Zhao, Peng Lin, Mingshuo Liu, Jianhua Dong, Haoyu Zhang
Currently, the majority of research on humanoid robot basketball shooting focuses on traditional control methods. However, these methods primarily rely on human-robot interaction and fixed shooting patterns to control the robot’s shooting actions, resulting in limited autonomy for the robot. They often require extensive manual design and coding operations, and face challenges in adapting to different shooting scenarios. To address these problems, this paper applies deep reinforcement learning to the basketball shooting task for a humanoid robot. The task environment is based on the basketball shooting competition defined in the FIRA HuroCup. This paper uses the Double DQN algorithm to train the humanoid robot to master end-to-end basketball shooting skills, specifically: The robot takes RGB images captured by its own head camera as input, then decides to take one of three discrete actions, including turning left, turning right, and shooting. In the experimental section, we validate the effectiveness of our approach and conduct an analysis and discussion on the setup of important parameters that influence the experimental results.
目前,有关仿人机器人篮球投篮的研究大多集中在传统的控制方法上。然而,这些方法主要依靠人机交互和固定的投篮模式来控制机器人的投篮动作,导致机器人的自主性有限。它们通常需要大量的人工设计和编码操作,在适应不同投篮场景方面也面临挑战。为了解决这些问题,本文将深度强化学习应用于仿人机器人的篮球投篮任务。任务环境基于 FIRA HuroCup 中定义的篮球投篮比赛。本文使用双 DQN 算法训练仿人机器人掌握端到端篮球投篮技能,具体来说,机器人在投篮时会捕捉 RGB 图像,并将其转换为 RGB 图像:机器人将自己头部摄像头捕捉到的 RGB 图像作为输入,然后决定从左转、右转和投篮三个离散动作中选择一个。在实验部分,我们验证了我们方法的有效性,并对影响实验结果的重要参数设置进行了分析和讨论。
{"title":"Deep Reinforcement Learning for a Humanoid Robot Basketball Player","authors":"Shuaiqi Zhang, Guodong Zhao, Peng Lin, Mingshuo Liu, Jianhua Dong, Haoyu Zhang","doi":"10.1109/ROBIO58561.2023.10354565","DOIUrl":"https://doi.org/10.1109/ROBIO58561.2023.10354565","url":null,"abstract":"Currently, the majority of research on humanoid robot basketball shooting focuses on traditional control methods. However, these methods primarily rely on human-robot interaction and fixed shooting patterns to control the robot’s shooting actions, resulting in limited autonomy for the robot. They often require extensive manual design and coding operations, and face challenges in adapting to different shooting scenarios. To address these problems, this paper applies deep reinforcement learning to the basketball shooting task for a humanoid robot. The task environment is based on the basketball shooting competition defined in the FIRA HuroCup. This paper uses the Double DQN algorithm to train the humanoid robot to master end-to-end basketball shooting skills, specifically: The robot takes RGB images captured by its own head camera as input, then decides to take one of three discrete actions, including turning left, turning right, and shooting. In the experimental section, we validate the effectiveness of our approach and conduct an analysis and discussion on the setup of important parameters that influence the experimental results.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"45 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139187376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1