首页 > 最新文献

2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)最新文献

英文 中文
Structure-Aware Policy to Improve Generalization among Various Robots and Environments 结构感知策略在各种机器人和环境中提高泛化
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011945
Wei-qing Xu, Yue Gao, Buqing Nie
Recently, Deep Reinforcement Learning (DRL) has been used to solve complex robot control tasks with outstanding success. However, previous DRL methods still exist some shortcomings, such as poor generalization performance, which makes policy performance quite sensitive to small vari-ations of the task settings. Besides, it is quite time-consuming and computationally expensive to retrain a new policy from scratch for new tasks, hence restricts the applications of DRL-based methods in the real world. In this work, we propose a novel DRL generalization method called GNN-embedding, which incorporates the robot hardware and the environment simultaneously with GNN-based policy network and learnable embedding vectors of tasks. Thus, it can learn a unified policy for different robots under different environment conditions, which improves the generalization performance of existing DRL robot policies. Multiple experiments on the Hopper-v2 robot are conducted. The experimental results demonstrate the effectiveness and efficiency of GNN-embedding on generalization, including multi-task learning and transfer learning problems.
近年来,深度强化学习(DRL)被用于解决复杂的机器人控制任务,并取得了显著的成功。然而,以往的DRL方法仍然存在一些缺点,如泛化性能差,使得策略性能对任务设置的微小变化非常敏感。此外,为新任务重新训练新策略非常耗时和计算成本高,从而限制了基于drl的方法在现实世界中的应用。在这项工作中,我们提出了一种新的DRL泛化方法,称为gnn嵌入,该方法将机器人硬件和环境同时与基于gnn的策略网络和可学习的任务嵌入向量结合起来。从而可以在不同的环境条件下学习到针对不同机器人的统一策略,提高了现有DRL机器人策略的泛化性能。对Hopper-v2机器人进行了多次实验。实验结果证明了gnn嵌入在多任务学习和迁移学习等泛化问题上的有效性和高效性。
{"title":"Structure-Aware Policy to Improve Generalization among Various Robots and Environments","authors":"Wei-qing Xu, Yue Gao, Buqing Nie","doi":"10.1109/ROBIO55434.2022.10011945","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011945","url":null,"abstract":"Recently, Deep Reinforcement Learning (DRL) has been used to solve complex robot control tasks with outstanding success. However, previous DRL methods still exist some shortcomings, such as poor generalization performance, which makes policy performance quite sensitive to small vari-ations of the task settings. Besides, it is quite time-consuming and computationally expensive to retrain a new policy from scratch for new tasks, hence restricts the applications of DRL-based methods in the real world. In this work, we propose a novel DRL generalization method called GNN-embedding, which incorporates the robot hardware and the environment simultaneously with GNN-based policy network and learnable embedding vectors of tasks. Thus, it can learn a unified policy for different robots under different environment conditions, which improves the generalization performance of existing DRL robot policies. Multiple experiments on the Hopper-v2 robot are conducted. The experimental results demonstrate the effectiveness and efficiency of GNN-embedding on generalization, including multi-task learning and transfer learning problems.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122630566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Legless Squamate Reptiles Inspired Design: Simple Soft Crawling Actuator 灵感设计:简单的软爬行驱动器
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011788
Huichen Ma, Junjie Zhou, Lijun Meng, Jianghao Jiang, Sanxi Ma
This paper presents a novel pneumatic soft crawling actuator that exploits scale-like chassis to move. Based on the lateral undulation movement, bellows-type actuators are designed with embedded fluidic chambers that produce bidirectional bending when pressurized. Three chassis structures are created and manufactured to simulate the anisotropy friction by analyzing the legless squamate reptile motion principle. Inspired by the rigid snake robot modeling, a framework to solve the dynamic behavior problem of a soft crawling actuator is further modeled. Particularly, the expected movement has been achieved. Through quantitative analysis, the horizontal belt type shows a more effective drive. Locomotion experimental results of the soft crawling actuator prototype on a carpeted surface show good agreement with model predictions. The demonstrations of terrain adaptability prove movement ability in complicated and constrained environments such as a steep slope, ladders surface, and step surface.
本文提出了一种新型的气动软爬行执行机构,该机构利用鳞片状底盘进行移动。基于横向波动运动,设计了波纹管型执行器,其内嵌式流体室在加压时产生双向弯曲。通过分析无腿爬行动物的运动原理,制作了三种底盘结构,模拟了底盘的各向异性摩擦。受刚体蛇形机器人建模的启发,进一步建立了求解柔性爬行作动器动态行为问题的框架。特别是,预期的运动已经实现。通过定量分析,水平带式传动更为有效。软爬行作动器样机在地毯表面上的运动实验结果与模型预测结果吻合较好。地形适应性的演示证明了在陡坡、梯面、台阶面等复杂约束环境下的移动能力。
{"title":"Legless Squamate Reptiles Inspired Design: Simple Soft Crawling Actuator","authors":"Huichen Ma, Junjie Zhou, Lijun Meng, Jianghao Jiang, Sanxi Ma","doi":"10.1109/ROBIO55434.2022.10011788","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011788","url":null,"abstract":"This paper presents a novel pneumatic soft crawling actuator that exploits scale-like chassis to move. Based on the lateral undulation movement, bellows-type actuators are designed with embedded fluidic chambers that produce bidirectional bending when pressurized. Three chassis structures are created and manufactured to simulate the anisotropy friction by analyzing the legless squamate reptile motion principle. Inspired by the rigid snake robot modeling, a framework to solve the dynamic behavior problem of a soft crawling actuator is further modeled. Particularly, the expected movement has been achieved. Through quantitative analysis, the horizontal belt type shows a more effective drive. Locomotion experimental results of the soft crawling actuator prototype on a carpeted surface show good agreement with model predictions. The demonstrations of terrain adaptability prove movement ability in complicated and constrained environments such as a steep slope, ladders surface, and step surface.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122800597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Deep Meta-Metric Learning Method for Few-Shot Weld Seam Visual Detection 基于深度元度量学习的少射焊缝视觉检测方法
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10012017
Tianchen Zhu, Shiqiang Zhu, Jiakai Zhu, Wei Song, Cunjun Li, Hongjiang Ge, Jianjun Gu
Deep learning-based object detection algorithms are gradually promoted in industrial visual detection due to their versatility and high accuracy. These algorithms usually require large amounts of training data, however there is a problem of lack of training samples in actual weld seam detection tasks that challenges the weld seam visual detection task. To improve the performance on weld seam detection, especially for those few-shot tasks, this paper proposes a meta-metric learning method for few-shot weld seam detection. The method introduces a distance metric-learning module besides the meta-learning algorithm. By optimizing the training strategy and classification mode of the base detection model, the method accelerates the training process and improves the learning capability on few-shot weld seam samples. Compared with the base model, the mAP of the method proposed in this paper on the weld seam dataset is improved by about 8.9%.
基于深度学习的目标检测算法因其通用性和准确性高,在工业视觉检测中逐渐得到推广。这些算法通常需要大量的训练数据,但在实际的焊缝检测任务中存在训练样本不足的问题,给焊缝视觉检测任务带来了挑战。为了提高焊缝检测的性能,特别是针对少射点任务,本文提出了一种用于少射点焊缝检测的元度量学习方法。该方法在元学习算法的基础上引入了距离度量学习模块。该方法通过优化基检测模型的训练策略和分类模式,加快了训练过程,提高了对少射焊缝样本的学习能力。与基本模型相比,本文方法在焊缝数据集上的mAP提高了约8.9%。
{"title":"A Deep Meta-Metric Learning Method for Few-Shot Weld Seam Visual Detection","authors":"Tianchen Zhu, Shiqiang Zhu, Jiakai Zhu, Wei Song, Cunjun Li, Hongjiang Ge, Jianjun Gu","doi":"10.1109/ROBIO55434.2022.10012017","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10012017","url":null,"abstract":"Deep learning-based object detection algorithms are gradually promoted in industrial visual detection due to their versatility and high accuracy. These algorithms usually require large amounts of training data, however there is a problem of lack of training samples in actual weld seam detection tasks that challenges the weld seam visual detection task. To improve the performance on weld seam detection, especially for those few-shot tasks, this paper proposes a meta-metric learning method for few-shot weld seam detection. The method introduces a distance metric-learning module besides the meta-learning algorithm. By optimizing the training strategy and classification mode of the base detection model, the method accelerates the training process and improves the learning capability on few-shot weld seam samples. Compared with the base model, the mAP of the method proposed in this paper on the weld seam dataset is improved by about 8.9%.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121275146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-sensor Fusion for Stiffness Estimation to Assist Legged Robot Control in Unstructured Environment 多传感器融合刚度估计辅助腿式机器人非结构环境控制
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011688
Yue Gao, Huajian Wu, Mingdong Sun
Legged robot is designed for more flexibility when navigating in complex unstructured environment. When the end-effectors of the robot contacting non-rigid ground, the robot sinks due to different stiffness of the ground. This presents a challenge for accurate and robust control of the upper platform. In this paper, a real-time muti-sensor fusion method Dual Parallelizable Particle Filter (DPPF) is proposed to estimate ground stiffness. DPPF utilized RGB-D camera, IMU and 3-DoF force sensors. Meanwhile, we established a ground material database and trained a real-time ground segmentation network to assist the stiffness estimation of the ground. Then the information of ground material is utilized as a prior distribution for DPPF to achieve faster stiffness estimation. The experiments on synthetic data and on six-legged robot show that DPPF has faster computing speed, fewer convergent steps than previous state estimation methods. The estimated stiffness can be utilized for legged robot impedance control, posture control and trajectory planning.
腿式机器人是为了在复杂的非结构化环境中更灵活地导航而设计的。当机器人末端执行器接触非刚性地面时,由于地面刚度的不同,机器人会发生下沉。这对上部平台的精确和鲁棒控制提出了挑战。提出了一种基于双并行粒子滤波(Dual Parallelizable Particle Filter, DPPF)的实时多传感器融合地面刚度估计方法。DPPF采用RGB-D摄像头、IMU和3自由度力传感器。同时,我们建立了地面材料数据库,训练了实时地面分割网络,以辅助地面刚度估计。然后利用地面材料信息作为DPPF的先验分布,实现更快的刚度估计。在合成数据和六足机器人上的实验表明,DPPF算法比以往的状态估计方法具有更快的计算速度和更少的收敛步骤。估计的刚度可用于腿式机器人的阻抗控制、姿态控制和轨迹规划。
{"title":"Multi-sensor Fusion for Stiffness Estimation to Assist Legged Robot Control in Unstructured Environment","authors":"Yue Gao, Huajian Wu, Mingdong Sun","doi":"10.1109/ROBIO55434.2022.10011688","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011688","url":null,"abstract":"Legged robot is designed for more flexibility when navigating in complex unstructured environment. When the end-effectors of the robot contacting non-rigid ground, the robot sinks due to different stiffness of the ground. This presents a challenge for accurate and robust control of the upper platform. In this paper, a real-time muti-sensor fusion method Dual Parallelizable Particle Filter (DPPF) is proposed to estimate ground stiffness. DPPF utilized RGB-D camera, IMU and 3-DoF force sensors. Meanwhile, we established a ground material database and trained a real-time ground segmentation network to assist the stiffness estimation of the ground. Then the information of ground material is utilized as a prior distribution for DPPF to achieve faster stiffness estimation. The experiments on synthetic data and on six-legged robot show that DPPF has faster computing speed, fewer convergent steps than previous state estimation methods. The estimated stiffness can be utilized for legged robot impedance control, posture control and trajectory planning.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"696 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132795592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Robust Invariant Extended Kalman filtering for Biped Robot* 双足机器人的自适应鲁棒不变扩展卡尔曼滤波
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011668
Chengzhi Gao, Ye Xie, Shiqiang Zhu, Guanyu Huang, Lingyu Kong, Anhuan Xie, J. Gu, Dan Zhang, Jun Shao, Haofu Qian
The precise estimation of system states is essential for the locomotion control of biped robots to maintain balance. Currently, the estimation of system states is either based on vision data that is susceptible to the environment, or fusing data from Inertial Measurement Unit (IMU) and the kinematic calculation. Invariant Extended Kalman Filtering (IEKF) is one of the most successful fusing algorithms to estimate system state. Generally, in IEKF, the noise covariance of system state is assumed to be known. However, the noise covariance of contact point for biped is not available since the ground-contact situation normally varies and not previously known. This paper presents a new fusing algorithm-Adaptive Robust Invariant Extended Kalman Filtering (ARIEKF) to adaptively adjust the noise parameter of contact point. The proposed algorithm applied the principle of robust estimation to resist outlier effects of state, and introduced an adaptive factor for the noise covariance of state to control its outlying disturbance influences. This paper firstly completed the full state estimation of biped robot using the theory of Lie groups and invariant observer. Then, the adaptive scale factor evaluated by three-segment approach was adopted to adjust the noise covariance of contact point. Finally, both IEKF and proposed ARIEKF are applied to our biped robot-Cosmos and the accuracy of two algorithms are compared. The mean square errors of the velocity of two algorithms were evaluated using the measurements from motion capture system. Experiments demonstrated that the mean square errors of the velocity are decreased 50 percent when compared with IEKF.
系统状态的精确估计是双足机器人保持平衡运动控制的关键。目前,系统状态的估计要么是基于易受环境影响的视觉数据,要么是将惯性测量单元(IMU)的数据与运动计算相融合。不变扩展卡尔曼滤波(IEKF)是一种最成功的估计系统状态的融合算法。通常,在IEKF中,假设系统状态的噪声协方差是已知的。然而,由于地面接触情况通常是变化的,并且事先不知道,因此无法获得两足动物接触点的噪声协方差。提出了一种新的融合算法——自适应鲁棒不变扩展卡尔曼滤波(ARIEKF),用于自适应调整接触点的噪声参数。该算法应用鲁棒估计原理来抵抗状态的离群效应,并引入状态噪声协方差自适应因子来控制状态的离群干扰影响。本文首先利用李群理论和不变观测器完成了双足机器人的全状态估计。然后,采用三段法评估的自适应尺度因子对接触点的噪声协方差进行调整;最后,将IEKF算法和ARIEKF算法应用于我们的双足机器人- cosmos,并比较了两种算法的精度。利用运动捕捉系统的测量结果,评估了两种算法的速度均方误差。实验表明,与IEKF相比,速度的均方误差减小了50%。
{"title":"Adaptive Robust Invariant Extended Kalman filtering for Biped Robot*","authors":"Chengzhi Gao, Ye Xie, Shiqiang Zhu, Guanyu Huang, Lingyu Kong, Anhuan Xie, J. Gu, Dan Zhang, Jun Shao, Haofu Qian","doi":"10.1109/ROBIO55434.2022.10011668","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011668","url":null,"abstract":"The precise estimation of system states is essential for the locomotion control of biped robots to maintain balance. Currently, the estimation of system states is either based on vision data that is susceptible to the environment, or fusing data from Inertial Measurement Unit (IMU) and the kinematic calculation. Invariant Extended Kalman Filtering (IEKF) is one of the most successful fusing algorithms to estimate system state. Generally, in IEKF, the noise covariance of system state is assumed to be known. However, the noise covariance of contact point for biped is not available since the ground-contact situation normally varies and not previously known. This paper presents a new fusing algorithm-Adaptive Robust Invariant Extended Kalman Filtering (ARIEKF) to adaptively adjust the noise parameter of contact point. The proposed algorithm applied the principle of robust estimation to resist outlier effects of state, and introduced an adaptive factor for the noise covariance of state to control its outlying disturbance influences. This paper firstly completed the full state estimation of biped robot using the theory of Lie groups and invariant observer. Then, the adaptive scale factor evaluated by three-segment approach was adopted to adjust the noise covariance of contact point. Finally, both IEKF and proposed ARIEKF are applied to our biped robot-Cosmos and the accuracy of two algorithms are compared. The mean square errors of the velocity of two algorithms were evaluated using the measurements from motion capture system. Experiments demonstrated that the mean square errors of the velocity are decreased 50 percent when compared with IEKF.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133973682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vascular Environment Identification Based on Multi-dimensional Data Fusion for Interventional Surgical Robots 基于多维数据融合的介入手术机器人血管环境识别
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011932
Dongni. Yang, Wei Wei, Jiaqian Li, Nan Xiao
Vascular interventional surgery is the most commonly used method for the treatment of cardio-vascular and cerebrovascular diseases. Master-slave interventional surgical robot is a promising technology, which can further improve the accuracy and safety of surgery. However, imperfect measurement of catheter force remains a surgical risk. Inspired by the function of insect antennae, a thin-film force sensing device was installed in the catheter head. Combined with the pressure sensor in the catheter clamping device, the LSTM network was used to predict and classify the curvature of the current passing vessel, and the recognition accuracy was 97%. In the process of robotic surgery, real-time feedback of current pressure information and vascular curvature information can enhance the doctor's judgment of the operation state and improve the safety of surgery.
血管介入手术是治疗心脑血管疾病最常用的方法。主从式介入手术机器人是一种很有前途的技术,它可以进一步提高手术的准确性和安全性。然而,不完善的导管力测量仍然是一个手术风险。受昆虫触角功能的启发,在导管头部安装了薄膜力传感装置。结合导管夹紧装置中的压力传感器,利用LSTM网络对流经血管的曲率进行预测和分类,识别准确率达到97%。在机器人手术过程中,实时反馈当前压力信息和血管曲率信息,可以增强医生对手术状态的判断,提高手术的安全性。
{"title":"Vascular Environment Identification Based on Multi-dimensional Data Fusion for Interventional Surgical Robots","authors":"Dongni. Yang, Wei Wei, Jiaqian Li, Nan Xiao","doi":"10.1109/ROBIO55434.2022.10011932","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011932","url":null,"abstract":"Vascular interventional surgery is the most commonly used method for the treatment of cardio-vascular and cerebrovascular diseases. Master-slave interventional surgical robot is a promising technology, which can further improve the accuracy and safety of surgery. However, imperfect measurement of catheter force remains a surgical risk. Inspired by the function of insect antennae, a thin-film force sensing device was installed in the catheter head. Combined with the pressure sensor in the catheter clamping device, the LSTM network was used to predict and classify the curvature of the current passing vessel, and the recognition accuracy was 97%. In the process of robotic surgery, real-time feedback of current pressure information and vascular curvature information can enhance the doctor's judgment of the operation state and improve the safety of surgery.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131838099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 学习使用深度强化学习和锚定信息操纵工具
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10012027
Junhang Wei, Shaowei Cui, Peng Hao, Shuo Wang
Endowing robots with tool manipulation skills helps them accomplish challenging tasks. While robots manipulate tools to achieve goals, the alignment of tools and targets is a noise-sensitive and contact-rich task. However, it is difficult to access the accurate pose of the tool and the target. When there is unknown noise in the observations, reinforcement learning can't be sure to perform well. In this paper, we define the easier-to-obtain accurate task-related information as anchor information and introduce a tool manipulation method based on reinforcement learning and anchor information, which can perform well when the observations include unknown noise. To evaluate the method, we build a simulated environment ToolGym, which includes four different kinds of tools and different noise sampling functions for each tool. Finally, we compare our method with baseline methods to show the effectiveness of the proposed method.
赋予机器人工具操作技能有助于它们完成具有挑战性的任务。当机器人操纵工具来实现目标时,工具和目标的对齐是一项噪声敏感和接触丰富的任务。然而,很难获得刀具和目标的准确位姿。当观察结果中存在未知噪声时,强化学习不能保证表现良好。本文将较容易获得准确的任务相关信息定义为锚点信息,并引入了一种基于强化学习和锚点信息的工具操作方法,该方法可以在观测值中包含未知噪声时表现良好。为了评估该方法,我们构建了一个模拟环境ToolGym,其中包括四种不同的工具和每种工具不同的噪声采样函数。最后,我们将我们的方法与基线方法进行了比较,以显示所提出方法的有效性。
{"title":"Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information","authors":"Junhang Wei, Shaowei Cui, Peng Hao, Shuo Wang","doi":"10.1109/ROBIO55434.2022.10012027","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10012027","url":null,"abstract":"Endowing robots with tool manipulation skills helps them accomplish challenging tasks. While robots manipulate tools to achieve goals, the alignment of tools and targets is a noise-sensitive and contact-rich task. However, it is difficult to access the accurate pose of the tool and the target. When there is unknown noise in the observations, reinforcement learning can't be sure to perform well. In this paper, we define the easier-to-obtain accurate task-related information as anchor information and introduce a tool manipulation method based on reinforcement learning and anchor information, which can perform well when the observations include unknown noise. To evaluate the method, we build a simulated environment ToolGym, which includes four different kinds of tools and different noise sampling functions for each tool. Finally, we compare our method with baseline methods to show the effectiveness of the proposed method.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134189177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-stage Precision Landing Method for Autonomous eVTOL Based on Multi-marker Joint Localization 基于多标记关节定位的自主eVTOL多级精密着陆方法
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011837
Senwei Xiang, Minxiang Ye, Shiqiang Zhu, J. Gu, Anhuan Xie, Zehua Men
Electric vertical takeoff and landing aircraft (eVTOL) has drawn more and more attention from home and abroad in recent years. It is believed autonomous eVTOL will create a new era of Urban Air Mobility (UAM) and Advanced Air Mobility (AAM). To autonomous eVTOL, precision landing is an extremely critical operation for it directly affects flight safety. In this paper, we analyze the special issues an eVTOL will encounter when it lands in the UAM applications. A multi-stage precision landing method based on multi-marker joint localization is proposed accordingly. Our method contains three key elements: a compatible vertiport with multiple visual markers; an accurate, fast detection and localization algorithm for the vertiport and a multi-stage landing strategy. We implement our method on an eVTOL prototype named ZJ-Copter developed by Zhejiang Lab. A series of real-world experiments have been conducted to validate the effectiveness and accuracy of the proposed method. Experiment results show that our method works well in real-world scenarios for autonomous eVTOL without GPS signal during landing process. The positioning accuracy is less than 0.1m (altitude $< mathbf{10}mathbf{m}$), while the landing accuracy is less than 0.5m.
电动垂直起降飞机(eVTOL)近年来越来越受到国内外的关注。人们相信,自动eVTOL将开创城市空中交通(UAM)和先进空中交通(AAM)的新时代。对于自主eVTOL来说,精确着陆是一项极其关键的操作,它直接影响到飞行安全。本文分析了eVTOL在UAM应用中遇到的一些特殊问题。据此,提出了一种基于多标记关节定位的多级精密着陆方法。我们的方法包含三个关键元素:具有多个视觉标记的兼容垂直端口;一种准确、快速的垂直起降检测与定位算法及多级降落策略。我们在浙江实验室开发的eVTOL原型机ZJ-Copter上实现了我们的方法。一系列的实际实验验证了该方法的有效性和准确性。实验结果表明,该方法在无GPS信号的自主eVTOL着陆过程中效果良好。定位精度小于0.1m(高度$< mathbf{10}mathbf{m}$),着陆精度小于0.5m。
{"title":"A Multi-stage Precision Landing Method for Autonomous eVTOL Based on Multi-marker Joint Localization","authors":"Senwei Xiang, Minxiang Ye, Shiqiang Zhu, J. Gu, Anhuan Xie, Zehua Men","doi":"10.1109/ROBIO55434.2022.10011837","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011837","url":null,"abstract":"Electric vertical takeoff and landing aircraft (eVTOL) has drawn more and more attention from home and abroad in recent years. It is believed autonomous eVTOL will create a new era of Urban Air Mobility (UAM) and Advanced Air Mobility (AAM). To autonomous eVTOL, precision landing is an extremely critical operation for it directly affects flight safety. In this paper, we analyze the special issues an eVTOL will encounter when it lands in the UAM applications. A multi-stage precision landing method based on multi-marker joint localization is proposed accordingly. Our method contains three key elements: a compatible vertiport with multiple visual markers; an accurate, fast detection and localization algorithm for the vertiport and a multi-stage landing strategy. We implement our method on an eVTOL prototype named ZJ-Copter developed by Zhejiang Lab. A series of real-world experiments have been conducted to validate the effectiveness and accuracy of the proposed method. Experiment results show that our method works well in real-world scenarios for autonomous eVTOL without GPS signal during landing process. The positioning accuracy is less than 0.1m (altitude $< mathbf{10}mathbf{m}$), while the landing accuracy is less than 0.5m.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130308015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Road Defect Detection Based on Semantic Transformed Disparity Image Segmentation 基于语义变换视差图像分割的道路缺陷检测
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011748
Li Wang, Wenbo Shi, Haozhe Zhu, D. Zhang, Yikang Zhang, Jiahe Fan, M. J. Bocus
Road defects can severely affect the safety of road users and vehicle conditions. Over the past decade, due to the limited amount of labeled training data, machine vision-based road defect detection approaches have been mainly used, while machine/deep learning-based methods were merely discussed. With the recent development of artificial intelligence, convolutional neural network (CNN)-based road defect detection systems for automated road condition assessment have become an active sphere of study. In this regard, this paper presents a comprehensive road defect detection system based on computer stereo vision, non-linear regression, and CNN. A dense disparity image is first estimated from a pair of stereo road images using an efficient stereo matching algorithm. The estimated disparity image is then transformed to better identify road defects by minimizing a global energy function w.r.t. road disparity projection model coefficients and stereo rig roll angle, using the non-linear regression approach. Finally, three popular semantic segmentation CNNs are trained using the transformed disparity images. Extensive experiments are conducted to demonstrate the performance of our proposed road defect detection approach. The achieved pixel-level accuracy and intersection over union (IoU) are 98.37% and 67.65%, respectively.
道路缺陷会严重影响道路使用者的安全和车辆状况。在过去的十年中,由于标记训练数据的数量有限,基于机器视觉的道路缺陷检测方法被主要使用,而基于机器/深度学习的方法仅被讨论。随着人工智能的发展,基于卷积神经网络(CNN)的道路缺陷检测系统已成为一个活跃的研究领域。为此,本文提出了一种基于计算机立体视觉、非线性回归和CNN的综合道路缺陷检测系统。首先利用一种高效的立体匹配算法从一对立体道路图像中估计出密集的视差图像。然后使用非线性回归方法,通过最小化全局能量函数w.r.t.道路视差投影模型系数和立体钻机侧倾角,对估计的视差图像进行转换,以更好地识别道路缺陷。最后,利用变换后的视差图像训练了三种常用的语义分割cnn。大量的实验证明了我们提出的道路缺陷检测方法的性能。所获得的像素级精度和IoU分别为98.37%和67.65%。
{"title":"Road Defect Detection Based on Semantic Transformed Disparity Image Segmentation","authors":"Li Wang, Wenbo Shi, Haozhe Zhu, D. Zhang, Yikang Zhang, Jiahe Fan, M. J. Bocus","doi":"10.1109/ROBIO55434.2022.10011748","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011748","url":null,"abstract":"Road defects can severely affect the safety of road users and vehicle conditions. Over the past decade, due to the limited amount of labeled training data, machine vision-based road defect detection approaches have been mainly used, while machine/deep learning-based methods were merely discussed. With the recent development of artificial intelligence, convolutional neural network (CNN)-based road defect detection systems for automated road condition assessment have become an active sphere of study. In this regard, this paper presents a comprehensive road defect detection system based on computer stereo vision, non-linear regression, and CNN. A dense disparity image is first estimated from a pair of stereo road images using an efficient stereo matching algorithm. The estimated disparity image is then transformed to better identify road defects by minimizing a global energy function w.r.t. road disparity projection model coefficients and stereo rig roll angle, using the non-linear regression approach. Finally, three popular semantic segmentation CNNs are trained using the transformed disparity images. Extensive experiments are conducted to demonstrate the performance of our proposed road defect detection approach. The achieved pixel-level accuracy and intersection over union (IoU) are 98.37% and 67.65%, respectively.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130354545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual manipulator collaborative shaft slot assembly via MADDPG 双机械手协同轴槽装配madpg
Pub Date : 2022-12-05 DOI: 10.1109/ROBIO55434.2022.10011768
Junying Yao, Xiaojuan Wang, Renqiang Li, Wenxiao Wang, X. Ping, Yongkui Liu
The traditional dual manipulator control systems have not only complex motion coupling problems, but also larger computational burden, and hence it is difficult to meet the requirements of intelligent assembly. In this paper, based on multi-agent reinforcement learning theory, Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm is investigated in the collaborative assembly shaft slot assembly via dual manipulator system. For the collaborative shaft slot assembly in the dual manipulator system, sparse rewards in traditional multi-agent reinforcement learning often exist because of the long sequence decision-making problem. For the above problems, this paper considers the influence of the decision-making of a single manipulator on the overall task rewards when the overall rewards of multi -agent reinforcement learning are designed. In the proposed algorithm, by calculating the difference before and after the state of each manipulator, and applying the difference as the internal state excitation to the overall task rewards, the traditional reward function of multi-agent reinforcement learning is improved. In order to verify the designed algorithm, the dual manipulator shaft slot assembly system and test scenario are established on the CoppeliaSim simulation platform. Simulation results show that the success rate of the shaft slot assembly via the improved MADDPG algorithm is about 83 % *
传统的双机械手控制系统不仅存在复杂的运动耦合问题,而且计算量较大,难以满足智能装配的要求。基于多智能体强化学习理论,研究了双机械臂协同装配轴槽装配中的多智能体深度确定性策略梯度(madpg)算法。对于双机械臂系统下的协作轴槽装配,传统的多智能体强化学习由于决策过程的长序列问题,往往存在奖励稀疏的问题。针对上述问题,本文在设计多智能体强化学习的整体奖励时,考虑了单个机械手的决策对整体任务奖励的影响。该算法通过计算各机械手状态前后的差值,并将差值作为整体任务奖励的内部状态激励,对传统的多智能体强化学习奖励函数进行了改进。为了验证所设计的算法,在CoppeliaSim仿真平台上建立了双机械手轴槽装配系统和测试场景。仿真结果表明,改进的madpg算法对轴槽装配的成功率约为83% *
{"title":"Dual manipulator collaborative shaft slot assembly via MADDPG","authors":"Junying Yao, Xiaojuan Wang, Renqiang Li, Wenxiao Wang, X. Ping, Yongkui Liu","doi":"10.1109/ROBIO55434.2022.10011768","DOIUrl":"https://doi.org/10.1109/ROBIO55434.2022.10011768","url":null,"abstract":"The traditional dual manipulator control systems have not only complex motion coupling problems, but also larger computational burden, and hence it is difficult to meet the requirements of intelligent assembly. In this paper, based on multi-agent reinforcement learning theory, Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm is investigated in the collaborative assembly shaft slot assembly via dual manipulator system. For the collaborative shaft slot assembly in the dual manipulator system, sparse rewards in traditional multi-agent reinforcement learning often exist because of the long sequence decision-making problem. For the above problems, this paper considers the influence of the decision-making of a single manipulator on the overall task rewards when the overall rewards of multi -agent reinforcement learning are designed. In the proposed algorithm, by calculating the difference before and after the state of each manipulator, and applying the difference as the internal state excitation to the overall task rewards, the traditional reward function of multi-agent reinforcement learning is improved. In order to verify the designed algorithm, the dual manipulator shaft slot assembly system and test scenario are established on the CoppeliaSim simulation platform. Simulation results show that the success rate of the shaft slot assembly via the improved MADDPG algorithm is about 83 % *","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"318 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115026022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1