Deep Reinforcement Learning-Based Collision-Free Navigation for Magnetic Helical Microrobots in Dynamic Environments

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-08 DOI:10.1109/TASE.2024.3470810
Huaping Wang;Yukang Qiu;Yaozhen Hou;Qing Shi;Hen-Wei Huang;Qiang Huang;Toshio Fukuda
{"title":"Deep Reinforcement Learning-Based Collision-Free Navigation for Magnetic Helical Microrobots in Dynamic Environments","authors":"Huaping Wang;Yukang Qiu;Yaozhen Hou;Qing Shi;Hen-Wei Huang;Qiang Huang;Toshio Fukuda","doi":"10.1109/TASE.2024.3470810","DOIUrl":null,"url":null,"abstract":"Magnetic helical microrobots have great potential in biomedical applications due to their ability to access confined and enclosed environments via remote manipulation by magnetic fields. However, achieving collision-free navigation for microrobots in complex and unstructured environments, particularly in highly dynamic settings, remains a challenge. In this paper, we present a novel deep reinforcement learning-based control framework for magnetic helical microrobots, focusing on the tasks of goal-reaching and dynamic obstacle avoidance. To streamline data collection, a specialized training environment capturing essential aspects of navigation for magnetic helical microrobots is devised. The robustness and adaptability of the trained policy are supported using a randomization technique within the training environment. To facilitate seamless integration with real-world magnetic actuation systems, a visual processing algorithm based on OpenCV is devised and incorporated to collect policy observations. Simulations and experiments in various scenarios validate the high robustness and adaptability of the method. The performance assessment revealed a success rate of 99% in navigating the microrobot around 4 dynamic obstacles of comparable speeds and a success rate of 90% in environments with 14 dynamic obstacles. The results indicate the potential for future applications of our method in unstructured, confined, and dynamic living environments.Note to Practitioners—The motivation of this work is to develop a robust and effective control scheme for collision-free navigation of magnetic helical microrobots in dynamic environments. The conventional navigation strategies in dynamic environments mainly include global path planning and local path replanning; thus, highly dynamic environments require frequent updates to the planned path, making it difficult to apply in highly dynamic environments. In this work, a deep reinforcement learning-based control framework is proposed that can guide microrobots through many dynamic obstacles to a series of locations without collisions. The simulation and experimental results validate the efficacy of the proposed control framework and the robustness and adaptability of the trained policy. The proposed control scheme enables better understanding of advanced motion control methods for magnetic microrobots.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"7810-7820"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10709338/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Magnetic helical microrobots have great potential in biomedical applications due to their ability to access confined and enclosed environments via remote manipulation by magnetic fields. However, achieving collision-free navigation for microrobots in complex and unstructured environments, particularly in highly dynamic settings, remains a challenge. In this paper, we present a novel deep reinforcement learning-based control framework for magnetic helical microrobots, focusing on the tasks of goal-reaching and dynamic obstacle avoidance. To streamline data collection, a specialized training environment capturing essential aspects of navigation for magnetic helical microrobots is devised. The robustness and adaptability of the trained policy are supported using a randomization technique within the training environment. To facilitate seamless integration with real-world magnetic actuation systems, a visual processing algorithm based on OpenCV is devised and incorporated to collect policy observations. Simulations and experiments in various scenarios validate the high robustness and adaptability of the method. The performance assessment revealed a success rate of 99% in navigating the microrobot around 4 dynamic obstacles of comparable speeds and a success rate of 90% in environments with 14 dynamic obstacles. The results indicate the potential for future applications of our method in unstructured, confined, and dynamic living environments.Note to Practitioners—The motivation of this work is to develop a robust and effective control scheme for collision-free navigation of magnetic helical microrobots in dynamic environments. The conventional navigation strategies in dynamic environments mainly include global path planning and local path replanning; thus, highly dynamic environments require frequent updates to the planned path, making it difficult to apply in highly dynamic environments. In this work, a deep reinforcement learning-based control framework is proposed that can guide microrobots through many dynamic obstacles to a series of locations without collisions. The simulation and experimental results validate the efficacy of the proposed control framework and the robustness and adaptability of the trained policy. The proposed control scheme enables better understanding of advanced motion control methods for magnetic microrobots.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的磁螺旋微机器人在动态环境中的无碰撞导航
磁性螺旋微型机器人由于能够通过磁场的远程操纵进入密闭和封闭的环境,在生物医学应用中具有很大的潜力。然而,在复杂和非结构化环境中,特别是在高动态环境中,实现微型机器人的无碰撞导航仍然是一个挑战。在本文中,我们提出了一种新的基于深度强化学习的磁性螺旋微型机器人控制框架,重点研究了目标到达和动态避障任务。为了简化数据收集,设计了一个专门的训练环境,以捕获磁螺旋微型机器人导航的基本方面。在训练环境中使用随机化技术来支持训练策略的鲁棒性和适应性。为了促进与现实世界磁致动系统的无缝集成,设计了一种基于OpenCV的视觉处理算法,并将其集成到策略观察中。各种场景下的仿真和实验验证了该方法的鲁棒性和适应性。性能评估显示,微型机器人在4个速度相当的动态障碍物周围导航的成功率为99%,在14个动态障碍物的环境中导航的成功率为90%。结果表明,我们的方法在非结构化、受限和动态生活环境中的未来应用潜力。从业人员注意:本工作的动机是开发一种鲁棒和有效的控制方案,用于磁性螺旋微型机器人在动态环境中的无碰撞导航。动态环境下传统的导航策略主要包括全局路径规划和局部路径重新规划;因此,高度动态的环境需要频繁地更新规划的路径,这使得在高度动态的环境中应用变得困难。在这项工作中,提出了一种基于深度强化学习的控制框架,可以引导微型机器人通过许多动态障碍物到达一系列位置而不会发生碰撞。仿真和实验结果验证了所提控制框架的有效性以及所训练策略的鲁棒性和自适应性。所提出的控制方案能够更好地理解磁性微型机器人的先进运动控制方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Sensorless Robotic External Force Estimation in Uncertain Interactive Environments: A Hybrid Adaptive-Robust Kalman Filtering Approach Prescribed-Time Critic-Only Consensus for Constrained Multiagent Systems Through ADP Hierarchical Distributed Optimal Safety-Critical Consensus of Multi-Robot Systems in Dynamic Environments HECTOR: Human-centric Hierarchical Coordination and Supervision of Robotic Fleets under Continual Temporal Tasks Finite-Time Flexible Performance-Based Formation Control for Wheeled Humanoid Robots with Dynamic Obstacle Avoidance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1