Path Generation with Reinforcement Learning for Surgical Robot Control

Junhong Chen, Zeyu Wang, Ruiqi Zhu, Rui Zhang, Weibang Bai, Benny P. L. Lo
{"title":"Path Generation with Reinforcement Learning for Surgical Robot Control","authors":"Junhong Chen, Zeyu Wang, Ruiqi Zhu, Rui Zhang, Weibang Bai, Benny P. L. Lo","doi":"10.1109/BHI56158.2022.9926849","DOIUrl":null,"url":null,"abstract":"In the field of robotic surgery, Robot-Assisted Minimally Invasive Surgery(RAMIS) has shown its great potential of benefiting both surgeons and patients in the past few decades of research and practice. The current trend of RAMIS targets towards a higher level of autonomy in carrying out surgical tasks. However, most real RAMIS tasks still rely on manual control, thus the performance mostly depends on the dexterity of the surgeon. Their fatigue or small errors could cause life-threatening damages to the patients, especially high-workload surgeons. Since corrections and errors are inevitable in manual control, the actual tool paths in real operations are often deviated from ideal trajectories. For robot Learning from Demonstrations(LfD), these sub-optimal paths would eventually affect the robot's learning performance. Therefore, much research is being explored in enhancing the performance of robot-generated instrument tool paths and at the same time reducing the reliance on manual manipulation demonstrations in surgical robot learning. In this paper, both Reinforcement Learning and Learning from Demonstration are used to generate a smooth moving trajectory without the use of manual robotic control kinematics data. Two tasks, peg transfer and pattern cutting, were chosen to verify the performance. The method was trained and validated in simulations, namely Asynchronous Multi-Body Framework (AMBF) and Moveit. Then da Vinci Research Kit is used to validate the real case performance. The results have shown that this path generation framework could automate given repetitive surgical tasks, and potentially adapted to other surgical tasks.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI56158.2022.9926849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the field of robotic surgery, Robot-Assisted Minimally Invasive Surgery(RAMIS) has shown its great potential of benefiting both surgeons and patients in the past few decades of research and practice. The current trend of RAMIS targets towards a higher level of autonomy in carrying out surgical tasks. However, most real RAMIS tasks still rely on manual control, thus the performance mostly depends on the dexterity of the surgeon. Their fatigue or small errors could cause life-threatening damages to the patients, especially high-workload surgeons. Since corrections and errors are inevitable in manual control, the actual tool paths in real operations are often deviated from ideal trajectories. For robot Learning from Demonstrations(LfD), these sub-optimal paths would eventually affect the robot's learning performance. Therefore, much research is being explored in enhancing the performance of robot-generated instrument tool paths and at the same time reducing the reliance on manual manipulation demonstrations in surgical robot learning. In this paper, both Reinforcement Learning and Learning from Demonstration are used to generate a smooth moving trajectory without the use of manual robotic control kinematics data. Two tasks, peg transfer and pattern cutting, were chosen to verify the performance. The method was trained and validated in simulations, namely Asynchronous Multi-Body Framework (AMBF) and Moveit. Then da Vinci Research Kit is used to validate the real case performance. The results have shown that this path generation framework could automate given repetitive surgical tasks, and potentially adapted to other surgical tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强化学习的手术机器人控制路径生成
在机器人手术领域,机器人辅助微创手术(RAMIS)在过去几十年的研究和实践中显示出了巨大的潜力,使外科医生和患者都受益。RAMIS目前的趋势是在执行手术任务时实现更高水平的自主性。然而,大多数真实的RAMIS任务仍然依赖于手动控制,因此性能主要取决于外科医生的灵巧性。他们的疲劳或小失误可能会对病人造成危及生命的损害,尤其是高工作量的外科医生。由于修正和误差在人工控制中是不可避免的,实际操作中的实际刀具轨迹经常偏离理想轨迹。对于机器人从演示中学习(LfD),这些次优路径最终会影响机器人的学习性能。因此,如何提高机器人生成的仪器工具路径的性能,同时减少手术机器人学习中对人工操作演示的依赖,是目前研究的热点。在本文中,使用强化学习和演示学习来生成平滑的运动轨迹,而不使用手动机器人控制运动学数据。选择两个任务,钉转移和图案切割,以验证性能。采用异步多体框架(AMBF)和Moveit进行了仿真训练和验证。然后使用达芬奇研究工具包来验证真实案例的性能。结果表明,该路径生成框架可以自动执行重复性手术任务,并可能适应其他手术任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
BEBOP: Bidirectional dEep Brain cOnnectivity maPping Stabilizing Skeletal Pose Estimation using mmWave Radar via Dynamic Model and Filtering Behavioral Data Categorization for Transformers-based Models in Digital Health Gender Difference in Prognosis of Patients with Heart Failure: A Propensity Score Matching Analysis Influence of Sensor Position and Body Movements on Radar-Based Heart Rate Monitoring
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1