Deepak Raina;Mythra V. Balakuntala;Byung Wook Kim;Juan Wachs;Richard Voyles
{"title":"Coaching a Robotic Sonographer: Learning Robotic Ultrasound With Sparse Expert’s Feedback","authors":"Deepak Raina;Mythra V. Balakuntala;Byung Wook Kim;Juan Wachs;Richard Voyles","doi":"10.1109/TMRB.2024.3464698","DOIUrl":null,"url":null,"abstract":"Ultrasound is widely employed for clinical intervention and diagnosis, due to its advantages of offering non-invasive, radiation-free, and real-time imaging. However, the accessibility of this dexterous procedure is limited due to the substantial training and expertise required of operators. The robotic ultrasound (RUS) offers a viable solution to address this limitation; nonetheless, achieving human-level proficiency remains challenging. Learning from demonstrations (LfD) methods have been explored in RUS, which learns the policy prior from a dataset of offline demonstrations to encode the mental model of the expert sonographer. However, active engagement of experts, i.e., Coaching, during the training of RUS has not been explored thus far. Coaching is known for enhancing efficiency and performance in human training. This paper proposes a coaching framework for RUS to amplify its performance. The framework combines DRL (self-supervised practice) with sparse expert’s feedback through coaching. The DRL employs an off-policy Soft Actor-Critic (SAC) network, with a reward based on image quality rating. The coaching by experts is modeled as a Partially Observable Markov Decision Process (POMDP), which updates the policy parameters based on the correction by the expert. The validation study on phantoms showed that coaching increases the learning rate by 25% and the number of high-quality image acquisition by 74.5%.","PeriodicalId":73318,"journal":{"name":"IEEE transactions on medical robotics and bionics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical robotics and bionics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10684826/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Ultrasound is widely employed for clinical intervention and diagnosis, due to its advantages of offering non-invasive, radiation-free, and real-time imaging. However, the accessibility of this dexterous procedure is limited due to the substantial training and expertise required of operators. The robotic ultrasound (RUS) offers a viable solution to address this limitation; nonetheless, achieving human-level proficiency remains challenging. Learning from demonstrations (LfD) methods have been explored in RUS, which learns the policy prior from a dataset of offline demonstrations to encode the mental model of the expert sonographer. However, active engagement of experts, i.e., Coaching, during the training of RUS has not been explored thus far. Coaching is known for enhancing efficiency and performance in human training. This paper proposes a coaching framework for RUS to amplify its performance. The framework combines DRL (self-supervised practice) with sparse expert’s feedback through coaching. The DRL employs an off-policy Soft Actor-Critic (SAC) network, with a reward based on image quality rating. The coaching by experts is modeled as a Partially Observable Markov Decision Process (POMDP), which updates the policy parameters based on the correction by the expert. The validation study on phantoms showed that coaching increases the learning rate by 25% and the number of high-quality image acquisition by 74.5%.
超声波具有无创、无辐射和实时成像的优点,被广泛用于临床干预和诊断。然而,由于操作人员需要大量的培训和专业知识,这种灵巧程序的可及性受到了限制。机器人超声(RUS)为解决这一局限性提供了可行的解决方案;然而,要达到人类水平的熟练程度仍具有挑战性。从演示中学习(LfD)方法已在 RUS 中进行了探索,该方法从离线演示数据集中学习先验策略,以编码超声波专家的心智模型。但是,在 RUS 的训练过程中,专家的积极参与(即辅导)迄今为止还没有被探索过。众所周知,教练可以提高人类培训的效率和绩效。本文为 RUS 提出了一个教练框架,以提高其性能。该框架将 DRL(自我监督练习)与稀疏专家反馈(通过辅导)相结合。DRL 采用非政策软演员-批评家(SAC)网络,奖励基于图像质量评级。专家的指导被建模为部分可观测马尔可夫决策过程(POMDP),并根据专家的纠正更新策略参数。在模型上进行的验证研究表明,辅导使学习率提高了 25%,高质量图像获取数量提高了 74.5%。