Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks

IF 2.8 Q3 ENGINEERING, BIOMEDICAL Healthcare Technology Letters Pub Date : 2024-02-17 DOI:10.1049/htl2.12078
Ummey Tanin, Adrienne Duimering, Christine Law, Jessica Ruzicki, Gabriela Luna, Matthew Holden
{"title":"Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks","authors":"Ummey Tanin,&nbsp;Adrienne Duimering,&nbsp;Christine Law,&nbsp;Jessica Ruzicki,&nbsp;Gabriela Luna,&nbsp;Matthew Holden","doi":"10.1049/htl2.12078","DOIUrl":null,"url":null,"abstract":"<p>An important part of surgical training in ophthalmology is understanding how to proficiently perform cataract surgery. Operating skill in cataract surgery is typically assessed by real-time or video-based expert review using a rating scale. This is time-consuming, subjective and labour-intensive. A typical trainee graduates with over 100 complete surgeries, each of which requires review by the surgical educators. Due to the consistently repetitive nature of this task, it lends itself well to machine learning-based evaluation. Recent studies utilize deep learning models trained on tool motion trajectories obtained using additional equipment or robotic systems. However, the process of tool recognition by extracting frames from the videos to perform phase recognition followed by skill assessment is exhaustive. This project proposes a deep learning model for skill evaluation using raw surgery videos that is cost-effective and end-to-end trainable. An advanced ensemble of convolutional neural network models is leveraged to model technical skills in cataract surgeries and is evaluated using a large dataset comprising almost 200 surgical trials. The highest accuracy of 0.8494 is observed on the phacoemulsification step data. Our model yielded an average accuracy of 0.8200 and an average AUC score of 0.8800 for all four phase datasets of cataract surgery proving its robustness against different data. The proposed ensemble model with 2D and 3D convolutional neural networks demonstrated a promising result without using tool motion trajectories to evaluate surgery expertise.</p>","PeriodicalId":37474,"journal":{"name":"Healthcare Technology Letters","volume":"11 2-3","pages":"189-195"},"PeriodicalIF":2.8000,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/htl2.12078","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/htl2.12078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

An important part of surgical training in ophthalmology is understanding how to proficiently perform cataract surgery. Operating skill in cataract surgery is typically assessed by real-time or video-based expert review using a rating scale. This is time-consuming, subjective and labour-intensive. A typical trainee graduates with over 100 complete surgeries, each of which requires review by the surgical educators. Due to the consistently repetitive nature of this task, it lends itself well to machine learning-based evaluation. Recent studies utilize deep learning models trained on tool motion trajectories obtained using additional equipment or robotic systems. However, the process of tool recognition by extracting frames from the videos to perform phase recognition followed by skill assessment is exhaustive. This project proposes a deep learning model for skill evaluation using raw surgery videos that is cost-effective and end-to-end trainable. An advanced ensemble of convolutional neural network models is leveraged to model technical skills in cataract surgeries and is evaluated using a large dataset comprising almost 200 surgical trials. The highest accuracy of 0.8494 is observed on the phacoemulsification step data. Our model yielded an average accuracy of 0.8200 and an average AUC score of 0.8800 for all four phase datasets of cataract surgery proving its robustness against different data. The proposed ensemble model with 2D and 3D convolutional neural networks demonstrated a promising result without using tool motion trajectories to evaluate surgery expertise.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 2D-3D 卷积神经网络集合进行白内障手术的性能评估
眼科手术培训的一个重要部分是了解如何熟练地进行白内障手术。白内障手术的操作技能通常是通过实时或视频专家评审,使用评分表进行评估。这种方法耗时、主观且劳动密集。一名典型的受训人员在毕业时需要完成 100 多例完整的手术,而每例手术都需要由手术教育者进行审查。由于这项任务具有持续重复的性质,因此非常适合基于机器学习的评估。最近的研究利用使用附加设备或机器人系统获得的工具运动轨迹来训练深度学习模型。然而,通过从视频中提取帧来进行阶段识别,然后再进行技能评估,这样的工具识别过程非常繁琐。本项目提出了一种利用原始手术视频进行技能评估的深度学习模型,该模型具有成本效益高、端到端可训练的特点。利用先进的卷积神经网络模型集合对白内障手术中的技术技能进行建模,并使用包含近 200 个手术试验的大型数据集进行评估。在超声乳化步骤数据中观察到的最高准确率为 0.8494。在白内障手术的所有四个阶段数据集上,我们的模型得出了 0.8200 的平均准确率和 0.8800 的平均 AUC 分数,这证明了它对不同数据的鲁棒性。所提出的二维和三维卷积神经网络集合模型在不使用工具运动轨迹来评估手术专业性的情况下取得了很好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Healthcare Technology Letters
Healthcare Technology Letters Health Professions-Health Information Management
CiteScore
6.10
自引率
4.80%
发文量
12
审稿时长
22 weeks
期刊介绍: Healthcare Technology Letters aims to bring together an audience of biomedical and electrical engineers, physical and computer scientists, and mathematicians to enable the exchange of the latest ideas and advances through rapid online publication of original healthcare technology research. Major themes of the journal include (but are not limited to): Major technological/methodological areas: Biomedical signal processing Biomedical imaging and image processing Bioinstrumentation (sensors, wearable technologies, etc) Biomedical informatics Major application areas: Cardiovascular and respiratory systems engineering Neural engineering, neuromuscular systems Rehabilitation engineering Bio-robotics, surgical planning and biomechanics Therapeutic and diagnostic systems, devices and technologies Clinical engineering Healthcare information systems, telemedicine, mHealth.
期刊最新文献
Differential analysis of brain functional network parameters in MHE patients The Feasibility of Ambulatory Heart Rate Variability Monitoring in Non-Suicidal Self-Injury Signal-quality-aware multisensor fusion for atrial fibrillation detection Deep regression 2D-3D ultrasound registration for liver motion correction in focal tumour thermal ablation Writing the Signs: An Explainable Machine Learning Approach for Alzheimer's Disease Classification from Handwriting
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1