白内障手术术中瞳孔分析计算框架

Binh Duong Giap, Karthik Srinivasan,, Ossama Mahmoud, Dena Ballouz, Jefferson Lustre, Keely Likosky, Shahzad I. Mian, Bradford L. Tannen, Nambi Nallasamy
{"title":"白内障手术术中瞳孔分析计算框架","authors":"Binh Duong Giap, Karthik Srinivasan,, Ossama Mahmoud, Dena Ballouz, Jefferson Lustre, Keely Likosky, Shahzad I. Mian, Bradford L. Tannen, Nambi Nallasamy","doi":"10.1101/2024.03.13.24304223","DOIUrl":null,"url":null,"abstract":"Purpose: Pupillary instability is a known risk factor for complications in cataract surgery. This study aims to develop and validate an innovative and reliable computational framework for the automated assessment of pupil morphologic changes during the various phases of cataract surgery. Design: Retrospective surgical video analysis. Subjects: Two hundred forty complete surgical video recordings, among which 190 surgeries were conducted without the use of pupil expansion devices and 50 were performed with the use of a pupil expansion device. Methods: The proposed framework consists of three stages: feature extraction, deep learning (DL)-based anatomy recognition, and obstruction detection/compensation. In the first stage, surgical video frames undergo noise reduction using a tensor-based wavelet feature extraction method. In the second stage, DL-based segmentation models are trained and employed to segment the pupil, limbus, and palpebral fissure. In the third stage, obstructed visualization of the pupil is detected and compensated for using a DL-based algorithm. A dataset of 5,700 intraoperative video frames across 190 cataract surgeries in the BigCat database was collected for validating algorithm performance. Main Outcome Measures: The pupil analysis framework was assessed on the basis of segmentation performance for both obstructed and unobstructed pupils. Classification performance of models utilizing the segmented pupil time series to predict surgeon use of a pupil expansion device was also assessed. Results: An architecture based on the FPN model with VGG16 backbone integrated with the AWTFE feature extraction method demonstrated the highest performance in anatomy segmentation, with Dice coefficient of 96.52%. Incorporation of an obstruction compensation algorithm improved performance further (Dice 96.82%). Downstream analysis of framework output enabled the development of an SVM-based classifier that could predict surgeon usage of a pupil expansion device prior to its placement with 96.67% accuracy and AUC of 99.44%. Conclusions: The experimental results demonstrate that the proposed framework 1) provides high accuracy in pupil analysis compared to human-annotated ground truth, 2) substantially outperforms isolated use of a DL segmentation model, and 3) can enable downstream analytics with clinically valuable predictive capacity.","PeriodicalId":501390,"journal":{"name":"medRxiv - Ophthalmology","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Computational Framework for Intraoperative Pupil Analysis in Cataract Surgery\",\"authors\":\"Binh Duong Giap, Karthik Srinivasan,, Ossama Mahmoud, Dena Ballouz, Jefferson Lustre, Keely Likosky, Shahzad I. Mian, Bradford L. Tannen, Nambi Nallasamy\",\"doi\":\"10.1101/2024.03.13.24304223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: Pupillary instability is a known risk factor for complications in cataract surgery. This study aims to develop and validate an innovative and reliable computational framework for the automated assessment of pupil morphologic changes during the various phases of cataract surgery. Design: Retrospective surgical video analysis. Subjects: Two hundred forty complete surgical video recordings, among which 190 surgeries were conducted without the use of pupil expansion devices and 50 were performed with the use of a pupil expansion device. Methods: The proposed framework consists of three stages: feature extraction, deep learning (DL)-based anatomy recognition, and obstruction detection/compensation. In the first stage, surgical video frames undergo noise reduction using a tensor-based wavelet feature extraction method. In the second stage, DL-based segmentation models are trained and employed to segment the pupil, limbus, and palpebral fissure. In the third stage, obstructed visualization of the pupil is detected and compensated for using a DL-based algorithm. A dataset of 5,700 intraoperative video frames across 190 cataract surgeries in the BigCat database was collected for validating algorithm performance. Main Outcome Measures: The pupil analysis framework was assessed on the basis of segmentation performance for both obstructed and unobstructed pupils. Classification performance of models utilizing the segmented pupil time series to predict surgeon use of a pupil expansion device was also assessed. Results: An architecture based on the FPN model with VGG16 backbone integrated with the AWTFE feature extraction method demonstrated the highest performance in anatomy segmentation, with Dice coefficient of 96.52%. Incorporation of an obstruction compensation algorithm improved performance further (Dice 96.82%). Downstream analysis of framework output enabled the development of an SVM-based classifier that could predict surgeon usage of a pupil expansion device prior to its placement with 96.67% accuracy and AUC of 99.44%. Conclusions: The experimental results demonstrate that the proposed framework 1) provides high accuracy in pupil analysis compared to human-annotated ground truth, 2) substantially outperforms isolated use of a DL segmentation model, and 3) can enable downstream analytics with clinically valuable predictive capacity.\",\"PeriodicalId\":501390,\"journal\":{\"name\":\"medRxiv - Ophthalmology\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Ophthalmology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.03.13.24304223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Ophthalmology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.03.13.24304223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:瞳孔不稳定是白内障手术并发症的一个已知风险因素。本研究旨在开发并验证一种创新、可靠的计算框架,用于自动评估白内障手术各阶段的瞳孔形态变化。设计:回顾性手术视频分析。研究对象240 个完整的手术视频记录,其中 190 例手术未使用瞳孔扩大装置,50 例手术使用了瞳孔扩大装置。研究方法:采用瞳孔扩大装置进行手术:提议的框架包括三个阶段:特征提取、基于深度学习(DL)的解剖识别和阻塞检测/补偿。在第一阶段,使用基于张量的小波特征提取方法对手术视频帧进行降噪处理。在第二阶段,对基于 DL 的分割模型进行训练,并将其用于分割瞳孔、瞳孔边缘和睑裂。在第三阶段,使用基于 DL 的算法检测并补偿瞳孔的视觉障碍。在 BigCat 数据库中收集了 190 例白内障手术的 5,700 个术中视频帧数据集,用于验证算法性能。主要结果测量:评估瞳孔分析框架的依据是对有障碍和无障碍瞳孔的分割性能。还评估了利用瞳孔时间序列分割预测外科医生使用瞳孔扩大装置的模型的分类性能。结果:基于带有 VGG16 主干网的 FPN 模型的结构与 AWTFE 特征提取方法相结合,在解剖分割方面表现最佳,Dice 系数达到 96.52%。加入阻塞补偿算法后,性能进一步提高(Dice 96.82%)。通过对框架输出的下游分析,开发出了基于 SVM 的分类器,该分类器可在瞳孔扩大装置放置前预测外科医生的使用情况,准确率为 96.67%,AUC 为 99.44%。结论实验结果表明,所提出的框架 1) 与人类标注的基本事实相比,瞳孔分析具有很高的准确性;2) 远远优于单独使用 DL 分割模型的效果;3) 可以实现具有临床价值预测能力的下游分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Computational Framework for Intraoperative Pupil Analysis in Cataract Surgery
Purpose: Pupillary instability is a known risk factor for complications in cataract surgery. This study aims to develop and validate an innovative and reliable computational framework for the automated assessment of pupil morphologic changes during the various phases of cataract surgery. Design: Retrospective surgical video analysis. Subjects: Two hundred forty complete surgical video recordings, among which 190 surgeries were conducted without the use of pupil expansion devices and 50 were performed with the use of a pupil expansion device. Methods: The proposed framework consists of three stages: feature extraction, deep learning (DL)-based anatomy recognition, and obstruction detection/compensation. In the first stage, surgical video frames undergo noise reduction using a tensor-based wavelet feature extraction method. In the second stage, DL-based segmentation models are trained and employed to segment the pupil, limbus, and palpebral fissure. In the third stage, obstructed visualization of the pupil is detected and compensated for using a DL-based algorithm. A dataset of 5,700 intraoperative video frames across 190 cataract surgeries in the BigCat database was collected for validating algorithm performance. Main Outcome Measures: The pupil analysis framework was assessed on the basis of segmentation performance for both obstructed and unobstructed pupils. Classification performance of models utilizing the segmented pupil time series to predict surgeon use of a pupil expansion device was also assessed. Results: An architecture based on the FPN model with VGG16 backbone integrated with the AWTFE feature extraction method demonstrated the highest performance in anatomy segmentation, with Dice coefficient of 96.52%. Incorporation of an obstruction compensation algorithm improved performance further (Dice 96.82%). Downstream analysis of framework output enabled the development of an SVM-based classifier that could predict surgeon usage of a pupil expansion device prior to its placement with 96.67% accuracy and AUC of 99.44%. Conclusions: The experimental results demonstrate that the proposed framework 1) provides high accuracy in pupil analysis compared to human-annotated ground truth, 2) substantially outperforms isolated use of a DL segmentation model, and 3) can enable downstream analytics with clinically valuable predictive capacity.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Prediction of the ectasia screening index from raw Casia2 volume data for keratoconus identification by using convolutional neural networks Utilizing AI-Generated Plain Language Summaries to Enhance Interdisciplinary Understanding of Ophthalmology Notes: A Randomized Trial Deep Learning-Based Detection of Reticular Pseudodrusen in Age-Related Macular Degeneration on Optical Coherence Tomography Photoreceptor outer segment reflectivity with ultrahigh resolution visible light optical coherence tomography in systemic hydroxychloroquine use Comparison of visual function analysis of people with low vision using three different models of augmented reality devices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1