Surgical phase classification and operative skill assessment through spatial context aware CNNs and time-invariant feature extracting autoencoders

IF 5.3 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biocybernetics and Biomedical Engineering Pub Date : 2023-10-01 DOI:10.1016/j.bbe.2023.10.001
Chakka Sai Pradeep, Neelam Sinha
{"title":"Surgical phase classification and operative skill assessment through spatial context aware CNNs and time-invariant feature extracting autoencoders","authors":"Chakka Sai Pradeep,&nbsp;Neelam Sinha","doi":"10.1016/j.bbe.2023.10.001","DOIUrl":null,"url":null,"abstract":"<div><p>Automated surgical video analysis promises improved healthcare. We propose novel spatial context aware combined loss function for end-to-end Encoder-Decoder training for Surgical Phase Classification (SPC) on laparoscopic cholecystectomy (LC) videos. Proposed loss function leverages on fine-grained class activation maps obtained from fused multi-layer Layer-CAM for supervised learning of SPC, obtaining improved Layer-CAM explanations. Post classification, we introduce graph theory to incorporate known hierarchies of surgical phases. We report peak SPC accuracy of 96.16%, precision of 94.08% and recall of 90.02% on public dataset Cholec80, with 7 phases. Our proposed method utilizes just 73.5% of parameters as against existing state-of-the-art methodology, achieving improvement of 0.5% in accuracy, 1.76% in precision with comparable recall, with an order less standard deviation. We also propose DNN based surgical skill assessment methodology. This approach utilizes surgical phase prediction scores from the final fully-connected layer of spatial-context aware classifier to form multi-channel temporal signal of surgical phases. Time-invariant representation is obtained from this temporal signal through time- and frequency-domain analyses. Autoencoder based time-invariant features are utilized for reconstruction and identification of prominent peaks in dissimilarity curves. We devise a surgical skill measure (SSM) based on spatial-context aware temporal-prominence-of-peaks curve. SSM values are expected to be high when executed skillfully, aligning with expert assessed GOALS metric. We illustrate this trend on Cholec80 and m2cai16-tool datasets, in comparison with GOALS metric. Concurrence in the trend of SSM with respect to GOALS metric is obtained on these test videos, making it a promising step towards automated surgical skill assessment.</p></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"43 4","pages":"Pages 700-724"},"PeriodicalIF":5.3000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521623000554","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Automated surgical video analysis promises improved healthcare. We propose novel spatial context aware combined loss function for end-to-end Encoder-Decoder training for Surgical Phase Classification (SPC) on laparoscopic cholecystectomy (LC) videos. Proposed loss function leverages on fine-grained class activation maps obtained from fused multi-layer Layer-CAM for supervised learning of SPC, obtaining improved Layer-CAM explanations. Post classification, we introduce graph theory to incorporate known hierarchies of surgical phases. We report peak SPC accuracy of 96.16%, precision of 94.08% and recall of 90.02% on public dataset Cholec80, with 7 phases. Our proposed method utilizes just 73.5% of parameters as against existing state-of-the-art methodology, achieving improvement of 0.5% in accuracy, 1.76% in precision with comparable recall, with an order less standard deviation. We also propose DNN based surgical skill assessment methodology. This approach utilizes surgical phase prediction scores from the final fully-connected layer of spatial-context aware classifier to form multi-channel temporal signal of surgical phases. Time-invariant representation is obtained from this temporal signal through time- and frequency-domain analyses. Autoencoder based time-invariant features are utilized for reconstruction and identification of prominent peaks in dissimilarity curves. We devise a surgical skill measure (SSM) based on spatial-context aware temporal-prominence-of-peaks curve. SSM values are expected to be high when executed skillfully, aligning with expert assessed GOALS metric. We illustrate this trend on Cholec80 and m2cai16-tool datasets, in comparison with GOALS metric. Concurrence in the trend of SSM with respect to GOALS metric is obtained on these test videos, making it a promising step towards automated surgical skill assessment.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于空间上下文感知cnn和时不变特征提取自编码器的手术阶段分类和手术技能评估
自动化手术视频分析有望改善医疗保健。我们提出了一种新的空间上下文感知组合损失函数,用于腹腔镜胆囊切除术(LC)视频的手术阶段分类(SPC)的端到端编码器-解码器训练。所提出的损失函数利用从融合的多层层CAM获得的细粒度类激活图来监督SPC的学习,从而获得改进的层CAM解释。在分类后,我们引入图论来合并手术阶段的已知层次。我们在公共数据集Cholec80上报告了峰值SPC准确率为96.16%,准确率为94.08%,召回率为90.02%,共有7个阶段。与现有最先进的方法相比,我们提出的方法仅使用了73.5%的参数,准确率提高了0.5%,在可比召回的情况下,准确度提高了1.76%,标准偏差减少了一个数量级。我们还提出了基于DNN的手术技能评估方法。该方法利用来自空间上下文感知分类器的最终完全连接层的手术阶段预测分数来形成手术阶段的多通道时间信号。通过时域和频域分析,从该时间信号中获得了时不变表示。基于自动编码器的时不变特征被用于相异度曲线中显著峰值的重建和识别。我们设计了一种基于峰值曲线的空间上下文感知时间突出度的手术技能测量(SSM)。当熟练执行时,SSM值预计会很高,与专家评估的目标指标一致。我们在Cholec80和m2cai16工具数据集上说明了这一趋势,并与GOALS指标进行了比较。在这些测试视频中,SSM与GOALS指标的趋势是一致的,这使其成为自动化手术技能评估的一个有希望的步骤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
16.50
自引率
6.20%
发文量
77
审稿时长
38 days
期刊介绍: Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.
期刊最新文献
Automating synaptic plasticity analysis: A deep learning approach to segmenting hippocampal field potential signal Probabilistic and explainable modeling of Phase–Phase Cross-Frequency Coupling patterns in EEG. Application to dyslexia diagnosis Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms Profiled delivery of bicarbonate during weekly cycle of hemodialysis Lightweight beat score map method for electrocardiogram-based arrhythmia classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1