Digital twins as a unifying framework for surgical data science: the enabling role of geometric scene understanding

Hao Ding, Lalithkumar Seenivasan, Benjamin Killeen, Sue Min Cho, Mathias Unberath
{"title":"Digital twins as a unifying framework for surgical data science: the enabling role of geometric scene understanding","authors":"Hao Ding, Lalithkumar Seenivasan, Benjamin Killeen, Sue Min Cho, Mathias Unberath","doi":"10.20517/ais.2024.16","DOIUrl":null,"url":null,"abstract":"Surgical data science is devoted to enhancing the quality, safety, and efficacy of interventional healthcare. While the use of powerful machine learning algorithms is becoming the standard approach for surgical data science, the underlying end-to-end task models directly infer high-level concepts (e.g., surgical phase or skill) from low-level observations (e.g., endoscopic video). This end-to-end nature of contemporary approaches makes the models vulnerable to non-causal relationships in the data and requires the re-development of all components if new surgical data science tasks are to be solved. The digital twin (DT) paradigm, an approach to building and maintaining computational representations of real-world scenarios, offers a framework for separating low-level processing from high-level inference. In surgical data science, the DT paradigm would allow for the development of generalist surgical data science approaches on top of the universal DT representation, deferring DT model building to low-level computer vision algorithms. In this latter effort of DT model creation, geometric scene understanding plays a central role in building and updating the digital model. In this work, we visit existing geometric representations, geometric scene understanding tasks, and successful applications for building primitive DT frameworks. Although the development of advanced methods is still hindered in surgical data science by the lack of annotations, the complexity and limited observability of the scene, emerging works on synthetic data generation, sim-to-real generalization, and foundation models offer new directions for overcoming these challenges and advancing the DT paradigm.","PeriodicalId":72305,"journal":{"name":"Artificial intelligence surgery","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20517/ais.2024.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Surgical data science is devoted to enhancing the quality, safety, and efficacy of interventional healthcare. While the use of powerful machine learning algorithms is becoming the standard approach for surgical data science, the underlying end-to-end task models directly infer high-level concepts (e.g., surgical phase or skill) from low-level observations (e.g., endoscopic video). This end-to-end nature of contemporary approaches makes the models vulnerable to non-causal relationships in the data and requires the re-development of all components if new surgical data science tasks are to be solved. The digital twin (DT) paradigm, an approach to building and maintaining computational representations of real-world scenarios, offers a framework for separating low-level processing from high-level inference. In surgical data science, the DT paradigm would allow for the development of generalist surgical data science approaches on top of the universal DT representation, deferring DT model building to low-level computer vision algorithms. In this latter effort of DT model creation, geometric scene understanding plays a central role in building and updating the digital model. In this work, we visit existing geometric representations, geometric scene understanding tasks, and successful applications for building primitive DT frameworks. Although the development of advanced methods is still hindered in surgical data science by the lack of annotations, the complexity and limited observability of the scene, emerging works on synthetic data generation, sim-to-real generalization, and foundation models offer new directions for overcoming these challenges and advancing the DT paradigm.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数字孪生作为外科数据科学的统一框架:几何场景理解的促进作用
手术数据科学致力于提高介入医疗的质量、安全性和有效性。虽然使用功能强大的机器学习算法已成为外科数据科学的标准方法,但底层的端到端任务模型直接从低层次的观察结果(如内窥镜视频)中推断出高层次的概念(如手术阶段或技能)。当代方法的这种端到端性质使模型容易受到数据中非因果关系的影响,如果要解决新的手术数据科学任务,就需要重新开发所有组件。数字孪生(DT)范式是一种构建和维护真实世界场景计算表征的方法,它提供了一个将低级处理与高级推理分离开来的框架。在手术数据科学中,DT 范式允许在通用 DT 表示之上开发通用手术数据科学方法,将 DT 模型的构建推迟到低级计算机视觉算法。在后一种 DT 模型创建工作中,几何场景理解在数字模型的构建和更新中发挥着核心作用。在这项工作中,我们访问了现有的几何表示法、几何场景理解任务以及构建原始 DT 框架的成功应用。虽然在外科数据科学中,缺乏注释、场景的复杂性和有限的可观测性仍然阻碍着先进方法的发展,但合成数据生成、模拟到真实的泛化和基础模型等新兴工作为克服这些挑战和推进 DT 范式提供了新的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.40
自引率
0.00%
发文量
0
期刊最新文献
Robotic caudo-peripheral approach for liver parenchymal transection in anatomical liver resections for hepatocellular carcinoma Digital twins as a unifying framework for surgical data science: the enabling role of geometric scene understanding The health technology assessment in the artificial intelligence era: the AI surgical department The 1st Orsi Innotech Surgical AI Day congress report The 1st Orsi Innotech Surgical AI Day congress report
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1