基于门控对抗域自适应的膀胱镜深度估计。

IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Biomedical Engineering Letters Pub Date : 2023-05-01 DOI:10.1007/s13534-023-00261-3
Peter Somers, Simon Holdenried-Krafft, Johannes Zahn, Johannes Schüle, Carina Veil, Niklas Harland, Simon Walz, Arnulf Stenzl, Oliver Sawodny, Cristina Tarín, Hendrik P A Lensch
{"title":"基于门控对抗域自适应的膀胱镜深度估计。","authors":"Peter Somers,&nbsp;Simon Holdenried-Krafft,&nbsp;Johannes Zahn,&nbsp;Johannes Schüle,&nbsp;Carina Veil,&nbsp;Niklas Harland,&nbsp;Simon Walz,&nbsp;Arnulf Stenzl,&nbsp;Oliver Sawodny,&nbsp;Cristina Tarín,&nbsp;Hendrik P A Lensch","doi":"10.1007/s13534-023-00261-3","DOIUrl":null,"url":null,"abstract":"<p><p>Monocular depth estimation from camera images is very important for surrounding scene evaluation in many technical fields from automotive to medicine. However, traditional triangulation methods using stereo cameras or multiple views with the assumption of a rigid environment are not applicable for endoscopic domains. Particularly in cystoscopies it is not possible to produce ground truth depth information to directly train machine learning algorithms for using a monocular image directly for depth prediction. This work considers first creating a synthetic cystoscopic environment for initial encoding of depth information from synthetically rendered images. Next, the task of predicting pixel-wise depth values for real images is constrained to a domain adaption between the synthetic and real image domains. This adaptation is done through added gated residual blocks in order to simplify the network task and maintain training stability during adversarial training. Training is done on an internally collected cystoscopy dataset from human patients. The results after training demonstrate the ability to predict reasonable depth estimations from actual cystoscopic videos and added stability from using gated residual blocks is shown to prevent mode collapse during adversarial training.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"13 2","pages":"141-151"},"PeriodicalIF":3.2000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10130294/pdf/","citationCount":"0","resultStr":"{\"title\":\"Cystoscopic depth estimation using gated adversarial domain adaptation.\",\"authors\":\"Peter Somers,&nbsp;Simon Holdenried-Krafft,&nbsp;Johannes Zahn,&nbsp;Johannes Schüle,&nbsp;Carina Veil,&nbsp;Niklas Harland,&nbsp;Simon Walz,&nbsp;Arnulf Stenzl,&nbsp;Oliver Sawodny,&nbsp;Cristina Tarín,&nbsp;Hendrik P A Lensch\",\"doi\":\"10.1007/s13534-023-00261-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Monocular depth estimation from camera images is very important for surrounding scene evaluation in many technical fields from automotive to medicine. However, traditional triangulation methods using stereo cameras or multiple views with the assumption of a rigid environment are not applicable for endoscopic domains. Particularly in cystoscopies it is not possible to produce ground truth depth information to directly train machine learning algorithms for using a monocular image directly for depth prediction. This work considers first creating a synthetic cystoscopic environment for initial encoding of depth information from synthetically rendered images. Next, the task of predicting pixel-wise depth values for real images is constrained to a domain adaption between the synthetic and real image domains. This adaptation is done through added gated residual blocks in order to simplify the network task and maintain training stability during adversarial training. Training is done on an internally collected cystoscopy dataset from human patients. The results after training demonstrate the ability to predict reasonable depth estimations from actual cystoscopic videos and added stability from using gated residual blocks is shown to prevent mode collapse during adversarial training.</p>\",\"PeriodicalId\":46898,\"journal\":{\"name\":\"Biomedical Engineering Letters\",\"volume\":\"13 2\",\"pages\":\"141-151\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10130294/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Engineering Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s13534-023-00261-3\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Engineering Letters","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s13534-023-00261-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

从相机图像的单目深度估计对于从汽车到医疗等许多技术领域的周围场景评估非常重要。然而,传统的使用立体相机或假设刚性环境的多视图的三角测量方法不适用于内窥镜域。特别是在膀胱镜检查中,不可能产生真实的深度信息来直接训练机器学习算法,直接使用单眼图像进行深度预测。这项工作首先考虑创建一个合成的膀胱镜环境,用于从合成渲染图像中初始编码深度信息。接下来,预测真实图像的逐像素深度值的任务被限制在合成图像和真实图像域之间的域自适应。这种自适应是通过添加门控残差块来实现的,以简化网络任务并在对抗训练中保持训练稳定性。训练是在内部收集的人类患者膀胱镜数据集上进行的。训练后的结果表明,能够从实际的膀胱镜视频中预测合理的深度估计,并且使用门控残余块增加稳定性,可以防止对抗性训练期间的模式崩溃。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Cystoscopic depth estimation using gated adversarial domain adaptation.

Monocular depth estimation from camera images is very important for surrounding scene evaluation in many technical fields from automotive to medicine. However, traditional triangulation methods using stereo cameras or multiple views with the assumption of a rigid environment are not applicable for endoscopic domains. Particularly in cystoscopies it is not possible to produce ground truth depth information to directly train machine learning algorithms for using a monocular image directly for depth prediction. This work considers first creating a synthetic cystoscopic environment for initial encoding of depth information from synthetically rendered images. Next, the task of predicting pixel-wise depth values for real images is constrained to a domain adaption between the synthetic and real image domains. This adaptation is done through added gated residual blocks in order to simplify the network task and maintain training stability during adversarial training. Training is done on an internally collected cystoscopy dataset from human patients. The results after training demonstrate the ability to predict reasonable depth estimations from actual cystoscopic videos and added stability from using gated residual blocks is shown to prevent mode collapse during adversarial training.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Engineering Letters
Biomedical Engineering Letters ENGINEERING, BIOMEDICAL-
CiteScore
6.80
自引率
0.00%
发文量
34
期刊介绍: Biomedical Engineering Letters (BMEL) aims to present the innovative experimental science and technological development in the biomedical field as well as clinical application of new development. The article must contain original biomedical engineering content, defined as development, theoretical analysis, and evaluation/validation of a new technique. BMEL publishes the following types of papers: original articles, review articles, editorials, and letters to the editor. All the papers are reviewed in single-blind fashion.
期刊最新文献
CT synthesis with deep learning for MR-only radiotherapy planning: a review. A comprehensive review on Compton camera image reconstruction: from principles to AI innovations. A review of deep learning-based reconstruction methods for accelerated MRI using spatiotemporal and multi-contrast redundancies. Strategies for mitigating inter-crystal scattering effects in positron emission tomography: a comprehensive review. Self-supervised learning for CT image denoising and reconstruction: a review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1