基于链式残差池化和梯度加权损失的单目深度估计

Jiajun Han, Zhengang Jiang, Guanyuan Feng
{"title":"基于链式残差池化和梯度加权损失的单目深度估计","authors":"Jiajun Han, Zhengang Jiang, Guanyuan Feng","doi":"10.1109/ICCECE58074.2023.10135509","DOIUrl":null,"url":null,"abstract":"In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.","PeriodicalId":120030,"journal":{"name":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Monocular depth estimation based on Chained Residual Pooling and Gradient Weighted Loss\",\"authors\":\"Jiajun Han, Zhengang Jiang, Guanyuan Feng\",\"doi\":\"10.1109/ICCECE58074.2023.10135509\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.\",\"PeriodicalId\":120030,\"journal\":{\"name\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCECE58074.2023.10135509\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE58074.2023.10135509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

近年来,自动驾驶领域的自监督单目深度估计任务取得了显著的成果。采用亮度一致性假设指导网络训练,相邻帧图像亮度需要保持恒定。然而,这种假设并不适用于腹腔镜手术,因为在手术过程中,同一组织的光强度会随着时间的推移而变化。此外,腹腔镜中定义的感受野导致结构化线索的利用率低,当腹腔镜框架被输入深度估计网络时,预测的深度图在组织轮廓上表现不佳。本文针对亮度一致性问题,提出将图像的二阶梯度和视差图的二阶梯度整合到光度重建误差中来引导网络。针对腹腔镜图像上下文线索利用率低的问题,在网络解码器部分对以下线索进行加权,以提高组织轮廓线索的低分辨率特征图的重用性。在SCARED数据集上进行了实验,将新的损失和新的模块分别放入网络中进行训练,验证了它们的有效性,结果在四个常用指标上都表现出良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Monocular depth estimation based on Chained Residual Pooling and Gradient Weighted Loss
In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Clutter Edge and Target Detection Method Based on Central Moment Feature Adaptive short-time Fourier transform based on reinforcement learning Design and implementation of carrier aggregation and secure communication in distribution field network Power data attribution revocation searchable encrypted cloud storage Research of Intrusion Detection Based on Neural Network Optimized by Sparrow Search Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1