基于链式残差池化和梯度加权损失的单目深度估计

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE) Pub Date : 2023-01-06 DOI:10.1109/ICCECE58074.2023.10135509

Jiajun Han, Zhengang Jiang, Guanyuan Feng

{"title":"基于链式残差池化和梯度加权损失的单目深度估计","authors":"Jiajun Han, Zhengang Jiang, Guanyuan Feng","doi":"10.1109/ICCECE58074.2023.10135509","DOIUrl":null,"url":null,"abstract":"In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.","PeriodicalId":120030,"journal":{"name":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Monocular depth estimation based on Chained Residual Pooling and Gradient Weighted Loss\",\"authors\":\"Jiajun Han, Zhengang Jiang, Guanyuan Feng\",\"doi\":\"10.1109/ICCECE58074.2023.10135509\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.\",\"PeriodicalId\":120030,\"journal\":{\"name\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCECE58074.2023.10135509\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE58074.2023.10135509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

近年来，自动驾驶领域的自监督单目深度估计任务取得了显著的成果。采用亮度一致性假设指导网络训练，相邻帧图像亮度需要保持恒定。然而，这种假设并不适用于腹腔镜手术，因为在手术过程中，同一组织的光强度会随着时间的推移而变化。此外，腹腔镜中定义的感受野导致结构化线索的利用率低，当腹腔镜框架被输入深度估计网络时，预测的深度图在组织轮廓上表现不佳。本文针对亮度一致性问题，提出将图像的二阶梯度和视差图的二阶梯度整合到光度重建误差中来引导网络。针对腹腔镜图像上下文线索利用率低的问题，在网络解码器部分对以下线索进行加权，以提高组织轮廓线索的低分辨率特征图的重用性。在SCARED数据集上进行了实验，将新的损失和新的模块分别放入网络中进行训练，验证了它们的有效性，结果在四个常用指标上都表现出良好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Monocular depth estimation based on Chained Residual Pooling and Gradient Weighted Loss

In recent years, the self-supervised monocular depth estimation task in the field of autonomous driving has achieved remarkable results. The brightness consistency assumption is adopted to guide network training, The image brightness needs to be kept constant in adjacent frames. However, this assumption does not apply to laparoscopic scenarios, where the intensity of light for the same tissue changes over time during surgery. In addition, the defined receptive fields in laparoscopy lead to low utilization of structured cues, the predicted depth map performs poorly on the tissue contour when the laparoscopic frame is fed into the depth estimation network. In this work, aiming at the problem of luminance consistency, it is proposed to integrate the second-order gradient of the image and the second-order gradient of the parallax map into the photometric reconstruction error to guide the network. In view of the problem of the low utilization rate of laparoscopic image context clues, the following clues are weighted in the decoder part of the network to improve the reuse of low-resolution feature maps for tissue contour clues. Experiments were performed on the SCARED dataset, and new losses and new modules were put into the network separately to train to verify their effectiveness, the results showed good performance on all four commonly used indicators.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)

自引率

0.00%

发文量