利用注意力 U-Net 优化深度估计

IF 1.6 Q2 ENGINEERING, MULTIDISCIPLINARY International Journal of System Assurance Engineering and Management Pub Date : 2024-07-20 DOI:10.1007/s13198-024-02431-7
Huma Farooq, Manzoor Ahmad Chachoo, Sajid Yousuf Bhat
{"title":"利用注意力 U-Net 优化深度估计","authors":"Huma Farooq, Manzoor Ahmad Chachoo, Sajid Yousuf Bhat","doi":"10.1007/s13198-024-02431-7","DOIUrl":null,"url":null,"abstract":"<p>Depth maps (DMs) are invaluable tools encapsulating scene information in a three-dimensional context. They have a crucial part in reconstructing the spatial layout of a scene, enabling a comprehensive understanding of object geometry. These DMs can originate from either a single image or a combination of multiple images, with the former approach referred to as monocular depth mapping. However, deriving accurate depth maps is a complex and ill-posed problem that often necessitates intricate calibration. Recent advances have turned to deep learning (DL) techniques to address these challenges. In the context of monocular depth estimation, we propose a novel methodology utilizing an Attention U-Net architecture (Attention UNet). By incorporating attention mechanisms, we bolster the network’s ability to extract salient features, particularly along object boundaries. Critically, this enhancement is achieved without introducing additional parameters to the networks, ensuring efficient model training. Our proposed approach is effective in producing high-quality depth maps with notable advantages. By leveraging the Attention UNet architecture, we substantially improve depth map accuracy, reducing the root mean square error (RMSE) by 0.23 on the benchmark NYU V2 dataset, Highlighting its supremacy compared to current state-of-the-art techniques.</p>","PeriodicalId":14463,"journal":{"name":"International Journal of System Assurance Engineering and Management","volume":"45 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing depth estimation with attention U-Net\",\"authors\":\"Huma Farooq, Manzoor Ahmad Chachoo, Sajid Yousuf Bhat\",\"doi\":\"10.1007/s13198-024-02431-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Depth maps (DMs) are invaluable tools encapsulating scene information in a three-dimensional context. They have a crucial part in reconstructing the spatial layout of a scene, enabling a comprehensive understanding of object geometry. These DMs can originate from either a single image or a combination of multiple images, with the former approach referred to as monocular depth mapping. However, deriving accurate depth maps is a complex and ill-posed problem that often necessitates intricate calibration. Recent advances have turned to deep learning (DL) techniques to address these challenges. In the context of monocular depth estimation, we propose a novel methodology utilizing an Attention U-Net architecture (Attention UNet). By incorporating attention mechanisms, we bolster the network’s ability to extract salient features, particularly along object boundaries. Critically, this enhancement is achieved without introducing additional parameters to the networks, ensuring efficient model training. Our proposed approach is effective in producing high-quality depth maps with notable advantages. By leveraging the Attention UNet architecture, we substantially improve depth map accuracy, reducing the root mean square error (RMSE) by 0.23 on the benchmark NYU V2 dataset, Highlighting its supremacy compared to current state-of-the-art techniques.</p>\",\"PeriodicalId\":14463,\"journal\":{\"name\":\"International Journal of System Assurance Engineering and Management\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of System Assurance Engineering and Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s13198-024-02431-7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of System Assurance Engineering and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13198-024-02431-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

深度图(DM)是在三维环境中封装场景信息的宝贵工具。它们在重建场景空间布局、全面了解物体几何形状方面发挥着至关重要的作用。这些深度图可以来自单张图像,也可以来自多张图像的组合,前者被称为单眼深度图。然而,推导精确的深度图是一个复杂且难以解决的问题,通常需要进行复杂的校准。最近,深度学习(DL)技术在应对这些挑战方面取得了进展。在单目深度估算方面,我们提出了一种利用注意力 U-Net 架构(Attention UNet)的新方法。通过加入注意力机制,我们增强了网络提取显著特征的能力,尤其是沿物体边界提取特征的能力。重要的是,这种增强无需为网络引入额外参数,从而确保了高效的模型训练。我们提出的方法在生成高质量深度图方面具有显著优势。通过利用注意力 UNet 架构,我们大幅提高了深度图的准确性,在基准 NYU V2 数据集上将均方根误差 (RMSE) 降低了 0.23,与当前最先进的技术相比,凸显了其优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Optimizing depth estimation with attention U-Net

Depth maps (DMs) are invaluable tools encapsulating scene information in a three-dimensional context. They have a crucial part in reconstructing the spatial layout of a scene, enabling a comprehensive understanding of object geometry. These DMs can originate from either a single image or a combination of multiple images, with the former approach referred to as monocular depth mapping. However, deriving accurate depth maps is a complex and ill-posed problem that often necessitates intricate calibration. Recent advances have turned to deep learning (DL) techniques to address these challenges. In the context of monocular depth estimation, we propose a novel methodology utilizing an Attention U-Net architecture (Attention UNet). By incorporating attention mechanisms, we bolster the network’s ability to extract salient features, particularly along object boundaries. Critically, this enhancement is achieved without introducing additional parameters to the networks, ensuring efficient model training. Our proposed approach is effective in producing high-quality depth maps with notable advantages. By leveraging the Attention UNet architecture, we substantially improve depth map accuracy, reducing the root mean square error (RMSE) by 0.23 on the benchmark NYU V2 dataset, Highlighting its supremacy compared to current state-of-the-art techniques.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.30
自引率
10.00%
发文量
252
期刊介绍: This Journal is established with a view to cater to increased awareness for high quality research in the seamless integration of heterogeneous technologies to formulate bankable solutions to the emergent complex engineering problems. Assurance engineering could be thought of as relating to the provision of higher confidence in the reliable and secure implementation of a system’s critical characteristic features through the espousal of a holistic approach by using a wide variety of cross disciplinary tools and techniques. Successful realization of sustainable and dependable products, systems and services involves an extensive adoption of Reliability, Quality, Safety and Risk related procedures for achieving high assurancelevels of performance; also pivotal are the management issues related to risk and uncertainty that govern the practical constraints encountered in their deployment. It is our intention to provide a platform for the modeling and analysis of large engineering systems, among the other aforementioned allied goals of systems assurance engineering, leading to the enforcement of performance enhancement measures. Achieving a fine balance between theory and practice is the primary focus. The Journal only publishes high quality papers that have passed the rigorous peer review procedure of an archival scientific Journal. The aim is an increasing number of submissions, wide circulation and a high impact factor.
期刊最新文献
Vision-based gait analysis to detect Parkinson’s disease using hybrid Harris hawks and Arithmetic optimization algorithm with Random Forest classifier Zero crossing point detection in a distorted sinusoidal signal using random forest classifier FL-XGBTC: federated learning inspired with XG-boost tuned classifier for YouTube spam content detection A generalized product adoption model under random marketing conditions Assessing e-learning platforms in higher education with reference to student satisfaction: a PLS-SEM approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1