用于自动驾驶的具有自校正功能的自监督双目深度估计算法

IF 2.3 4区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IET Intelligent Transport Systems Pub Date : 2024-06-19 DOI:10.1049/itr2.12522

Jingyao Bao, Hongfei Yu, Yongjia Zou, Jin Lv, Wei Liu, Yang Cao

{"title":"用于自动驾驶的具有自校正功能的自监督双目深度估计算法","authors":"Jingyao Bao, Hongfei Yu, Yongjia Zou, Jin Lv, Wei Liu, Yang Cao","doi":"10.1049/itr2.12522","DOIUrl":null,"url":null,"abstract":"<p>Aiming to address the challenge where existing methods struggle to predict accurate disparities for imperfectly rectified stereo images, and that supervised training requires a considerable amount of ground truth, a self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving is proposed. Firstly, a subnetwork dedicated to stereo rectification, aiming to estimate the homography between stereo images is developed. This homography facilitates the transformation of stereo image pairs, aligning their corresponding pixels horizontally. Secondly, a foundational self-supervised framework primarily centred on minimizing errors in stereo image reconstruction, combined with the generative-adversarial strategy is introduced. Finally, a vertical offset prediction module (VOPM) is incorporated into the basic framework to further enhance the resistance of the stereo matching network to pixel-level vertical offset errors. Experimental results on the public KITTI dataset for autonomous driving demonstrate the effectiveness of this approach in improving the disparity prediction performance for imperfectly rectified stereo images. Moreover, the self-supervised training framework exhibits superiority over state-of-the-art methods.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12522","citationCount":"0","resultStr":"{\"title\":\"Self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving\",\"authors\":\"Jingyao Bao, Hongfei Yu, Yongjia Zou, Jin Lv, Wei Liu, Yang Cao\",\"doi\":\"10.1049/itr2.12522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Aiming to address the challenge where existing methods struggle to predict accurate disparities for imperfectly rectified stereo images, and that supervised training requires a considerable amount of ground truth, a self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving is proposed. Firstly, a subnetwork dedicated to stereo rectification, aiming to estimate the homography between stereo images is developed. This homography facilitates the transformation of stereo image pairs, aligning their corresponding pixels horizontally. Secondly, a foundational self-supervised framework primarily centred on minimizing errors in stereo image reconstruction, combined with the generative-adversarial strategy is introduced. Finally, a vertical offset prediction module (VOPM) is incorporated into the basic framework to further enhance the resistance of the stereo matching network to pixel-level vertical offset errors. Experimental results on the public KITTI dataset for autonomous driving demonstrate the effectiveness of this approach in improving the disparity prediction performance for imperfectly rectified stereo images. Moreover, the self-supervised training framework exhibits superiority over state-of-the-art methods.</p>\",\"PeriodicalId\":50381,\"journal\":{\"name\":\"IET Intelligent Transport Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12522\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Intelligent Transport Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12522\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12522","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

现有的方法很难预测未完全矫正的立体图像的准确差距，而且监督训练需要大量的地面实况，为了解决这一难题，我们提出了一种用于自动驾驶的具有自矫正功能的自监督双目深度估计算法。首先，开发了一个专门用于立体矫正的子网络，旨在估算立体图像之间的同源性。这种同源性有助于立体图像对的转换，使其相应像素水平对齐。其次，结合生成-对抗策略，引入了一个基础性自监督框架，其主要核心是最大限度地减少立体图像重建中的误差。最后，在基本框架中加入了垂直偏移预测模块（VOPM），以进一步增强立体匹配网络对像素级垂直偏移误差的抵抗能力。在用于自动驾驶的公共 KITTI 数据集上的实验结果表明，这种方法能有效提高不完全校正立体图像的差距预测性能。此外，自监督训练框架比最先进的方法更具优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving

Aiming to address the challenge where existing methods struggle to predict accurate disparities for imperfectly rectified stereo images, and that supervised training requires a considerable amount of ground truth, a self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving is proposed. Firstly, a subnetwork dedicated to stereo rectification, aiming to estimate the homography between stereo images is developed. This homography facilitates the transformation of stereo image pairs, aligning their corresponding pixels horizontally. Secondly, a foundational self-supervised framework primarily centred on minimizing errors in stereo image reconstruction, combined with the generative-adversarial strategy is introduced. Finally, a vertical offset prediction module (VOPM) is incorporated into the basic framework to further enhance the resistance of the stereo matching network to pixel-level vertical offset errors. Experimental results on the public KITTI dataset for autonomous driving demonstrate the effectiveness of this approach in improving the disparity prediction performance for imperfectly rectified stereo images. Moreover, the self-supervised training framework exhibits superiority over state-of-the-art methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Intelligent Transport Systems 工程技术-运输科技

CiteScore

6.50

自引率

7.40%

发文量

159

审稿时长

3 months

期刊介绍： IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following: Sustainable traffic solutions Deployments with enabling technologies Pervasive monitoring Applications; demonstrations and evaluation Economic and behavioural analyses of ITS services and scenario Data Integration and analytics Information collection and processing; image processing applications in ITS ITS aspects of electric vehicles Autonomous vehicles; connected vehicle systems; In-vehicle ITS, safety and vulnerable road user aspects Mobility as a service systems Traffic management and control Public transport systems technologies Fleet and public transport logistics Emergency and incident management Demand management and electronic payment systems Traffic related air pollution management Policy and institutional issues Interoperability, standards and architectures Funding scenarios Enforcement Human machine interaction Education, training and outreach Current Special Issue Call for papers: Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf