Mohamed Aladem, Sumanth Chennupati, Zaid A. El-Shair, S. Rawashdeh
{"title":"A Comparative Study of Different CNN Encoders for Monocular Depth Prediction","authors":"Mohamed Aladem, Sumanth Chennupati, Zaid A. El-Shair, S. Rawashdeh","doi":"10.1109/NAECON46414.2019.9057857","DOIUrl":null,"url":null,"abstract":"Depth estimation of an observed scene is an important task for many domains such as mobile robotics, autonomous driving, and augmented reality. Traditionally, specialized sensors such as stereo cameras and structured light (RGB-D) ones are used to obtain depth along with color information of the environment. However, extending typical monocular cameras with the ability to infer depth information is an attractive solution. In this paper, we will demonstrate a Convolutional Neural Network (CNN) in an encoder-decoder architecture to perform monocular depth prediction. Additionally, we will evaluate and compare different CNN encoders’ performance.","PeriodicalId":193529,"journal":{"name":"2019 IEEE National Aerospace and Electronics Conference (NAECON)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE National Aerospace and Electronics Conference (NAECON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAECON46414.2019.9057857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Depth estimation of an observed scene is an important task for many domains such as mobile robotics, autonomous driving, and augmented reality. Traditionally, specialized sensors such as stereo cameras and structured light (RGB-D) ones are used to obtain depth along with color information of the environment. However, extending typical monocular cameras with the ability to infer depth information is an attractive solution. In this paper, we will demonstrate a Convolutional Neural Network (CNN) in an encoder-decoder architecture to perform monocular depth prediction. Additionally, we will evaluate and compare different CNN encoders’ performance.