{"title":"在静态图像中提取图像特征进行深度估计","authors":"M. Ogino, Junji Suzuki, M. Asada","doi":"10.1109/DEVLRN.2013.6652551","DOIUrl":null,"url":null,"abstract":"Human feels three-dimensional effect for static image with the cues of various kinds of image features such as relative sizes of objects, up and down, rules of perspective, texture gradient, and shadow. The features are called pictorial depth cues. Human is thought to learn to extract these features as important cues for depth estimation in the developmental process. In this paper, we make a hypothesis that pictorial depth cues are acquired so that disparities can be predicted well and make a model that extracts features appropriate for depth estimation from static images. Random forest network is trained to extract important ones among a large amount image features so as to estimate motion and stereo disparities. The experiments with simulation and real environments show high correlation between estimated and real disparities.","PeriodicalId":106997,"journal":{"name":"2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Extracting image features in static images for depth estimation\",\"authors\":\"M. Ogino, Junji Suzuki, M. Asada\",\"doi\":\"10.1109/DEVLRN.2013.6652551\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human feels three-dimensional effect for static image with the cues of various kinds of image features such as relative sizes of objects, up and down, rules of perspective, texture gradient, and shadow. The features are called pictorial depth cues. Human is thought to learn to extract these features as important cues for depth estimation in the developmental process. In this paper, we make a hypothesis that pictorial depth cues are acquired so that disparities can be predicted well and make a model that extracts features appropriate for depth estimation from static images. Random forest network is trained to extract important ones among a large amount image features so as to estimate motion and stereo disparities. The experiments with simulation and real environments show high correlation between estimated and real disparities.\",\"PeriodicalId\":106997,\"journal\":{\"name\":\"2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEVLRN.2013.6652551\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2013.6652551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extracting image features in static images for depth estimation
Human feels three-dimensional effect for static image with the cues of various kinds of image features such as relative sizes of objects, up and down, rules of perspective, texture gradient, and shadow. The features are called pictorial depth cues. Human is thought to learn to extract these features as important cues for depth estimation in the developmental process. In this paper, we make a hypothesis that pictorial depth cues are acquired so that disparities can be predicted well and make a model that extracts features appropriate for depth estimation from static images. Random forest network is trained to extract important ones among a large amount image features so as to estimate motion and stereo disparities. The experiments with simulation and real environments show high correlation between estimated and real disparities.