Vahid Reza Khazaie, Alireza Akhavanpour, R. Ebrahimpour
{"title":"Occluded Visual Object Recognition Using Deep Conditional Generative Adversarial Nets and Feedforward Convolutional Neural Networks","authors":"Vahid Reza Khazaie, Alireza Akhavanpour, R. Ebrahimpour","doi":"10.1109/MVIP49855.2020.9116887","DOIUrl":null,"url":null,"abstract":"Core object recognition is the task of recognizing objects without regard to any variations in the conditions like pose, illumination or any other structural modifications. This task is solved through the feedforward processing of information in the human visual system. Deep neural networks can perform like humans in this task. However, we do not know how object recognition under more challenging conditions like occlusion is solved. Some computational models imply that recurrent processing might be a solution to the beyond core object recognition task. The other potential mechanism for solving occlusion is to reconstruct the occluded part of the object taking advantage of generative models. Here we used Conditional Generative Adversarial Networks for reconstruction. For reasonable size occlusion, we were able to remove the effect of occlusion and we recovered the performance of the base model. We showed getting the benefit of GANs for reconstruction and adding information by generative models can cause a better performance in the object recognition task under occlusion.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MVIP49855.2020.9116887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Core object recognition is the task of recognizing objects without regard to any variations in the conditions like pose, illumination or any other structural modifications. This task is solved through the feedforward processing of information in the human visual system. Deep neural networks can perform like humans in this task. However, we do not know how object recognition under more challenging conditions like occlusion is solved. Some computational models imply that recurrent processing might be a solution to the beyond core object recognition task. The other potential mechanism for solving occlusion is to reconstruct the occluded part of the object taking advantage of generative models. Here we used Conditional Generative Adversarial Networks for reconstruction. For reasonable size occlusion, we were able to remove the effect of occlusion and we recovered the performance of the base model. We showed getting the benefit of GANs for reconstruction and adding information by generative models can cause a better performance in the object recognition task under occlusion.