Occluded Visual Object Recognition Using Deep Conditional Generative Adversarial Nets and Feedforward Convolutional Neural Networks

Vahid Reza Khazaie, Alireza Akhavanpour, R. Ebrahimpour
{"title":"Occluded Visual Object Recognition Using Deep Conditional Generative Adversarial Nets and Feedforward Convolutional Neural Networks","authors":"Vahid Reza Khazaie, Alireza Akhavanpour, R. Ebrahimpour","doi":"10.1109/MVIP49855.2020.9116887","DOIUrl":null,"url":null,"abstract":"Core object recognition is the task of recognizing objects without regard to any variations in the conditions like pose, illumination or any other structural modifications. This task is solved through the feedforward processing of information in the human visual system. Deep neural networks can perform like humans in this task. However, we do not know how object recognition under more challenging conditions like occlusion is solved. Some computational models imply that recurrent processing might be a solution to the beyond core object recognition task. The other potential mechanism for solving occlusion is to reconstruct the occluded part of the object taking advantage of generative models. Here we used Conditional Generative Adversarial Networks for reconstruction. For reasonable size occlusion, we were able to remove the effect of occlusion and we recovered the performance of the base model. We showed getting the benefit of GANs for reconstruction and adding information by generative models can cause a better performance in the object recognition task under occlusion.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MVIP49855.2020.9116887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Core object recognition is the task of recognizing objects without regard to any variations in the conditions like pose, illumination or any other structural modifications. This task is solved through the feedforward processing of information in the human visual system. Deep neural networks can perform like humans in this task. However, we do not know how object recognition under more challenging conditions like occlusion is solved. Some computational models imply that recurrent processing might be a solution to the beyond core object recognition task. The other potential mechanism for solving occlusion is to reconstruct the occluded part of the object taking advantage of generative models. Here we used Conditional Generative Adversarial Networks for reconstruction. For reasonable size occlusion, we were able to remove the effect of occlusion and we recovered the performance of the base model. We showed getting the benefit of GANs for reconstruction and adding information by generative models can cause a better performance in the object recognition task under occlusion.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度条件生成对抗网络和前馈卷积神经网络的遮挡视觉目标识别
核心对象识别的任务是识别对象,而不考虑任何变化的条件,如姿势,照明或任何其他结构的修改。这个任务是通过人类视觉系统对信息的前馈处理来解决的。深度神经网络可以像人类一样完成这项任务。然而,我们不知道在遮挡等更具挑战性的条件下如何解决目标识别问题。一些计算模型暗示循环处理可能是超核心对象识别任务的解决方案。另一种解决遮挡的潜在机制是利用生成模型重建被遮挡的物体部分。这里我们使用条件生成对抗网络进行重建。对于合理大小的遮挡,我们能够去除遮挡的影响,并恢复基础模型的性能。研究结果表明,利用gan进行重建并通过生成模型添加信息可以在遮挡下的目标识别任务中获得更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Offline Handwritten Signature Verification and Recognition Based on Deep Transfer Learning A High-Accuracy, Cost-Effective People Counting Solution Based on Visual Depth Data Source Camera Identification Using WLBP Descriptor Convolutional Neural Network for Building Extraction from High-Resolution Remote Sensing Images PCB Defect Detection Using Denoising Convolutional Autoencoders
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1