{"title":"Object recognition in primates: what can early visual areas contribute?","authors":"Christian Quaia, Richard J. Krauzlis","doi":"10.3389/fnbeh.2024.1425496","DOIUrl":null,"url":null,"abstract":"IntroductionIf neuroscientists were asked which brain area is responsible for object recognition in primates, most would probably answer infero-temporal (IT) cortex. While IT is likely responsible for fine discriminations, and it is accordingly dominated by foveal visual inputs, there is more to object recognition than fine discrimination. Importantly, foveation of an object of interest usually requires recognizing, with reasonable confidence, its presence in the periphery. Arguably, IT plays a secondary role in such peripheral recognition, and other visual areas might instead be more critical.MethodsTo investigate how signals carried by early visual processing areas (such as LGN and V1) could be used for object recognition in the periphery, we focused here on the task of distinguishing faces from non-faces. We tested how sensitive various models were to nuisance parameters, such as changes in scale and orientation of the image, and the type of image background.ResultsWe found that a model of V1 simple or complex cells could provide quite reliable information, resulting in performance better than 80% in realistic scenarios. An LGN model performed considerably worse.DiscussionBecause peripheral recognition is both crucial to enable fine recognition (by bringing an object of interest on the fovea), and probably sufficient to account for a considerable fraction of our daily recognition-guided behavior, we think that the current focus on area IT and foveal processing is too narrow. We propose that rather than a hierarchical system with IT-like properties as its primary aim, object recognition should be seen as a parallel process, with high-accuracy foveal modules operating in parallel with lower-accuracy and faster modules that can operate across the visual field.","PeriodicalId":12368,"journal":{"name":"Frontiers in Behavioral Neuroscience","volume":"38 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Behavioral Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fnbeh.2024.1425496","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
IntroductionIf neuroscientists were asked which brain area is responsible for object recognition in primates, most would probably answer infero-temporal (IT) cortex. While IT is likely responsible for fine discriminations, and it is accordingly dominated by foveal visual inputs, there is more to object recognition than fine discrimination. Importantly, foveation of an object of interest usually requires recognizing, with reasonable confidence, its presence in the periphery. Arguably, IT plays a secondary role in such peripheral recognition, and other visual areas might instead be more critical.MethodsTo investigate how signals carried by early visual processing areas (such as LGN and V1) could be used for object recognition in the periphery, we focused here on the task of distinguishing faces from non-faces. We tested how sensitive various models were to nuisance parameters, such as changes in scale and orientation of the image, and the type of image background.ResultsWe found that a model of V1 simple or complex cells could provide quite reliable information, resulting in performance better than 80% in realistic scenarios. An LGN model performed considerably worse.DiscussionBecause peripheral recognition is both crucial to enable fine recognition (by bringing an object of interest on the fovea), and probably sufficient to account for a considerable fraction of our daily recognition-guided behavior, we think that the current focus on area IT and foveal processing is too narrow. We propose that rather than a hierarchical system with IT-like properties as its primary aim, object recognition should be seen as a parallel process, with high-accuracy foveal modules operating in parallel with lower-accuracy and faster modules that can operate across the visual field.
导言如果问神经科学家哪个脑区负责灵长类动物的物体识别,大多数人可能会回答是颞下皮层(IT)。虽然颞叶下皮层很可能负责精细辨别,而且它也相应地受到眼窝视觉输入的支配,但物体识别并不只限于精细辨别。重要的是,要对感兴趣的物体进行眼窝识别,通常需要以合理的信心识别其在外围的存在。为了研究早期视觉处理区域(如 LGN 和 V1)所携带的信号如何用于外围物体识别,我们重点研究了区分面孔和非面孔的任务。我们测试了各种模型对干扰参数(如图像比例和方向的变化以及图像背景类型)的敏感程度。结果我们发现,V1 简单或复杂细胞模型可以提供相当可靠的信息,在现实场景中的表现优于 80%。讨论由于外围识别对于实现精细识别(通过将感兴趣的对象置于眼窝)至关重要,而且可能足以解释我们日常识别引导行为的相当一部分,因此我们认为目前对 IT 区和眼窝处理的关注过于狭隘。我们建议,与其将一个具有类似 IT 特性的分层系统作为其主要目标,不如将物体识别视为一个并行过程,即高精度的眼窝模块与可在整个视野中运作的低精度和快速模块并行运作。
期刊介绍:
Frontiers in Behavioral Neuroscience is a leading journal in its field, publishing rigorously peer-reviewed research that advances our understanding of the neural mechanisms underlying behavior. Field Chief Editor Nuno Sousa at the Instituto de Pesquisa em Ciências da Vida e da Saúde (ICVS) is supported by an outstanding Editorial Board of international experts. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.
This journal publishes major insights into the neural mechanisms of animal and human behavior, and welcomes articles studying the interplay between behavior and its neurobiological basis at all levels: from molecular biology and genetics, to morphological, biochemical, neurochemical, electrophysiological, neuroendocrine, pharmacological, and neuroimaging studies.