Optimal estimation of local motion-in-depth with naturalistic stimuli.

IF 4.4 2区 医学 Q1 NEUROSCIENCES Journal of Neuroscience Pub Date : 2024-11-26 DOI:10.1523/JNEUROSCI.0490-24.2024
Daniel Herrera-Esposito, Johannes Burge
{"title":"Optimal estimation of local motion-in-depth with naturalistic stimuli.","authors":"Daniel Herrera-Esposito, Johannes Burge","doi":"10.1523/JNEUROSCI.0490-24.2024","DOIUrl":null,"url":null,"abstract":"<p><p>Estimating the motion of objects in depth is important for behavior, and is strongly supported by binocular visual cues. To understand both how the brain should estimate motion in depth and how natural constraints shape and limit performance in two local 3D motion tasks, we develop image-computable ideal observers from a large number of binocular video clips created from a dataset of natural images. The observers spatio-temporally filter the videos, and non-linearly decode 3D motion from the filter responses. The optimal filters and decoder are dictated by the task-relevant image statistics, and are specific to each task. Multiple findings emerge. First, two distinct filter subpopulations are spontaneously learned for each task. For 3D speed estimation, filters emerge for processing either changing disparities over time (CDOT) or interocular velocity differences (IOVD), cues that are used by humans. For 3D direction estimation, filters emerge for discriminating either left-right or towards-away motion. Second, the filter responses, conditioned on the latent variable, are well-described as jointly Gaussian, and the covariance of the filter responses carries the information about the task-relevant latent variable. Quadratic combination is thus necessary for optimal decoding, which can be implemented by biologically plausible neural computations. Finally, the ideal observer yields non-obvious-and in some cases counter-intuitive-patterns of performance like those exhibited by humans. Important characteristics of human 3D motion processing and estimation may therefore result from optimal information processing in the early visual system.<b>Significance statement</b> Humans and other animals extract and process features of natural images that are useful for estimating motion-in-depth, an ability that is crucial for successful interaction with the environment. But the enormous diversity of natural visual inputs that are consistent with a given 3D motion-natural stimulus variability-presents a challenging computational problem. The neural populations that support the estimation of motion-in-depth are under active investigation. Here, we study how to optimally estimate local 3D motion with naturalistic stimulus variability. We show that the optimal computations are biologically plausible, and that they reproduce sometimes counterintuitive performance patterns independently reported in the human psychophysical literature. Novel testable hypotheses for future neurophysiological and psychophysical research are discussed.</p>","PeriodicalId":50114,"journal":{"name":"Journal of Neuroscience","volume":" ","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1523/JNEUROSCI.0490-24.2024","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Estimating the motion of objects in depth is important for behavior, and is strongly supported by binocular visual cues. To understand both how the brain should estimate motion in depth and how natural constraints shape and limit performance in two local 3D motion tasks, we develop image-computable ideal observers from a large number of binocular video clips created from a dataset of natural images. The observers spatio-temporally filter the videos, and non-linearly decode 3D motion from the filter responses. The optimal filters and decoder are dictated by the task-relevant image statistics, and are specific to each task. Multiple findings emerge. First, two distinct filter subpopulations are spontaneously learned for each task. For 3D speed estimation, filters emerge for processing either changing disparities over time (CDOT) or interocular velocity differences (IOVD), cues that are used by humans. For 3D direction estimation, filters emerge for discriminating either left-right or towards-away motion. Second, the filter responses, conditioned on the latent variable, are well-described as jointly Gaussian, and the covariance of the filter responses carries the information about the task-relevant latent variable. Quadratic combination is thus necessary for optimal decoding, which can be implemented by biologically plausible neural computations. Finally, the ideal observer yields non-obvious-and in some cases counter-intuitive-patterns of performance like those exhibited by humans. Important characteristics of human 3D motion processing and estimation may therefore result from optimal information processing in the early visual system.Significance statement Humans and other animals extract and process features of natural images that are useful for estimating motion-in-depth, an ability that is crucial for successful interaction with the environment. But the enormous diversity of natural visual inputs that are consistent with a given 3D motion-natural stimulus variability-presents a challenging computational problem. The neural populations that support the estimation of motion-in-depth are under active investigation. Here, we study how to optimally estimate local 3D motion with naturalistic stimulus variability. We show that the optimal computations are biologically plausible, and that they reproduce sometimes counterintuitive performance patterns independently reported in the human psychophysical literature. Novel testable hypotheses for future neurophysiological and psychophysical research are discussed.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用自然刺激对局部运动深度进行优化估计。
估计物体在深度方向上的运动对行为非常重要,并且得到双目视觉线索的有力支持。为了了解大脑应如何估计深度运动,以及自然约束如何塑造和限制两个局部三维运动任务的表现,我们从自然图像数据集中创建的大量双目视频片段中开发了可进行图像计算的理想观察者。观测器对视频进行时空过滤,并根据过滤响应对三维运动进行非线性解码。最佳滤波器和解码器由与任务相关的图像统计数据决定,并且针对每项任务。研究得出了多项发现。首先,每个任务都会自发学习到两个不同的滤波器子群。对于三维速度估计,滤波器用于处理随时间变化的差距(CDOT)或眼间速度差(IOVD),这些都是人类使用的线索。对于三维方向估计,滤波器的出现是为了区分左右运动或朝向远方的运动。其次,以潜在变量为条件的滤波器响应被很好地描述为联合高斯,滤波器响应的协方差包含了任务相关潜在变量的信息。因此,二次组合是最佳解码的必要条件,这可以通过生物学上可信的神经计算来实现。最后,理想观察者会产生非显而易见的性能模式,在某些情况下甚至与人类表现出的性能模式背道而驰。因此,人类三维运动处理和估计的重要特征可能来自于早期视觉系统的最佳信息处理。意义声明 人类和其他动物提取并处理自然图像的特征,这些特征有助于估计深度运动,这种能力对于成功与环境互动至关重要。但是,与特定三维运动相一致的自然视觉输入的巨大多样性--自然刺激的可变性--提出了一个具有挑战性的计算问题。目前正在积极研究支持深度运动估计的神经群。在这里,我们研究了如何利用自然刺激变异性来优化局部三维运动的估计。我们的研究表明,最优计算在生物学上是可行的,而且它们重现了人类心理物理文献中独立报道的有时与直觉相反的表现模式。我们还讨论了未来神经生理学和心理物理学研究的新的可检验假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Neuroscience
Journal of Neuroscience 医学-神经科学
CiteScore
9.30
自引率
3.80%
发文量
1164
审稿时长
12 months
期刊介绍: JNeurosci (ISSN 0270-6474) is an official journal of the Society for Neuroscience. It is published weekly by the Society, fifty weeks a year, one volume a year. JNeurosci publishes papers on a broad range of topics of general interest to those working on the nervous system. Authors now have an Open Choice option for their published articles
期刊最新文献
A "Conscious" Loss of Balance: Directing Attention to Movement Can Impair the Cortical Response to Postural Perturbations. Haploinsufficiency of Syngap1 in Striatal Indirect Pathway Neurons Alters Motor and Goal-Directed Behaviors in Mice. How Distributed Subcortical Integration of Reward and Threat May Inform Subsequent Approach-Avoidance Decisions. The Spatiotemporal Dynamics of Bottom-Up and Top-Down Processing during At-a-Glance Reading. The Alzheimer's Disease Risk Gene CD2AP Functions in Dendritic Spines by Remodeling F-Actin.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1