纹理统计编码模型揭示了人类视觉皮层的层次特征选择性

Margaret M. Henderson, M. Tarr, Leila Wehbe
{"title":"纹理统计编码模型揭示了人类视觉皮层的层次特征选择性","authors":"Margaret M. Henderson, M. Tarr, Leila Wehbe","doi":"10.1101/2022.09.23.509292","DOIUrl":null,"url":null,"abstract":"Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P–S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1–V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P–S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system. SIGNIFICANCE STATEMENT Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.","PeriodicalId":22786,"journal":{"name":"The Journal of Neuroscience","volume":"48 1","pages":"4144 - 4161"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Texture Statistics Encoding Model Reveals Hierarchical Feature Selectivity across Human Visual Cortex\",\"authors\":\"Margaret M. Henderson, M. Tarr, Leila Wehbe\",\"doi\":\"10.1101/2022.09.23.509292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P–S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1–V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P–S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system. SIGNIFICANCE STATEMENT Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.\",\"PeriodicalId\":22786,\"journal\":{\"name\":\"The Journal of Neuroscience\",\"volume\":\"48 1\",\"pages\":\"4144 - 4161\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Neuroscience\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2022.09.23.509292\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Neuroscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2022.09.23.509292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

中级特征,如轮廓和纹理,提供了低级和高级视觉表示之间的计算链接。虽然大脑中中层表征的性质尚未完全被理解,但过去的工作已经提出了一种纹理统计模型,称为P-S模型(Portilla和Simoncelli, 2000),是预测V1-V4区域神经反应以及人类行为数据的候选模型。然而,目前尚不清楚该模型如何很好地解释高级视觉皮层对自然场景图像的反应。为了验证这一点,我们基于P-S统计构建了单体素编码模型,并将模型拟合到来自自然场景数据集(Allen et al., 2022)的人类受试者(男女)的fMRI数据中。我们证明了纹理统计编码模型可以预测单个体素在早期视网膜病变区域和更高级别类别选择区域的hold - down响应。该模型可靠地预测高级视觉皮层信号的能力表明,纹理统计特征的表征在整个大脑中广泛存在。此外,使用方差划分分析,我们确定了哪些特征是最独特的预测大脑反应,并表明高阶纹理特征的贡献从早期区域增加到腹侧表面的高级区域。我们还证明,纹理统计的敏感性模式可用于恢复视觉皮层内的广泛组织轴,包括捕获语义图像内容的维度。这些结果为描述中级特征表征如何在视觉系统中分层出现提供了关键的一步。中间视觉特征,如纹理,在皮质计算中起着重要作用,可能有助于物体和场景识别等任务。在这里,我们使用过去工作中提出的纹理模型来构建编码模型,预测人类视觉皮层神经群对自然场景刺激的反应(用fMRI测量)。我们发现,该模型可以预测视觉系统多个层次的神经群体的反应,并且该模型能够揭示从早期视网膜异位皮层到腹侧和外侧视觉皮层更高区域的特征表征复杂性的增加。这些结果支持了纹理表征可能在视觉处理中发挥广泛潜在作用的观点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Texture Statistics Encoding Model Reveals Hierarchical Feature Selectivity across Human Visual Cortex
Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P–S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1–V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P–S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system. SIGNIFICANCE STATEMENT Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Erratum: Schlüter et al., “Rabphilin Knock-Out Mice Reveal That Rabphilin Is Not Required for Rab3 Function in Regulating Neurotransmitter Release” Category-selective representation of relationships in visual cortex Phosphorylation of RPT6 controls its ability to bind DNA and regulate gene expression in the hippocampus of male rats during memory formation Neural network connectivity following opioid dependence is altered by a common genetic variant in the mu-opioid receptor,OPRM1A118G An Ascending Excitatory Circuit from the Dorsal Raphe for Sensory Modulation of Pain
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1