利用计算机视觉估计自然场景中数量级和非数量级的视觉星等分布。

IF 2.2 3区 心理学 Q2 PSYCHOLOGY, EXPERIMENTAL Psychological Research-Psychologische Forschung Pub Date : 2024-12-03 DOI:10.1007/s00426-024-02064-2
Kuinan Hou, Marco Zorzi, Alberto Testolin
{"title":"利用计算机视觉估计自然场景中数量级和非数量级的视觉星等分布。","authors":"Kuinan Hou, Marco Zorzi, Alberto Testolin","doi":"10.1007/s00426-024-02064-2","DOIUrl":null,"url":null,"abstract":"<p><p>Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such \"ecological\" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.</p>","PeriodicalId":48184,"journal":{"name":"Psychological Research-Psychologische Forschung","volume":"89 1","pages":"31"},"PeriodicalIF":2.2000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision.\",\"authors\":\"Kuinan Hou, Marco Zorzi, Alberto Testolin\",\"doi\":\"10.1007/s00426-024-02064-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such \\\"ecological\\\" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.</p>\",\"PeriodicalId\":48184,\"journal\":{\"name\":\"Psychological Research-Psychologische Forschung\",\"volume\":\"89 1\",\"pages\":\"31\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychological Research-Psychologische Forschung\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1007/s00426-024-02064-2\",\"RegionNum\":3,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY, EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological Research-Psychologische Forschung","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1007/s00426-024-02064-2","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

人类和许多动物一样具有感知和近似表示视觉场景中物体数量的能力。这种能力在整个童年时期都在提高,这表明学习和发展在塑造我们的数字感方面起着关键作用。基于深度学习的计算研究进一步支持了这一假设,这些研究表明,在学习具有不同数量的图像的统计结构的神经网络中,数字感知可以自发地出现。然而,神经网络模型通常使用合成数据集进行训练,这些数据集可能不能忠实地反映自然环境的统计结构,并且人们对使用更多的生态视觉刺激来研究人类的数字感知也越来越感兴趣。在这项工作中,我们利用计算机视觉算法的最新进展来设计和实现一个原始管道,该管道可用于估计包含数千个描绘日常生活中物体的真实图像的大规模数据集中的数量级和非数值量级的分布。我们证明了在自然视觉场景中,不同数字出现的频率遵循幂律分布。此外,我们表明,在数据集和场景类型(同质与异构对象集)中,数量和连续数量级的相关结构是稳定的。我们认为,考虑这种“生态”的协方差模式对于理解非数字视觉线索对数量判断的影响是重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision.

Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such "ecological" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.10
自引率
8.70%
发文量
137
期刊介绍: Psychological Research/Psychologische Forschung publishes articles that contribute to a basic understanding of human perception, attention, memory, and action. The Journal is devoted to the dissemination of knowledge based on firm experimental ground, but not to particular approaches or schools of thought. Theoretical and historical papers are welcome to the extent that they serve this general purpose; papers of an applied nature are acceptable if they contribute to basic understanding or serve to bridge the often felt gap between basic and applied research in the field covered by the Journal.
期刊最新文献
Decomposing delta plots: exploring the time course of the congruency effect using inhibition and facilitation curves. Grounded cognition and the representation of momentum: abstract concepts modulate mislocalization. Can't help processing numbers with text: Eye-tracking evidence for simultaneous instead of sequential processing of text and numbers in arithmetic word problems. Effect of spatial training on space-number mapping: a situated cognition account. Action toward sound sources enhances auditory spatial confidence: on the metacognitive consequences of reaching to sounds.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1