用于动作识别的人类视觉通路与深度卷积神经网络:后期层而非早期层的表征对应性。

IF 3.1 3区 医学 Q2 NEUROSCIENCES Journal of Cognitive Neuroscience Pub Date : 2024-10-29 DOI:10.1162/jocn_a_02233
Yujia Peng;Xizi Gong;Hongjing Lu;Fang Fang
{"title":"用于动作识别的人类视觉通路与深度卷积神经网络:后期层而非早期层的表征对应性。","authors":"Yujia Peng;Xizi Gong;Hongjing Lu;Fang Fang","doi":"10.1162/jocn_a_02233","DOIUrl":null,"url":null,"abstract":"Deep convolutional neural networks (DCNNs) have attained human-level performance for object categorization and exhibited representation alignment between network layers and brain regions. Does such representation alignment naturally extend to other visual tasks beyond recognizing objects in static images? In this study, we expanded the exploration to the recognition of human actions from videos and assessed the representation capabilities and alignment of two-stream DCNNs in comparison with brain regions situated along ventral and dorsal pathways. Using decoding analysis and representational similarity analysis, we show that DCNN models do not show hierarchical representation alignment to human brain across visual regions when processing action videos. Instead, later layers of DCNN models demonstrate greater representation similarities to the human visual cortex. These findings were revealed for two display formats: photorealistic avatars with full-body information and simplified stimuli in the point-light display. The discrepancies in representation alignment suggest fundamental differences in how DCNNs and the human brain represent dynamic visual information related to actions.","PeriodicalId":51081,"journal":{"name":"Journal of Cognitive Neuroscience","volume":"36 11","pages":"2458-2480"},"PeriodicalIF":3.1000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10738325","citationCount":"0","resultStr":"{\"title\":\"Human Visual Pathways for Action Recognition versus Deep Convolutional Neural Networks: Representation Correspondence in Late but Not Early Layers\",\"authors\":\"Yujia Peng;Xizi Gong;Hongjing Lu;Fang Fang\",\"doi\":\"10.1162/jocn_a_02233\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep convolutional neural networks (DCNNs) have attained human-level performance for object categorization and exhibited representation alignment between network layers and brain regions. Does such representation alignment naturally extend to other visual tasks beyond recognizing objects in static images? In this study, we expanded the exploration to the recognition of human actions from videos and assessed the representation capabilities and alignment of two-stream DCNNs in comparison with brain regions situated along ventral and dorsal pathways. Using decoding analysis and representational similarity analysis, we show that DCNN models do not show hierarchical representation alignment to human brain across visual regions when processing action videos. Instead, later layers of DCNN models demonstrate greater representation similarities to the human visual cortex. These findings were revealed for two display formats: photorealistic avatars with full-body information and simplified stimuli in the point-light display. The discrepancies in representation alignment suggest fundamental differences in how DCNNs and the human brain represent dynamic visual information related to actions.\",\"PeriodicalId\":51081,\"journal\":{\"name\":\"Journal of Cognitive Neuroscience\",\"volume\":\"36 11\",\"pages\":\"2458-2480\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10738325\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cognitive Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10738325/\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cognitive Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://ieeexplore.ieee.org/document/10738325/","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

深度卷积神经网络(DCNN)在物体分类方面的表现已达到人类水平,并显示出网络层与大脑区域之间的表征一致性。除了识别静态图像中的物体,这种表征一致性是否还能自然扩展到其他视觉任务?在这项研究中,我们将探索范围扩大到从视频中识别人类动作,并评估了双流 DCNN 与位于腹侧和背侧通路的脑区的表征能力和一致性。通过解码分析和表征相似性分析,我们发现 DCNN 模型在处理动作视频时并没有显示出与人脑各视觉区域的分层表征一致性。相反,DCNN 模型的后几层与人类视觉皮层表现出更大的表征相似性。这些发现针对两种显示格式:具有全身信息的逼真化身和点光源显示屏中的简化刺激。表征一致性的差异表明,DCNN 和人类大脑在如何表征与动作相关的动态视觉信息方面存在根本差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Human Visual Pathways for Action Recognition versus Deep Convolutional Neural Networks: Representation Correspondence in Late but Not Early Layers
Deep convolutional neural networks (DCNNs) have attained human-level performance for object categorization and exhibited representation alignment between network layers and brain regions. Does such representation alignment naturally extend to other visual tasks beyond recognizing objects in static images? In this study, we expanded the exploration to the recognition of human actions from videos and assessed the representation capabilities and alignment of two-stream DCNNs in comparison with brain regions situated along ventral and dorsal pathways. Using decoding analysis and representational similarity analysis, we show that DCNN models do not show hierarchical representation alignment to human brain across visual regions when processing action videos. Instead, later layers of DCNN models demonstrate greater representation similarities to the human visual cortex. These findings were revealed for two display formats: photorealistic avatars with full-body information and simplified stimuli in the point-light display. The discrepancies in representation alignment suggest fundamental differences in how DCNNs and the human brain represent dynamic visual information related to actions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Cognitive Neuroscience
Journal of Cognitive Neuroscience 医学-神经科学
CiteScore
5.30
自引率
3.10%
发文量
151
审稿时长
3-8 weeks
期刊介绍: Journal of Cognitive Neuroscience investigates brain–behavior interaction and promotes lively interchange among the mind sciences.
期刊最新文献
How Can Graph Theory Inform the Dual-stream Model of Speech Processing? A Resting-state Functional Magnetic Resonance Imaging Study of Stroke and Aphasia Symptomology. Neural Correlates of Retrieval Success and Precision: A Functional Magnetic Resonance Imaging Study. Neurophysiological and Autonomic Dynamics of Threat Processing during Sustained Social Fear Generalization. Attentional Refreshing in Working Memory and Its Interplay with Long-term Memory: A Behavioral and EEG Study. Experimental Manipulation of the Bilateral Posterior Parietal Cortex Strengthens Associative Memory in Healthy Participants: A Continuous Theta-burst Stimulation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1