实现人类水平的视觉系统

IF 4.4 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Science China Technological Sciences Pub Date : 2024-07-30 DOI:10.1007/s11431-024-2762-5
JianHao Ding, TieJun Huang
{"title":"实现人类水平的视觉系统","authors":"JianHao Ding, TieJun Huang","doi":"10.1007/s11431-024-2762-5","DOIUrl":null,"url":null,"abstract":"<p>The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.</p>","PeriodicalId":21612,"journal":{"name":"Science China Technological Sciences","volume":"49 1","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards human-leveled vision systems\",\"authors\":\"JianHao Ding, TieJun Huang\",\"doi\":\"10.1007/s11431-024-2762-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.</p>\",\"PeriodicalId\":21612,\"journal\":{\"name\":\"Science China Technological Sciences\",\"volume\":\"49 1\",\"pages\":\"\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science China Technological Sciences\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11431-024-2762-5\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science China Technological Sciences","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11431-024-2762-5","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

人类视觉系统是一个由数十亿个神经元组成的复杂且相互关联的网络。它在将环境光刺激转化为引导和塑造人类感知和行动的信息方面发挥着至关重要的作用。视觉系统研究旨在揭示人类视觉感知的基本神经结构原理及其可能的应用。目前主要有两种方法:生物系统分析与模拟、基于深度学习的人工智能模型。在此,我们旨在讨论人类级视觉系统的两种方法。深度学习在表征、建模和硬件设计方面取得的成就极大地影响了视觉领域。然而,深度学习模型与人类视觉系统在可扩展性、可移植性和可持续性方面仍有很大差距。通过进一步了解生物视觉系统不同组成部分的特性和功能,生物视觉系统的研究进展有助于填补这一空白。我们以重建视网膜的努力为例,说明即使我们现在无法在计算机上复制视觉系统,但结合神经科学领域现有的研究成果,我们仍然可以学到很多东西。在文章的最后,我们建议从生物结构的计算对应物出发,逐步回溯建立视觉系统,以便在未来实现人类水平的视觉系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Towards human-leveled vision systems

The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Science China Technological Sciences
Science China Technological Sciences ENGINEERING, MULTIDISCIPLINARY-MATERIALS SCIENCE, MULTIDISCIPLINARY
CiteScore
8.40
自引率
10.90%
发文量
4380
审稿时长
3.3 months
期刊介绍: Science China Technological Sciences, an academic journal cosponsored by the Chinese Academy of Sciences and the National Natural Science Foundation of China, and published by Science China Press, is committed to publishing high-quality, original results in both basic and applied research. Science China Technological Sciences is published in both print and electronic forms. It is indexed by Science Citation Index. Categories of articles: Reviews summarize representative results and achievements in a particular topic or an area, comment on the current state of research, and advise on the research directions. The author’s own opinion and related discussion is requested. Research papers report on important original results in all areas of technological sciences. Brief reports present short reports in a timely manner of the latest important results.
期刊最新文献
A novel method for extracting and optimizing the complex permittivity of paper-based composites based on an artificial neural network model A systematic framework of constructing surrogate model for slider track peeling strength prediction Bridging the Fabry–Perot cavity and asymmetric Berreman mode for long-wave infrared nonreciprocal thermal emitters Unveiling the protective role of biofilm formation on the photoaging of microplastics Adhesive hydrogel interface for enhanced epidermal signal
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1