Single-chip photonic deep neural network with forward-only training

IF 32.9 1区 物理与天体物理 Q1 OPTICS Nature Photonics Pub Date : 2024-12-02 DOI:10.1038/s41566-024-01567-z
Saumil Bandyopadhyay, Alexander Sludds, Stefan Krastanov, Ryan Hamerly, Nicholas Harris, Darius Bunandar, Matthew Streshinsky, Michael Hochberg, Dirk Englund
{"title":"Single-chip photonic deep neural network with forward-only training","authors":"Saumil Bandyopadhyay, Alexander Sludds, Stefan Krastanov, Ryan Hamerly, Nicholas Harris, Darius Bunandar, Matthew Streshinsky, Michael Hochberg, Dirk Englund","doi":"10.1038/s41566-024-01567-z","DOIUrl":null,"url":null,"abstract":"As deep neural networks revolutionize machine learning, energy consumption and throughput are emerging as fundamental limitations of complementary metal–oxide–semiconductor (CMOS) electronics. This has motivated a search for new hardware architectures optimized for artificial intelligence, such as electronic systolic arrays, memristor crossbar arrays and optical accelerators. Optical systems can perform linear matrix operations at an exceptionally high rate and efficiency, motivating recent demonstrations of low-latency matrix accelerators and optoelectronic image classifiers. However, demonstrating coherent, ultralow-latency optical processing of deep neural networks has remained an outstanding challenge. Here we realize such a system in a scalable photonic integrated circuit that monolithically integrates multiple coherent optical processor units for matrix algebra and nonlinear activation functions into a single chip. We experimentally demonstrate this fully integrated coherent optical neural network architecture for a deep neural network with six neurons and three layers that optically computes both linear and nonlinear functions with a latency of 410 ps, unlocking new applications that require ultrafast, direct processing of optical signals. We implement backpropagation-free in situ training on this system, achieving 92.5% accuracy on a six-class vowel classification task, which is comparable to the accuracy obtained on a digital computer. This work lends experimental evidence to theoretical proposals for in situ training, enabling orders of magnitude improvements in the throughput of training data. Moreover, the fully integrated coherent optical neural network opens the path to inference at nanosecond latency and femtojoule per operation energy efficiency. Researchers experimentally demonstrate a fully integrated coherent optical neural network. The system, with six neurons and three layers, operates with a latency of 410 ps.","PeriodicalId":18926,"journal":{"name":"Nature Photonics","volume":"18 12","pages":"1335-1343"},"PeriodicalIF":32.9000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Photonics","FirstCategoryId":"101","ListUrlMain":"https://www.nature.com/articles/s41566-024-01567-z","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

As deep neural networks revolutionize machine learning, energy consumption and throughput are emerging as fundamental limitations of complementary metal–oxide–semiconductor (CMOS) electronics. This has motivated a search for new hardware architectures optimized for artificial intelligence, such as electronic systolic arrays, memristor crossbar arrays and optical accelerators. Optical systems can perform linear matrix operations at an exceptionally high rate and efficiency, motivating recent demonstrations of low-latency matrix accelerators and optoelectronic image classifiers. However, demonstrating coherent, ultralow-latency optical processing of deep neural networks has remained an outstanding challenge. Here we realize such a system in a scalable photonic integrated circuit that monolithically integrates multiple coherent optical processor units for matrix algebra and nonlinear activation functions into a single chip. We experimentally demonstrate this fully integrated coherent optical neural network architecture for a deep neural network with six neurons and three layers that optically computes both linear and nonlinear functions with a latency of 410 ps, unlocking new applications that require ultrafast, direct processing of optical signals. We implement backpropagation-free in situ training on this system, achieving 92.5% accuracy on a six-class vowel classification task, which is comparable to the accuracy obtained on a digital computer. This work lends experimental evidence to theoretical proposals for in situ training, enabling orders of magnitude improvements in the throughput of training data. Moreover, the fully integrated coherent optical neural network opens the path to inference at nanosecond latency and femtojoule per operation energy efficiency. Researchers experimentally demonstrate a fully integrated coherent optical neural network. The system, with six neurons and three layers, operates with a latency of 410 ps.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
单片光子深度神经网络的正向训练
随着深度神经网络彻底改变机器学习,能量消耗和吞吐量正在成为互补金属氧化物半导体(CMOS)电子产品的基本限制。这促使人们寻找针对人工智能优化的新硬件架构,如电子收缩阵列、忆阻交叉栅阵列和光学加速器。光学系统可以以极高的速率和效率执行线性矩阵运算,这激发了最近低延迟矩阵加速器和光电图像分类器的演示。然而,展示深度神经网络的相干、超低延迟光学处理仍然是一个突出的挑战。我们在一个可扩展的光子集成电路中实现了这样一个系统,该系统将用于矩阵代数和非线性激活函数的多个相干光处理器单元单片集成到单个芯片中。我们通过实验展示了这种完全集成的相干光神经网络架构,用于具有六个神经元和三层的深度神经网络,该网络以410 ps的延迟光学计算线性和非线性函数,解锁了需要超快速,直接处理光信号的新应用。我们在该系统上实现了无反向传播的原位训练,在六类元音分类任务上实现了92.5%的准确率,与数字计算机上获得的准确率相当。这项工作为现场训练的理论建议提供了实验证据,使训练数据的吞吐量得到了数量级的提高。此外,完全集成的相干光神经网络开辟了以纳秒延迟和飞焦耳每操作能量效率进行推理的道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Nature Photonics
Nature Photonics 物理-光学
CiteScore
54.20
自引率
1.70%
发文量
158
审稿时长
12 months
期刊介绍: Nature Photonics is a monthly journal dedicated to the scientific study and application of light, known as Photonics. It publishes top-quality, peer-reviewed research across all areas of light generation, manipulation, and detection. The journal encompasses research into the fundamental properties of light and its interactions with matter, as well as the latest developments in optoelectronic devices and emerging photonics applications. Topics covered include lasers, LEDs, imaging, detectors, optoelectronic devices, quantum optics, biophotonics, optical data storage, spectroscopy, fiber optics, solar energy, displays, terahertz technology, nonlinear optics, plasmonics, nanophotonics, and X-rays. In addition to research papers and review articles summarizing scientific findings in optoelectronics, Nature Photonics also features News and Views pieces and research highlights. It uniquely includes articles on the business aspects of the industry, such as technology commercialization and market analysis, offering a comprehensive perspective on the field.
期刊最新文献
Ytterbium spectroscopy for new physics searches Colloidal quantum dots shine in the mid-infrared Unveiling the nanoscale dynamics of electroluminescence Fourier ptychography sharpens terahertz imaging Hard disk drive capacity limits smashed by nanofocusing of plasmons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1