神经常微分方程的光学计算

Applied Optics and Photonics China Pub Date : 2023-12-18 DOI:10.1117/12.3005767

Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai

{"title":"神经常微分方程的光学计算","authors":"Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai","doi":"10.1117/12.3005767","DOIUrl":null,"url":null,"abstract":"Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.","PeriodicalId":298662,"journal":{"name":"Applied Optics and Photonics China","volume":" 84","pages":"129660P - 129660P-5"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optical computing for neural ordinary differential equations\",\"authors\":\"Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai\",\"doi\":\"10.1117/12.3005767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.\",\"PeriodicalId\":298662,\"journal\":{\"name\":\"Applied Optics and Photonics China\",\"volume\":\" 84\",\"pages\":\"129660P - 129660P-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Optics and Photonics China\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.3005767\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Optics and Photonics China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3005767","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

增加层数可以提高片上光神经网络（ONN）的模型性能。然而，由于网络隐藏层的连续级联，这会导致集成光子芯片面积增大。我们介绍了一种基于神经常微分方程（ODE）的新型光学计算架构，该架构采用光学 ODE 求解器对隐藏层的连续动态进行参数化。该架构由神经网络（ONN）、光子积分器和光反馈回路组成，可配置为代表残差神经网络（ResNets），并在有效减少芯片面积占用的情况下实现递归神经网络的功能。对于基于干涉的光电非线性隐藏层，我们证明了在图像分类任务中，单隐藏层架构可以达到与双层光学 ResNets 大致相同的精度。此外，该架构还提高了基于衍射的全光学线性隐藏层的模型分类精度。我们还利用该架构的时变动态特性进行了高精度轨迹预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optical computing for neural ordinary differential equations

Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Optics and Photonics China

自引率

0.00%

发文量