Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai
{"title":"神经常微分方程的光学计算","authors":"Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai","doi":"10.1117/12.3005767","DOIUrl":null,"url":null,"abstract":"Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.","PeriodicalId":298662,"journal":{"name":"Applied Optics and Photonics China","volume":" 84","pages":"129660P - 129660P-5"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optical computing for neural ordinary differential equations\",\"authors\":\"Yun Zhao, Hang Chen, Min Lin, Haiou Zhang, Tao Yan, Xing Lin, Ruqi Huang, Qionghai Dai\",\"doi\":\"10.1117/12.3005767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.\",\"PeriodicalId\":298662,\"journal\":{\"name\":\"Applied Optics and Photonics China\",\"volume\":\" 84\",\"pages\":\"129660P - 129660P-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Optics and Photonics China\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.3005767\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Optics and Photonics China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3005767","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optical computing for neural ordinary differential equations
Increasing the layer number can improve the model performance of on-chip optical neural networks (ONNs). However, this results in larger integrated photonic chip areas due to the successive cascading of network hidden layers. We introduce a novel architecture for optical computing based on neural ordinary differential equations (ODEs) that employing optical ODE solvers to parameterize the continuous dynamics of hidden layers. The architecture comprises ONNs followed by a photonic integrator and an optical feedback loop, which can be configured to represent residual neural networks (ResNets) and implement the function of recurrent neural networks with effectively reduced chip area occupancy. For the interference-based optoelectronic nonlinear hidden layer, we demonstrate that the single hidden layer architecture can achieve approximately the same accuracy as the two-layer optical ResNets in image classification tasks. Furthermore, the architecture improves the model classification accuracy for the diffraction-based all-optical linear hidden layer. We also utilize the time-dependent dynamics property of architecture for trajectory prediction with high accuracy.