硅光学神经网络芯片的混合精度量化

IF 2.2 3区物理与天体物理 Q2 OPTICS Optics Communications Pub Date : 2024-10-23 DOI:10.1016/j.optcom.2024.131231

Ye Zhang , Ruiting Wang , Yejin Zhang , Jiaoqing Pan

{"title":"硅光学神经网络芯片的混合精度量化","authors":"Ye Zhang , Ruiting Wang , Yejin Zhang , Jiaoqing Pan","doi":"10.1016/j.optcom.2024.131231","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the field of neural network research has witnessed remarkable advancements in various domains. One of the emerging approaches is the integration of photonic computing, which leverages the unique properties of light for ultra-fast information processing. In this article, we establish a mixed precision quantization model to silicon-based optical neural networks and evaluates their performance on the MNIST and Fashion-MNIST datasets. Through a genetic algorithm-based optimization process, we achieve significant parameter compression while maintaining competitive accuracy. Our findings demonstrate that with an average quantization bitwidth of 4.5 bits on the MNIST dataset, we achieve an impressive 85.94% reduction in parameter size compared to traditional 32-bit networks, with only a marginal accuracy drop of 0.65%. Similarly, on the Fashion-MNIST dataset, we achieve an average quantization bitwidth of 5.67 bits, resulting in an 82.28% reduction in parameter size with a slight accuracy drop of 0.8%.</div></div>","PeriodicalId":19586,"journal":{"name":"Optics Communications","volume":"574 ","pages":"Article 131231"},"PeriodicalIF":2.2000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mixed precision quantization of silicon optical neural network chip\",\"authors\":\"Ye Zhang , Ruiting Wang , Yejin Zhang , Jiaoqing Pan\",\"doi\":\"10.1016/j.optcom.2024.131231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, the field of neural network research has witnessed remarkable advancements in various domains. One of the emerging approaches is the integration of photonic computing, which leverages the unique properties of light for ultra-fast information processing. In this article, we establish a mixed precision quantization model to silicon-based optical neural networks and evaluates their performance on the MNIST and Fashion-MNIST datasets. Through a genetic algorithm-based optimization process, we achieve significant parameter compression while maintaining competitive accuracy. Our findings demonstrate that with an average quantization bitwidth of 4.5 bits on the MNIST dataset, we achieve an impressive 85.94% reduction in parameter size compared to traditional 32-bit networks, with only a marginal accuracy drop of 0.65%. Similarly, on the Fashion-MNIST dataset, we achieve an average quantization bitwidth of 5.67 bits, resulting in an 82.28% reduction in parameter size with a slight accuracy drop of 0.8%.</div></div>\",\"PeriodicalId\":19586,\"journal\":{\"name\":\"Optics Communications\",\"volume\":\"574 \",\"pages\":\"Article 131231\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optics Communications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0030401824009684\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030401824009684","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

摘要

近年来，神经网络研究领域在各个领域都取得了显著进展。光子计算是新兴的方法之一，它利用光的独特特性进行超快信息处理。在本文中，我们为硅基光学神经网络建立了一个混合精度量化模型，并评估了它们在 MNIST 和 Fashion-MNIST 数据集上的性能。通过基于遗传算法的优化过程，我们实现了显著的参数压缩，同时保持了具有竞争力的精度。我们的研究结果表明，与传统的 32 位网络相比，在平均量化位宽为 4.5 位的 MNIST 数据集上，我们实现了令人印象深刻的 85.94% 的参数缩减，而准确率仅下降了 0.65%。同样，在时尚-MNIST 数据集上，我们实现了 5.67 比特的平均量化位宽，从而将参数大小减少了 82.28%，准确率却略微下降了 0.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Mixed precision quantization of silicon optical neural network chip

In recent years, the field of neural network research has witnessed remarkable advancements in various domains. One of the emerging approaches is the integration of photonic computing, which leverages the unique properties of light for ultra-fast information processing. In this article, we establish a mixed precision quantization model to silicon-based optical neural networks and evaluates their performance on the MNIST and Fashion-MNIST datasets. Through a genetic algorithm-based optimization process, we achieve significant parameter compression while maintaining competitive accuracy. Our findings demonstrate that with an average quantization bitwidth of 4.5 bits on the MNIST dataset, we achieve an impressive 85.94% reduction in parameter size compared to traditional 32-bit networks, with only a marginal accuracy drop of 0.65%. Similarly, on the Fashion-MNIST dataset, we achieve an average quantization bitwidth of 5.67 bits, resulting in an 82.28% reduction in parameter size with a slight accuracy drop of 0.8%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Optics Communications 物理-光学

CiteScore

5.10

自引率

8.30%

发文量

681

审稿时长

38 days

期刊介绍： Optics Communications invites original and timely contributions containing new results in various fields of optics and photonics. The journal considers theoretical and experimental research in areas ranging from the fundamental properties of light to technological applications. Topics covered include classical and quantum optics, optical physics and light-matter interactions, lasers, imaging, guided-wave optics and optical information processing. Manuscripts should offer clear evidence of novelty and significance. Papers concentrating on mathematical and computational issues, with limited connection to optics, are not suitable for publication in the Journal. Similarly, small technical advances, or papers concerned only with engineering applications or issues of materials science fall outside the journal scope.