Digital Modeling on Large Kernel Metamaterial Neural Network.

IF 0.5 4区计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Journal of Imaging Science and Technology Pub Date : 2023-11-01 DOI:10.2352/j.imagingsci.technol.2023.67.6.060404

Quan Liu, Hanyu Zheng, Brandon T Swartz, Ho Hin Lee, Zuhayr Asad, Ivan Kravchenko, Jason G Valentine, Yuankai Huo

{"title":"Digital Modeling on Large Kernel Metamaterial Neural Network.","authors":"Quan Liu, Hanyu Zheng, Brandon T Swartz, Ho Hin Lee, Zuhayr Asad, Ivan Kravchenko, Jason G Valentine, Yuankai Huo","doi":"10.2352/j.imagingsci.technol.2023.67.6.060404","DOIUrl":null,"url":null,"abstract":"<p><p>Deep neural networks (DNNs) utilized recently are physically deployed with computational units (e.g., CPUs and GPUs). Such a design might lead to a heavy computational burden, significant latency, and intensive power consumption, which are critical limitations in applications such as the Internet of Things (IoT), edge computing, and the usage of drones. Recent advances in optical computational units (e.g., metamaterial) have shed light on energy-free and light-speed neural networks. However, the digital design of the metamaterial neural network (MNN) is fundamentally limited by its physical limitations, such as precision, noise, and bandwidth during fabrication. Moreover, the unique advantages of MNN's (e.g., light-speed computation) are not fully explored via standard 3×3 convolution kernels. In this paper, we propose a novel large kernel metamaterial neural network (LMNN) that maximizes the digital capacity of the state-of-the-art (SOTA) MNN with model re-parametrization and network compression, while also considering the optical limitation explicitly. The new digital learning scheme can maximize the learning capacity of MNN while modeling the physical restrictions of meta-optic. With the proposed LMNN, the computation cost of the convolutional front-end can be offloaded into fabricated optical hardware. The experimental results on two publicly available datasets demonstrate that the optimized hybrid design improved classification accuracy while reducing computational latency. The development of the proposed LMNN is a promising step towards the ultimate goal of energy-free and light-speed AI.</p>","PeriodicalId":15924,"journal":{"name":"Journal of Imaging Science and Technology","volume":"67 6","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10970463/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Imaging Science and Technology","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.2352/j.imagingsci.technol.2023.67.6.060404","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Deep neural networks (DNNs) utilized recently are physically deployed with computational units (e.g., CPUs and GPUs). Such a design might lead to a heavy computational burden, significant latency, and intensive power consumption, which are critical limitations in applications such as the Internet of Things (IoT), edge computing, and the usage of drones. Recent advances in optical computational units (e.g., metamaterial) have shed light on energy-free and light-speed neural networks. However, the digital design of the metamaterial neural network (MNN) is fundamentally limited by its physical limitations, such as precision, noise, and bandwidth during fabrication. Moreover, the unique advantages of MNN's (e.g., light-speed computation) are not fully explored via standard 3×3 convolution kernels. In this paper, we propose a novel large kernel metamaterial neural network (LMNN) that maximizes the digital capacity of the state-of-the-art (SOTA) MNN with model re-parametrization and network compression, while also considering the optical limitation explicitly. The new digital learning scheme can maximize the learning capacity of MNN while modeling the physical restrictions of meta-optic. With the proposed LMNN, the computation cost of the convolutional front-end can be offloaded into fabricated optical hardware. The experimental results on two publicly available datasets demonstrate that the optimized hybrid design improved classification accuracy while reducing computational latency. The development of the proposed LMNN is a promising step towards the ultimate goal of energy-free and light-speed AI.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

大内核超材料神经网络的数字建模

最近使用的深度神经网络（DNN）是与计算单元（如 CPU 和 GPU）一起物理部署的。这种设计可能会导致沉重的计算负担、明显的延迟和密集的功耗，这些都是物联网（IoT）、边缘计算和无人机等应用的关键限制因素。光学计算单元（如超材料）的最新进展为无能耗和光速神经网络带来了曙光。然而，超材料神经网络（MNN）的数字设计从根本上受限于其物理限制，如制造过程中的精度、噪声和带宽。此外，MNN 的独特优势（如光速计算）并没有通过标准 3×3 卷积核得到充分发挥。在本文中，我们提出了一种新型大核超材料神经网络（LMNN），通过模型重参数化和网络压缩，最大限度地提高了最先进（SOTA）MNN 的数字容量，同时还明确考虑了光学限制。新的数字学习方案可以最大限度地提高 MNN 的学习能力，同时模拟元光学的物理限制。利用所提出的 LMNN，卷积前端的计算成本可被卸载到制造的光学硬件中。在两个公开数据集上的实验结果表明，优化的混合设计提高了分类精度，同时降低了计算延迟。拟议 LMNN 的开发是实现无能耗、光速人工智能终极目标的重要一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Imaging Science and Technology 工程技术-成像科学与照相技术

CiteScore

2.00

自引率

10.00%

发文量

审稿时长

>12 weeks

期刊介绍： Typical issues include research papers and/or comprehensive reviews from a variety of topical areas. In the spirit of fostering constructive scientific dialog, the Journal accepts Letters to the Editor commenting on previously published articles. Periodically the Journal features a Special Section containing a group of related— usually invited—papers introduced by a Guest Editor. Imaging research topics that have coverage in JIST include: Digital fabrication and biofabrication; Digital printing technologies; 3D imaging: capture, display, and print; Augmented and virtual reality systems; Mobile imaging; Computational and digital photography; Machine vision and learning; Data visualization and analysis; Image and video quality evaluation; Color image science; Image archiving, permanence, and security; Imaging applications including astronomy, medicine, sports, and autonomous vehicles.