Direct Gradient Calculation: Simple and Variation‐Tolerant On‐Chip Training Method for Neural Networks

Advanced Intelligent Systems Pub Date : 2021-07-05 DOI:10.1002/aisy.202100064

Hyungyo Kim, Joon Hwang, D. Kwon, Jangsaeng Kim, Min-Kyu Park, Ji-Young Im, Byung-Gook Park, Jong-Ho Lee

{"title":"Direct Gradient Calculation: Simple and Variation‐Tolerant On‐Chip Training Method for Neural Networks","authors":"Hyungyo Kim, Joon Hwang, D. Kwon, Jangsaeng Kim, Min-Kyu Park, Ji-Young Im, Byung-Gook Park, Jong-Ho Lee","doi":"10.1002/aisy.202100064","DOIUrl":null,"url":null,"abstract":"On‐chip training of neural networks (NNs) is regarded as a promising training method for neuromorphic systems with analog synaptic devices. Herein, a novel on‐chip training method called direct gradient calculation (DGC) is proposed to substitute conventional backpropagation (BP). In this method, the gradients of a cost function with respect to the weights are calculated directly by sequentially applying a small temporal change to each weight and then measuring the change in cost value. DGC achieves a similar accuracy to that of BP while performing a handwritten digit classification task, validating its training feasibility. In particular, DGC can be applied to analog hardware‐based convolutional NNs (CNNs), which is considered to be a challenging task, enabling appropriate on‐chip training. A hybrid method is also proposed that efficiently combines DGC and BP for training CNNs, and the method achieves a similar accuracy to that of BP and DGC while enhancing the training speed. Furthermore, networks utilizing DGC maintain a higher level of accuracy than those using BP in the presence of variations in hardware (such as synaptic device conductance and neuron circuit component variations) while requiring fewer circuit components.","PeriodicalId":7187,"journal":{"name":"Advanced Intelligent Systems","volume":"11 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/aisy.202100064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

On‐chip training of neural networks (NNs) is regarded as a promising training method for neuromorphic systems with analog synaptic devices. Herein, a novel on‐chip training method called direct gradient calculation (DGC) is proposed to substitute conventional backpropagation (BP). In this method, the gradients of a cost function with respect to the weights are calculated directly by sequentially applying a small temporal change to each weight and then measuring the change in cost value. DGC achieves a similar accuracy to that of BP while performing a handwritten digit classification task, validating its training feasibility. In particular, DGC can be applied to analog hardware‐based convolutional NNs (CNNs), which is considered to be a challenging task, enabling appropriate on‐chip training. A hybrid method is also proposed that efficiently combines DGC and BP for training CNNs, and the method achieves a similar accuracy to that of BP and DGC while enhancing the training speed. Furthermore, networks utilizing DGC maintain a higher level of accuracy than those using BP in the presence of variations in hardware (such as synaptic device conductance and neuron circuit component variations) while requiring fewer circuit components.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

直接梯度计算:简单和变化容忍芯片上的神经网络训练方法

在芯片上训练神经网络(NNs)被认为是具有模拟突触装置的神经形态系统的一种很有前途的训练方法。本文提出了一种新的片上训练方法，称为直接梯度计算(DGC)，以取代传统的反向传播(BP)方法。在这种方法中，成本函数相对于权重的梯度是通过顺序地对每个权重施加一个小的时间变化，然后测量成本值的变化来直接计算的。在执行手写数字分类任务时，DGC达到了与BP相似的准确率，验证了其训练的可行性。特别是，DGC可以应用于基于模拟硬件的卷积神经网络(cnn)，这被认为是一项具有挑战性的任务，可以实现适当的片上训练。提出了一种将DGC和BP有效地结合起来训练cnn的混合方法，该方法在提高训练速度的同时获得了与BP和DGC相似的精度。此外，在硬件变化(如突触装置电导和神经元电路组件变化)的情况下，使用DGC的网络比使用BP的网络保持更高的准确性，同时需要更少的电路组件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Advanced Intelligent Systems

自引率

0.00%

发文量