{"title":"On the Control of Computing-in-memory Devices with Resource-efficient Digital Circuits towards their On-chip Learning","authors":"Tatsuya Kaneko, Hiroshi Momose, Hitoshi Suwa, Takashi Ono, Yuriko Hayata, Kazuyuki Kouno, Tetsuya Asai","doi":"10.1587/nolta.14.639","DOIUrl":null,"url":null,"abstract":"Computing-in-memory (CIM) devices have attracted attention because of their high operation efficiency in edge AI, which requires low power operation. This paper proposed a digital circuit architecture controlling the inference and learning of CIM devices such as the RAND chip, which utilizes the non-linearity of ReRAM as memory elements. The RAND chip is used as the CIM device for inference and as external memory for training. The system performance in the XOR identification test achieves the same convergence as the software implementation of the learning core. The proposed learning core achieved efficiency of 7.77 GOPS/W, thereby verifying the effectiveness of the proposed architecture for on-line CIM device learning.","PeriodicalId":54110,"journal":{"name":"IEICE Nonlinear Theory and Its Applications","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Nonlinear Theory and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/nolta.14.639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Computing-in-memory (CIM) devices have attracted attention because of their high operation efficiency in edge AI, which requires low power operation. This paper proposed a digital circuit architecture controlling the inference and learning of CIM devices such as the RAND chip, which utilizes the non-linearity of ReRAM as memory elements. The RAND chip is used as the CIM device for inference and as external memory for training. The system performance in the XOR identification test achieves the same convergence as the software implementation of the learning core. The proposed learning core achieved efficiency of 7.77 GOPS/W, thereby verifying the effectiveness of the proposed architecture for on-line CIM device learning.