AICNN: Implementing Typical CNN Algorithms with Analog-to-Information Conversion Architecture

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) Pub Date : 2017-07-03 DOI:10.1109/ISVLSI.2017.23

Kaige Jia, Zheyu Liu, F. Qiao, Xinjun Liu, Qi Wei, Huazhong Yang

{"title":"AICNN: Implementing Typical CNN Algorithms with Analog-to-Information Conversion Architecture","authors":"Kaige Jia, Zheyu Liu, F. Qiao, Xinjun Liu, Qi Wei, Huazhong Yang","doi":"10.1109/ISVLSI.2017.23","DOIUrl":null,"url":null,"abstract":"AICNN architecture is presented in this work to map the state-of-the-art machine-learning algorithms of CNN to power-constrained embedded hardware. As the combination of analog-to-information conversion and typical CNN algorithms, AICNN can realize ultra-highly efficient computation by using massive parallel analog signal processing circuits, which could also significantly reduce ADC devices cost of converting sensors' outputs. As a design example, the specific AICNN-3 implementation is evaluated, which realize the minimum system of typical CNN task using AICNN architecture, with SMIC 0.18 µm CMOS process. Simulation results show that the AICNN-3 can classify a 28x28 MNIST image with only 1.47nJ. Compared with baseline implementation on CPU, the AICNN-3 could achieve 67000x energy-efficiency improvement, however the accuracy loss is less than 1%. Moreover, the influences of devices mismatch and process variations are evaluated using Monte Carlo statistical method, for the imperfection of analog processing paradigm, as well as the scalability of AICNN architecture is also discussed.","PeriodicalId":187936,"journal":{"name":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISVLSI.2017.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

AICNN architecture is presented in this work to map the state-of-the-art machine-learning algorithms of CNN to power-constrained embedded hardware. As the combination of analog-to-information conversion and typical CNN algorithms, AICNN can realize ultra-highly efficient computation by using massive parallel analog signal processing circuits, which could also significantly reduce ADC devices cost of converting sensors' outputs. As a design example, the specific AICNN-3 implementation is evaluated, which realize the minimum system of typical CNN task using AICNN architecture, with SMIC 0.18 µm CMOS process. Simulation results show that the AICNN-3 can classify a 28x28 MNIST image with only 1.47nJ. Compared with baseline implementation on CPU, the AICNN-3 could achieve 67000x energy-efficiency improvement, however the accuracy loss is less than 1%. Moreover, the influences of devices mismatch and process variations are evaluated using Monte Carlo statistical method, for the imperfection of analog processing paradigm, as well as the scalability of AICNN architecture is also discussed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AICNN:用模拟-信息转换架构实现典型的CNN算法

本文提出了AICNN架构，将CNN最先进的机器学习算法映射到功率受限的嵌入式硬件。AICNN是模拟-信息转换与典型CNN算法的结合，通过使用大量并行模拟信号处理电路，可以实现超高效率的计算，也可以显著降低ADC器件转换传感器输出的成本。以AICNN-3为设计实例，采用中芯国际0.18µm CMOS工艺，采用AICNN架构实现了典型CNN任务的最小系统。仿真结果表明，AICNN-3对28x28 MNIST图像的分类精度仅为1.47nJ。与CPU上的基准实现相比，AICNN-3可以实现67000x的能效提升，但精度损失小于1%。此外，针对模拟处理范式的不完善性，利用蒙特卡罗统计方法评估了器件失配和工艺变化对AICNN结构的影响，并讨论了AICNN结构的可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

自引率

0.00%

发文量