Sing-Yu Pan, Shuenn-Yuh Lee, Yi-Wen Hung, Chou-Ching K. Lin, G. Shieh
{"title":"基于RISC-V内核的可编程CNN加速器在实时可穿戴应用中的应用","authors":"Sing-Yu Pan, Shuenn-Yuh Lee, Yi-Wen Hung, Chou-Ching K. Lin, G. Shieh","doi":"10.1109/RASSE54974.2022.9989732","DOIUrl":null,"url":null,"abstract":"This paper has proposed an epilepsy detection algorithm to identify the seizure attack. The algorithm includes a simplified signal preprocessing process and an 8 layers Convolution Neural Network (CNN). This paper has also proposed an architecture, including a CNN accelerator and a 2-stage reduced instruction set computer-V (RISC-V) CPU, to implement the detection algorithm in real-time. The accelerator is implemented in SystemVerilog and validated on the Xilinx PYNQ-Z2. The implementation consumes 3411 LUTs, 2262 flip-flops, 84 KB block random access memory (BRAM), and only 6 DSPs. The total power consumption is 0.118 W in 10-MHz operation frequency. The detection algorithm provides 99.16% accuracy on fixed-point operations with detection latency of 0.137 ms/class. Moreover, the CNN accelerator has the programable ability, so the accelerator can execute different CNN models to fit various wearable applications for different biomedical acquisition systems.","PeriodicalId":382440,"journal":{"name":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Programmable CNN Accelerator with RISC-V Core in Real-Time Wearable Application\",\"authors\":\"Sing-Yu Pan, Shuenn-Yuh Lee, Yi-Wen Hung, Chou-Ching K. Lin, G. Shieh\",\"doi\":\"10.1109/RASSE54974.2022.9989732\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper has proposed an epilepsy detection algorithm to identify the seizure attack. The algorithm includes a simplified signal preprocessing process and an 8 layers Convolution Neural Network (CNN). This paper has also proposed an architecture, including a CNN accelerator and a 2-stage reduced instruction set computer-V (RISC-V) CPU, to implement the detection algorithm in real-time. The accelerator is implemented in SystemVerilog and validated on the Xilinx PYNQ-Z2. The implementation consumes 3411 LUTs, 2262 flip-flops, 84 KB block random access memory (BRAM), and only 6 DSPs. The total power consumption is 0.118 W in 10-MHz operation frequency. The detection algorithm provides 99.16% accuracy on fixed-point operations with detection latency of 0.137 ms/class. Moreover, the CNN accelerator has the programable ability, so the accelerator can execute different CNN models to fit various wearable applications for different biomedical acquisition systems.\",\"PeriodicalId\":382440,\"journal\":{\"name\":\"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RASSE54974.2022.9989732\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RASSE54974.2022.9989732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种癫痫检测算法来识别癫痫发作。该算法包括一个简化的信号预处理过程和一个8层卷积神经网络(CNN)。本文还提出了一种包括CNN加速器和二级精简指令集计算机- v (RISC-V) CPU的架构,以实现实时检测算法。该加速器在SystemVerilog中实现,并在Xilinx PYNQ-Z2上进行了验证。该实现消耗3411个lut、2262个触发器、84 KB块随机存取存储器(BRAM)和仅6个dsp。在10mhz工作频率下,总功耗为0.118 W。该算法对定点运算的检测准确率为99.16%,检测延迟为0.137 ms/class。此外,CNN加速器具有可编程能力,因此加速器可以执行不同的CNN模型,以适应不同生物医学采集系统的各种可穿戴应用。
A Programmable CNN Accelerator with RISC-V Core in Real-Time Wearable Application
This paper has proposed an epilepsy detection algorithm to identify the seizure attack. The algorithm includes a simplified signal preprocessing process and an 8 layers Convolution Neural Network (CNN). This paper has also proposed an architecture, including a CNN accelerator and a 2-stage reduced instruction set computer-V (RISC-V) CPU, to implement the detection algorithm in real-time. The accelerator is implemented in SystemVerilog and validated on the Xilinx PYNQ-Z2. The implementation consumes 3411 LUTs, 2262 flip-flops, 84 KB block random access memory (BRAM), and only 6 DSPs. The total power consumption is 0.118 W in 10-MHz operation frequency. The detection algorithm provides 99.16% accuracy on fixed-point operations with detection latency of 0.137 ms/class. Moreover, the CNN accelerator has the programable ability, so the accelerator can execute different CNN models to fit various wearable applications for different biomedical acquisition systems.