A. Kyriakos, V. Kitsakis, Alexandros Louropoulos, E. Papatheofanous, I. Patronas, D. Reisis
{"title":"High Performance Accelerator for CNN Applications","authors":"A. Kyriakos, V. Kitsakis, Alexandros Louropoulos, E. Papatheofanous, I. Patronas, D. Reisis","doi":"10.1109/PATMOS.2019.8862166","DOIUrl":null,"url":null,"abstract":"The continuing advancement of the neural networks based techniques led to their exploitation in many applications, such as computer vision and the natural language processing systems where they provide high accuracy results at the cost of their high computational complexity. Hardware implemented AI accelerators provide the needed performance improvement for applications in specific areas, including robotics, autonomous systems and internet of things. The current study presents an FPGA based accelerator for Convolutional Neural Networks (CNN). The CNN model is trained for the MNIST dataset and the VHDL design targets high throughput, low power while using only on chip memory. The architecture uses parallel computations at the convolutional and fully connected layers and it has a highly pipelined output layer. The architecture implementation on a Xilinx Virtex VC707 validates the results.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PATMOS.2019.8862166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
The continuing advancement of the neural networks based techniques led to their exploitation in many applications, such as computer vision and the natural language processing systems where they provide high accuracy results at the cost of their high computational complexity. Hardware implemented AI accelerators provide the needed performance improvement for applications in specific areas, including robotics, autonomous systems and internet of things. The current study presents an FPGA based accelerator for Convolutional Neural Networks (CNN). The CNN model is trained for the MNIST dataset and the VHDL design targets high throughput, low power while using only on chip memory. The architecture uses parallel computations at the convolutional and fully connected layers and it has a highly pipelined output layer. The architecture implementation on a Xilinx Virtex VC707 validates the results.