{"title":"A CNN/FPGA Co-Exploration Method Based on Hourglass Network","authors":"Zhong Qiu, Jianli Li, Dongbao Liang, Tao Su","doi":"10.1109/ICET51757.2021.9450895","DOIUrl":null,"url":null,"abstract":"Field Programming Gate Array (FPGA) has become a popular choice for neural network inference for its great potential of parallel computing and abundant on-chip buffer resource. Both software and hardware researchers are trying to explore the design space to obtain the best network model and the best accelerator design. Different from the existing sequential exploration method, we propose a CNN/FPGA co-exploration method to search for the best network model and hardware design simultaneously. We applied our method to Hourglass Network, a widely used model for pose estimation, and the top 5 experimental results show that our method can achieve an average of 22.44% faster in computation, 16.48% faster in memory access, and 17.81% faster in total, with a minor accuracy loss of 0.75%. The whole search and training process only cost 75 GPU hours.","PeriodicalId":316980,"journal":{"name":"2021 IEEE 4th International Conference on Electronics Technology (ICET)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Electronics Technology (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET51757.2021.9450895","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Field Programming Gate Array (FPGA) has become a popular choice for neural network inference for its great potential of parallel computing and abundant on-chip buffer resource. Both software and hardware researchers are trying to explore the design space to obtain the best network model and the best accelerator design. Different from the existing sequential exploration method, we propose a CNN/FPGA co-exploration method to search for the best network model and hardware design simultaneously. We applied our method to Hourglass Network, a widely used model for pose estimation, and the top 5 experimental results show that our method can achieve an average of 22.44% faster in computation, 16.48% faster in memory access, and 17.81% faster in total, with a minor accuracy loss of 0.75%. The whole search and training process only cost 75 GPU hours.