An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks

Zhijian Lin, Meng Zhang, Dongpeng Weng, Fei Liu
{"title":"An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks","authors":"Zhijian Lin, Meng Zhang, Dongpeng Weng, Fei Liu","doi":"10.1109/iccss55260.2022.9802420","DOIUrl":null,"url":null,"abstract":"In recent years, the current trend of Convolutional Neural Networks (CNNs) is toward lower computational cost to achieve lightweight. In lightweight convolutional neural networks, the depthwise separable convolution (DSC) is becoming the mainstream method. But in DSC, the pointwise convolution (PWC) with $1\\times 1$ filters still has abundant parameters and computation. In this paper, an more efficient convolution algorithm is proposed to replace PWC, named kernel shared group convolution (KSGC). KSGC is used to combine channel information, which can be seen as the same convolution kernel sliding on the channel. In addition, Winograd algorithm is used to mitigate the number of multiplications required by KSGC in this paper. A CNN accelerator using a novel processing element (PE) performs 1-D Winograd in KSGC was implemented on a Ultra96-V2 field-programmable gate array (FPGA).At 200MHz clock frequency, the accelerator achieved computational performance of 52. 7GOPS and performance-power ratio of 10.42GOPS/W.","PeriodicalId":254992,"journal":{"name":"2022 5th International Conference on Circuits, Systems and Simulation (ICCSS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Circuits, Systems and Simulation (ICCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iccss55260.2022.9802420","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In recent years, the current trend of Convolutional Neural Networks (CNNs) is toward lower computational cost to achieve lightweight. In lightweight convolutional neural networks, the depthwise separable convolution (DSC) is becoming the mainstream method. But in DSC, the pointwise convolution (PWC) with $1\times 1$ filters still has abundant parameters and computation. In this paper, an more efficient convolution algorithm is proposed to replace PWC, named kernel shared group convolution (KSGC). KSGC is used to combine channel information, which can be seen as the same convolution kernel sliding on the channel. In addition, Winograd algorithm is used to mitigate the number of multiplications required by KSGC in this paper. A CNN accelerator using a novel processing element (PE) performs 1-D Winograd in KSGC was implemented on a Ultra96-V2 field-programmable gate array (FPGA).At 200MHz clock frequency, the accelerator achieved computational performance of 52. 7GOPS and performance-power ratio of 10.42GOPS/W.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Winograd的新型卷积神经网络加速器
近年来,卷积神经网络(cnn)的发展趋势是降低计算成本,实现轻量化。在轻量级卷积神经网络中,深度可分离卷积(DSC)正逐渐成为主流方法。但在DSC中,1 × 1滤波器的点向卷积(PWC)仍然具有丰富的参数和计算量。本文提出了一种更有效的卷积算法来代替PWC,称为核共享群卷积(kernel shared group convolution, KSGC)。KSGC用于合并信道信息,这可以看作是在信道上滑动的相同卷积核。此外,本文还使用Winograd算法来减少KSGC所需的乘法次数。在Ultra96-V2现场可编程门阵列(FPGA)上实现了一种使用新型处理元件(PE)在KSGC中执行一维Winograd的CNN加速器。在200MHz时钟频率下,加速器的计算性能达到52。7GOPS,性能功率比10.42GOPS/W。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Coordination and Optimization of Virtual Power Plant Based on Multi-agent System Thermal-Aware IC Chip Design by Combining High Thermal Conductivity Materials and GAA MOSFET A Novel Compact LC-Based Balun Combiner with 2nd and 3rd Harmonic Suppression A High Linearity and Low Load Regulation LDO with SATEC and TIR Compensation Design and Implementation of Intelligent-pharmaceutical-delivery-system Based on Loongson 1B
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1