Yalong Pang, Jun Han, Jianmin Zeng, Yujie Huang, Xiaoyang Zeng
{"title":"Instruction set extension and hardware acceleration for SVM application toward a vector processor","authors":"Yalong Pang, Jun Han, Jianmin Zeng, Yujie Huang, Xiaoyang Zeng","doi":"10.1109/ISOCC.2017.8368818","DOIUrl":null,"url":null,"abstract":"This paper presents instruction set extension and hardware acceleration for SVM application toward a vector processor. Based on the nyuzi processor, we customize the corresponding hardware acceleration unit, namely, kernel function processing unit (KPU), both supporting the linear kernel function and radial basis function (RBF) kernel. This work we utilize the mask vector to realize the exponential computation, and the total RBF kernel is completed with only approximately 35 basic instructions. The design is synthesized with SMIC 65nm CMOS technology, requiring 887 equivalent kGates and the max frequency is 540MHz. The simulation results show that with KPU the cycles of SVM training is obviously decreased and speedup is 2.62.","PeriodicalId":248826,"journal":{"name":"2017 International SoC Design Conference (ISOCC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International SoC Design Conference (ISOCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISOCC.2017.8368818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents instruction set extension and hardware acceleration for SVM application toward a vector processor. Based on the nyuzi processor, we customize the corresponding hardware acceleration unit, namely, kernel function processing unit (KPU), both supporting the linear kernel function and radial basis function (RBF) kernel. This work we utilize the mask vector to realize the exponential computation, and the total RBF kernel is completed with only approximately 35 basic instructions. The design is synthesized with SMIC 65nm CMOS technology, requiring 887 equivalent kGates and the max frequency is 540MHz. The simulation results show that with KPU the cycles of SVM training is obviously decreased and speedup is 2.62.