{"title":"基于FPGA的残数系统MAC单元快速实现","authors":"Bhavik Mohindroo, Atharv Paliwal, Kriti Suneja","doi":"10.1109/incet49848.2020.9154105","DOIUrl":null,"url":null,"abstract":"In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"FPGA based Faster Implementation of MAC Unit in Residual Number System\",\"authors\":\"Bhavik Mohindroo, Atharv Paliwal, Kriti Suneja\",\"doi\":\"10.1109/incet49848.2020.9154105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
FPGA based Faster Implementation of MAC Unit in Residual Number System
In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.