基于FPGA的残数系统MAC单元快速实现

2020 International Conference for Emerging Technology (INCET) Pub Date : 2020-06-01 DOI:10.1109/incet49848.2020.9154105

Bhavik Mohindroo, Atharv Paliwal, Kriti Suneja

{"title":"基于FPGA的残数系统MAC单元快速实现","authors":"Bhavik Mohindroo, Atharv Paliwal, Kriti Suneja","doi":"10.1109/incet49848.2020.9154105","DOIUrl":null,"url":null,"abstract":"In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"FPGA based Faster Implementation of MAC Unit in Residual Number System\",\"authors\":\"Bhavik Mohindroo, Atharv Paliwal, Kriti Suneja\",\"doi\":\"10.1109/incet49848.2020.9154105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在这个快速发展的世界里，每个人都很匆忙，即使在电子世界里，速度也成为了一个关键因素。这不是答案，但一个快速的答案是时间的需要。人工智能几乎触及了我们生活的方方面面。但是，机器学习算法的软件实现还不能满足在纳秒内解决问题的期望，特别是在涉及大量计算和算术计算的神经网络时。乘法累加单元是许多信号处理和机器学习算法的组成部分。本文实现了一个基于残数系统的分布式算法的算法模块，以证明硬件设计比软件实现更有效。目的是利用fpga的并行性来加速计算。使用的目标设备是Xilinx公司Virtex- 6系列的xc6vlx75t3ff484。在MATLAB和ModelSim中使用相关数据进行了仿真测试，以证明其可行性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

FPGA based Faster Implementation of MAC Unit in Residual Number System

In this fast-growing world, where everyone is in hurry, speed has become a critical factor even in the electronics world. It’s not the answer but a fast answer is the need of the time. Artificial intelligence has touched almost all aspects of our lives. But the software implementation of machine learning algorithms has not been able to meet the expectations of solutions in nanoseconds, especially where neural networks with extensive calculations and arithmetic computations are involved. Multiplication and accumulation unit is an integral part of many signal processing and machine learning algorithms. Here we implement an arithmetic module based on distributed arithmetic imbibed with Residual Number System to prove the efficacy of hardware design over software implementation. The purpose is to exploit the parallelism property of FPGAs to accelerate the computations. The target device used is xc6vlx75t3ff484 from Virtex- 6 family in Xilinx. Simulations are tested in MATLAB and ModelSim with associated data to justify its feasibility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference for Emerging Technology (INCET)

自引率

0.00%

发文量