{"title":"Asymmetric large size multiplication using embedded blocks with efficient compression technique in FPGAs","authors":"Shuli Gao, D. Al-Khalili, N. Chabini","doi":"10.1109/ICECS.2011.6122233","DOIUrl":null,"url":null,"abstract":"In this paper, asymmetric non-pipelined large size unsigned multipliers are implemented using symmetric and asymmetric embedded multipliers in FPGAs. The decomposition of the operands used in this approach is based on the sizes of the embedded blocks. The partial products of the segmented operands are organized in rows, and the additions of the products are performed through optimized compression dictated by the architecture of the CLB. The optimization algorithm has led to the minimization of the total critical path delay with reduced utilization of FPGA embedded blocks. The multipliers are implemented in Xilinx' FPGAs using 18×18-bit and 25×18-bit embedded signed multipliers. The implementation results have demonstrated the effectiveness of the proposed approach achieving speed improvement of 29% and reduction of embedded block utilization of 28% for operand size of up to 192 bits compared to the Standard scheme.","PeriodicalId":251525,"journal":{"name":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","volume":"1609 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 18th IEEE International Conference on Electronics, Circuits, and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECS.2011.6122233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, asymmetric non-pipelined large size unsigned multipliers are implemented using symmetric and asymmetric embedded multipliers in FPGAs. The decomposition of the operands used in this approach is based on the sizes of the embedded blocks. The partial products of the segmented operands are organized in rows, and the additions of the products are performed through optimized compression dictated by the architecture of the CLB. The optimization algorithm has led to the minimization of the total critical path delay with reduced utilization of FPGA embedded blocks. The multipliers are implemented in Xilinx' FPGAs using 18×18-bit and 25×18-bit embedded signed multipliers. The implementation results have demonstrated the effectiveness of the proposed approach achieving speed improvement of 29% and reduction of embedded block utilization of 28% for operand size of up to 192 bits compared to the Standard scheme.