{"title":"On the improved implementations and performance evaluation of digit-by-digit integer restoring and non-restoring cube root algorithms","authors":"Yamin Li, Wanming Chu","doi":"10.1109/CITS.2016.7546386","DOIUrl":null,"url":null,"abstract":"This paper describes digit-by-digit integer restoring and non-restoring algorithms for computing the cube root of a 33-bit radicand and gives improved methods for the implementations of the algorithms to speed up the computations. The methods include calculating multiplication products in advance with additions and using carry save adders (CSAs) to calculate the partial remainder at each iteration. The cost/performance is evaluated by implementing the algorithms on an FPGA chip. The results show that the CSA methods achieve performance speedups of 199.63% and 177.06%, for restoring and non-restoring cube root algorithms, respectively.","PeriodicalId":340958,"journal":{"name":"2016 International Conference on Computer, Information and Telecommunication Systems (CITS)","volume":"197 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Computer, Information and Telecommunication Systems (CITS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CITS.2016.7546386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
This paper describes digit-by-digit integer restoring and non-restoring algorithms for computing the cube root of a 33-bit radicand and gives improved methods for the implementations of the algorithms to speed up the computations. The methods include calculating multiplication products in advance with additions and using carry save adders (CSAs) to calculate the partial remainder at each iteration. The cost/performance is evaluated by implementing the algorithms on an FPGA chip. The results show that the CSA methods achieve performance speedups of 199.63% and 177.06%, for restoring and non-restoring cube root algorithms, respectively.