A New Multiplication Algorithm for Extended Precision Using Floating-Point Expansions

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH) Pub Date : 2016-07-10 DOI:10.1109/ARITH.2016.18

J. Muller, Valentina Popescu, P. T. P. Tang

{"title":"A New Multiplication Algorithm for Extended Precision Using Floating-Point Expansions","authors":"J. Muller, Valentina Popescu, P. T. P. Tang","doi":"10.1109/ARITH.2016.18","DOIUrl":null,"url":null,"abstract":"Some important computational problems must use a floating-point (FP) precision several times higher than the hardware-implemented available one. These computations critically rely on software libraries for high-precision FP arithmetic. The representation of a high-precision data type crucially influences the corresponding arithmetic algorithms. Recent work showed that algorithms for FP expansions, that is, a representation based on unevaluated sum of standard FP types, benefit from various high-performance support for native FP, such as low latency, high throughput, vectorization, threading, etc. Bailey's QD library and its corresponding Graphics Processing Unit (GPU) version, GQD, are such examples. Despite using native FP arithmetic as the key operations, QD and GQD algorithms are focused on double-double or quad-double representations and do not generalize efficiently or naturally to a flexible number of components in the FP expansion. In this paper, we introduce a new multiplication algorithm for FP expansion with flexible precision, up to the order of tens of FP elements in mind. The main feature consists in the partial products being accumulated in a special designed data structure that has the regularity of a fixed-point representation while allowing the computation to be naturally carried out using native FP types. This allows us to easily avoid unnecessary computation and to present rigorous accuracy analysis transparently. The algorithm, its correctness and accuracy proofs and some performance comparisons with existing libraries are all contributions of this paper.","PeriodicalId":145448,"journal":{"name":"2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ARITH.2016.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Some important computational problems must use a floating-point (FP) precision several times higher than the hardware-implemented available one. These computations critically rely on software libraries for high-precision FP arithmetic. The representation of a high-precision data type crucially influences the corresponding arithmetic algorithms. Recent work showed that algorithms for FP expansions, that is, a representation based on unevaluated sum of standard FP types, benefit from various high-performance support for native FP, such as low latency, high throughput, vectorization, threading, etc. Bailey's QD library and its corresponding Graphics Processing Unit (GPU) version, GQD, are such examples. Despite using native FP arithmetic as the key operations, QD and GQD algorithms are focused on double-double or quad-double representations and do not generalize efficiently or naturally to a flexible number of components in the FP expansion. In this paper, we introduce a new multiplication algorithm for FP expansion with flexible precision, up to the order of tens of FP elements in mind. The main feature consists in the partial products being accumulated in a special designed data structure that has the regularity of a fixed-point representation while allowing the computation to be naturally carried out using native FP types. This allows us to easily avoid unnecessary computation and to present rigorous accuracy analysis transparently. The algorithm, its correctness and accuracy proofs and some performance comparisons with existing libraries are all contributions of this paper.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种利用浮点展开扩展精度的新乘法算法

一些重要的计算问题必须使用比硬件实现的精度高几倍的浮点(FP)精度。这些计算严重依赖于高精度FP算法的软件库。高精度数据类型的表示对相应的算法影响很大。最近的研究表明，FP展开算法(即基于标准FP类型的未求值和的表示)受益于对原生FP的各种高性能支持，如低延迟、高吞吐量、向量化、线程化等。Bailey的QD库及其相应的图形处理单元(GPU)版本GQD就是这样的例子。尽管使用原生FP算法作为关键操作，但QD和GQD算法侧重于双双或四双表示，并且不能有效或自然地推广到FP扩展中的灵活数量的组件。本文介绍了一种新的FP展开的乘法算法，其精度可达到几十个FP元素的数量级。其主要特性在于部分积在一个特殊设计的数据结构中，该数据结构具有定点表示的规律性，同时允许使用本机FP类型自然地执行计算。这使我们可以轻松地避免不必要的计算，并透明地呈现严格的精度分析。该算法的正确性和准确性证明以及与现有库的性能比较都是本文的贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)

自引率

0.00%

发文量

期刊最新文献

Accuracy and Performance Trade-Offs of Logarithmic Number Units in Multi-Core Clusters Recovering Numerical Reproducibility in Hydrodynamic Simulations Multi-fault Attack Detection for RNS Cryptographic Architecture Accelerating Big Integer Arithmetic Using Intel IFMA Extensions A CRC-Based Concurrent Fault Detection Architecture for Galois/Counter Mode (GCM)