1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)最新文献

英文中文

On error analysis in arithmetic with varying relative precision 变相对精度算法的误差分析

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 2018-02-08 DOI: 10.1109/ARITH.1987.6158694

J. Demmel

Recently Clenshaw/Olver and Iri/Matsui proposed new floating point arithmetics which seek to eliminate overflows and underflows from most computations. Their common approach is to redistribute the available numbers to spread out the largest and smallest numbers much more thinly than in standard floating point, thus achieving a larger range at the cost of lower precision at the ends of the range. The goal of these arithmetics is to eliminate much of the effort needed to write code which is reliable despite over/under flow. In this paper we argue that for many codes this eliminated effort will reappear in the error analyses needed to ascertain or guarantee the accuracy of the computed solution. Thus reliability with respect to over/under flow has been traded for reliability with respect to roundoff. We also propose a hardware flag, analogous to the “sticky flags” of the IEEE binary floating point standard, to do some of this extra error analysis automatically.

最近，Clenshaw/Olver和Iri/Matsui提出了新的浮点算法，旨在消除大多数计算中的溢出和下溢。它们的常用方法是重新分配可用数，使最大和最小的数比标准浮点数更稀疏地展开，从而以范围末端较低的精度为代价获得更大的范围。这些算法的目标是消除编写可靠的代码所需的大部分工作，尽管流上/流下。在本文中，我们认为对于许多代码，这种消除的努力将在确定或保证计算解的准确性所需的错误分析中重新出现。因此，关于过流/下流的可靠性已经被交换为关于舍入的可靠性。我们还提出了一个硬件标志，类似于IEEE二进制浮点标准的“粘性标志”，可以自动进行一些额外的错误分析。

引用次数: 10

Systolic solution of linear systems over GF(p) with partial pivoting GF(p)上具有部分旋转的线性系统的收缩解

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158700

B. Hochet, P. Quinton, Y. Robert

We propose two systolic architectures for the Gaussian triangularization and the Gauss-Jordan diagonalization of large dense nxn matrices over GF(p), where p is a prime number. The solution of large dense linear systems over GF(p) is the major computational step in various algorithms issued from arithmetic number theory and computer algebra. The two proposed architectures implement the elimination with partial pivoting, although the operation of the array remains purely systolic. The last section is devoted to the design and layout of a CMOS 8 by 8 Gauss-Jordan diagonalization systolic chip over GF(2).

本文提出了GF(p)上的大密度nxn矩阵的高斯三角化和高斯-乔丹对角化的两种收缩结构，其中p是素数。GF(p)上的大型密集线性系统的解是算术数论和计算机代数中各种算法的主要计算步骤。尽管阵列的操作仍然是纯粹的收缩，但这两种提出的架构实现了部分枢轴的消除。最后一节致力于在GF(2)上设计和布局CMOS 8 × 8高斯-乔丹对角化收缩芯片。

引用次数: 7

A design of time-optimum and register-number-minimum systolic convolver 一种时间最优和寄存器数最小的收缩卷积器设计

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158720

H. Umeo

We present an optimum bit-parallel/word-sequential systolic convolver. Our design is the best one among the previous many convolvers in the sense that its optimality in time and space performances is simultaneously attained without augmenting any global control, broadcasting, preloading, and/or multi sequential or parallel I/O ports, which were allowed in most of the previous designs. As an application of our convolver we give a systolic polynomial divider which can compute the polynomial division in exactly n + 0(1) steps on [min (n−m, m)/2] + 0(1) systolic cells, for the division of any degree n polynomial by any degree m polynomial(n ≧ m).

我们提出了一个最优的位并行/字序收缩卷积器。我们的设计在之前的许多卷积器中是最好的，因为它同时达到了时间和空间性能的最优性，而不需要增加任何全局控制、广播、预加载和/或多个顺序或并行I/O端口，而这些在之前的大多数设计中是允许的。对于任意n次多项式除以任意m次多项式(n≧m)的除法，我们给出了一个收缩多项式除法，它可以在[min (n−m, m)/2] + 0(1)个收缩细胞上精确地n + 0(1)步内计算出多项式的除法。

引用次数: 0

A bit-serial arithmetic unit for rational arithmetic 有理数算术的位串行算术单元

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158705

Peter Kornerup, D. Matula

We describe a binary implementation of an algorithm of Gosper to compute the sum, difference, product, quotient and certain rational functions of two rational operands applicable to integrated approximate and exact rational computation. The arithmetic unit we propose is an eight register computation cell with bit serial input and output employing the binary lexicographic continued fraction (LCF) representation of the rational operands. The operands and results are processed in a most-significant-bit first on-line fashion with bit level logic leading to less delay in the computation cell when compared to operation on the full partial quotients of the standard continued fraction representation. Minimization of delay is investigated with the aim of supporting greater throughput in cascaded parallel computation with such computation cells.

描述了一种计算两个有理数的和、差、积、商和某些有理数函数的算法的二进制实现，适用于积分近似和精确有理数计算。我们提出的算术单元是一个8寄存器的计算单元，具有位串行输入和输出，采用二进制字典连续分数(LCF)表示有理数。操作数和结果以最有效位优先在线方式处理，与对标准连分数表示的全部分商的操作相比，位级逻辑导致计算单元中的延迟更小。研究了延迟最小化的问题，目的是在级联并行计算中支持更大的吞吐量。

引用次数: 4

The FELIN arithmetic coprocessor chip FELIN算术协处理器芯片

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158691

M. Cosnard, A. Guyot, B. Hochet, J. Muller, H. Ouaouicha, P. Paul, E. Zysman

We describe a general VLSI architecture for the computation of arithmetic expressions including floating-point trancendental functions. This architecture is divided in three parts: a communication machine, the control part of a computation machine and the operative part of this computation machine. In order to compute the most usual trancendental functions, we introduced some general algorithms, presented briefly here, including as a particular case the CORDIC scheme. Our major architecture goals were regularity, parametrization and automatic design. The final chip is designed in a 2-Alu CMOS technology, and its name is FELIN (“Fonctions ELémentaires INtégrées is the french for integrated elementary functions”). This work was supported in part by the GRECO C3 and the GCIS of the French CNRS.

我们描述了一种通用的VLSI架构，用于计算包括浮点超越函数在内的算术表达式。该体系结构分为三部分:通信机、计算机的控制部分和计算机的操作部分。为了计算最常见的超越函数，我们介绍了一些通用算法，在这里简要介绍，其中包括CORDIC方案。我们的主要架构目标是规则化、参数化和自动化设计。最后的芯片采用2-Alu CMOS技术设计，它的名字是FELIN(“functions elsammentaires intsamgrimes是法语的integrated elementary functions”)。这项工作得到了GRECO C3和法国CNRS GCIS的部分支持。

引用次数: 11

Synthesis of area-efficient VLSI architectures for vector and matrix multiplication 用于矢量和矩阵乘法的面积高效VLSI架构的合成

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158718

Steven G. Smith, P. Denyer

A methodology is presented for synthesis of area-efficient, high-performance VLSI modules for vector and matrix multiplication. Three fundamental computational elements are employed in the composition of these architectures: memory register, multiplexer (1-from-2 data selecter), and carry-save add-shift computer. Two's complement serial/parallel carry-save accumulation provides performance, while the use of symmetric-coded distributed arithmetic eliminates redundant computation to effect area-savings.

提出了一种用于矢量和矩阵乘法的面积高效、高性能VLSI模块的合成方法。在这些体系结构的组成中使用了三个基本的计算元素:存储器寄存器、多路复用器(1-from-2数据选择器)和进位保存加移位计算机。两个互补的串行/并行进位节省积累提供了性能，而使用对称编码的分布式算法消除了冗余计算，以达到节省面积的效果。

引用次数: 1

Fast multiply and divide for a VLSI floating-point unit VLSI浮点单元的快速乘法和除法

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158684

B. K. Bose, Li-fan Pei, G. Taylor, D. Patterson

This paper presents the design of a fast and area-efficient multiply-divide unit used in building a VLSI floating-point processor (FPU), conforming to the IEEE standard 754. Details of the algorithms, implementation techniques and design tradeoffs are presented, The multiplier and divider are implemented in 2 micron CMOS technology with two layers of metal, and occupy 23 square mm (23% of the entire FPU). We expect to perform extended-precision multiplication and division in 1.1 and 2.8 microseconds, respectively.

本文设计了一种用于构建符合IEEE标准754的VLSI浮点处理器(FPU)的快速、面积高效的乘除单元。该乘法器和除法器采用两层金属的2微米CMOS技术实现，占地23平方毫米(占整个FPU的23%)。我们期望分别在1.1微秒和2.8微秒内执行扩展精度的乘法和除法。

引用次数: 15

Error detection and correction for addition and subtraction, through use of higher radix extensions of hamming codes 错误检测和修正的加法和减法，通过使用更高的基数扩展的汉明码

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158714

J. E. Robertson

The properties of Hamming codes for error detection and correction can be extended from the binary parity check to addition, modulo 2r. Malfunctions in hardware during addition, modulo 2r, can be detected and corrected. Since carry-save and signed-digit addition, radix r, are included in addition, modulo 2r, this extension of Hamming codes makes possible new techniques for detection and correction of hardware malfunctions during signed-digit and carry-save addition.

汉明码的错误检测和纠错特性可以从二进制奇偶校验扩展到加法、模2r。在加法期间，模2r的硬件故障，可以检测和纠正。由于进位保存和带符号数字加法，基数r，包括加法，模2r，这种汉明码的扩展使得在有符号数字和带符号数字保存加法期间检测和纠正硬件故障的新技术成为可能。

引用次数: 1

A radix-4 on-line division algorithm 一种基数-4在线除法算法

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158719

P. Tu, M. Ercegovac

We present an on-line algorithm for radix-4 floating point division. The divisor is first transformed in to a range such that the quotient digits are computed as a function of the scaled partial remainder only.

提出了一种4基浮点除法的在线算法。除数首先被转换成一个范围，使得商数字仅作为缩放后的部分余数的函数来计算。

引用次数: 19

On the implementation of shifters, multipliers, and dividers in VLSI floating point units VLSI浮点单元中移位器、乘法器和除法器的实现

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

Pub Date : 1987-05-18 DOI: 10.1109/ARITH.1987.6158711

V. Peng, S. Samudrala, M. Gavrielov

Several options for the implementation of combinatorial shifters, multipliers, and dividers for a VLSI floating point unit are presented and compared. The comparisons are made in the context of a single chip implementation in light of the constraints imposed by currently available MOS technology.

对VLSI浮点单元的组合移位器、乘法器和除法器的几种实现方案进行了介绍和比较。比较是在单芯片实现的背景下进行的，考虑到目前可用的MOS技术所施加的限制。

引用次数: 14

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

1987 IEEE 8th Symposium on Computer Arithmetic (ARITH)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀