2015 IEEE 22nd Symposium on Computer Arithmetic最新文献

英文中文

Pain versus Gain in the Hardware Design of FPUs and Supercomputers fpu和超级计算机硬件设计中的痛苦与收获

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 2005-06-27 DOI: 10.1109/ARITH.2005.33

Roger A. Golliver, S. M. Müller, S. Oberman, M. Schmookler, Debjit Das Sarma, A. Beaumont-Smith

In 1990 there was a dramatic change in the overall design of floating-point units (FPUs) with the introduction of the fused multiply-add dataflow. This design is common today due to its performance advantage over separated units. Recently the constraining parameters have been changing for sub 10 micron technologies and the resulting designs are focusing on increasing the frequency at the cost of pipeline depth. Wire lengths are a crucial design parameter and there is a great deal of effort spent in floorplanning the execution elements to be very close together. It is now typical that a signal sent across an FPU may take 1 or more clock cycles. Thus, the physical design is very important and requires global optimizations of placement of macros as well as complex power reduction. Additionally technology scaling continues to decrease feature sizes and more execution units or even processor cores can be placed on a chip. Execution units such as Decimal FPUs are in product plans. There are single chip designs with 8 vector processing units which are used to accelerate the video games we play. The processing power in these single chip game processors is the equivalent of supercomputers. What is the next trendsetting design or key problem in computer arithmetic? We have asked a panel of expert arithmetic unit hardware designers to discuss the current pain versus gain tradeoffs and to speculate on the future of arithmetic design. Panelists:

1990年，随着融合乘加数据流的引入，浮点单元(fpu)的总体设计发生了巨大变化。由于其性能优于分离单元，这种设计在今天很常见。最近，亚10微米技术的约束参数发生了变化，由此产生的设计侧重于以管道深度为代价来提高频率。电线长度是一个至关重要的设计参数，在平面规划上花费了大量的精力，使执行元素非常接近。现在，通过FPU发送的信号可能需要1个或多个时钟周期是典型的。因此，物理设计非常重要，需要对宏的放置进行全局优化以及降低复杂的功耗。此外，技术扩展继续减小功能尺寸，并且可以在芯片上放置更多的执行单元甚至处理器内核。执行单元(如Decimal fpu)在产品计划中。有8个矢量处理单元的单芯片设计用于加速我们玩的视频游戏。这些单芯片游戏处理器的处理能力相当于超级计算机。下一个引领潮流的设计或计算机算法中的关键问题是什么?我们邀请了一组算术单元硬件设计专家来讨论当前的痛苦与收益权衡，并推测算术设计的未来。小组成员:

{"title":"Pain versus Gain in the Hardware Design of FPUs and Supercomputers","authors":"Roger A. Golliver, S. M. Müller, S. Oberman, M. Schmookler, Debjit Das Sarma, A. Beaumont-Smith","doi":"10.1109/ARITH.2005.33","DOIUrl":"https://doi.org/10.1109/ARITH.2005.33","url":null,"abstract":"In 1990 there was a dramatic change in the overall design of floating-point units (FPUs) with the introduction of the fused multiply-add dataflow. This design is common today due to its performance advantage over separated units. Recently the constraining parameters have been changing for sub 10 micron technologies and the resulting designs are focusing on increasing the frequency at the cost of pipeline depth. Wire lengths are a crucial design parameter and there is a great deal of effort spent in floorplanning the execution elements to be very close together. It is now typical that a signal sent across an FPU may take 1 or more clock cycles. Thus, the physical design is very important and requires global optimizations of placement of macros as well as complex power reduction. Additionally technology scaling continues to decrease feature sizes and more execution units or even processor cores can be placed on a chip. Execution units such as Decimal FPUs are in product plans. There are single chip designs with 8 vector processing units which are used to accelerate the video games we play. The processing power in these single chip game processors is the equivalent of supercomputers. What is the next trendsetting design or key problem in computer arithmetic? We have asked a panel of expert arithmetic unit hardware designers to discuss the current pain versus gain tradeoffs and to speculate on the future of arithmetic design. Panelists:","PeriodicalId":6526,"journal":{"name":"2015 IEEE 22nd Symposium on Computer Arithmetic","volume":"119 1","pages":"39"},"PeriodicalIF":0.0,"publicationDate":"2005-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80393076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Computer Arithmetic-A Processor Architect's Perspective 计算机算术——处理器架构师的视角

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 2001-06-11 DOI: 10.1109/ARITH.2001.10003

R. Lee

引用次数: 4

Computer Arithmetic - A Programmer's Perspective 计算机算术——一个程序员的视角

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1999-04-14 DOI: 10.1109/ARITH.1999.10005

R. Brent

Advances in computer hardware often have little impact until they become accessible to programmers using high-level languages. For example, the IEEE floating-point arithmetic standard provides various rounding modes and exceptions, but it is difficult or impossible to take advantage of these from most high-level languages, so the full capabilities of IEEE-compatible hardware are seldom used. When they are used by writing in machine or assembly language, there is a high cost in program development and testing time, lack of portability, and difficulty of software maintenance.In this talk we discuss several areas in which computer hardware, especially arithmetic hardware, can or should significantly influence programming language design. These include: vector units, floating-point exception handling, floating-point rounding modes, high/extended precision registers/arithmetic, and use of unusual number systems. Relevant application areas include interval arithmetic, high-precision integer arithmetic for computer algebra and cryptography, and testing of hardware by comparison with software simulations.

计算机硬件的进步通常没有什么影响，直到程序员可以使用高级语言。例如，IEEE浮点算术标准提供了各种舍入模式和异常，但大多数高级语言很难或不可能利用这些模式和异常，因此很少使用与IEEE兼容的硬件的全部功能。当使用机器语言或汇编语言编写时，程序开发和测试时间成本高，缺乏可移植性，软件维护困难。在这次演讲中，我们将讨论计算机硬件，特别是算术硬件，能够或应该显著影响编程语言设计的几个领域。这些包括:向量单位、浮点异常处理、浮点舍入模式、高/扩展精度寄存器/算术，以及使用不寻常的数字系统。相关的应用领域包括区间算法、计算机代数和密码学的高精度整数算法，以及通过与软件模拟的比较来测试硬件。

引用次数: 1

Fast hardware units for the computation of accurate dot products 用于精确点积计算的快速硬件单元

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1991-01-01 DOI: 10.1109/ARITH.1991.145536

A. Knöfel

Matrix and vector operations based on dot product expressions occur in almost all scientific and engineering applications. The lack of the popular programming languages and computer architectures to provide operators for these data types and corresponding accurate hardware instructions forced users to emulate the vector operations by constructing loops with scalar floating point instructions. Cancellation and immediate rounding in these loops cause uncertain and inaccurate numerical results and aggrevate an error analysis.

基于点积表达式的矩阵和向量运算几乎出现在所有的科学和工程应用中。缺乏流行的编程语言和计算机体系结构来为这些数据类型提供运算符和相应的精确硬件指令，迫使用户通过构造带有标量浮点指令的循环来模拟向量操作。这些循环中的消去和立即舍入会导致不确定和不准确的数值结果，并汇总误差分析。

引用次数: 3

Systolic arrays for integer Chinese remaindering 用于整型中文余数的收缩数组

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1989-09-06 DOI: 10.1109/ARITH.1989.72829

Ç. Koç, P. Cappello

The authors present several time-optimal and space-time-optimal systolic arrays for computing a process dependence graph corresponding to the mixed-radix conversion algorithm. The arrays are particularly suitable for software implementations of algorithms from the applications of residue number systems on a programmable systolic/wavefront array. Examples of such applications are the exact solution of linear systems and matrix problems over integral domains. The authors also describe a decomposition strategy for treating a mixed-radix conversion problem whose size exceeds the array size. >

本文提出了几种计算过程依赖图的时间最优和空间-时间最优收缩阵列，这些阵列对应于混合基数转换算法。该阵列特别适合于残数系统在可编程收缩/波前阵列上应用的算法的软件实现。这种应用的例子是线性系统和矩阵问题在积分域上的精确解。作者还描述了一种分解策略，用于处理大小超过数组大小的混合基数转换问题。>

引用次数: 5

A family of CMOS floating point arithmetic chips 一种CMOS浮点运算芯片

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1985-06-04 DOI: 10.1109/ARITH.1985.6158945

J. Eldon

Although the advantages of floating point arithmetic have long been recognized, hardware complexity and expense have impeded its use in high speed digital signal processing (DSP). Now, however, the availability of a growing number of fast dedicated floating point adder and multiplier chips is spurring renewed interest in floating point for real time filtering and spectral analysis.

虽然浮点运算的优点早已被人们所认识，但硬件的复杂性和昂贵的成本阻碍了它在高速数字信号处理(DSP)中的应用。然而，现在，越来越多的快速专用浮点加法器和乘法器芯片的可用性重新激发了对实时滤波和频谱分析的浮点的兴趣。

引用次数: 1

Foundations of finite precision arithmetic 有限精度算术的基础

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1972-05-15 DOI: 10.1109/ARITH.1972.6153887

D. Matula

Completeness and uniqueness properties for the representation of the base 㬲 digital numbers by finite length radix polynomials with various digit sets are studied. The digit sets guaranteeing completeness and uniqueness are characterized. A digital conversion algorithm is introduced for determining a base β radix polynomial with digits from a specified set D having a particular value whenever such a radix polynomial exists. The notion of precision of a radix polynomial is formalized, and the determination of the precision from the given base β, digit set D, and real value a of the radix polynomial is investigated.

研究了具有不同数字集的有限长度基数多项式表示基数㬲数字的完备性和唯一性。对保证完备唯一性的数字集进行了刻画。本文介绍了一种数字转换算法，用于确定一个基底β基多项式，当该基多项式存在时，该基β基多项式中来自特定集合D的数字具有特定值。给出了基多项式精度的概念，研究了基多项式的精度由给定的基底β、数字集D和实值a确定的问题。

引用次数: 4

Analog techniques for residue operations 残余操作的模拟技术

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1972-05-15 DOI: 10.1109/ARITH.1972.6153892

T. L. Cauthen, T. Rao

The precision required for residue operations is primarily associated with resistive components and operational amplifiers. However, technology has advanced to the point that analog components can be built to an accuracy of less than one tenth of one percent (at least with discrete components). Companies are attempting to build DAC'S with fifteen (15) bit accuracy which will settle to the least significant bit in one hundred (100) nanoseconds. None are commercially available but seven or eight bit DAC's with settling times on the order of one hundred (100) nanoseconds are not uncommon. Sixteen (16) bit ADC's have been announced recently as a result of new analog components. It is not uncommon to find operational amplifiers on the market with gain bandwidth products in excess of one hundred fifty (150) megahertz and linearities on the order of one tenth of one percent. One can also find analog voltage comparators (such as the Motorola MC1650) which has a hysteresis of ten millivolts and a switching speed of less than two nanoseconds. Shotkey diodes have been introduced which allow transistors to switch in less than one and one-half nanoseconds, and manufacturers have learned to build both active and passive components to a tolerance of less than one-tenth of one percent.

剩余运算所需的精度主要与电阻元件和运算放大器有关。然而，随着技术的进步，模拟元件的制造精度可以低于百分之一的十分之一(至少对于分立元件)。公司正试图建立具有十五(15)位精度的DAC，它将在一百(100)纳秒内达到最低有效位。没有一种是商用的，但7位或8位DAC的稳定时间在一百(100)纳秒的数量级上并不罕见。16位ADC最近被宣布为新型模拟元件的结果。在市场上发现增益带宽产品超过一百五十(150)兆赫的运算放大器并不罕见，线性度为百分之一的十分之一。人们还可以找到模拟电压比较器(如摩托罗拉MC1650)，其迟滞为10毫伏，开关速度小于2纳秒。射键二极管已经被引入，它允许晶体管在不到1.5纳秒的时间内切换，制造商已经学会制造有源和无源元件，公差小于百分之一的十分之一。

{"title":"Analog techniques for residue operations","authors":"T. L. Cauthen, T. Rao","doi":"10.1109/ARITH.1972.6153892","DOIUrl":"https://doi.org/10.1109/ARITH.1972.6153892","url":null,"abstract":"The precision required for residue operations is primarily associated with resistive components and operational amplifiers. However, technology has advanced to the point that analog components can be built to an accuracy of less than one tenth of one percent (at least with discrete components). Companies are attempting to build DAC'S with fifteen (15) bit accuracy which will settle to the least significant bit in one hundred (100) nanoseconds. None are commercially available but seven or eight bit DAC's with settling times on the order of one hundred (100) nanoseconds are not uncommon. Sixteen (16) bit ADC's have been announced recently as a result of new analog components. It is not uncommon to find operational amplifiers on the market with gain bandwidth products in excess of one hundred fifty (150) megahertz and linearities on the order of one tenth of one percent. One can also find analog voltage comparators (such as the Motorola MC1650) which has a hysteresis of ten millivolts and a switching speed of less than two nanoseconds. Shotkey diodes have been introduced which allow transistors to switch in less than one and one-half nanoseconds, and manufacturers have learned to build both active and passive components to a tolerance of less than one-tenth of one percent.","PeriodicalId":6526,"journal":{"name":"2015 IEEE 22nd Symposium on Computer Arithmetic","volume":"1 1","pages":"1-23"},"PeriodicalIF":0.0,"publicationDate":"1972-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88001938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Multiresidue codes for double error correction 双重纠错的多残码

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1972-05-15 DOI: 10.1109/ARITH.1972.6153889

P. Monteiro, T. Rao

A new class of (separate) multiresidue codes has been proposed, which is capable of double error correction. The codes are derived from a class of AN codes where A is of the form π i=1 3(2ki−1). Previously all discussions on separate code implementation had restricted themselves to single error correcting codes only. We have shown that these multiple-error correcting separate codes can be relatively easily implemented as the check bases are of the form 2k−1. A comparison with the multiresidue codes derived from Barrows-Mandelbaum codes has shown that these codes have in general a higher information rate and an easier implementation.

提出了一种新的(单独的)多残码，它具有双重纠错能力。这些码由一类AN码推导而来，其中a的形式为π i=1 3(2ki−1)。以前，所有关于单独代码实现的讨论都仅限于单个纠错代码。我们已经证明，这些多重纠错分离码可以相对容易地实现，因为校验基的形式是2k−1。与Barrows-Mandelbaum码的多残差码的比较表明，这些码通常具有较高的信息率和更容易实现。

引用次数: 2

A simulative study of correlated error propagation in various finite arithmetics 各种有限算法中相关误差传播的仿真研究

2015 IEEE 22nd Symposium on Computer Arithmetic

Pub Date : 1972-05-15 DOI: 10.1109/ARITH.1972.6153915

J. Marasa, D. Matula

The accumulated round-off error incurred in long arithmetic computations involving a randomized mixture of addition, subtraction, multiplication and division operations applied to an initial randomly generated data base is studied via simulation. Truncated and rounded floating-point arithmetic and truncated and rounded logarithmic arithmetic are simultaneously utilized for each of the computation sequences and the resulting round-off error accumulations for these four systems are compared. Fundamental results related to the nature of the correlated errors incurred under various arithmetic operator mixes are discussed.

通过仿真研究了在初始随机生成的数据库中随机混合加、减、乘、除运算的长时间算术计算所产生的累积舍入误差。对每个计算序列同时使用截断和舍入的浮点运算和截断和舍入的对数运算，并比较了这四种系统的舍入误差累积。讨论了各种算术算子混合产生的相关误差性质的基本结果。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2015 IEEE 22nd Symposium on Computer Arithmetic

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀