Proceedings of the 12th Symposium on Computer Arithmetic最新文献

英文中文

Redundant binary Booth recoding 冗余二进制布斯重新编码

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465377

Chung Nan Lyu, D. Matula

We investigate the efficiencies attainable pursuing Booth recoding directly from redundant binary input with limited carry propagation. As a digit conversion problem we extend the important result that each radix 4 Booth recoded digit can be determined from 5 consecutive input signed bits to obtain that each radix 2/sup k/ Booth recoded digit can be determined from 2k+1 consecutive input signed bits and prove this to be the minimum possible for any k/spl ges/2. Analysis of alternative bit pair encodings of signed bits yields the improved result that each radix 2/sup k/ Booth recoded digit can be determined from only 2k encoded bit pairs employing sign and magnitude bit encoding, a result which does not extend to conventional borrow-save or carry-save redundant binary digit encodings. Radices 4 and 8 gate level designs are illustrated for alternative encodings, with our signed bit design shown to yield smaller depth and fewer gates than existing redundant binary Booth recoding circuits from the literature.<>

我们研究了在有限进位传播的情况下，直接从冗余二进制输入进行布斯编码所能达到的效率。作为一个数字转换问题，我们将每个基数4 Booth编码的数字可以从5个连续输入的有符号比特中确定的重要结果推广到每个基数2/sup k/ Booth编码的数字可以从2k+1个连续输入的有符号比特中确定，并证明了这是任意k/spl ges/2的最小可能值。对有符号位的替代位对编码的分析得到改进的结果，即每个基数2/sup k/ Booth编码数字可以从使用符号位和幅度位编码的2k个编码位对中确定，这一结果不能扩展到传统的借位节省或进位节省冗余二进制数字编码。根号4和8门电平设计用于替代编码，我们的符号位设计显示比文献中现有的冗余二进制布斯编码电路产生更小的深度和更少的门。

引用次数: 66

Redundant Cordic rotator based on parallel prediction 基于并行预测的冗余Cordic旋转器

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465362

E. Antelo, J. Bruguera, J. Villalba, E. Zapata

We present a Cordic rotator, using carry-save arithmetic, based on the prediction of all the coefficients into which the rotation angle is decomposed. The prediction algorithm is based on the use of radix-2 microrotations with multiple shifts in the first iterations and the use of a redundant radix-2 and radix-4 representation for the coefficients in the rest of the microrotations. The use of multiple shifts facilitates the prediction of the coefficients in the case of microrotations where i/spl les/n/4, being n the precision of the algorithm, and the use of radix-4 microrotations helps to reduce the total number of iterations. The prediction is carried out using the redundant representation of the z coordinate, without any need for conversions to a non-redundant representation. Finally, we present a VLSI architecture based on this algorithm. As the production of the coefficients is very fast, and they are known before starting each microrotation, the resulting architecture can be highly pipelined and consequently appropriate for applications where high speeds are required.<>

基于对旋转角度分解后的所有系数的预测，我们提出了一种Cordic旋转器，该旋转器采用免进位算法。该预测算法基于在第一次迭代中使用具有多次移位的基数-2微旋转，以及在其余微旋转中使用冗余的基数-2和基数-4表示系数。使用多重移位有助于在微旋转的情况下预测系数，其中i/spl小于/n/4，这是算法的精度，并且使用基数为4的微旋转有助于减少总迭代次数。预测是使用z坐标的冗余表示进行的，不需要转换为非冗余表示。最后，我们提出了一个基于此算法的VLSI架构。由于系数的产生非常快，并且在开始每次微旋转之前就知道它们，因此所得架构可以高度流水线化，因此适用于需要高速的应用

引用次数: 13

Very-high radix combined division and square root with prescaling and selection by rounding 非常高的基数结合除法和平方根与预缩放和四舍五入选择

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465367

T. Lang, P. Montuschi

An algorithm for square root with prescaling is developed and combined with a similar scheme for division. An implementation is described, evaluated and compared with other combined div/sqrt implementations.<>

提出了一种带有预标度的平方根算法，并将其与一种类似的除法算法相结合。对一个实现进行描述、评估，并与其他div/sqrt组合实现进行比较。

引用次数: 24

The K5 transcendental functions K5超越函数

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465368

Thomas W. Lynch, Ashraf Ahmed, M. Schulte, T. K. Callaway, Robert Tisdale

This paper describes the development of the transcendental instructions for the K5, AMD's recently completed x86 compatible superscalar microprocessor. A multi-level development cycle, with testing between levels, facilitated the early detection of errors and limited their effect on the design schedule. The algorithms for the transcendental functions use table-driven reductions followed by polynomial approximations. Multiprecision arithmetic operations are used when necessary to maintain sufficient accuracy and to ensure that the transcendental functions have a maximum error of one unit in the last place.<>

本文描述了AMD最近完成的兼容x86的超标量微处理器K5的超越指令的开发。多层次的开发周期，在不同级别之间进行测试，有助于及早发现错误，并限制它们对设计进度的影响。超越函数的算法使用表驱动约简，然后是多项式近似。当需要保持足够的精度并确保超越函数在最后一个位置的最大误差为一个单位时，使用多精度算术运算

引用次数: 16

30-ns 55-b shared radix 2 division and square root using a self-timed circuit 30-ns 55-b使用自定时电路共享基数2除法和平方根

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465371

G. Matsubara, N. Ide, H. Tago, Seigo Suzuki, N. Goto

A shared radix 2 division and square root implementation using a self-timed circuit is presented. The same execution time for division and square root is achieved by using an on-the-fly digit decoding and a root multiple generation technique. Most of the hardware is shared, and only several multiplexers are required to exchange a divisor multiple and a root multiple. Moreover, quotient selection logic is accelerated by a new algorithm using a 3-b carry propagation adder. The implementation of the shared division and square root unit is realized by assuming 0.3 /spl mu/m CMOS technology. The wiring capacitance and other parasitic parameters are taken into account. The execution time of floating point 55-b full mantissa division and square root is expected to be less than 30 ns in the worst case of an input vector determined by an intensive circuit simulation.<>

提出了一种利用自定时电路实现共享根除和平方根的方法。除法和平方根的执行时间相同是通过使用动态数字解码和根多重生成技术实现的。大多数硬件是共享的，并且只需要几个多路复用器来交换一个除数倍数和一个根倍数。此外，采用3b进位传播加法器的新算法加快了商选择逻辑的速度。通过假设0.3 /spl mu/m CMOS技术实现了共享除法和平方根的实现。考虑了布线电容和其他寄生参数。在由密集电路模拟确定的输入向量的最坏情况下，浮点55-b全尾数除法和平方根的执行时间预计小于30 ns。

引用次数: 19

O(n)-depth circuit algorithm for modular exponentiation O(n)深度电路模求幂算法

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465360

T. Hamano, N. Takagi, S. Yajima, F. Preparata

An O(n)-depth polynomial-size combinational circuit algorithm is proposed for n-bit modular exponentiation, i.e., for the computation of "x/sup y/ mod m" for arbitrary integers x, y and m. Represented as n-bit binary integers, within bounds 2/sup n-1//spl les/m<2/sup n/ and 0/spl les/x,y>

提出了一种O(n)深度多项式大小的组合电路算法，用于n位模求幂，即对任意整数x, y, m进行“x/sup y/ mod m”的计算。表示为n位二进制整数，在2/sup n-1//spl les/m>

引用次数: 18

Arithmetic for relative accuracy 相对精度算法

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465357

R. V. Drunen, L. Spaanenburg, P. Lucassen, J. Nijhuis, J. T. Udding

Of the three factors named in Moore's first Law that drive the advance of computational systems, circuit design receives relatively little mention. We introduce here a circuit variety that allows to include accuracy considerations. It is shown that accuracy-drive can be effectively realised and leads to 60% speed improvement. Details are given of a floating-point unit with full hardware support of complex calculations, specifically tailored to speed-up MD-simulations on the GROMACS scientific parallel computer.<>

在摩尔第一定律中提到的推动计算系统进步的三个因素中，电路设计相对较少被提及。我们在这里介绍一种允许包括精度考虑的电路品种。结果表明，该方法可以有效地实现精度驱动，使转速提高60%。详细介绍了一个浮点单元，具有复杂计算的完整硬件支持，专门为加速GROMACS科学并行计算机上的md模拟而量身定制。

引用次数: 5

Semi-logarithmic number systems 半对数数系

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465358

J. Muller, A. Tisserand, Alexandre Scherbyna

We present a new class of number systems, called semi-logarithmic number systems, that constitute a family of various compromises between floating-point and logarithmic number systems. We propose arithmetic algorithms for the semi-logarithmic number systems, and we compare these number systems to the classical floating-point or logarithmic number systems.<>

我们提出了一类新的数字系统，称为半对数数系统，它构成了浮点数系统和对数数系统之间的各种折衷。我们提出了半对数数系统的算法，并将这些数系统与经典浮点数或对数数系统进行了比较

引用次数: 18

Function evaluation by table look-up and addition 通过表查找和加法计算函数

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465382

Hannes Hassler, N. Takagi

We describe a general approach decomposing a function into a sum of functions, each with a smaller input site than the original. Hence we can map such functions with essentially the same precision using small ROM tables and adders. We derive an easy method to compute the worst case error for many elementary functions and an error bound for the rest. Important applications are reciprocals, logarithms, exponentials and others.<>

我们描述了一种将函数分解为函数和的一般方法，每个函数的输入位置都比原始的小。因此，我们可以使用小的ROM表和加法器以基本相同的精度映射这些函数。我们推导出一种简单的方法来计算许多初等函数的最坏情况误差和其他函数的误差界。重要的应用是倒数、对数、指数等。

引用次数: 111

A new VLSI vector arithmetic coprocessor for the PC 一种新型的VLSI矢量算术协处理器

Proceedings of the 12th Symposium on Computer Arithmetic

Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465356

C. Baumhof

A new vector arithmetic coprocessor MIM XPA3233 with integrated PCI bus interface has been developed in CMOS VLSI technology. The chip performs dot products of vectors with components of the IEEE DOUBLE data format to full accuracy or with only one final rounding. Details on the realisation of the multiplication, accumulation and carry resolution processes are discussed. Performance data and some details about the actual VLSI realisation are presented. Software support for the coprocessor is available in the programming languages PASCAL-XSC and C-XSC or from a special C subroutine library. Programming examples are shown using PASCAL-XSC and C.<>

基于CMOS VLSI技术，开发了一种集成PCI总线接口的矢量算法协处理器MIM XPA3233。该芯片与IEEE DOUBLE数据格式的组件进行向量的点积，以达到完全精度或仅最后一次四舍五入。详细讨论了乘法、累加和进位解析过程的实现。给出了性能数据和实际VLSI实现的一些细节。协处理器的软件支持可以通过PASCAL-XSC和C- xsc编程语言或特殊的C子程序库获得。程序示例显示使用PASCAL-XSC和c。

引用次数: 17

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 12th Symposium on Computer Arithmetic

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀