Proceedings of IEEE 11th Symposium on Computer Arithmetic最新文献

英文中文

n /spl times/ n carry-save multipliers without final addition N /spl倍/ N个无最终加法的进位乘法器

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378109

P. Montuschi, L. Ciminiera

Carry-save multipliers require an adder at the last step to convert the carry-sum representation of the most significant half of the result into an irredundant form. A multiplication scheme where by this conversion is performed with a circuit operating in parallel with the carry-save array is presented. The resulting implementation, when a radix-2 adder array is used, produces a result on 2n bits with a delay comparable to that of the multiplier proposed by M.D. Ercegovac and T. Lang (1990). When a radix-4 array is used, the proposed unit is almost twice as fast as units proposed previously.<>

免进位乘法器在最后一步需要加法器来将结果的最有效一半的进位和表示转换为不冗余的形式。提出了一种乘法方案，其中通过与进位保存阵列并行操作的电路来执行这种转换。当使用基数为2的加法器阵列时，产生的结果为2n位，其延迟与M.D. Ercegovac和T. Lang(1990)提出的乘法器相当。当使用基数为4的数组时，建议的单位几乎是以前建议的单位的两倍。

引用次数: 5

Integer mapping architectures for the polynomial ring engine 多项式环引擎的整数映射体系结构

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378110

S. Bizzan, G. Jullien, N. Wigley, W. Miller

A finite polynomial ring structure for mapping inner product computations to parallel independent ring computations over 3-b moduli has been introduced by N.M. Wigley et al. (1992). The main algorithmic computation architecture can be implemented using well-established systolic array mapping principles, and a project to construct a Polynomial Ring Engine (PRE) is underway to exploit the VLSI implementation properties of such computations. A semi-systolic architecture for the input and output conversion mappings that are required in the engine is introduced here. It is shown that the entire mappings procedure can be carried out with pipelined six-input logic blocks and small, fast, binary adders. CMOS implementation techniques for the pipelined blocks are discussed, and the design procedure is illustrated with results from a recently completed module generator.<>

N.M. Wigley等人(1992)引入了一种有限多项式环结构，用于将内积计算映射到3-b模上的并行独立环计算。主要的算法计算架构可以使用完善的收缩阵列映射原理来实现，并且一个构建多项式环引擎(PRE)的项目正在进行中，以利用这种计算的VLSI实现特性。这里将介绍用于引擎中所需的输入和输出转换映射的半收缩架构。结果表明，整个映射过程可以用流水线的六输入逻辑块和小型、快速的二进制加法器来完成。讨论了流水线模块的CMOS实现技术，并用最近完成的模块生成器的结果说明了设计过程。

引用次数: 4

A lazy exact arithmetic 一个懒惰的精确算术

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378086

Mohand Ourabah Benouamer, P. Jaillon, D. Michelucci, J. Moreau

Systems based on exact arithmetic are very slow. In practical situations, very few computations need be performed exactly as approximating the results is very often sufficient. Unfortunately, it is impossible to know at the time when the computation is called for whether an exact evaluation will be necessary or not. The arithmetic library presented here achieves laziness by postponing any exact computation until it is proved to be indispensable. This yields very substantial gains in performance while allowing exact decisions. The lazy arithmetic techniques are presented in the context of rational computations, using the field of computational geometry as a background.<>

基于精确算术的系统非常慢。在实际情况中，很少需要精确地执行计算，因为通常近似结果就足够了。不幸的是，在调用计算时不可能知道是否需要精确的求值。这里提供的算术库通过延迟任何精确计算，直到证明它是必不可少的来实现惰性。这在允许做出精确决策的同时，在性能上产生了非常可观的收益。以计算几何领域为背景，在理性计算的背景下提出了懒惰算术技术

引用次数: 11

Algorithms and multi-valued circuits for the multioperand addition in the binary stored-carry number system 二进制存储进位系统中多操作数加法的算法和多值电路

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378092

D. Etiemble, K. Navi

Algorithms for the sum of two (three and four) digits in the binary stored-carry number system, using the smallest set of values for the positional sum, are presented. The corresponding adders, which use multivalued current-mode circuits, are also presented. The implementation of multioperand additions using these adders is compared with the usual binary implementation.<>

在二进制存储进位数系统中，使用最小值集进行位置和，给出了两位数(三位数和四位数)和的算法。相应的加法器，采用多值电流模式电路，也被介绍。将使用这些加法器的多操作数加法的实现与通常的二进制实现进行比较。

引用次数: 17

Comparing several GCD algorithms 比较几种GCD算法

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378094

T. Jebelean

The execution times of several algorithms for computing the GCD of arbitrary precision integers are compared. These algorithms are the known ones (Euclidean, binary, plus-minus), and the improved variants of these for multidigit computation (Lehmer and similar), as well as new algorithms introduced by the author: an improved Lehmer algorithm using two digits in partial consequence computation, and a generation of the binary algorithm using a new concept of modular conjugates. The last two algorithms prove to be the fastest of all, giving a speedup of six to eight times over the classical Euclidean scheme, and two times over the best currently known algorithms. Also, the generalized binary algorithm is suitable for systolic parallelization in a least-significant digits first pipelined manner.<>

比较了几种计算任意精度整数GCD的算法的执行时间。这些算法是已知的算法(欧几里得，二进制，正负)，以及这些算法的改进变体(Lehmer和类似的)，以及作者介绍的新算法:在部分结果计算中使用两位数的改进Lehmer算法，以及使用模共轭新概念的二进制算法的生成。最后两种算法被证明是所有算法中最快的，比经典的欧几里得方案加快了6到8倍，比目前已知的最好的算法快了2倍。此外，广义二进制算法适用于以最低有效数字优先的流水线方式进行收缩并行化。

引用次数: 28

On digit-recurrence division implementations for field programmable gate arrays 现场可编程门阵列的数字递归除法实现

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378091

M. E. Louie, M. Ercegovac

The flexibility of field programmable gate arrays (FPGAs) can provide arithmetic-intensive programs with the benefits of custom hardware but without the high cost of custom silicon implementations. Efficient mappings are key to fast arithmetic implementations on FPGAs. A process for developing such mappings with lookup table based FPGAs is explored. The development process is illustrated with SRT division and the Xilinx XC4010 FPGA. With this mapping process a linear sequential array design that avoids the common problem of large fanout delay in the critical path is created. This approach has a cycle time that is independent of precision, yet it requires approximately the same number of logic blocks as a conventional implementation.<>

现场可编程门阵列(fpga)的灵活性可以提供具有定制硬件优势的算术密集型程序，但没有定制硅实现的高成本。高效的映射是fpga快速算法实现的关键。探讨了用基于查找表的fpga开发这种映射的过程。通过SRT划分和Xilinx XC4010 FPGA说明了开发过程。通过这种映射过程，创建了一种线性顺序阵列设计，避免了关键路径中常见的大扇出延迟问题。这种方法的周期时间与精度无关，但它需要的逻辑块数量与传统实现大致相同

引用次数: 28

A 17 /spl times/ 69 bit multiply and add unit with redundant binary feedback and single cycle latency 具有冗余二进制反馈和单周期延迟的17 /spl倍/ 69位乘法和加法单元

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378096

W. S. Briggs, D. Matula

The authors describe a numeric processor with a kernel that is a tree of redundant binary adders and effects either a 17 /spl times/ 69-b multiply-and-add or a 19 /spl times/ 69-b multiply with exact redundant binary output and single cycle latency. Feedback paths selectively allow a high-order or low-order part of the adder tree output to be fed back in redundant binary form to the multiplicand and/or addend inputs to the adder tree. The authors describe algorithms iteratively using this adder tree kernel for IEEE double extended multiplication, division, and square root; conversions between 18-digit BCD integers and 64-b binary integers; and transcendental function evaluation. The multiplier design described was implemented in the Cyrix 83D87 numeric coprocessor (typically 33 MHz). Results for this coprocessor as compared with competitive x87 units are included.<>

作者描述了一个数字处理器，其内核是一个冗余二进制加法器树，具有精确冗余二进制输出和单周期延迟，可以实现17 /spl次/ 69-b的乘法和加法或19 /spl次/ 69-b的乘法。反馈路径选择性地允许加法器树输出的高阶或低阶部分以冗余二进制形式反馈到乘数和/或加法器树的加数输入。作者描述了算法迭代使用该加法树核的IEEE双扩展乘法，除法和平方根;18位BCD整数与64-b二进制整数之间的转换;和超越函数求值。所描述的乘法器设计是在Cyrix 83D87数字协处理器(通常为33 MHz)中实现的。本文还包括了该协处理器与竞争机型x87的比较结果。

引用次数: 23

Adaptive beamforming using RNS arithmetic 基于RNS算法的自适应波束形成

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378111

B. Kirsch, P. Turner

The adaptive beamforming problem is solved using an algorithm-architecture-arithmetic combination that can be used for a small platform such as are found on aircraft or sonobuoys. The arithmetic used is the RNS system implemented on an array of processors that can be reassigned as the algorithm proceeds. The underlying algorithm is a modified Gaussian elimination. The (non-RNS) division operations are eliminated in favor of some scaling and the adaptive use of the processor array to accommodate the growth in dynamic range.<>

自适应波束形成问题采用一种算法-体系结构-算法组合的方法来解决，该方法可用于飞机或声纳浮标等小型平台。所使用的算法是在一组处理器上实现的RNS系统，这些处理器可以随着算法的进行而重新分配。底层算法是一种改进的高斯消去法。(非rns)除法操作被消除，有利于一些缩放和自适应使用处理器阵列来适应动态范围的增长

引用次数: 9

Exploiting trivial and redundant computation 利用琐碎和冗余的计算

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378089

Stephen Richardson

The notion of trivial computation, in which the appearance of simple operands renders potentially complex operations simple, is discussed. An example of a trivial operation is integer division, where the divisor is two; the division becomes a simple shift operation. The concept of redundant computation, in which some operation repeatedly does the same function because it repeatedly sees the same operands, is also discussed. Experiments on two separate benchmark suites, the SPEC benchmarks and the Perfect Club, find a surprising amount of trivial and redundant operation. Various architectural means of exploiting this knowledge to improve computational efficiency include detection of trivial operands and the result cache. Further experimentation shows significant speedup from these techniques, as measured on three different styles of machine architecture.<>

讨论了平凡计算的概念，其中简单操作数的出现使潜在的复杂操作变得简单。一个简单的例子是整数除法，除数是2;除法变成了简单的移位操作。还讨论了冗余计算的概念，冗余计算是指某些操作由于重复看到相同的操作数而重复执行相同的函数。在两个独立的基准测试套件(SPEC基准测试和Perfect Club)上进行的实验发现了大量琐碎和冗余的操作。利用这些知识来提高计算效率的各种体系结构方法包括检测琐碎操作数和结果缓存。进一步的实验表明，在三种不同风格的机器架构上，这些技术显著提高了速度。

引用次数: 81

Combined system-level redundancy and modular arithmetic for fault tolerant digital signal processing 结合系统级冗余和模块化算法的容错数字信号处理

Proceedings of IEEE 11th Symposium on Computer Arithmetic

Pub Date : 1993-06-29 DOI: 10.1109/ARITH.1993.378112

W. Jenkins, B. Schnaufer, A. Mansen

This paper proposes combining system-level modular redundancy with the arithmetic modularity of residue number system (RNS) arithmetic to achieve fault tolerance in high speed digital signal processing (DSP) systems. Double, triple, and quadruple modular redundancy are combined with RNS modularity for realizing important DSP computational kernels. The discussion includes the development of the serial-by-modulus (SBM) RNS architecture in which residue digits are processed sequentially in circuits that handle only one modular operation at a given time, thereby sacrificing speed for circuit simplicity. As a potential application of the SBM concept, a variable-word-length sum-of-products signal processing kernel is developed based on a serial-by-modulus RNS architecture. Because the RNS is not a weighted number representation, if the instantaneous dynamic range requirement can be estimated it may be possible to perform the computation with only enough residue digits to provide the necessary dynamic range.<>

本文提出将系统级模块化冗余与残数系统(RNS)算法的算术模块化相结合，实现高速数字信号处理(DSP)系统的容错。将双、三、四重模块冗余与RNS模块化相结合，实现重要的DSP计算内核。讨论包括串行模数(SBM) RNS架构的发展，其中在电路中顺序处理残数，在给定时间内只处理一个模块操作，从而牺牲电路简单性的速度。作为SBM概念的潜在应用，基于串行-模RNS架构开发了变字长积和信号处理内核。由于RNS不是加权数字表示，如果可以估计瞬时动态范围要求，则可能只使用足够的剩余数字来执行计算以提供必要的动态范围。

引用次数: 10

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of IEEE 11th Symposium on Computer Arithmetic

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀