首页 > 最新文献

Proceedings of the 12th Symposium on Computer Arithmetic最新文献

英文 中文
Redundant binary Booth recoding 冗余二进制布斯重新编码
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465377
Chung Nan Lyu, D. Matula
We investigate the efficiencies attainable pursuing Booth recoding directly from redundant binary input with limited carry propagation. As a digit conversion problem we extend the important result that each radix 4 Booth recoded digit can be determined from 5 consecutive input signed bits to obtain that each radix 2/sup k/ Booth recoded digit can be determined from 2k+1 consecutive input signed bits and prove this to be the minimum possible for any k/spl ges/2. Analysis of alternative bit pair encodings of signed bits yields the improved result that each radix 2/sup k/ Booth recoded digit can be determined from only 2k encoded bit pairs employing sign and magnitude bit encoding, a result which does not extend to conventional borrow-save or carry-save redundant binary digit encodings. Radices 4 and 8 gate level designs are illustrated for alternative encodings, with our signed bit design shown to yield smaller depth and fewer gates than existing redundant binary Booth recoding circuits from the literature.<>
我们研究了在有限进位传播的情况下,直接从冗余二进制输入进行布斯编码所能达到的效率。作为一个数字转换问题,我们将每个基数4 Booth编码的数字可以从5个连续输入的有符号比特中确定的重要结果推广到每个基数2/sup k/ Booth编码的数字可以从2k+1个连续输入的有符号比特中确定,并证明了这是任意k/spl ges/2的最小可能值。对有符号位的替代位对编码的分析得到改进的结果,即每个基数2/sup k/ Booth编码数字可以从使用符号位和幅度位编码的2k个编码位对中确定,这一结果不能扩展到传统的借位节省或进位节省冗余二进制数字编码。根号4和8门电平设计用于替代编码,我们的符号位设计显示比文献中现有的冗余二进制布斯编码电路产生更小的深度和更少的门。
{"title":"Redundant binary Booth recoding","authors":"Chung Nan Lyu, D. Matula","doi":"10.1109/ARITH.1995.465377","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465377","url":null,"abstract":"We investigate the efficiencies attainable pursuing Booth recoding directly from redundant binary input with limited carry propagation. As a digit conversion problem we extend the important result that each radix 4 Booth recoded digit can be determined from 5 consecutive input signed bits to obtain that each radix 2/sup k/ Booth recoded digit can be determined from 2k+1 consecutive input signed bits and prove this to be the minimum possible for any k/spl ges/2. Analysis of alternative bit pair encodings of signed bits yields the improved result that each radix 2/sup k/ Booth recoded digit can be determined from only 2k encoded bit pairs employing sign and magnitude bit encoding, a result which does not extend to conventional borrow-save or carry-save redundant binary digit encodings. Radices 4 and 8 gate level designs are illustrated for alternative encodings, with our signed bit design shown to yield smaller depth and fewer gates than existing redundant binary Booth recoding circuits from the literature.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131194331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 66
Redundant Cordic rotator based on parallel prediction 基于并行预测的冗余Cordic旋转器
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465362
E. Antelo, J. Bruguera, J. Villalba, E. Zapata
We present a Cordic rotator, using carry-save arithmetic, based on the prediction of all the coefficients into which the rotation angle is decomposed. The prediction algorithm is based on the use of radix-2 microrotations with multiple shifts in the first iterations and the use of a redundant radix-2 and radix-4 representation for the coefficients in the rest of the microrotations. The use of multiple shifts facilitates the prediction of the coefficients in the case of microrotations where i/spl les/n/4, being n the precision of the algorithm, and the use of radix-4 microrotations helps to reduce the total number of iterations. The prediction is carried out using the redundant representation of the z coordinate, without any need for conversions to a non-redundant representation. Finally, we present a VLSI architecture based on this algorithm. As the production of the coefficients is very fast, and they are known before starting each microrotation, the resulting architecture can be highly pipelined and consequently appropriate for applications where high speeds are required.<>
基于对旋转角度分解后的所有系数的预测,我们提出了一种Cordic旋转器,该旋转器采用免进位算法。该预测算法基于在第一次迭代中使用具有多次移位的基数-2微旋转,以及在其余微旋转中使用冗余的基数-2和基数-4表示系数。使用多重移位有助于在微旋转的情况下预测系数,其中i/spl小于/n/4,这是算法的精度,并且使用基数为4的微旋转有助于减少总迭代次数。预测是使用z坐标的冗余表示进行的,不需要转换为非冗余表示。最后,我们提出了一个基于此算法的VLSI架构。由于系数的产生非常快,并且在开始每次微旋转之前就知道它们,因此所得架构可以高度流水线化,因此适用于需要高速的应用
{"title":"Redundant Cordic rotator based on parallel prediction","authors":"E. Antelo, J. Bruguera, J. Villalba, E. Zapata","doi":"10.1109/ARITH.1995.465362","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465362","url":null,"abstract":"We present a Cordic rotator, using carry-save arithmetic, based on the prediction of all the coefficients into which the rotation angle is decomposed. The prediction algorithm is based on the use of radix-2 microrotations with multiple shifts in the first iterations and the use of a redundant radix-2 and radix-4 representation for the coefficients in the rest of the microrotations. The use of multiple shifts facilitates the prediction of the coefficients in the case of microrotations where i/spl les/n/4, being n the precision of the algorithm, and the use of radix-4 microrotations helps to reduce the total number of iterations. The prediction is carried out using the redundant representation of the z coordinate, without any need for conversions to a non-redundant representation. Finally, we present a VLSI architecture based on this algorithm. As the production of the coefficients is very fast, and they are known before starting each microrotation, the resulting architecture can be highly pipelined and consequently appropriate for applications where high speeds are required.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132896545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Very-high radix combined division and square root with prescaling and selection by rounding 非常高的基数结合除法和平方根与预缩放和四舍五入选择
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465367
T. Lang, P. Montuschi
An algorithm for square root with prescaling is developed and combined with a similar scheme for division. An implementation is described, evaluated and compared with other combined div/sqrt implementations.<>
提出了一种带有预标度的平方根算法,并将其与一种类似的除法算法相结合。对一个实现进行描述、评估,并与其他div/sqrt组合实现进行比较。
{"title":"Very-high radix combined division and square root with prescaling and selection by rounding","authors":"T. Lang, P. Montuschi","doi":"10.1109/ARITH.1995.465367","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465367","url":null,"abstract":"An algorithm for square root with prescaling is developed and combined with a similar scheme for division. An implementation is described, evaluated and compared with other combined div/sqrt implementations.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128179394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
The K5 transcendental functions K5超越函数
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465368
Thomas W. Lynch, Ashraf Ahmed, M. Schulte, T. K. Callaway, Robert Tisdale
This paper describes the development of the transcendental instructions for the K5, AMD's recently completed x86 compatible superscalar microprocessor. A multi-level development cycle, with testing between levels, facilitated the early detection of errors and limited their effect on the design schedule. The algorithms for the transcendental functions use table-driven reductions followed by polynomial approximations. Multiprecision arithmetic operations are used when necessary to maintain sufficient accuracy and to ensure that the transcendental functions have a maximum error of one unit in the last place.<>
本文描述了AMD最近完成的兼容x86的超标量微处理器K5的超越指令的开发。多层次的开发周期,在不同级别之间进行测试,有助于及早发现错误,并限制它们对设计进度的影响。超越函数的算法使用表驱动约简,然后是多项式近似。当需要保持足够的精度并确保超越函数在最后一个位置的最大误差为一个单位时,使用多精度算术运算
{"title":"The K5 transcendental functions","authors":"Thomas W. Lynch, Ashraf Ahmed, M. Schulte, T. K. Callaway, Robert Tisdale","doi":"10.1109/ARITH.1995.465368","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465368","url":null,"abstract":"This paper describes the development of the transcendental instructions for the K5, AMD's recently completed x86 compatible superscalar microprocessor. A multi-level development cycle, with testing between levels, facilitated the early detection of errors and limited their effect on the design schedule. The algorithms for the transcendental functions use table-driven reductions followed by polynomial approximations. Multiprecision arithmetic operations are used when necessary to maintain sufficient accuracy and to ensure that the transcendental functions have a maximum error of one unit in the last place.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130556083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
30-ns 55-b shared radix 2 division and square root using a self-timed circuit 30-ns 55-b使用自定时电路共享基数2除法和平方根
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465371
G. Matsubara, N. Ide, H. Tago, Seigo Suzuki, N. Goto
A shared radix 2 division and square root implementation using a self-timed circuit is presented. The same execution time for division and square root is achieved by using an on-the-fly digit decoding and a root multiple generation technique. Most of the hardware is shared, and only several multiplexers are required to exchange a divisor multiple and a root multiple. Moreover, quotient selection logic is accelerated by a new algorithm using a 3-b carry propagation adder. The implementation of the shared division and square root unit is realized by assuming 0.3 /spl mu/m CMOS technology. The wiring capacitance and other parasitic parameters are taken into account. The execution time of floating point 55-b full mantissa division and square root is expected to be less than 30 ns in the worst case of an input vector determined by an intensive circuit simulation.<>
提出了一种利用自定时电路实现共享根除和平方根的方法。除法和平方根的执行时间相同是通过使用动态数字解码和根多重生成技术实现的。大多数硬件是共享的,并且只需要几个多路复用器来交换一个除数倍数和一个根倍数。此外,采用3b进位传播加法器的新算法加快了商选择逻辑的速度。通过假设0.3 /spl mu/m CMOS技术实现了共享除法和平方根的实现。考虑了布线电容和其他寄生参数。在由密集电路模拟确定的输入向量的最坏情况下,浮点55-b全尾数除法和平方根的执行时间预计小于30 ns。
{"title":"30-ns 55-b shared radix 2 division and square root using a self-timed circuit","authors":"G. Matsubara, N. Ide, H. Tago, Seigo Suzuki, N. Goto","doi":"10.1109/ARITH.1995.465371","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465371","url":null,"abstract":"A shared radix 2 division and square root implementation using a self-timed circuit is presented. The same execution time for division and square root is achieved by using an on-the-fly digit decoding and a root multiple generation technique. Most of the hardware is shared, and only several multiplexers are required to exchange a divisor multiple and a root multiple. Moreover, quotient selection logic is accelerated by a new algorithm using a 3-b carry propagation adder. The implementation of the shared division and square root unit is realized by assuming 0.3 /spl mu/m CMOS technology. The wiring capacitance and other parasitic parameters are taken into account. The execution time of floating point 55-b full mantissa division and square root is expected to be less than 30 ns in the worst case of an input vector determined by an intensive circuit simulation.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130935076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
O(n)-depth circuit algorithm for modular exponentiation O(n)深度电路模求幂算法
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465360
T. Hamano, N. Takagi, S. Yajima, F. Preparata
An O(n)-depth polynomial-size combinational circuit algorithm is proposed for n-bit modular exponentiation, i.e., for the computation of "x/sup y/ mod m" for arbitrary integers x, y and m. Represented as n-bit binary integers, within bounds 2/sup n-1//spl les/m<2/sup n/ and 0/spl les/x,y>
提出了一种O(n)深度多项式大小的组合电路算法,用于n位模求幂,即对任意整数x, y, m进行“x/sup y/ mod m”的计算。表示为n位二进制整数,在2/sup n-1//spl les/m>
{"title":"O(n)-depth circuit algorithm for modular exponentiation","authors":"T. Hamano, N. Takagi, S. Yajima, F. Preparata","doi":"10.1109/ARITH.1995.465360","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465360","url":null,"abstract":"An O(n)-depth polynomial-size combinational circuit algorithm is proposed for n-bit modular exponentiation, i.e., for the computation of \"x/sup y/ mod m\" for arbitrary integers x, y and m. Represented as n-bit binary integers, within bounds 2/sup n-1//spl les/m<2/sup n/ and 0/spl les/x,y<m. The algorithm is a generalization of the square-and-multiply method. An obvious implementation of the square-and-multiply method yields a circuit of depth O(nlogn) and size O(n/sup 3/). In the proposed algorithm, the terms x/sup 2/ mod m's for all i's /spl epsiv.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122182316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Arithmetic for relative accuracy 相对精度算法
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465357
R. V. Drunen, L. Spaanenburg, P. Lucassen, J. Nijhuis, J. T. Udding
Of the three factors named in Moore's first Law that drive the advance of computational systems, circuit design receives relatively little mention. We introduce here a circuit variety that allows to include accuracy considerations. It is shown that accuracy-drive can be effectively realised and leads to 60% speed improvement. Details are given of a floating-point unit with full hardware support of complex calculations, specifically tailored to speed-up MD-simulations on the GROMACS scientific parallel computer.<>
在摩尔第一定律中提到的推动计算系统进步的三个因素中,电路设计相对较少被提及。我们在这里介绍一种允许包括精度考虑的电路品种。结果表明,该方法可以有效地实现精度驱动,使转速提高60%。详细介绍了一个浮点单元,具有复杂计算的完整硬件支持,专门为加速GROMACS科学并行计算机上的md模拟而量身定制。
{"title":"Arithmetic for relative accuracy","authors":"R. V. Drunen, L. Spaanenburg, P. Lucassen, J. Nijhuis, J. T. Udding","doi":"10.1109/ARITH.1995.465357","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465357","url":null,"abstract":"Of the three factors named in Moore's first Law that drive the advance of computational systems, circuit design receives relatively little mention. We introduce here a circuit variety that allows to include accuracy considerations. It is shown that accuracy-drive can be effectively realised and leads to 60% speed improvement. Details are given of a floating-point unit with full hardware support of complex calculations, specifically tailored to speed-up MD-simulations on the GROMACS scientific parallel computer.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116251775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Semi-logarithmic number systems 半对数数系
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465358
J. Muller, A. Tisserand, Alexandre Scherbyna
We present a new class of number systems, called semi-logarithmic number systems, that constitute a family of various compromises between floating-point and logarithmic number systems. We propose arithmetic algorithms for the semi-logarithmic number systems, and we compare these number systems to the classical floating-point or logarithmic number systems.<>
我们提出了一类新的数字系统,称为半对数数系统,它构成了浮点数系统和对数数系统之间的各种折衷。我们提出了半对数数系统的算法,并将这些数系统与经典浮点数或对数数系统进行了比较
{"title":"Semi-logarithmic number systems","authors":"J. Muller, A. Tisserand, Alexandre Scherbyna","doi":"10.1109/ARITH.1995.465358","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465358","url":null,"abstract":"We present a new class of number systems, called semi-logarithmic number systems, that constitute a family of various compromises between floating-point and logarithmic number systems. We propose arithmetic algorithms for the semi-logarithmic number systems, and we compare these number systems to the classical floating-point or logarithmic number systems.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115858578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A GaAs IEEE floating point standard single precision multiplier 一个GaAs IEEE浮点标准单精度乘法器
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465372
S. Cui, N. Burgess, M. Liebelt, K. Eshraghian
This paper presents a GaAs IEEE floating point standard single precision multiplier. A modified carry save array is used in conjunction with Booth's algorithm to reduce the partial product addition and interconnection. A special rounding technique called Trailing-1's Predictor is used to speed up the final addition and rounding. The combination of the fast arithmetic architecture and compact layout style achieves 4 ns multiplication time with 3.5 W power dissipation at 75/spl deg/C giving 14 mW/MHz. The area is 2.43 mm by 3.77 mm (excluding pads) and uses 28,000 transistors to give a density of 3056 transistors/mm/sup 2/ for 0.8-/spl mu/m GaAs technology.<>
提出了一种GaAs IEEE浮点标准单精度乘法器。采用改进的进位保存阵列与Booth算法相结合,减少了部分乘积的加法和互连。一种特殊的四舍五入技术称为尾1的预测器,用于加快最后的加法和四舍五入。快速的算法架构和紧凑的布局风格相结合,在75/spl度/C下,在14 mW/MHz下,实现了4 ns的乘法时间和3.5 W的功耗。面积为2.43 mm × 3.77 mm(不包括焊盘),使用28,000个晶体管,为0.8-/spl mu/m GaAs技术提供3056个晶体管/mm/sup / 2/的密度。
{"title":"A GaAs IEEE floating point standard single precision multiplier","authors":"S. Cui, N. Burgess, M. Liebelt, K. Eshraghian","doi":"10.1109/ARITH.1995.465372","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465372","url":null,"abstract":"This paper presents a GaAs IEEE floating point standard single precision multiplier. A modified carry save array is used in conjunction with Booth's algorithm to reduce the partial product addition and interconnection. A special rounding technique called Trailing-1's Predictor is used to speed up the final addition and rounding. The combination of the fast arithmetic architecture and compact layout style achieves 4 ns multiplication time with 3.5 W power dissipation at 75/spl deg/C giving 14 mW/MHz. The area is 2.43 mm by 3.77 mm (excluding pads) and uses 28,000 transistors to give a density of 3056 transistors/mm/sup 2/ for 0.8-/spl mu/m GaAs technology.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122797459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A new VLSI vector arithmetic coprocessor for the PC 一种新型的VLSI矢量算术协处理器
Pub Date : 1995-07-19 DOI: 10.1109/ARITH.1995.465356
C. Baumhof
A new vector arithmetic coprocessor MIM XPA3233 with integrated PCI bus interface has been developed in CMOS VLSI technology. The chip performs dot products of vectors with components of the IEEE DOUBLE data format to full accuracy or with only one final rounding. Details on the realisation of the multiplication, accumulation and carry resolution processes are discussed. Performance data and some details about the actual VLSI realisation are presented. Software support for the coprocessor is available in the programming languages PASCAL-XSC and C-XSC or from a special C subroutine library. Programming examples are shown using PASCAL-XSC and C.<>
基于CMOS VLSI技术,开发了一种集成PCI总线接口的矢量算法协处理器MIM XPA3233。该芯片与IEEE DOUBLE数据格式的组件进行向量的点积,以达到完全精度或仅最后一次四舍五入。详细讨论了乘法、累加和进位解析过程的实现。给出了性能数据和实际VLSI实现的一些细节。协处理器的软件支持可以通过PASCAL-XSC和C- xsc编程语言或特殊的C子程序库获得。程序示例显示使用PASCAL-XSC和c。
{"title":"A new VLSI vector arithmetic coprocessor for the PC","authors":"C. Baumhof","doi":"10.1109/ARITH.1995.465356","DOIUrl":"https://doi.org/10.1109/ARITH.1995.465356","url":null,"abstract":"A new vector arithmetic coprocessor MIM XPA3233 with integrated PCI bus interface has been developed in CMOS VLSI technology. The chip performs dot products of vectors with components of the IEEE DOUBLE data format to full accuracy or with only one final rounding. Details on the realisation of the multiplication, accumulation and carry resolution processes are discussed. Performance data and some details about the actual VLSI realisation are presented. Software support for the coprocessor is available in the programming languages PASCAL-XSC and C-XSC or from a special C subroutine library. Programming examples are shown using PASCAL-XSC and C.<<ETX>>","PeriodicalId":332829,"journal":{"name":"Proceedings of the 12th Symposium on Computer Arithmetic","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1995-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128868144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
Proceedings of the 12th Symposium on Computer Arithmetic
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1