首页 > 最新文献

Proceedings of IEEE 11th Symposium on Computer Arithmetic最新文献

英文 中文
Efficient complex matrix transformations with CORDIC 用CORDIC进行有效的复矩阵变换
Pub Date : 1993-06-20 DOI: 10.1109/ARITH.1993.378101
N. D. Hemkumar, Joseph R. Cavallaro
A two-sided unitary transformation (Q transformation) structured to permit integrated evaluation and application using CORDIC primitives is introduced. The Q transformation is shown to be useful as an atomic operation in parallel arrays for computing the eigenvalue/singular value decomposition of Hermitian/arbitrary matrices, and three specific Q transformations that are needed in such arrays are identified. Issues related to the use of CORDIC for complex arithmetic are addressed, and implementations in both conventional (nonredundant) CORDIC and redundant and online modifications to CORDIC are described. If the time to compute a CORDIC operation in nonredundant CORDIC is T/sub c/, the Q transformations identified here can be evaluated and/or applied in 2T/sub c/ using four CORDIC modules for maximum concurrency. In either case, 0.5 T/sub c/ is required to account for scale factor correction. It is shown that a Q transformation can be evaluated and/or applied in /spl ap/10n, where n is the desired bit-precision.<>
介绍了一种允许使用CORDIC原语进行综合评价和应用的双边酉变换(Q变换)。对于计算厄米矩阵/任意矩阵的特征值/奇异值分解,Q变换作为并行数组中的原子运算是有用的,并且确定了在这种数组中需要的三个特定的Q变换。讨论了与使用CORDIC进行复杂算法相关的问题,并描述了传统(非冗余)CORDIC以及对CORDIC进行冗余和在线修改的实现。如果在非冗余的CORDIC中计算CORDIC操作的时间为T/sub / c/,则可以使用四个CORDIC模块在2T/sub / c/中评估和/或应用这里确定的Q转换,以获得最大的并发性。在任何一种情况下,都需要0.5 T/sub c/来考虑尺度因子校正。证明了Q变换可以在/spl / ap/10n中求值和/或应用,其中n为所需的位精度
{"title":"Efficient complex matrix transformations with CORDIC","authors":"N. D. Hemkumar, Joseph R. Cavallaro","doi":"10.1109/ARITH.1993.378101","DOIUrl":"https://doi.org/10.1109/ARITH.1993.378101","url":null,"abstract":"A two-sided unitary transformation (Q transformation) structured to permit integrated evaluation and application using CORDIC primitives is introduced. The Q transformation is shown to be useful as an atomic operation in parallel arrays for computing the eigenvalue/singular value decomposition of Hermitian/arbitrary matrices, and three specific Q transformations that are needed in such arrays are identified. Issues related to the use of CORDIC for complex arithmetic are addressed, and implementations in both conventional (nonredundant) CORDIC and redundant and online modifications to CORDIC are described. If the time to compute a CORDIC operation in nonredundant CORDIC is T/sub c/, the Q transformations identified here can be evaluated and/or applied in 2T/sub c/ using four CORDIC modules for maximum concurrency. In either case, 0.5 T/sub c/ is required to account for scale factor correction. It is shown that a Q transformation can be evaluated and/or applied in /spl ap/10n, where n is the desired bit-precision.<<ETX>>","PeriodicalId":414758,"journal":{"name":"Proceedings of IEEE 11th Symposium on Computer Arithmetic","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132438583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Estimating the power consumption of CMOS adders CMOS加法器的功耗估算
Pub Date : 1900-01-01 DOI: 10.1109/ARITH.1993.378090
T. K. Callaway, E. Swartzlander
Six types of adders are examined in an attempt to model their power dissipation. It is shown that the use of a relatively simple model provides results that are qualitatively accurate, when compared to more sophisticated models and to physical implementations of the circuits. The main discrepancy between the simple model and the physical measurements seems to be the assumption that all gates will consume the same amount of power when they switch, regardless of their fan-in or fanout. Because the carry lookahead adder has several gates with a fan out and fan-in higher than two, the simple model underestimates its power dissipation.<>
本文研究了六种类型的加法器,试图对其功耗进行建模。结果表明,与更复杂的模型和电路的物理实现相比,使用相对简单的模型提供了定性准确的结果。简单模型和物理测量之间的主要差异似乎是假设所有门在开关时消耗相同的功率,而不管它们是风扇输入还是风扇输出。由于进位前瞻加法器有多个扇出和扇入大于2的门,因此简单模型低估了其功耗。
{"title":"Estimating the power consumption of CMOS adders","authors":"T. K. Callaway, E. Swartzlander","doi":"10.1109/ARITH.1993.378090","DOIUrl":"https://doi.org/10.1109/ARITH.1993.378090","url":null,"abstract":"Six types of adders are examined in an attempt to model their power dissipation. It is shown that the use of a relatively simple model provides results that are qualitatively accurate, when compared to more sophisticated models and to physical implementations of the circuits. The main discrepancy between the simple model and the physical measurements seems to be the assumption that all gates will consume the same amount of power when they switch, regardless of their fan-in or fanout. Because the carry lookahead adder has several gates with a fan out and fan-in higher than two, the simple model underestimates its power dissipation.<<ETX>>","PeriodicalId":414758,"journal":{"name":"Proceedings of IEEE 11th Symposium on Computer Arithmetic","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115116208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
The design of a 64-bit integer multiplier/divider unit 64位整数乘数/除法单元的设计
Pub Date : 1900-01-01 DOI: 10.1109/ARITH.1993.378095
David Eisig, Josh Rotstain, I. Koren
The highlights of the design of an integer multiplier/divider unit for a 64-b processor are presented. The final design is the result of a compromise between performance, complexity, and transistor count. It is optimized for two specific operations with the same hardware being shared by the remaining operations. Thus, for example, the multiplier can be configured for the execution of several different multiply operations and its hardware is also heavily utilized in division. The divider design is optimized for repetitive division by small numbers, since this is a characteristic of several important applications planned for the processor. For such small divisors, the reciprocal is calculated and stored in a content-addressable memory. The stored reciprocals can then be used to generate quotients through fast multiplication. Simulations of the planned applications show a 20% to 30% performance increase over alternative designs.<>
介绍了64-b处理器整数乘法器/除法器的设计要点。最终的设计是性能、复杂性和晶体管数量之间折衷的结果。它针对两个特定的操作进行了优化,其余操作共享相同的硬件。因此,例如,乘数可以配置为执行几种不同的乘法操作,其硬件也在除法中得到大量利用。除法器设计针对小数的重复除法进行了优化,因为这是为处理器计划的几个重要应用程序的特征。对于这样小的除数,计算其倒数并将其存储在内容可寻址存储器中。然后可以使用存储的倒数通过快速乘法生成商。计划应用的模拟表明,与其他设计相比,性能提高了20%至30%。
{"title":"The design of a 64-bit integer multiplier/divider unit","authors":"David Eisig, Josh Rotstain, I. Koren","doi":"10.1109/ARITH.1993.378095","DOIUrl":"https://doi.org/10.1109/ARITH.1993.378095","url":null,"abstract":"The highlights of the design of an integer multiplier/divider unit for a 64-b processor are presented. The final design is the result of a compromise between performance, complexity, and transistor count. It is optimized for two specific operations with the same hardware being shared by the remaining operations. Thus, for example, the multiplier can be configured for the execution of several different multiply operations and its hardware is also heavily utilized in division. The divider design is optimized for repetitive division by small numbers, since this is a characteristic of several important applications planned for the processor. For such small divisors, the reciprocal is calculated and stored in a content-addressable memory. The stored reciprocals can then be used to generate quotients through fast multiplication. Simulations of the planned applications show a 20% to 30% performance increase over alternative designs.<<ETX>>","PeriodicalId":414758,"journal":{"name":"Proceedings of IEEE 11th Symposium on Computer Arithmetic","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124784074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
New algorithms and VLSI architectures for SRT division and square root SRT除法和平方根的新算法和VLSI架构
Pub Date : 1900-01-01 DOI: 10.1109/ARITH.1993.378106
S. E. McQuillan, J. McCanny, R. Hamill
Radix two algorithms for SRT division and square-rooting are developed. For these schemes, the result digits and the residuals are computed concurrently and the computations in adjacent rows are overlapped. Consequently, their performance should exceed that of the radix 2 SRT methods. VLSI array architectures for implementing the new division and square-rooting methods are also presented.<>
提出了SRT除法和平方根的两种根号算法。对于这些方案,结果数和残差同时计算,相邻行的计算是重叠的。因此,它们的性能应该超过基数为2的SRT方法。本文还介绍了用于实现新的除法和平方根方法的VLSI阵列结构。
{"title":"New algorithms and VLSI architectures for SRT division and square root","authors":"S. E. McQuillan, J. McCanny, R. Hamill","doi":"10.1109/ARITH.1993.378106","DOIUrl":"https://doi.org/10.1109/ARITH.1993.378106","url":null,"abstract":"Radix two algorithms for SRT division and square-rooting are developed. For these schemes, the result digits and the residuals are computed concurrently and the computations in adjacent rows are overlapped. Consequently, their performance should exceed that of the radix 2 SRT methods. VLSI array architectures for implementing the new division and square-rooting methods are also presented.<<ETX>>","PeriodicalId":414758,"journal":{"name":"Proceedings of IEEE 11th Symposium on Computer Arithmetic","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130379410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
期刊
Proceedings of IEEE 11th Symposium on Computer Arithmetic
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1