Revisions to the IEEE 754 standard for floating-point arithmetic

E. Schwarz
{"title":"Revisions to the IEEE 754 standard for floating-point arithmetic","authors":"E. Schwarz","doi":"10.1109/ARITH.2003.1207667","DOIUrl":null,"url":null,"abstract":"Almost twenty years ago the IEEE 754 binary floating-point standard was adopted. Since then almost every microprocessor as well as many programming languages have defined the floating-point arithmetic to be IEEE 754 compliant. From the many years experience in implementing the standard in hardware and writing floating-point programs, there have been numerous suggestions for revisions. All IEEE standards must undergo a review process every 5 years or be dropped as an active standard. For past reviews this standard was extended without much discussion. But finally in January 2001 an in-depth review was started. A committee was formed and over the past two years many revisions have been evaluated. The most extensive change to the standard is to adopt formats for decimal floating-point arithmetic. This proposal creates decimal floating-point data formats for 32, 64, and 128 bits. Decimal floating-point arithmetic provides an exact representation of displayed numbers and provides a precise round at decimal radix point. This type of arithmetic is required in financial calculations. Some experts argue that decimal will replace binary due to its ability to represent decimal numbers exactly, while others think that binary will remain the key floating-point format due to its speed of execution and its more regular spacing of intervals. Another once controversial proposal is the addition of fused multiply-add. This operation only causes one rounding error, while in most implementations, provides twice the performance of separate operations. Other additions to the standard include a quadword format and many predicate functions such as comparison operators like greater than. Also operators for maximum and minimum have been accepted that after hours of arguing now favor a numeric result over a NaN. There are also deletions such as the single extended and double extended formats. And there are some items that are deleted in one meeting and resurrected in the following meeting such as signaling NaNs. Over the past two years of committee review there has been many proposals discussed. This panel discussion will enlighten the audience to the additions, deletions, and some of the current controversial proposals. The panel will consist of : • David Hough, Sun Microsystems, Editor of the Standard – Overview • Mike Cowlishaw, IBM Corp., Decimal Floating-Point Software Advocate • David Bailey, Lawrence Berkeley Lab., Quadword Precision Advocate • David Matula, Southern Methodist University, Academics / Industry Consultant • Eric Schwarz, IBM Corp., Decimal Floating-Point Hardware – Panel Chair","PeriodicalId":399928,"journal":{"name":"Proceedings 2003 16th IEEE Symposium on Computer Arithmetic","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2003 16th IEEE Symposium on Computer Arithmetic","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ARITH.2003.1207667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Almost twenty years ago the IEEE 754 binary floating-point standard was adopted. Since then almost every microprocessor as well as many programming languages have defined the floating-point arithmetic to be IEEE 754 compliant. From the many years experience in implementing the standard in hardware and writing floating-point programs, there have been numerous suggestions for revisions. All IEEE standards must undergo a review process every 5 years or be dropped as an active standard. For past reviews this standard was extended without much discussion. But finally in January 2001 an in-depth review was started. A committee was formed and over the past two years many revisions have been evaluated. The most extensive change to the standard is to adopt formats for decimal floating-point arithmetic. This proposal creates decimal floating-point data formats for 32, 64, and 128 bits. Decimal floating-point arithmetic provides an exact representation of displayed numbers and provides a precise round at decimal radix point. This type of arithmetic is required in financial calculations. Some experts argue that decimal will replace binary due to its ability to represent decimal numbers exactly, while others think that binary will remain the key floating-point format due to its speed of execution and its more regular spacing of intervals. Another once controversial proposal is the addition of fused multiply-add. This operation only causes one rounding error, while in most implementations, provides twice the performance of separate operations. Other additions to the standard include a quadword format and many predicate functions such as comparison operators like greater than. Also operators for maximum and minimum have been accepted that after hours of arguing now favor a numeric result over a NaN. There are also deletions such as the single extended and double extended formats. And there are some items that are deleted in one meeting and resurrected in the following meeting such as signaling NaNs. Over the past two years of committee review there has been many proposals discussed. This panel discussion will enlighten the audience to the additions, deletions, and some of the current controversial proposals. The panel will consist of : • David Hough, Sun Microsystems, Editor of the Standard – Overview • Mike Cowlishaw, IBM Corp., Decimal Floating-Point Software Advocate • David Bailey, Lawrence Berkeley Lab., Quadword Precision Advocate • David Matula, Southern Methodist University, Academics / Industry Consultant • Eric Schwarz, IBM Corp., Decimal Floating-Point Hardware – Panel Chair
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
对浮点运算的IEEE 754标准的修订
大约20年前,IEEE 754二进制浮点标准被采用。从那时起,几乎所有的微处理器以及许多编程语言都定义了符合IEEE 754标准的浮点运算。根据多年来在硬件上实现该标准和编写浮点程序的经验,有许多修改建议。所有IEEE标准必须每5年进行一次审查,否则将不再是现行标准。对于过去的审查,这个标准在没有太多讨论的情况下进行了扩展。但最终在2001年1月,一项深入的审查开始了。成立了一个委员会,并在过去两年中对许多修订进行了评估。该标准最广泛的变化是采用了十进制浮点运算的格式。该建议创建32位、64位和128位的十进制浮点数据格式。十进制浮点运算提供了显示数字的精确表示,并提供了小数点的精确舍入。这种算式在财务计算中是必需的。一些专家认为,十进制将取代二进制,因为它能够精确地表示十进制数,而另一些专家则认为,由于其执行速度和更规则的间隔间隔,二进制仍将是关键的浮点格式。另一个曾经有争议的提议是加入融合乘加。此操作只会导致一次舍入错误,而在大多数实现中,提供的性能是单独操作的两倍。标准中增加的其他内容包括四字格式和许多谓词函数,例如比较运算符,如大于。此外,经过数小时的争论,最大值和最小值的运算符现在更倾向于数字结果而不是NaN。也有删除,如单扩展和双扩展格式。还有一些项目在一次会议中被删除,而在下一次会议中重新出现,例如信令nan。在过去两年的委员会审查中,讨论了许多提案。这个小组讨论将启发观众对增加,删除,和一些目前有争议的建议。该小组将包括:•David Hough, Sun微系统公司,标准概述编辑•Mike Cowlishaw, IBM公司,十进制浮点软件倡导者•David Bailey,劳伦斯伯克利实验室。•David Matula,南卫理公会大学,学术/行业顾问•Eric Schwarz, IBM公司,十进制浮点硬件小组主席
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Worst cases and lattice reduction Representable correcting terms for possibly underflowing floating point operations The case for a redundant format in floating point arithmetic A less recursive variant of Karatsuba-Ofman algorithm for multiplying operands of size a power of two Revisiting SRT quotient digit selection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1