2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)最新文献

英文中文

Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs 硬浮点和dsp fpga的单精度自然对数体系结构

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)

Pub Date : 2016-07-01 DOI: 10.1109/ARITH.2016.20

M. Langhammer, B. Pasca

In this paper we will present a novel method for implementing floating point (FP) elementary functions using the new FP single precision addition and multiplication features of the Altera Arria~10 DSP Block architecture. Our application example will use log(x), one of the most commonly required functions for emerging datacenter and computing FPGA targets. We will explain why the combination of new FPGA technology, and at the same time, a massive increase in computing performance requirement, fuels the need for this work. We show a comprehensive error analysis, both for the overall function, and each subsection of the architecture, demonstrating that the hard FP (HFP) Blocks, in conjunction with the traditional flexibility and connectivity of the FPGA, can provide a robust and high performance solution. These methods create a highly accurate single precision IEEE754 function, which is OpenCL conformant. Our methods map directly to almost exclusively embedded structures, and therefore result in significant reduction in logic resources and routing stress compared to current methods, and demonstrate that newly introduced FPGA routing architectures can be leveraged to use almost no soft resources. We also show that the latency of the log(x) function can be changed independently of the architecture and function, allowing the performance of the function to be adjusted directly to the system clock rate.

本文将提出一种利用Altera Arria~10 DSP Block架构的浮点单精度加法和乘法特性来实现浮点(FP)初等函数的新方法。我们的应用程序示例将使用log(x)，这是新兴数据中心和计算FPGA目标最常用的函数之一。我们将解释为什么新的FPGA技术的结合，同时，计算性能要求的大量增加，推动了这项工作的需要。我们展示了对整体功能和架构的每个小节的全面误差分析，表明硬FP (HFP)块与FPGA的传统灵活性和连接性相结合，可以提供鲁棒性和高性能的解决方案。这些方法创建了一个高度精确的单精度IEEE754函数，它符合OpenCL。我们的方法直接映射到几乎完全嵌入的结构，因此与目前的方法相比，可以显著减少逻辑资源和路由压力，并证明新引入的FPGA路由架构可以利用几乎不使用软资源。我们还表明，log(x)函数的延迟可以独立于体系结构和功能而改变，从而允许将函数的性能直接调整为系统时钟速率。

{"title":"Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs","authors":"M. Langhammer, B. Pasca","doi":"10.1109/ARITH.2016.20","DOIUrl":"https://doi.org/10.1109/ARITH.2016.20","url":null,"abstract":"In this paper we will present a novel method for implementing floating point (FP) elementary functions using the new FP single precision addition and multiplication features of the Altera Arria~10 DSP Block architecture. Our application example will use log(x), one of the most commonly required functions for emerging datacenter and computing FPGA targets. We will explain why the combination of new FPGA technology, and at the same time, a massive increase in computing performance requirement, fuels the need for this work. We show a comprehensive error analysis, both for the overall function, and each subsection of the architecture, demonstrating that the hard FP (HFP) Blocks, in conjunction with the traditional flexibility and connectivity of the FPGA, can provide a robust and high performance solution. These methods create a highly accurate single precision IEEE754 function, which is OpenCL conformant. Our methods map directly to almost exclusively embedded structures, and therefore result in significant reduction in logic resources and routing stress compared to current methods, and demonstrate that newly introduced FPGA routing architectures can be leveraged to use almost no soft resources. We also show that the latency of the log(x) function can be changed independently of the architecture and function, allowing the performance of the function to be adjusted directly to the system clock rate.","PeriodicalId":145448,"journal":{"name":"2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124490580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Verificarlo: Checking Floating Point Accuracy through Monte Carlo Arithmetic Verificarlo:通过蒙特卡罗算法检查浮点精度

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)

Pub Date : 2015-09-04 DOI: 10.1109/ARITH.2016.31

C. Denis, P. D. O. Castro, E. Petit

Numerical accuracy of floating point computation is a well studied topic which has not made its way to the end-user in scientific computing. Yet, it has become a critical issue with the recent requirements for code modernization to harness new highly parallel hardware and perform higher resolution computation. To democratize numerical accuracy analysis, it is important to propose tools and methodologies to study large use cases in a reliable and automatic way. In this paper, we propose verificarlo, an extension to the LLVM compiler to automatically use Monte Carlo Arithmetic in a transparent way for the end-user. It supports all the major languages including C, C++, and Fortran. Unlike source-to-source approaches, our implementation captures the influence of compiler optimizations on the numerical accuracy. We illustrate how Monte Carlo Arithmetic using the verificarlo tool outperforms the existing approaches on various use cases and is a step toward automatic numerical analysis.

浮点计算的数值精度是一个研究得很好的课题，但在科学计算中尚未得到最终应用。然而，利用新的高度并行硬件和执行更高分辨率计算的代码现代化需求已经成为一个关键问题。为了使数值精度分析大众化，重要的是提出以可靠和自动的方式研究大型用例的工具和方法。在本文中，我们提出了verificarlo，它是LLVM编译器的一个扩展，可以为最终用户以透明的方式自动使用蒙特卡洛算法。它支持所有主要语言，包括C、c++和Fortran。与源到源的方法不同，我们的实现捕获了编译器优化对数值精度的影响。我们说明了使用verificarlo工具的蒙特卡罗算法如何在各种用例中优于现有方法，并且是迈向自动数值分析的一步。

引用次数: 56

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀