Matrix engine for signal processing applications using the logarithmic number system

Proceedings IEEE International Conference on Application- Specific Systems, Architectures, and Processors Pub Date : 2002-07-17 DOI:10.1109/ASAP.2002.1030730

E. Chester, J. N. Coleman

引用次数: 19

Abstract

An architecture design is presented for a device based upon the logarithmic number system (LNS) that is capable of performing general matrix and complex arithmetic, with features useful for DSP system-on-chip applications. A modified LNS addition/subtraction unit is employed in multiple execution units to achieve a maximum single-precision floating-point (FP) equivalent throughput of 3.2 Gflop/s at a clock frequency of 200 MHz. Each execution unit is capable of computing functions of the form (ab + cd)/sup e/ for e /spl isin/ {/spl plusmn/0.5, /spl plusmn/1, /spl plusmn/2} in a 5-stage arithmetic pipeline and returning a result every cycle, yielding a considerable per-cycle improvement over both floating- and fixed-point systems. Comparisons with existing devices and a single floating-point unit are given.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

矩阵引擎用于信号处理应用，使用对数数制

提出了一种基于对数系统(LNS)的器件体系结构设计，该器件能够执行一般矩阵和复杂运算，并具有DSP片上系统应用的特点。在多个执行单元中采用改进的LNS加减单元，在时钟频率为200mhz时，最大单精度浮点吞吐量可达3.2 Gflop/s。每个执行单元都能够在一个5阶段的算术管道中计算形式为(ab + cd)/sup /的函数(对于e/ spl isin/ {/spl plusmn/0.5， /spl plusmn/1， /spl plusmn/2}的函数，并在每个周期返回一个结果，与浮点和浮点系统相比，每个周期都有相当大的改进。给出了与现有器件和单个浮点单元的比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊