首页 > 最新文献

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines最新文献

英文 中文
WASMII: a data driven computer on a virtual hardware 基于虚拟硬件的数据驱动计算机
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279481
X. Ling, H. Amano
Virtual hardware is a technique to realize a large digital circuit with a small real hardware by using an extended Field Programmable Gate Array (FPGA) technology. Several configuration RAM modules are provided inside the FPGA chip, and the configuration of the gate array can be rapidly changed by replacing the active module. Data for configuration are transferred from an off-chip backup RAM to an unused configuration RAM module. A novel computation mechanism called the WASMII, which executes a target data flow graph directly, is proposed on the basis of the virtual hardware. A WASMII chip consists of the FPGA for virtual hardware and the additional mechanism to replace configuration RAM modules in the data driven manner. Configuration data are preloaded by the order which is assigned in advance with a static scheduling preprocessor. By connecting a number of WASMII chips, a highly parallel system can be easily constructed.<>
虚拟硬件是利用扩展的现场可编程门阵列(FPGA)技术,用小的真实硬件实现大的数字电路的一种技术。FPGA芯片内部提供了多个配置RAM模块,通过更换有源模块可以快速改变门阵列的配置。配置数据从片外备份RAM传输到未使用的配置RAM模块。在虚拟硬件的基础上,提出了一种新的直接执行目标数据流图的计算机制WASMII。WASMII芯片由用于虚拟硬件的FPGA和以数据驱动方式替换配置RAM模块的附加机制组成。配置数据按静态调度预处理器预先分配的顺序预加载。通过连接多个WASMII芯片,可以很容易地构建一个高度并行的系统。
{"title":"WASMII: a data driven computer on a virtual hardware","authors":"X. Ling, H. Amano","doi":"10.1109/FPGA.1993.279481","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279481","url":null,"abstract":"Virtual hardware is a technique to realize a large digital circuit with a small real hardware by using an extended Field Programmable Gate Array (FPGA) technology. Several configuration RAM modules are provided inside the FPGA chip, and the configuration of the gate array can be rapidly changed by replacing the active module. Data for configuration are transferred from an off-chip backup RAM to an unused configuration RAM module. A novel computation mechanism called the WASMII, which executes a target data flow graph directly, is proposed on the basis of the virtual hardware. A WASMII chip consists of the FPGA for virtual hardware and the additional mechanism to replace configuration RAM modules in the data driven manner. Configuration data are preloaded by the order which is assigned in advance with a static scheduling preprocessor. By connecting a number of WASMII chips, a highly parallel system can be easily constructed.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120911204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 111
Realising massively concurrent systems on the SPACE machine 在SPACE机器上实现大规模并发系统
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279482
George Milne, Paul Cockshott, George McCaskill, P. Barrie, HardLab
Highly concurrent systems occur frequently in the physical world. This paper focuses on a class of systems characterised as being highly concurrent and which are composed out of many simple parts which interact with other parts in their locality. It discusses how to describe these systems and introduces a cellular automata type of architecture which is used to simulate these systems directly in hardware, with physical concurrency being realised by true hardware concurrency. The architecture of the SPACE machine (Scalable Parallel Architecture for Concurrency Experiments), which is constructed from reconfigurable FPGA logic, is introduced and it is demonstrated how to simulate road traffic systems using it.<>
高度并发的系统在物理世界中经常出现。本文主要研究一类具有高度并发性的系统,这些系统由许多简单的部分组成,这些部分与局部的其他部分相互作用。它讨论了如何描述这些系统,并介绍了一种元胞自动机类型的体系结构,该体系结构用于直接在硬件中模拟这些系统,通过真正的硬件并发实现物理并发。介绍了由可重构FPGA逻辑构成的SPACE机器(Scalable Parallel architecture for Concurrency Experiments)的体系结构,并演示了如何使用它来模拟道路交通系统。
{"title":"Realising massively concurrent systems on the SPACE machine","authors":"George Milne, Paul Cockshott, George McCaskill, P. Barrie, HardLab","doi":"10.1109/FPGA.1993.279482","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279482","url":null,"abstract":"Highly concurrent systems occur frequently in the physical world. This paper focuses on a class of systems characterised as being highly concurrent and which are composed out of many simple parts which interact with other parts in their locality. It discusses how to describe these systems and introduces a cellular automata type of architecture which is used to simulate these systems directly in hardware, with physical concurrency being realised by true hardware concurrency. The architecture of the SPACE machine (Scalable Parallel Architecture for Concurrency Experiments), which is constructed from reconfigurable FPGA logic, is introduced and it is demonstrated how to simulate road traffic systems using it.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125187361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A reconfigurable computer for embedded control applications 用于嵌入式控制应用的可重构计算机
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279472
H. Herpel, N. Wehn, M. Gasteier, Manfred Glesner
The authors present a custom computer together with a software environment for rapid implementation of algorithms on reprogrammable hardware. The custom computer is based on FPGA boards embedded in a programmable interconnection network. The transformation of an algorithmic system specification into a configuration file for the FPGAs is supported through a set of high-level and structural synthesis tools. Hardware and software are tuned, but not limited to the application domain of real-time control. The presented approach is not a push-button approach, it takes advantage of the experience of the designer at several stages to reach a near optimal solution.<>
作者提出了一个自定义计算机和一个软件环境,用于在可编程硬件上快速实现算法。定制计算机是基于FPGA板嵌入可编程互连网络。将算法系统规范转换为fpga的配置文件是通过一组高级和结构合成工具来支持的。硬件和软件进行了优化,但不限于实时控制的应用领域。所呈现的方法并不是一种按钮式方法,而是利用设计师在几个阶段的经验来达到接近最优的解决方案
{"title":"A reconfigurable computer for embedded control applications","authors":"H. Herpel, N. Wehn, M. Gasteier, Manfred Glesner","doi":"10.1109/FPGA.1993.279472","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279472","url":null,"abstract":"The authors present a custom computer together with a software environment for rapid implementation of algorithms on reprogrammable hardware. The custom computer is based on FPGA boards embedded in a programmable interconnection network. The transformation of an algorithmic system specification into a configuration file for the FPGAs is supported through a set of high-level and structural synthesis tools. Hardware and software are tuned, but not limited to the application domain of real-time control. The presented approach is not a push-button approach, it takes advantage of the experience of the designer at several stages to reach a near optimal solution.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128827201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Text searching on Splash 2 文字搜索的飞溅2
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279466
V. Daniel, Pryor, Thistle, Nabeel Shirazi
The paper proposes a flexible, reprogrammable hardware solution to the acceleration of text-based keyword search problems. In these problems, a stream of input text is checked against a known list of keywords (a dictionary) for occurrences of those keywords in the text. The authors' solution employs an attached processor called Splash 2, which exploits the speed and reconfigurability of field programmable gate array technology. The Splash 2 system was designed and built at the SRC for a wide variety of applications. A Splash 2 system is comprised of an interface board to a Sun Sparc-2 host and up to 16 Splash boards, each of which contains 16 Xilinx 4010 FPGAs interconnected in a linear array and also through a 16-way full crossbar switch. Each Xilinx chip is coupled with a 4 Mbit static RAM through a dedicated interface. The text searching program implemented on a one-board Splash 2 system is capable of processing text at an estimated rate of 50 million characters per second.<>
本文提出了一种灵活的、可重新编程的硬件解决方案来加速基于文本的关键字搜索问题。在这些问题中,输入文本流将根据已知关键字列表(字典)检查这些关键字在文本中的出现情况。作者的解决方案采用了一个名为Splash 2的附加处理器,它利用了现场可编程门阵列技术的速度和可重构性。Splash 2系统是在SRC设计和制造的,适用于各种应用。一个Splash 2系统由一个连接Sun Sparc-2主机的接口板和多达16个Splash板组成,每个Splash板包含16个Xilinx 4010 fpga,以线性阵列连接,也通过16路全交叉开关连接。每个赛灵思芯片通过专用接口与一个4mbit静态RAM耦合。在单板Splash 2系统上实现的文本搜索程序能够以每秒5000万个字符的估计速率处理文本。
{"title":"Text searching on Splash 2","authors":"V. Daniel, Pryor, Thistle, Nabeel Shirazi","doi":"10.1109/FPGA.1993.279466","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279466","url":null,"abstract":"The paper proposes a flexible, reprogrammable hardware solution to the acceleration of text-based keyword search problems. In these problems, a stream of input text is checked against a known list of keywords (a dictionary) for occurrences of those keywords in the text. The authors' solution employs an attached processor called Splash 2, which exploits the speed and reconfigurability of field programmable gate array technology. The Splash 2 system was designed and built at the SRC for a wide variety of applications. A Splash 2 system is comprised of an interface board to a Sun Sparc-2 host and up to 16 Splash boards, each of which contains 16 Xilinx 4010 FPGAs interconnected in a linear array and also through a 16-way full crossbar switch. Each Xilinx chip is coupled with a 4 Mbit static RAM through a dedicated interface. The text searching program implemented on a one-board Splash 2 system is capable of processing text at an estimated rate of 50 million characters per second.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132982862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Searching genetic databases on Splash 2 在Splash 2上搜索基因数据库
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279464
Dzung T. Hoang
The author describes two systolic arrays for computing the edit distance between two genetic sequences using a well-known dynamic programming algorithm. The systolic arrays have been implemented for the Splash 2 programmable logic array and are intended to be used for database searching. Simulations indicate that the faster Splash 2 implementation can search a database at a rate of 12 million characters per second, several orders of magnitude faster than implementations of the dynamic programming algorithm on conventional computers.<>
作者描述了两个收缩阵列,用于计算两个基因序列之间的编辑距离,使用一个著名的动态规划算法。收缩阵列已经为Splash 2可编程逻辑阵列实现,并打算用于数据库搜索。模拟表明,更快的Splash 2实现可以以每秒1200万个字符的速度搜索数据库,比传统计算机上的动态规划算法实现快几个数量级。
{"title":"Searching genetic databases on Splash 2","authors":"Dzung T. Hoang","doi":"10.1109/FPGA.1993.279464","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279464","url":null,"abstract":"The author describes two systolic arrays for computing the edit distance between two genetic sequences using a well-known dynamic programming algorithm. The systolic arrays have been implemented for the Splash 2 programmable logic array and are intended to be used for database searching. Simulations indicate that the faster Splash 2 implementation can search a database at a rate of 12 million characters per second, several orders of magnitude faster than implementations of the dynamic programming algorithm on conventional computers.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"32 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131956731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 222
A self-reconfigurable processor 一个自重构的处理器
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279479
P. French, R. Taylor
Recent developments in the design and fabrication of field programmable logic devices (FPGAs) may well change the way in which one designs and fabricates conventional microprocessors. The use of uncommitted logic whose function may be modified at run time makes the prospect of dynamic application specific integrated circuits closer to reality than ever before. Much of the work to date on reconfigurable logic has focussed on its application in co-processor and 'glue' roles. This paper discusses how complete processors might be fabricated with a minimum of 'fixed' or static logic. It is shown that in order to exploit FPGAs, a processor that is radically different from conventional architectures is required. The paper concludes by considering what evolutions of current logic families would favour this type of application.<>
现场可编程逻辑器件(fpga)的设计和制造的最新发展可能会改变人们设计和制造传统微处理器的方式。可在运行时修改其功能的未提交逻辑的使用使得动态应用专用集成电路的前景比以往任何时候都更接近现实。迄今为止,关于可重构逻辑的大部分工作都集中在其在协处理器和“胶水”角色中的应用上。本文讨论了如何用最小的“固定”或静态逻辑构造完整的处理器。结果表明,为了利用fpga,需要一种与传统架构完全不同的处理器。本文最后考虑了当前逻辑族的哪些演进将有利于这种类型的应用。
{"title":"A self-reconfigurable processor","authors":"P. French, R. Taylor","doi":"10.1109/FPGA.1993.279479","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279479","url":null,"abstract":"Recent developments in the design and fabrication of field programmable logic devices (FPGAs) may well change the way in which one designs and fabricates conventional microprocessors. The use of uncommitted logic whose function may be modified at run time makes the prospect of dynamic application specific integrated circuits closer to reality than ever before. Much of the work to date on reconfigurable logic has focussed on its application in co-processor and 'glue' roles. This paper discusses how complete processors might be fabricated with a minimum of 'fixed' or static logic. It is shown that in order to exploit FPGAs, a processor that is radically different from conventional architectures is required. The paper concludes by considering what evolutions of current logic families would favour this type of application.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122512020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A data-parallel programming model for reconfigurable architectures 面向可重构体系结构的数据并行编程模型
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279476
S. Guccione, Mario J. Gonzalez
Recently, several machines have been built using field programmable gate array (FPGA) technology. These reconfigurable architectures have demonstrated very high performance for a variety of problems. The configuration of these machines typically rely on some form of hardware specification. The authors demonstrate that a more traditional software approach may be used. A vector based data-parallel model and its mapping to a reconfigurable architecture are introduced. Included in the model are parallel prefix or scan operators. The language supporting this model is a subset of the C programming language.<>
最近,一些机器已经使用现场可编程门阵列(FPGA)技术建造。这些可重构的体系结构已经在各种问题上证明了非常高的性能。这些机器的配置通常依赖于某种形式的硬件规范。作者论证了一种更传统的软件方法。介绍了一种基于矢量的数据并行模型及其到可重构体系结构的映射。模型中包括并行前缀或扫描操作符。支持该模型的语言是C编程语言的一个子集。
{"title":"A data-parallel programming model for reconfigurable architectures","authors":"S. Guccione, Mario J. Gonzalez","doi":"10.1109/FPGA.1993.279476","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279476","url":null,"abstract":"Recently, several machines have been built using field programmable gate array (FPGA) technology. These reconfigurable architectures have demonstrated very high performance for a variety of problems. The configuration of these machines typically rely on some form of hardware specification. The authors demonstrate that a more traditional software approach may be used. A vector based data-parallel model and its mapping to a reconfigurable architecture are introduced. Included in the model are parallel prefix or scan operators. The language supporting this model is a subset of the C programming language.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128489243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
A digit-recurrence square root implementation for field programmable gate arrays 现场可编程门阵列的数字递归平方根实现
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279465
M. E. Louie, M. Ercegovac
Creating efficient arithmetic processors requires a pairing of high speed arithmetic algorithms with optimal mapping strategies for a given technology. The authors propose bit reduction as key to an efficient pairing process for lookup table based field programmable gate arrays (FPGAs). Bit reduction simplifies the functions defining the original algorithm, thus permitting a mapping to fewer blocks and reducing the overall throughput delay. A mapping of a digit-recurrence square root algorithm to the Xilinx XC4010 FPGA illustrates the bit reduction process.<>
创建高效的算术处理器需要将高速算术算法与给定技术的最优映射策略配对。作者提出比特减少是基于查找表的现场可编程门阵列(fpga)的有效配对过程的关键。位减少简化了定义原始算法的函数,从而允许映射到更少的块并减少总体吞吐量延迟。一个数字递归平方根算法到Xilinx XC4010 FPGA的映射说明了比特缩减过程
{"title":"A digit-recurrence square root implementation for field programmable gate arrays","authors":"M. E. Louie, M. Ercegovac","doi":"10.1109/FPGA.1993.279465","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279465","url":null,"abstract":"Creating efficient arithmetic processors requires a pairing of high speed arithmetic algorithms with optimal mapping strategies for a given technology. The authors propose bit reduction as key to an efficient pairing process for lookup table based field programmable gate arrays (FPGAs). Bit reduction simplifies the functions defining the original algorithm, thus permitting a mapping to fewer blocks and reducing the overall throughput delay. A mapping of a digit-recurrence square root algorithm to the Xilinx XC4010 FPGA illustrates the bit reduction process.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132055085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Reconfigurable multi-bit processor for DSP applications in statistical physics 用于统计物理中DSP应用的可重构多比特处理器
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279473
S. Monaghan, C. Cowen
A PC-AT hosted DSP processor architecture implemented in SRAM-based field programmable gate arrays (FPGA) and static memories is described. Despite its simplicity, the processor circuits can be reconfigured under software control to tackle a class of multi-bit 'pixel' processing problems of current interest in the statistical physics of disordered materials, thereby offering some of the problem flexibility of a general purpose processor and the performance of custom hardware. The flexibility offered by the FPGA implementation is discussed in detail as is a particular application of the processor (to disordered superconductors). The performance of the processor is shown to compare well with similarly costing commercial DSP hardware. The low cost of the processor means it can be replicated to obtain dedicated supercomputer performance.<>
描述了一种基于sram的现场可编程门阵列(FPGA)和静态存储器的PC-AT托管DSP处理器架构。尽管它很简单,但处理器电路可以在软件控制下重新配置,以解决当前在无序材料的统计物理中感兴趣的一类多比特“像素”处理问题,从而提供通用处理器的一些问题灵活性和定制硬件的性能。详细讨论了FPGA实现所提供的灵活性,以及处理器(无序超导体)的特定应用。该处理器的性能与类似成本的商用DSP硬件相比较。处理器的低成本意味着它可以被复制以获得专用的超级计算机性能。
{"title":"Reconfigurable multi-bit processor for DSP applications in statistical physics","authors":"S. Monaghan, C. Cowen","doi":"10.1109/FPGA.1993.279473","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279473","url":null,"abstract":"A PC-AT hosted DSP processor architecture implemented in SRAM-based field programmable gate arrays (FPGA) and static memories is described. Despite its simplicity, the processor circuits can be reconfigured under software control to tackle a class of multi-bit 'pixel' processing problems of current interest in the statistical physics of disordered materials, thereby offering some of the problem flexibility of a general purpose processor and the performance of custom hardware. The flexibility offered by the FPGA implementation is discussed in detail as is a particular application of the processor (to disordered superconductors). The performance of the processor is shown to compare well with similarly costing commercial DSP hardware. The low cost of the processor means it can be replicated to obtain dedicated supercomputer performance.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132092487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A stochastic neural architecture that exploits dynamically reconfigurable FPGAs 利用动态可重构fpga的随机神经结构
Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279462
M. V. Daalen, P. Jeavons, J. Shawe-Taylor
The authors present an expandable digital architecture that provides an efficient real time implementation platform for large neural networks. The architecture makes heavy use of the techniques of bit serial stochastic computing to carry out the large number of required parallel synaptic calculations. In this design all real valued quantities are encoded on to stochastic bit streams in which the '1' density is proportional to the given quantity. The actual digital circuitry is simple and highly regular thus allowing very efficient space usage of fine grained FPGAs. Another feature of the design is that the large number of weights required by a neural network are generated by circuitry tailored to each of their specific values, thus saving valuable cells. Whenever one of these values is required to change, the appropriate circuitry must be dynamically reconfigured. This may always be achieved in a fixed and minimum number of cells for a given bit stream resolution.<>
作者提出了一个可扩展的数字架构,为大型神经网络提供了一个高效的实时实现平台。该体系结构大量使用比特串行随机计算技术来进行大量所需的并行突触计算。在这个设计中,所有实数值都被编码到随机比特流中,其中“1”的密度与给定的数量成正比。实际的数字电路是简单和高度规则,从而允许非常有效的空间使用细粒度fpga。该设计的另一个特点是,神经网络所需的大量权重是由根据每个特定值定制的电路产生的,从而节省了宝贵的细胞。每当需要改变这些值中的一个时,必须动态地重新配置适当的电路。对于给定的位流分辨率,这可以通过固定的最小单元数来实现。
{"title":"A stochastic neural architecture that exploits dynamically reconfigurable FPGAs","authors":"M. V. Daalen, P. Jeavons, J. Shawe-Taylor","doi":"10.1109/FPGA.1993.279462","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279462","url":null,"abstract":"The authors present an expandable digital architecture that provides an efficient real time implementation platform for large neural networks. The architecture makes heavy use of the techniques of bit serial stochastic computing to carry out the large number of required parallel synaptic calculations. In this design all real valued quantities are encoded on to stochastic bit streams in which the '1' density is proportional to the given quantity. The actual digital circuitry is simple and highly regular thus allowing very efficient space usage of fine grained FPGAs. Another feature of the design is that the large number of weights required by a neural network are generated by circuitry tailored to each of their specific values, thus saving valuable cells. Whenever one of these values is required to change, the appropriate circuitry must be dynamically reconfigured. This may always be achieved in a fixed and minimum number of cells for a given bit stream resolution.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132992818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
期刊
[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1