[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines最新文献

英文中文

Virtual wires: overcoming pin limitations in FPGA-based logic emulators 虚拟导线:克服基于fpga的逻辑仿真器的引脚限制

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-11-01 DOI: 10.1109/FPGA.1993.279469

J. Babb, R. Tessier, A. Agarwal

Existing FPGA-based logic emulators only use a fraction of potential communication bandwidth because they dedicate each FPGA pin (physical wire) to a single emulated signal (logical wire). Virtual wires overcome pin limitations by intelligently multiplexing each physical wire among multiple logical wires and pipelining these connections at the maximum clocking frequency of the FPGA. A virtual wire represents a connection from a logical output on one FPGA to a logical input on another FPGA. Virtual wires not only increase usable bandwidth, but also relax the absolute limits imposed on gate utilization. The resulting improvement in bandwidth reduces the need for global interconnect, allowing effective use of low dimension inter-chip connections (such as nearest-neighbor). Nearest-neighbor topologies, coupled with the ability of virtual wires to overlap communication with computation, can even improve emulation speeds. The authors present the concept of virtual wires and describe their first implementation, a 'softwire' compiler which utilizes static routing and relies on minimal hardware support. Results from compiling netlists for the 18 K gate Sparcle microprocessor and the 86 K gate Alewife Communications and Cache Controller indicate that virtual wires can increase FPGA gate utilization beyond 80 percent without a significant slowdown in emulation speed.<>

现有的基于FPGA的逻辑仿真器只使用潜在通信带宽的一小部分，因为它们将每个FPGA引脚(物理线)专用于单个仿真信号(逻辑线)。虚拟线通过在多个逻辑线之间智能地复用每个物理线，并在FPGA的最大时钟频率下将这些连接流水线化，从而克服了引脚限制。虚拟线表示从一个FPGA上的逻辑输出到另一个FPGA上的逻辑输入的连接。虚拟线不仅增加了可用带宽，而且放松了对栅极利用率的绝对限制。由此产生的带宽改进减少了对全局互连的需求，允许有效地使用低维芯片间连接(例如最近邻)。最近邻拓扑，再加上虚拟线路将通信与计算重叠的能力，甚至可以提高仿真速度。作者提出了虚拟线的概念，并描述了他们的第一个实现，一个“软线”编译器，它利用静态路由并依赖于最小的硬件支持。编译18k门Sparcle微处理器和86k门Alewife通信和缓存控制器的网络列表的结果表明，虚拟线可以将FPGA门利用率提高80%以上，而不会显着降低仿真速度。

{"title":"Virtual wires: overcoming pin limitations in FPGA-based logic emulators","authors":"J. Babb, R. Tessier, A. Agarwal","doi":"10.1109/FPGA.1993.279469","DOIUrl":"https://doi.org/10.1109/FPGA.1993.279469","url":null,"abstract":"Existing FPGA-based logic emulators only use a fraction of potential communication bandwidth because they dedicate each FPGA pin (physical wire) to a single emulated signal (logical wire). Virtual wires overcome pin limitations by intelligently multiplexing each physical wire among multiple logical wires and pipelining these connections at the maximum clocking frequency of the FPGA. A virtual wire represents a connection from a logical output on one FPGA to a logical input on another FPGA. Virtual wires not only increase usable bandwidth, but also relax the absolute limits imposed on gate utilization. The resulting improvement in bandwidth reduces the need for global interconnect, allowing effective use of low dimension inter-chip connections (such as nearest-neighbor). Nearest-neighbor topologies, coupled with the ability of virtual wires to overlap communication with computation, can even improve emulation speeds. The authors present the concept of virtual wires and describe their first implementation, a 'softwire' compiler which utilizes static routing and relies on minimal hardware support. Results from compiling netlists for the 18 K gate Sparcle microprocessor and the 86 K gate Alewife Communications and Cache Controller indicate that virtual wires can increase FPGA gate utilization beyond 80 percent without a significant slowdown in emulation speed.<<ETX>>","PeriodicalId":104383,"journal":{"name":"[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129113730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 192

Virtual computing and the Virtual Computer 虚拟计算和虚拟计算机

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279480

Steven Casselman

Virtual computing is an entirely new form of supercomputing that allows an algorithm to be implemented in hardware. Based on the Xilinx FPGA and ICube's FPID the Virtual Computer is completely reconfigurable in every respect. Computing machines based on reconfigurable logic are hyper-scalable meaning they scale up better than 1-1.<>

虚拟计算是一种全新的超级计算形式，它允许在硬件中实现算法。基于Xilinx FPGA和ICube的FPID，虚拟计算机在各个方面都是完全可重构的。基于可重构逻辑的计算机器是超可扩展的，这意味着它们的扩展比1-1更好。

引用次数: 103

Fine grain parallelism on a MIMD machine using FPGAs 使用fpga的MIMD机器上的细粒度并行性

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279485

Frédéric Raimbault, Dominique Lavenier, Stéphane Rubini, Bernard Pottier

The article presents the use of an FPGA chip (Xilinx 3090) to set up a fast systolic communication agent on a linear asynchronous network of transputer processors; the machine is called ArMen. The authors' work relies on the systolic programming environment ReLaCS, a close cousin to the C programming language. ReLaCS provides synchronous communication operators to simplify the programming of data transfers that occur in systolic algorithms. The ReLaCS compiler generates C programs that perform the computation process and the data management process of a systolic network.<>

本文介绍了利用FPGA芯片(Xilinx 3090)在一个线性异步网络上建立一个快速收缩通信代理;这台机器被称为ArMen。作者的工作依赖于收缩编程环境ReLaCS，它是C编程语言的近亲。ReLaCS提供同步通信操作符，以简化在收缩算法中发生的数据传输编程。ReLaCS编译器生成C程序，执行收缩网络的计算过程和数据管理过程

引用次数: 32

Hardware acceleration of divide-and-conquer paradigms: a case study 分而治之范式的硬件加速:一个案例研究

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279463

W. Luk, V. Lok, I. Page

The authors describe a method for speeding up divide-and-conquer algorithms with a hardware coprocessor, using sorting as an example. The method employs a conventional processor for the 'divide' and 'merge' phases, while the 'conquer' phase is handled by a purpose-built coprocessor. It is shown how transformation techniques from the Ruby language can be adopted in developing a family of systolic sorters, and how one of the resulting designs is prototyped in eight FPGAs on a PC coprocessor board known as CHS2*4 from Algotronix. The execution of the hardware unit is embedded in a sorting program, with the PC host merging the sorted sequences from the hardware sorter. The performance of this implementation is compared against various sorting algorithms on a number of PC systems.<>

作者描述了一种用硬件协处理器加速分治算法的方法，并以排序为例。该方法使用传统处理器处理“分割”和“合并”阶段，而“征服”阶段由专用协处理器处理。它展示了Ruby语言的转换技术如何用于开发一系列收缩分选器，以及如何在Algotronix的称为CHS2*4的PC协处理器板上的8个fpga中原型化其中一个结果设计。硬件单元的执行嵌入在排序程序中，PC主机合并来自硬件排序器的排序序列。将此实现的性能与许多PC系统上的各种排序算法进行了比较。

引用次数: 24

High performance analysis and control of complex systems using dynamically reconfigurable silicon and optical fiber memory 使用动态可重构硅和光纤存储器的复杂系统的高性能分析和控制

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279470

L. Wood

M is a highly parallel asynchronous computer for the analysis and control of complex systems. A complex system is a system with many interacting components. Examples of complex systems include applications in molecular biology, economics, and signal processing. M asynchronous computations reproduce the structural dynamics of a system using high fidelity behavioral modeling. Programs are composed of an application model, an environment model, and a distributed subsumption operating system. Processes are implemented using position independent instructions (broadcast automata) that operate in parallel on strings of binary data. All M FPGA fine grained parallel processing nodes are double buffered, asynchronous, and highly pipelined. The fiber system memory is optically multiplexed, and asynchronous. The technology will extend new gigabit ATM optical networks with integrated high performance computing services.<>

M是一种高度并行的异步计算机，用于分析和控制复杂系统。复杂系统是由许多相互作用的组件组成的系统。复杂系统的例子包括分子生物学、经济学和信号处理中的应用。异步计算使用高保真行为建模再现系统的结构动力学。程序由应用程序模型、环境模型和分布式包容操作系统组成。进程是使用与位置无关的指令(广播自动机)来实现的，这些指令并行地操作二进制数据串。所有M FPGA细粒度并行处理节点都是双缓冲、异步和高度流水线的。光纤系统内存是光复用的，并且是异步的。该技术将通过集成高性能计算服务扩展新的千兆ATM光网络

引用次数: 4

FPGA computing in a data parallel C FPGA计算中的数据并行C

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279474

Maya Gokhale, Ron Minnich

The authors demonstrate a new technique for automatically synthesizing digital logic from a high level algorithmic description in a data parallel language. The methodology has been implemented using the Splash 2 reconfigurable logic arrays for programs written in Data-parallel Bit-serial C (dbC). The translator generates a VHDL description of a SIMD processor array with one or more processors per Xilinx 4010 FPGA. The instruction set of each processor is customized to the dbC program being processed. In addition to the usual arithmetic operations, nearest neighbor communication, host-to-processor communication, and global reductions are supported.<>

作者展示了一种用数据并行语言从高级算法描述自动合成数字逻辑的新技术。该方法已使用Splash 2可重构逻辑阵列实现，用于用数据并行位串行C (dbC)编写的程序。转换器生成SIMD处理器阵列的VHDL描述，每个Xilinx 4010 FPGA具有一个或多个处理器。每个处理器的指令集都是针对正在处理的dbC程序定制的。除了常用的算术运算外，还支持最近邻通信、主机到处理器通信和全局约简。

引用次数: 59

Data-folding in SRAM configurable FPGAs SRAM可配置fpga中的数据折叠

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279467

P. Foulk

FPGAs which are configured by static RAM can be rapidly changed from one logic configuration to another. This raises the possibility of configuring the logic to implement a function for a specific set of values, i.e. folding the inputs into the logic design. The paper discusses data folding with respect to Algotronix FPGAs, presenting a text searching circuit as an example. This folded circuit saves at least half the logic over a conventional circuit, and very much more if data folding is taken as far as possible. It also presents performance figures for the folded circuit, and discusses other applications, and suggests features which are desirable if data folding is to be practicable, most of which are possessed by the Algotronix CAL array.<>

由静态RAM配置的fpga可以快速地从一种逻辑配置转换到另一种逻辑配置。这增加了配置逻辑以实现特定值集的功能的可能性，即将输入折叠到逻辑设计中。本文讨论了基于Algotronix fpga的数据折叠，并给出了一个文本搜索电路作为示例。这种折叠电路比传统电路至少节省了一半的逻辑，如果尽可能地进行数据折叠，则可以节省更多的逻辑。本文还介绍了折叠电路的性能数据，并讨论了其他应用，并提出了数据折叠可行时所需的特性，其中大部分都是Algotronix CAL阵列所具有的。

引用次数: 48

PRISM-II compiler and architecture PRISM-II编译器和体系结构

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279484

M. Wazlowski, L. Agarwal, T. Lee, A. Smith, E. Lam, P. Athanas, H. Silverman, S. Ghosh

This paper discusses the architecture and compiler for a general-purpose metamorphic computing platform called PRISM-II. PRISM-II improves the performance of many computationally-intensive tasks by augmenting the functionality of the core processor with new instructions that match the characteristics of targeted applications. In essence, PRISM (processor reconfiguration through instruction set metamorphosis) is a general purpose hardware platform that behaves like an application-specific platform. Two methods for hardware synthesis, one using VHDL Designer and the other using X-BLOX, are presented and synthesis results are compared.<>

本文讨论了通用变形计算平台PRISM-II的体系结构和编译器。PRISM-II通过使用与目标应用程序的特征相匹配的新指令增强核心处理器的功能，从而提高了许多计算密集型任务的性能。从本质上讲，PRISM(通过指令集变形对处理器进行重新配置)是一个通用的硬件平台，其行为类似于特定于应用程序的平台。介绍了用VHDL Designer和X-BLOX进行硬件综合的两种方法，并对综合结果进行了比较。

引用次数: 166

The CM-2X: a hybrid CM-2/Xilinx prototype CM-2X: CM-2/Xilinx的混合原型

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279471

Steven, Cuccaro, Craig E Reese

This paper describes the CM-2X prototype. This one-of-a-kind machine is the result of a Supercomputing Research Center/Thinking Machines Corporation joint effort to examine the suitability of a hybrid combination of CM-2 architecture and Xilinx programmable gate array technology. In addition to a description of the CM-2X and Xilinx architecture, a simple applications example is provided that illustrates many of the issues involved in programming the machine.<>

本文介绍了CM-2X原型机。这台独一无二的机器是超级计算研究中心/思维机器公司共同努力的结果，旨在研究CM-2架构和Xilinx可编程门阵列技术混合组合的适用性。除了CM-2X和Xilinx架构的描述之外，还提供了一个简单的应用示例，说明了编程机器所涉及的许多问题。

引用次数: 25

A field programmable accelerator for compiled-code applications 用于编译代码应用程序的现场可编程加速器

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

Pub Date : 1993-04-05 DOI: 10.1109/FPGA.1993.279478

D. Lewis, M.H. van Ierssel, D. H. Wong

The paper describes a special purpose application accelerator using field programmable gate arrays to accelerate a range of applications. The accelerator is designed to support applications by allowing the user to implement a processor with an instruction set designed for the specific application being accelerated, using specialized instructions to implement critical fragments of the application. A compiled-code software organization is used to reduce overhead operations. A prototype has been built, and the first application to be ported to it, logic simulation, is underway.<>

本文介绍了一种利用现场可编程门阵列加速各种应用的专用应用加速器。加速器的设计是为了支持应用程序，允许用户实现带有为特定应用程序设计的指令集的处理器，使用专门的指令来实现应用程序的关键片段。编译代码软件组织用于减少开销操作。一个原型已经建立，第一个要移植到它的应用程序，逻辑模拟，正在进行中。

引用次数: 9

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

[1993] Proceedings IEEE Workshop on FPGAs for Custom Computing Machines

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀