2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.最新文献

英文中文

Multi-hop routing of multi-terminal nets for evaluation of hybrid multi-FPGA boards 多终端网络的多跳路由，用于混合多fpga板的评估

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188696

S. Jain, Anshul Kumar, Shashi Kumar

In rapid prototyping system application, any large digital circuit can be implemented onto Multi-FPGA Board(MFB). Key MFB architectural feature is its inter-FPGA connections consisting of fixed connections(FC) i.e. FPGA-FPGA connections and programmable connections(PC) i.e. FPGA-programmable switch like FPID-FPGA. MFBs consisting of both the types of connections are known as hybrid MFBs. Since, PC requires two wires as against one wire in FC, MFB must have minimum number of PCs to keep fabrication easy. In partitioned circuit, multi-terminal nets (MTNs) are distributed over one or more circuit parts. When each circuit part is implemented over one FPGA, the MTNs between circuit parts will be routed over PCs and FCs between corresponding FPGAs. Multi-hop routers are used to minimize the use of PCs, but they increase source to sink delay with increasing number of hops. A generic multi-hop router to route two-terminal nets, which obeys the given limit on hops, was presented in our previous work [2002]. In this paper, we extend the same to route multi-terminal nets.

在快速成型系统应用中，任何大型数字电路都可以在多fpga板(MFB)上实现。MFB的主要架构特征是它的fpga间连接，包括固定连接(FC)，即FPGA-FPGA连接和可编程连接(PC)，即fpga -可编程开关，如FPGA-FPGA。由这两种类型的连接组成的mfb被称为混合mfb。由于PC需要两根电线而不是FC中的一根电线，因此MFB必须具有最少数量的PC以保持制造容易。在分区电路中，多终端网(mtn)分布在一个或多个电路部件上。当每个电路部分在一个FPGA上实现时，电路部分之间的mtn将通过pc和相应FPGA之间的fc路由。多跳路由器是为了尽量减少pc的使用，但是随着跳数的增加，它们会增加源到接收的延迟。我们在之前的工作[2002]中提出了一种通用的多跳路由器，它遵循给定的跳数限制来路由双终端网络。在本文中，我们将其扩展到路由多终端网络。

引用次数: 1

FPGA-based computation of free-form deformations 基于fpga的自由变形计算

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188722

Jun Jiang, W. Luk, D. Rueckert

This paper describes techniques for producing FPGA-based designs that support free-form deformation in medical image processing. The free-form deformation method is based on a B-spline algorithm for modelling three-dimensional deformable objects. Our design includes four optimisations. First, we store the values of a third-order B-spline model in lookup tables. Second, we adopt a customised number representation format in our implementation. Third, we transform a nested loop so that conditionals are moved outside the loop. Fourth, we pipeline the design to increase its throughput, and we also deploy multiple pipelines such that each covers a different image. Our design description, captured in the Handel-C language, is parameterisable at compile time to support a range of image resolutions and computational precisions. An implementation on a Xilinx XC2V6000 device would be capable of processing images of resolution up to 256 by 256 pixels in real time.

本文描述了在医学图像处理中支持自由变形的基于fpga的设计的技术。自由变形法是一种基于b样条算法的三维可变形物体建模方法。我们的设计包括四个优化。首先，我们将三阶b样条模型的值存储在查找表中。其次，我们在实现中采用了定制的数字表示格式。第三，转换嵌套循环，将条件移出循环。第四，我们对设计进行流水线处理以提高其吞吐量，并且我们还部署了多个流水线，以便每个流水线覆盖不同的图像。我们的设计描述是用Handel-C语言捕获的，在编译时是可参数化的，以支持一系列图像分辨率和计算精度。Xilinx XC2V6000设备上的实现将能够实时处理分辨率高达256 × 256像素的图像。

引用次数: 14

Debug methods for hybrid CPU/FPGA systems 混合CPU/FPGA系统的调试方法

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188688

Eric Roesler, B. Nelson

The combining of one or more CPU's and an FPGA fabric on the same die is growing in popularity. Such programmable system-on-chip (PSOC) systems promise performance and development time advantages over conventional technology. In a PSOC design, design errors may occur in many different places - the design of the CPU, the embedded software, the FPGA-based parts of the design, or the interfaces between these various parts. This paper presents a flexible tool that allows the user to dynamically adapt the CAD tool's behavior to the level of detail needed to track down design errors in PSOC designs. It provides for the creation of and coexistence of software source debuggers and gate level debug tools, all incorporated into the same debugging environment. A prototype PSOC debugging system based on a derivative of the JHDL CAD tool is presented which illustrates the range of debug support in both simulation and hardware execution modes that can be provided for PSOC debug. Extensions to the tool to support a wider range of embedded processors and GNU compiler tools are discussed along with conclusions and future work.

在同一个芯片上结合一个或多个CPU和一个FPGA结构正变得越来越流行。这种可编程片上系统(PSOC)系统比传统技术具有性能和开发时间优势。在PSOC设计中，设计错误可能发生在许多不同的地方- CPU的设计，嵌入式软件，基于fpga的设计部分，或这些不同部分之间的接口。本文提出了一个灵活的工具，允许用户动态地调整CAD工具的行为，以跟踪PSOC设计中的设计错误所需的细节水平。它提供了软件源代码调试器和门级调试工具的创建和共存，所有这些都合并到同一个调试环境中。提出了一个基于JHDL CAD工具的原型PSOC调试系统，该系统说明了可以为PSOC调试提供仿真和硬件执行模式的调试支持范围。讨论了该工具的扩展，以支持更广泛的嵌入式处理器和GNU编译器工具，以及结论和未来的工作。

引用次数: 16

Population based ant colony optimization on FPGA 基于FPGA的种群蚁群优化

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188673

Michael Guntsch, M. Middendorf, B. Scheuermann, O. Diessel, H. ElGindy, H. Schmeck, K. So

We propose to modify a type of ant algorithm called Population based Ant Colony Optimization (P-ACO) to allow implementation on an FPGA architecture. Ant algorithms are adapted from the natural behavior of ants and used to find good solutions to combinatorial optimization problems. General layout on the FPGA and algorithmic description are covered The most notable achievements featured in this paper are a runtime reduction and including the approximation of the heuristic function by a small set of favored decisions which changes over time.

我们建议修改一种称为基于种群的蚁群优化(P-ACO)的蚂蚁算法，以允许在FPGA架构上实现。蚁群算法是根据蚂蚁的自然行为来适应的，用于寻找组合优化问题的最佳解。本文中最显著的成就是运行时间的减少，包括通过一小组随时间变化的有利决策来逼近启发式函数。

引用次数: 18

Hardware Join Java: a high level language for reconfigurable hardware development 硬件连接Java:用于可重构硬件开发的高级语言

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188707

John Hopf, Stewart ltzstein, D. Kearney

Development of high level Hardware Description Languages (HDLs) is an integral area of research in Reconfigurable Computing (RC). There is an apparent need to enhance the development tools available and achieve more abstraction in languages to make hardware development easier for software programmers. The lack of a unified hardware/software language and difficulties in system verification are also other issues currently being faced. To overcome these issues, we propose a Hardware Join Java language that uses the high level syntax and semantics of Java with additions to support reconfigurable hardware description. The language adopts Join Java semantics to allow specification of concurrency without the inherent complexity of Java's standard thread and monitor mechanisms. From a specification, hardware classes will be compiled and linked with VHDL source code. Standard Java classes can be used for the software part of an application and will serve as an interface.

高级硬件描述语言(hdl)的开发是可重构计算(RC)研究的一个重要领域。显然需要增强可用的开发工具，并在语言中实现更多的抽象，以使软件程序员更容易进行硬件开发。缺乏统一的硬件/软件语言和系统核查方面的困难也是目前面临的其他问题。为了克服这些问题，我们提出了一种硬件连接Java语言，它使用Java的高级语法和语义，并添加了支持可重构硬件描述的功能。该语言采用Join Java语义，允许对并发性进行规范，而不需要Java标准线程和监视机制固有的复杂性。从一个规范，硬件类将编译和链接的VHDL源代码。标准Java类可以用于应用程序的软件部分，并充当接口。

引用次数: 12

Resource-aware run-time elaboration of behavioural FPGA specifications 资源感知的运行时细化行为FPGA规范

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188666

U. Malik, K. So, O. Diessel

The Circal process algebra is being used to explore the behavioural specification of systems that are mapped to field programmable logic circuits. In this paper we report on the implementation and performance of an interpreter for system specifications given in the Circal language. In contrast to the typical design flow for field programmable technology in which designs are statically partitioned, synthesised, and mapped to pre-allocated resources, in this system the specified circuits are extracted from behavioural specifications that are partitioned, elaborated, mapped, and configured at run time as control passes through them. We report on the details of a design that targets the Celoxica RC1000 co-processor and assess preliminary performance results for this implementation. The results clearly demonstrate our method is a practical approach to overcome resource constraints, particularly in applications where these change at run time. The results also establish a benchmark against which to measure future improvements and alternative methods.

循环过程代数被用于探索映射到现场可编程逻辑电路的系统的行为规范。在本文中，我们报告了一个用循环语言给出的系统规范的解释器的实现和性能。在现场可编程技术的典型设计流程中，设计是静态划分、综合和映射到预分配资源的，与此相反，在这个系统中，指定的电路是从行为规范中提取出来的，当控制通过它们时，这些行为规范被划分、详细说明、映射和配置。我们报告了针对Celoxica RC1000协处理器的设计细节，并评估了该实现的初步性能结果。结果清楚地表明，我们的方法是克服资源约束的实用方法，特别是在这些约束在运行时发生变化的应用程序中。结果还建立了一个基准，以衡量未来的改进和替代方法。

引用次数: 5

Clustered programmable-reconfigurable processors 集群可编程可重构处理器

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188674

Derek B. Gottlieb, Jeffrey J. Cook, Joshua D. Walstrom, Steve Ferrera, Chi-Wei Wang, N. Carter

In order to pose a successful challenge to conventional processor architectures, reconfigurable computing systems must achieve significantly better performance than conventional programmable processors by both greatly reducing the number of clock cycles required to execute a wide range of applications and achieving high clock rates when implemented in deep-submicron fabrication technologies. In this paper, we describe the architecture of Amalgam, a clustered programmable-reconfigurable processor that integrates multiple conventional processors and blocks of reconfigurable logic onto a single chip. Amalgam's distributed architecture allows implementation at high clock rates by limiting the impact of wire delay on cycle time and delivers an average of 13.7/spl times/ speedup on our benchmark applications when compared to an equivalent architecture that contains only a single programmable processor.

为了对传统处理器架构提出成功的挑战，可重构计算系统必须通过大大减少执行广泛应用所需的时钟周期数量和在深亚微米制造技术中实现高时钟速率来实现比传统可编程处理器更好的性能。在本文中，我们描述了Amalgam的体系结构，这是一种集群可编程可重构处理器，它将多个常规处理器和可重构逻辑块集成到单个芯片上。Amalgam的分布式架构通过限制线延迟对周期时间的影响，允许在高时钟速率下实现，与仅包含单个可编程处理器的等效架构相比，我们的基准应用程序平均提供13.7/spl次/加速。

引用次数: 18

Compiling run-time parametrisable designs 编译运行时参数化设计

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188663

A. Derbyshire, W. Luk

This paper explores representations and compilation of run-time parametrisable FPGA designs. We develop methods to produce designs with many run-time parameters, which would otherwise require an impractical number of bitstreams to be generated at compile time. Run-time parametrisation facilitates specialisation, which can be used to remove logic to produce a smaller and faster design. Our approach involves a source description based on Structural VHDL that allows designers to specify what parameters are available at compile time and at run time. Using this approach, converting a compile-time parameter into a run-time parameter or vice versa is straightforward. The source description does not contain explicit information on how to modify the design at run time. We describe a compilation scheme that can be used to extract this information, generate a run-time representation of the design and rapidly instantiate this representation at run time. We present techniques that allow a parametrised design to be incrementally modified in order to minimise the reconfiguration overhead Our compiler implementation generates a Java program that uses the JBits AN to implement the runtime representation and functions to incrementally modify the design. DES and AES encryption designs are used to illustrate our approach.

本文探讨了运行时可参数化FPGA设计的表示和编译。我们开发方法来产生具有许多运行时参数的设计，否则将需要在编译时生成不切实际的比特流数量。运行时参数化促进了专门化，可用于删除逻辑以产生更小、更快的设计。我们的方法包括基于结构化VHDL的源代码描述，允许设计人员指定在编译时和运行时可用的参数。使用这种方法，可以直接将编译时参数转换为运行时参数，反之亦然。源代码描述不包含关于如何在运行时修改设计的显式信息。我们描述了一个编译方案，该方案可用于提取该信息，生成设计的运行时表示，并在运行时快速实例化该表示。我们提出了一种技术，允许对参数化设计进行增量修改，以尽量减少重新配置的开销。我们的编译器实现生成一个Java程序，该程序使用JBits实现运行时表示和函数来增量修改设计。使用DES和AES加密设计来说明我们的方法。

{"title":"Compiling run-time parametrisable designs","authors":"A. Derbyshire, W. Luk","doi":"10.1109/FPT.2002.1188663","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188663","url":null,"abstract":"This paper explores representations and compilation of run-time parametrisable FPGA designs. We develop methods to produce designs with many run-time parameters, which would otherwise require an impractical number of bitstreams to be generated at compile time. Run-time parametrisation facilitates specialisation, which can be used to remove logic to produce a smaller and faster design. Our approach involves a source description based on Structural VHDL that allows designers to specify what parameters are available at compile time and at run time. Using this approach, converting a compile-time parameter into a run-time parameter or vice versa is straightforward. The source description does not contain explicit information on how to modify the design at run time. We describe a compilation scheme that can be used to extract this information, generate a run-time representation of the design and rapidly instantiate this representation at run time. We present techniques that allow a parametrised design to be incrementally modified in order to minimise the reconfiguration overhead Our compiler implementation generates a Java program that uses the JBits AN to implement the runtime representation and functions to incrementally modify the design. DES and AES encryption designs are used to illustrate our approach.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127366662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Efficient 4-input LUTs FPGA implementation of combinatorial multiplier over canonical base GF(16) 基于正则基GF组合乘法器的高效4输入LUTs FPGA实现(16)

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188701

V. Tomashau

Finite field arithmetic is the basis of some cryptographic and error correction algorithms. The performance of a corresponding hardware depends on the efficiency of the finite field arithmetic implementation. A good-quality finite field multiplier is needed first, since multiplication is an often-used and time consuming operation. Because FPGAs differ considerably in structure from other integrated circuits, the finite field multiplier designs, which are optimized for VLSI implementation, do not perform well on FPGAs. In this paper, some structures of a completely combinatorial GF(16) multiplier, based on 4-input LUTs and some other resources of the Xilinx FPGA, have been proposed. As a result, some improvement in area and time has been achieved by comparison with previous designs.

有限域算法是一些密码和纠错算法的基础。相应硬件的性能取决于有限域算法实现的效率。首先需要一个高质量的有限域乘法器，因为乘法是一种经常使用且耗时的操作。由于fpga在结构上与其他集成电路有很大的不同，针对VLSI实现进行优化的有限场乘法器设计在fpga上表现不佳。本文提出了基于4输入lut和Xilinx FPGA的其他资源的完全组合GF(16)乘法器的一些结构。因此，与以前的设计相比，在面积和时间上都有了一定的改进。

引用次数: 2

A technology mapping algorithm for CPLD architectures CPLD体系结构的技术映射算法

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

Pub Date : 2002-12-16 DOI: 10.1109/FPT.2002.1188683

Shih-Liang Chen, TingTing Hwang, C. Liu

In this paper, we propose a technology mapping algorithm for CPLD architectures. Our algorithm proceeds in two phases: mapping for single-output PLAs and packing for multiple-output PLAs. In the mapping phase we propose a look-up-table (LUT) based mapping algorithm. We will take advantage of existing LUT mapping algorithms for area and depth minimization. Benchmark results show that our algorithm produce better results in terms of area and depth as compared to TEMPLA.

本文提出了一种适用于CPLD体系结构的技术映射算法。我们的算法分两个阶段进行:单输出PLAs的映射和多输出PLAs的封装。在映射阶段，我们提出了一种基于查找表的映射算法。我们将利用现有的LUT映射算法进行面积和深度最小化。基准测试结果表明，与TEMPLA相比，我们的算法在面积和深度方面都取得了更好的结果。

引用次数: 25

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀