首页 > 最新文献

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.最新文献

英文 中文
Population based ant colony optimization on FPGA 基于FPGA的种群蚁群优化
Michael Guntsch, M. Middendorf, B. Scheuermann, O. Diessel, H. ElGindy, H. Schmeck, K. So
We propose to modify a type of ant algorithm called Population based Ant Colony Optimization (P-ACO) to allow implementation on an FPGA architecture. Ant algorithms are adapted from the natural behavior of ants and used to find good solutions to combinatorial optimization problems. General layout on the FPGA and algorithmic description are covered The most notable achievements featured in this paper are a runtime reduction and including the approximation of the heuristic function by a small set of favored decisions which changes over time.
我们建议修改一种称为基于种群的蚁群优化(P-ACO)的蚂蚁算法,以允许在FPGA架构上实现。蚁群算法是根据蚂蚁的自然行为来适应的,用于寻找组合优化问题的最佳解。本文中最显著的成就是运行时间的减少,包括通过一小组随时间变化的有利决策来逼近启发式函数。
{"title":"Population based ant colony optimization on FPGA","authors":"Michael Guntsch, M. Middendorf, B. Scheuermann, O. Diessel, H. ElGindy, H. Schmeck, K. So","doi":"10.1109/FPT.2002.1188673","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188673","url":null,"abstract":"We propose to modify a type of ant algorithm called Population based Ant Colony Optimization (P-ACO) to allow implementation on an FPGA architecture. Ant algorithms are adapted from the natural behavior of ants and used to find good solutions to combinatorial optimization problems. General layout on the FPGA and algorithmic description are covered The most notable achievements featured in this paper are a runtime reduction and including the approximation of the heuristic function by a small set of favored decisions which changes over time.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116779018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Hardware Join Java: a high level language for reconfigurable hardware development 硬件连接Java:用于可重构硬件开发的高级语言
John Hopf, Stewart ltzstein, D. Kearney
Development of high level Hardware Description Languages (HDLs) is an integral area of research in Reconfigurable Computing (RC). There is an apparent need to enhance the development tools available and achieve more abstraction in languages to make hardware development easier for software programmers. The lack of a unified hardware/software language and difficulties in system verification are also other issues currently being faced. To overcome these issues, we propose a Hardware Join Java language that uses the high level syntax and semantics of Java with additions to support reconfigurable hardware description. The language adopts Join Java semantics to allow specification of concurrency without the inherent complexity of Java's standard thread and monitor mechanisms. From a specification, hardware classes will be compiled and linked with VHDL source code. Standard Java classes can be used for the software part of an application and will serve as an interface.
高级硬件描述语言(hdl)的开发是可重构计算(RC)研究的一个重要领域。显然需要增强可用的开发工具,并在语言中实现更多的抽象,以使软件程序员更容易进行硬件开发。缺乏统一的硬件/软件语言和系统核查方面的困难也是目前面临的其他问题。为了克服这些问题,我们提出了一种硬件连接Java语言,它使用Java的高级语法和语义,并添加了支持可重构硬件描述的功能。该语言采用Join Java语义,允许对并发性进行规范,而不需要Java标准线程和监视机制固有的复杂性。从一个规范,硬件类将编译和链接的VHDL源代码。标准Java类可以用于应用程序的软件部分,并充当接口。
{"title":"Hardware Join Java: a high level language for reconfigurable hardware development","authors":"John Hopf, Stewart ltzstein, D. Kearney","doi":"10.1109/FPT.2002.1188707","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188707","url":null,"abstract":"Development of high level Hardware Description Languages (HDLs) is an integral area of research in Reconfigurable Computing (RC). There is an apparent need to enhance the development tools available and achieve more abstraction in languages to make hardware development easier for software programmers. The lack of a unified hardware/software language and difficulties in system verification are also other issues currently being faced. To overcome these issues, we propose a Hardware Join Java language that uses the high level syntax and semantics of Java with additions to support reconfigurable hardware description. The language adopts Join Java semantics to allow specification of concurrency without the inherent complexity of Java's standard thread and monitor mechanisms. From a specification, hardware classes will be compiled and linked with VHDL source code. Standard Java classes can be used for the software part of an application and will serve as an interface.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128128934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Diagnosis of open defects in FPGA interconnect FPGA互连开放性缺陷的诊断
M. Tahoori
In this paper, we present coarse-grain and fine-grain diagnosis techniques to identify a faulty element in FPGA interconnects. The fault model we use is stuck-open and resistive-open for interconnects. The presented technique requires only a small number of configurations while offering high resolution diagnosis. We implemented this technique on real FPGA chips and verified it using fault emulation.
在本文中,我们提出了粗粒度和细粒度诊断技术来识别FPGA互连中的故障元件。我们采用的故障模型是互连线的卡断和阻断。该技术在提供高分辨率诊断的同时,只需要少量的配置。我们在实际的FPGA芯片上实现了该技术,并通过故障仿真对其进行了验证。
{"title":"Diagnosis of open defects in FPGA interconnect","authors":"M. Tahoori","doi":"10.1109/FPT.2002.1188703","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188703","url":null,"abstract":"In this paper, we present coarse-grain and fine-grain diagnosis techniques to identify a faulty element in FPGA interconnects. The fault model we use is stuck-open and resistive-open for interconnects. The presented technique requires only a small number of configurations while offering high resolution diagnosis. We implemented this technique on real FPGA chips and verified it using fault emulation.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116764958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A novel three phase parallel genetic approach to routing for field programmable gate arrays 现场可编程门阵列路由的一种新型三相并行遗传方法
A. Muthukaruppan, S. Suresh, V. Kamakoti
This paper establishes a handshake between the fields of "parallel genetic algorithms" and reconfigurable systems, to provide a solution for the routing problem for FPGAs, that attempts to enhance the performance of the circuit implemented by the FPGA. We propose to solve the problem of routing for FPGAs in three phases, out of which the first two utilize the concept of genetic algorithms to transform an initial population of random suggested routings to a population that contains solutions approximating the optimal one.
本文在“并行遗传算法”和可重构系统领域之间建立了一个握手,为FPGA的路由问题提供了一个解决方案,试图提高FPGA实现电路的性能。我们建议分三个阶段解决fpga的路由问题,其中前两个阶段利用遗传算法的概念将随机建议路由的初始种群转换为包含近似最优解的种群。
{"title":"A novel three phase parallel genetic approach to routing for field programmable gate arrays","authors":"A. Muthukaruppan, S. Suresh, V. Kamakoti","doi":"10.1109/FPT.2002.1188705","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188705","url":null,"abstract":"This paper establishes a handshake between the fields of \"parallel genetic algorithms\" and reconfigurable systems, to provide a solution for the routing problem for FPGAs, that attempts to enhance the performance of the circuit implemented by the FPGA. We propose to solve the problem of routing for FPGAs in three phases, out of which the first two utilize the concept of genetic algorithms to transform an initial population of random suggested routings to a population that contains solutions approximating the optimal one.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126505732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Delivering error detection capabilities into a field programmable device: the HORUS processor case study 向现场可编程设备提供错误检测功能:HORUS处理器案例研究
F. Rodríguez, J. Campelo, J. J. Serrano
Designing a complete SoC or reuse SoC components to create a complete system is a common task nowadays. The flexibility offered by current design flows offers the designer an unprecedented capability to incorporate more and more demanded features like error detection and correction mechanisms to increase the system dependability. This is especially true for programmable devices, were rapid design and implementation methodologies are coupled with testing environments that are easily generated and used. This paper describes the design of the HORUS processor, a RISC processor augmented with a concurrent error mechanism, the architectural modifications needed on the original design to minimize the resulting performance penalty.
设计一个完整的SoC或重用SoC组件来创建一个完整的系统是一个常见的任务。当前设计流程提供的灵活性为设计人员提供了前所未有的能力,可以将越来越多的需求功能(如错误检测和纠正机制)纳入其中,以提高系统的可靠性。对于可编程设备来说尤其如此,因为快速设计和实现方法与易于生成和使用的测试环境相结合。本文描述了HORUS处理器的设计,这是一种增强了并发错误机制的RISC处理器,它需要在原始设计的基础上进行架构修改,以尽量减少由此带来的性能损失。
{"title":"Delivering error detection capabilities into a field programmable device: the HORUS processor case study","authors":"F. Rodríguez, J. Campelo, J. J. Serrano","doi":"10.1109/FPT.2002.1188724","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188724","url":null,"abstract":"Designing a complete SoC or reuse SoC components to create a complete system is a common task nowadays. The flexibility offered by current design flows offers the designer an unprecedented capability to incorporate more and more demanded features like error detection and correction mechanisms to increase the system dependability. This is especially true for programmable devices, were rapid design and implementation methodologies are coupled with testing environments that are easily generated and used. This paper describes the design of the HORUS processor, a RISC processor augmented with a concurrent error mechanism, the architectural modifications needed on the original design to minimize the resulting performance penalty.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130029620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A system level implementation of Rijndael on a memory-slot based FPGA card Rijndael在基于内存插槽的FPGA卡上的系统级实现
Dennis K. Y. Tong, Pui Sze Lo, Kin-Hong Lee, P. Leong
This paper describes system level issues encountered in a high performance implementation of a Rijndael encryption core on a memory-slot based reconfigurable computing platform called Pilchard. The Rijndael algorithm was adopted in 2000 by the US National Institute of Standards and Technology (NIST) as the Advanced Encryption Standard (AES). In the implementation of Rijndael, changing the number of unrolled rounds in the encryption core can affect the performance of the system. It is shown that for the design presented, the highest performance of 755 Mbit/sec was achieved by implementing a core with a single round. Although it is relatively easy to implement a high performance core on an FPGA, due to I/O bottlenecks, achieving high system level performance is more difficult. In order to optimize the performance of the host/FPGA interface, special instructions from the Intel Pentium III streaming SIMD extensions (SSE) along with write-combining memory operations were used. These features enabled the measured throughput of the AES core to reach 445 Mbit/sec which, although still slower than the AES core, was double that of an unoptimized interface.
本文描述了在基于内存槽的可重构计算平台Pilchard上实现Rijndael加密核时遇到的系统级问题。Rijndael算法于2000年被美国国家标准与技术研究所(NIST)采用为高级加密标准(AES)。在Rijndael的实现中,更改加密核心中展开的轮数可能会影响系统的性能。结果表明,在本设计中,单轮核心实现的最高性能为755 Mbit/sec。虽然在FPGA上实现高性能核心相对容易,但由于I/O瓶颈,实现高系统级性能更加困难。为了优化主机/FPGA接口的性能,使用了来自Intel Pentium III流SIMD扩展(SSE)的特殊指令以及写组合内存操作。这些特性使AES核心的测量吞吐量达到445 Mbit/sec,虽然仍然比AES核心慢,但是未优化接口的两倍。
{"title":"A system level implementation of Rijndael on a memory-slot based FPGA card","authors":"Dennis K. Y. Tong, Pui Sze Lo, Kin-Hong Lee, P. Leong","doi":"10.1109/FPT.2002.1188670","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188670","url":null,"abstract":"This paper describes system level issues encountered in a high performance implementation of a Rijndael encryption core on a memory-slot based reconfigurable computing platform called Pilchard. The Rijndael algorithm was adopted in 2000 by the US National Institute of Standards and Technology (NIST) as the Advanced Encryption Standard (AES). In the implementation of Rijndael, changing the number of unrolled rounds in the encryption core can affect the performance of the system. It is shown that for the design presented, the highest performance of 755 Mbit/sec was achieved by implementing a core with a single round. Although it is relatively easy to implement a high performance core on an FPGA, due to I/O bottlenecks, achieving high system level performance is more difficult. In order to optimize the performance of the host/FPGA interface, special instructions from the Intel Pentium III streaming SIMD extensions (SSE) along with write-combining memory operations were used. These features enabled the measured throughput of the AES core to reach 445 Mbit/sec which, although still slower than the AES core, was double that of an unoptimized interface.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130149943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Multiplier-less FIR digital filters using programmable sum-of-power-of-two (SOPOT) coefficients 使用可编程的2次幂和(SOPOT)系数的无乘法器FIR数字滤波器
K. Yeung, S. Chan
This paper proposes a new architecture for the implementation of multiplier-less FIR digital filters with programmable sum-of-powers-of-two (SOPOT) or canonical signed digit (CSD) coefficient representations. The multiplier-less FIR filter is implemented as the direct form structure with the filter coefficients represented as SOPOT representation, which can be realized as limited number of shifts and additions. Traditional VLSI implementations of multiplier-less FIR filters are usually hardwired and the filter coefficients cannot be programmed online. The proposed architecture is very modular in the structure and it can be connected to implement the multiplier-less FIR filter with arbitrary filter order and SOPOT terms using programmable SOPOT coefficients. The structure is also pipelined to achieve a high data throughput rate at low hardware cost. The proposed architecture was implemented and tested using the Altera FLEX 10K Field Programmable Gate Arrays (FPGA). The finite wordlength effect such as signal roundoff and overflow errors are also taken into account. A design example is given to demonstrate the feasibility of the proposed architecture.
本文提出了一种实现无乘法器FIR数字滤波器的新结构,该滤波器具有可编程的2次幂和(SOPOT)或正则符号数(CSD)系数表示。无乘法器FIR滤波器实现为直接形式结构,滤波器系数表示为SOPOT表示,可以通过有限次数的移位和加法来实现。传统的无乘法器FIR滤波器的VLSI实现通常是硬连线的,滤波器系数不能在线编程。所提出的结构在结构上是非常模块化的,它可以连接来实现具有任意滤波器阶数和使用可编程SOPOT系数的SOPOT项的无乘法器FIR滤波器。该结构还采用流水线方式,以低硬件成本实现高数据吞吐率。所提出的架构使用Altera FLEX 10K现场可编程门阵列(FPGA)进行了实现和测试。有限字长效应如信号舍入和溢出错误也被考虑在内。最后给出了一个设计实例,验证了该体系结构的可行性。
{"title":"Multiplier-less FIR digital filters using programmable sum-of-power-of-two (SOPOT) coefficients","authors":"K. Yeung, S. Chan","doi":"10.1109/FPT.2002.1188667","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188667","url":null,"abstract":"This paper proposes a new architecture for the implementation of multiplier-less FIR digital filters with programmable sum-of-powers-of-two (SOPOT) or canonical signed digit (CSD) coefficient representations. The multiplier-less FIR filter is implemented as the direct form structure with the filter coefficients represented as SOPOT representation, which can be realized as limited number of shifts and additions. Traditional VLSI implementations of multiplier-less FIR filters are usually hardwired and the filter coefficients cannot be programmed online. The proposed architecture is very modular in the structure and it can be connected to implement the multiplier-less FIR filter with arbitrary filter order and SOPOT terms using programmable SOPOT coefficients. The structure is also pipelined to achieve a high data throughput rate at low hardware cost. The proposed architecture was implemented and tested using the Altera FLEX 10K Field Programmable Gate Arrays (FPGA). The finite wordlength effect such as signal roundoff and overflow errors are also taken into account. A design example is given to demonstrate the feasibility of the proposed architecture.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127365337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Resource-aware run-time elaboration of behavioural FPGA specifications 资源感知的运行时细化行为FPGA规范
U. Malik, K. So, O. Diessel
The Circal process algebra is being used to explore the behavioural specification of systems that are mapped to field programmable logic circuits. In this paper we report on the implementation and performance of an interpreter for system specifications given in the Circal language. In contrast to the typical design flow for field programmable technology in which designs are statically partitioned, synthesised, and mapped to pre-allocated resources, in this system the specified circuits are extracted from behavioural specifications that are partitioned, elaborated, mapped, and configured at run time as control passes through them. We report on the details of a design that targets the Celoxica RC1000 co-processor and assess preliminary performance results for this implementation. The results clearly demonstrate our method is a practical approach to overcome resource constraints, particularly in applications where these change at run time. The results also establish a benchmark against which to measure future improvements and alternative methods.
循环过程代数被用于探索映射到现场可编程逻辑电路的系统的行为规范。在本文中,我们报告了一个用循环语言给出的系统规范的解释器的实现和性能。在现场可编程技术的典型设计流程中,设计是静态划分、综合和映射到预分配资源的,与此相反,在这个系统中,指定的电路是从行为规范中提取出来的,当控制通过它们时,这些行为规范被划分、详细说明、映射和配置。我们报告了针对Celoxica RC1000协处理器的设计细节,并评估了该实现的初步性能结果。结果清楚地表明,我们的方法是克服资源约束的实用方法,特别是在这些约束在运行时发生变化的应用程序中。结果还建立了一个基准,以衡量未来的改进和替代方法。
{"title":"Resource-aware run-time elaboration of behavioural FPGA specifications","authors":"U. Malik, K. So, O. Diessel","doi":"10.1109/FPT.2002.1188666","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188666","url":null,"abstract":"The Circal process algebra is being used to explore the behavioural specification of systems that are mapped to field programmable logic circuits. In this paper we report on the implementation and performance of an interpreter for system specifications given in the Circal language. In contrast to the typical design flow for field programmable technology in which designs are statically partitioned, synthesised, and mapped to pre-allocated resources, in this system the specified circuits are extracted from behavioural specifications that are partitioned, elaborated, mapped, and configured at run time as control passes through them. We report on the details of a design that targets the Celoxica RC1000 co-processor and assess preliminary performance results for this implementation. The results clearly demonstrate our method is a practical approach to overcome resource constraints, particularly in applications where these change at run time. The results also establish a benchmark against which to measure future improvements and alternative methods.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116725357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Clustered programmable-reconfigurable processors 集群可编程可重构处理器
Derek B. Gottlieb, Jeffrey J. Cook, Joshua D. Walstrom, Steve Ferrera, Chi-Wei Wang, N. Carter
In order to pose a successful challenge to conventional processor architectures, reconfigurable computing systems must achieve significantly better performance than conventional programmable processors by both greatly reducing the number of clock cycles required to execute a wide range of applications and achieving high clock rates when implemented in deep-submicron fabrication technologies. In this paper, we describe the architecture of Amalgam, a clustered programmable-reconfigurable processor that integrates multiple conventional processors and blocks of reconfigurable logic onto a single chip. Amalgam's distributed architecture allows implementation at high clock rates by limiting the impact of wire delay on cycle time and delivers an average of 13.7/spl times/ speedup on our benchmark applications when compared to an equivalent architecture that contains only a single programmable processor.
为了对传统处理器架构提出成功的挑战,可重构计算系统必须通过大大减少执行广泛应用所需的时钟周期数量和在深亚微米制造技术中实现高时钟速率来实现比传统可编程处理器更好的性能。在本文中,我们描述了Amalgam的体系结构,这是一种集群可编程可重构处理器,它将多个常规处理器和可重构逻辑块集成到单个芯片上。Amalgam的分布式架构通过限制线延迟对周期时间的影响,允许在高时钟速率下实现,与仅包含单个可编程处理器的等效架构相比,我们的基准应用程序平均提供13.7/spl次/加速。
{"title":"Clustered programmable-reconfigurable processors","authors":"Derek B. Gottlieb, Jeffrey J. Cook, Joshua D. Walstrom, Steve Ferrera, Chi-Wei Wang, N. Carter","doi":"10.1109/FPT.2002.1188674","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188674","url":null,"abstract":"In order to pose a successful challenge to conventional processor architectures, reconfigurable computing systems must achieve significantly better performance than conventional programmable processors by both greatly reducing the number of clock cycles required to execute a wide range of applications and achieving high clock rates when implemented in deep-submicron fabrication technologies. In this paper, we describe the architecture of Amalgam, a clustered programmable-reconfigurable processor that integrates multiple conventional processors and blocks of reconfigurable logic onto a single chip. Amalgam's distributed architecture allows implementation at high clock rates by limiting the impact of wire delay on cycle time and delivers an average of 13.7/spl times/ speedup on our benchmark applications when compared to an equivalent architecture that contains only a single programmable processor.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114186834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Compiling run-time parametrisable designs 编译运行时参数化设计
A. Derbyshire, W. Luk
This paper explores representations and compilation of run-time parametrisable FPGA designs. We develop methods to produce designs with many run-time parameters, which would otherwise require an impractical number of bitstreams to be generated at compile time. Run-time parametrisation facilitates specialisation, which can be used to remove logic to produce a smaller and faster design. Our approach involves a source description based on Structural VHDL that allows designers to specify what parameters are available at compile time and at run time. Using this approach, converting a compile-time parameter into a run-time parameter or vice versa is straightforward. The source description does not contain explicit information on how to modify the design at run time. We describe a compilation scheme that can be used to extract this information, generate a run-time representation of the design and rapidly instantiate this representation at run time. We present techniques that allow a parametrised design to be incrementally modified in order to minimise the reconfiguration overhead Our compiler implementation generates a Java program that uses the JBits AN to implement the runtime representation and functions to incrementally modify the design. DES and AES encryption designs are used to illustrate our approach.
本文探讨了运行时可参数化FPGA设计的表示和编译。我们开发方法来产生具有许多运行时参数的设计,否则将需要在编译时生成不切实际的比特流数量。运行时参数化促进了专门化,可用于删除逻辑以产生更小、更快的设计。我们的方法包括基于结构化VHDL的源代码描述,允许设计人员指定在编译时和运行时可用的参数。使用这种方法,可以直接将编译时参数转换为运行时参数,反之亦然。源代码描述不包含关于如何在运行时修改设计的显式信息。我们描述了一个编译方案,该方案可用于提取该信息,生成设计的运行时表示,并在运行时快速实例化该表示。我们提出了一种技术,允许对参数化设计进行增量修改,以尽量减少重新配置的开销。我们的编译器实现生成一个Java程序,该程序使用JBits实现运行时表示和函数来增量修改设计。使用DES和AES加密设计来说明我们的方法。
{"title":"Compiling run-time parametrisable designs","authors":"A. Derbyshire, W. Luk","doi":"10.1109/FPT.2002.1188663","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188663","url":null,"abstract":"This paper explores representations and compilation of run-time parametrisable FPGA designs. We develop methods to produce designs with many run-time parameters, which would otherwise require an impractical number of bitstreams to be generated at compile time. Run-time parametrisation facilitates specialisation, which can be used to remove logic to produce a smaller and faster design. Our approach involves a source description based on Structural VHDL that allows designers to specify what parameters are available at compile time and at run time. Using this approach, converting a compile-time parameter into a run-time parameter or vice versa is straightforward. The source description does not contain explicit information on how to modify the design at run time. We describe a compilation scheme that can be used to extract this information, generate a run-time representation of the design and rapidly instantiate this representation at run time. We present techniques that allow a parametrised design to be incrementally modified in order to minimise the reconfiguration overhead Our compiler implementation generates a Java program that uses the JBits AN to implement the runtime representation and functions to incrementally modify the design. DES and AES encryption designs are used to illustrate our approach.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127366662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1