首页 > 最新文献

2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.最新文献

英文 中文
The effect of cluster packing and node duplication control in delay driven clustering 延迟驱动聚类中簇填充和节点复制控制的影响
M. E. Dehkordi, S. Brown
Although delay driven clustering algorithms can optimize circuit delay, they usually result in huge area increase. We present a node duplication control strategy along with a simple packing algorithm that greatly reduce the area penalty with a very small degradation in performance. We use the Quartus Design System from Altera to test our algorithm for a set of MCNC benchmark circuits. The results show that while a 14.5% average delay decrease can be achieved with an average area increase of 240.8%, the algorithm reduces the average area penalty to 27.4% with an average delay decrease of 13.8%. Also the number of clusters and the fitting time reported by Quartus are reduced by more than 91% and 60%, respectively.
延迟驱动的聚类算法虽然可以优化电路延迟,但通常会导致大面积增加。我们提出了一种节点重复控制策略以及一种简单的打包算法,该算法在性能下降很小的情况下大大减少了面积损失。我们使用Altera的Quartus设计系统在一组MCNC基准电路上测试我们的算法。结果表明,在平均面积增加240.8%的情况下,平均延迟减少14.5%;在平均延迟减少13.8%的情况下,平均面积损失减少27.4%。此外,Quartus报告的聚类数量和拟合时间分别减少了91%和60%以上。
{"title":"The effect of cluster packing and node duplication control in delay driven clustering","authors":"M. E. Dehkordi, S. Brown","doi":"10.1109/FPT.2002.1188686","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188686","url":null,"abstract":"Although delay driven clustering algorithms can optimize circuit delay, they usually result in huge area increase. We present a node duplication control strategy along with a simple packing algorithm that greatly reduce the area penalty with a very small degradation in performance. We use the Quartus Design System from Altera to test our algorithm for a set of MCNC benchmark circuits. The results show that while a 14.5% average delay decrease can be achieved with an average area increase of 240.8%, the algorithm reduces the average area penalty to 27.4% with an average delay decrease of 13.8%. Also the number of clusters and the fitting time reported by Quartus are reduced by more than 91% and 60%, respectively.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128725887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
FPGA-based system-level design framework based on the IRIS synthesis tool and System Generator 基于IRIS综合工具和System Generator的fpga系统级设计框架
Y. Yi, Roger Francis Woods
A system level design framework for FPGA-based DSP design is presented. The design flow utilizes System Generator, a system level tool developed by Xilinx, and links it to an "in-house" architectural synthesis tool, IRIS. Whilst System Generator allows FPGA-based Intellectual Property (IP) cores to be incorporated into the design flow, it does not address the timing and latency problems introduced by the cores which can be considerable, particularly when the cores are pipelined. These problems are addressed by the IRIS synthesis tool. The paper describes the tools, their interaction and illustrates the flow using an 8-tap Transpose-Form Retimed Delayed LMS (TF-RDLMS) adaptive filter.
提出了基于fpga的DSP设计的系统级设计框架。设计流程利用了System Generator,一个由Xilinx开发的系统级工具,并将其链接到一个“内部”架构综合工具IRIS。虽然System Generator允许将基于fpga的知识产权(IP)内核整合到设计流程中,但它不能解决内核引入的时间和延迟问题,这可能是相当大的,特别是当内核是流水线的时候。IRIS综合工具解决了这些问题。本文描述了这些工具,它们之间的相互作用,并说明了使用8分接转置形式重新定时延迟LMS (TF-RDLMS)自适应滤波器的流程。
{"title":"FPGA-based system-level design framework based on the IRIS synthesis tool and System Generator","authors":"Y. Yi, Roger Francis Woods","doi":"10.1109/FPT.2002.1188668","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188668","url":null,"abstract":"A system level design framework for FPGA-based DSP design is presented. The design flow utilizes System Generator, a system level tool developed by Xilinx, and links it to an \"in-house\" architectural synthesis tool, IRIS. Whilst System Generator allows FPGA-based Intellectual Property (IP) cores to be incorporated into the design flow, it does not address the timing and latency problems introduced by the cores which can be considerable, particularly when the cores are pipelined. These problems are addressed by the IRIS synthesis tool. The paper describes the tools, their interaction and illustrates the flow using an 8-tap Transpose-Form Retimed Delayed LMS (TF-RDLMS) adaptive filter.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125915452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An FPGA-based processor for shogi mating problems 基于fpga的shogi配对问题处理器
Y. Hori, M. Sonoyama, T. Maruyama
After the success of DEEP BLUE in computer chess, shogi, or Japanese chess is a next challenging target in artificial intelligence for game playing. The complexity and huge search space of shogi have been motivating researchers to make shogi programs, but none of them is competent enough to play against human experts. To improve the competence of shogi programs, it is a promising approach to develop dedicated hardware systems. However inflexible architecture and lack of hardware resource have been the significant problems in hardware development. The flexibility and recent progress in the gate size of FPGAs are expected to give solutions to the problems. As a first step to shogi hardware, we implemented modules to generate check and defense moves in tsume-shogi, or mating problems in shogi. With the latest FPGA, we successfully implemented all modules on a single chip and eliminated the bottleneck of memory bandwidth. In this paper we describe a procedure for parallel move generation in tsume-shogi hardware and architecture of the modules implemented on an FPGA. A discussion about the performance of the hardware is also included in the paper. The hardware is roughly estimated to work 10-50 times faster than software.
继“深蓝”在计算机国际象棋中取得成功之后,“将棋”将成为人工智能领域下一个具有挑战性的目标。将棋的复杂性和巨大的搜索空间一直激励着研究人员制作将棋程序,但它们都没有能力与人类专家对抗。为了提高将棋程序的能力,开发专用硬件系统是一种很有前途的方法。然而,硬件架构的不灵活和硬件资源的缺乏一直是硬件开发中的突出问题。fpga的灵活性和栅极尺寸的最新进展有望解决这些问题。作为shogi硬件的第一步,我们实现了模块来生成棋棋中的检查和防御动作,或者说是棋棋中的配对问题。利用最新的FPGA,我们成功地在单芯片上实现了所有模块,并消除了内存带宽的瓶颈。本文描述了在tsume-shogi硬件中并行移动生成的过程,以及在FPGA上实现的模块结构。本文还对硬件的性能进行了讨论。据粗略估计,硬件的工作速度比软件快10-50倍。
{"title":"An FPGA-based processor for shogi mating problems","authors":"Y. Hori, M. Sonoyama, T. Maruyama","doi":"10.1109/FPT.2002.1188672","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188672","url":null,"abstract":"After the success of DEEP BLUE in computer chess, shogi, or Japanese chess is a next challenging target in artificial intelligence for game playing. The complexity and huge search space of shogi have been motivating researchers to make shogi programs, but none of them is competent enough to play against human experts. To improve the competence of shogi programs, it is a promising approach to develop dedicated hardware systems. However inflexible architecture and lack of hardware resource have been the significant problems in hardware development. The flexibility and recent progress in the gate size of FPGAs are expected to give solutions to the problems. As a first step to shogi hardware, we implemented modules to generate check and defense moves in tsume-shogi, or mating problems in shogi. With the latest FPGA, we successfully implemented all modules on a single chip and eliminated the bottleneck of memory bandwidth. In this paper we describe a procedure for parallel move generation in tsume-shogi hardware and architecture of the modules implemented on an FPGA. A discussion about the performance of the hardware is also included in the paper. The hardware is roughly estimated to work 10-50 times faster than software.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114788541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Real-time packet editing using reconfigurable hardware for active networking 实时包编辑使用可重构硬件的主动网络
T. Miyazaki, T. Murooka, N. Takahashi, M. Hashimoto
An active network node architecture, called Active Packet Editing (APE), is proposed. The main concept of APE is to accelerate functions essential to active network operation, such as packet classification and NAT (Network Address Translation). The twofold architecture of APE combines a software active packet processor with a high-speed hardware packet editor. Based on preset rules (pattern, action), the packet editor classifies and modifies packets passing through the node. Upon the receipt of active packets, the software active packet processor dynamically configures the packet editor. This paper focuses on introducing the high-speed packet editing mechanism which utilizes FPGAs (Field Programmable Gate Arrays) and CAMs (Content Addressable Memories). A prototype network node based on the architecture performs Gigabit-class throughput with packet editing.
提出了一种主动网络节点结构,称为主动分组编辑(APE)。APE的主要概念是加速当前网络运行所必需的功能,如数据包分类和NAT(网络地址转换)。APE的双重体系结构结合了软件主动数据包处理器和高速硬件数据包编辑器。报文编辑器根据预先设定的规则(模式、动作),对通过节点的报文进行分类和修改。收到活动包后,软件活动包处理器动态配置包编辑器。本文重点介绍了利用fpga(现场可编程门阵列)和CAMs(内容可寻址存储器)的高速分组编辑机制。基于该架构的原型网络节点通过分组编辑实现千兆级吞吐量。
{"title":"Real-time packet editing using reconfigurable hardware for active networking","authors":"T. Miyazaki, T. Murooka, N. Takahashi, M. Hashimoto","doi":"10.1109/FPT.2002.1188661","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188661","url":null,"abstract":"An active network node architecture, called Active Packet Editing (APE), is proposed. The main concept of APE is to accelerate functions essential to active network operation, such as packet classification and NAT (Network Address Translation). The twofold architecture of APE combines a software active packet processor with a high-speed hardware packet editor. Based on preset rules (pattern, action), the packet editor classifies and modifies packets passing through the node. Upon the receipt of active packets, the software active packet processor dynamically configures the packet editor. This paper focuses on introducing the high-speed packet editing mechanism which utilizes FPGAs (Field Programmable Gate Arrays) and CAMs (Content Addressable Memories). A prototype network node based on the architecture performs Gigabit-class throughput with packet editing.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131361066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Scalable acceleration of inductive logic programs 归纳逻辑程序的可伸缩加速
A. Fidjeland, W. Luk, S. Muggleton
Inductive logic programming systems are an emerging and powerful paradigm for machine learning which can make use of background knowledge to produce theories expressed in logic. They have been applied successfully to a wide range of problem domains, from protein structure prediction to satellite fault diagnosis. However, their execution can be computationally demanding. We introduce a scalable FPGA-based architecture for executing inductive logic programs, such that the execution speed largely increases linearly with respect to the number of processors. The architecture contains multiple processors derived from Warren's Abstract Machine, which has been optimised for hardware implementation using techniques such as instruction grouping and speculative assignment. The effectiveness of the architecture is demonstrated using the mutagenesis data set containing 12000 facts of chemical compounds.
归纳逻辑编程系统是一种新兴的、强大的机器学习范式,它可以利用背景知识来产生用逻辑表达的理论。它们已经成功地应用于从蛋白质结构预测到卫星故障诊断等广泛的问题领域。然而,它们的执行可能需要大量的计算。我们引入了一个可扩展的基于fpga的架构来执行归纳逻辑程序,这样执行速度就会随着处理器数量的增加而线性增加。该架构包含来自Warren抽象机的多个处理器,该处理器使用指令分组和推测分配等技术对硬件实现进行了优化。使用包含12000个化合物事实的诱变数据集证明了该架构的有效性。
{"title":"Scalable acceleration of inductive logic programs","authors":"A. Fidjeland, W. Luk, S. Muggleton","doi":"10.1109/FPT.2002.1188689","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188689","url":null,"abstract":"Inductive logic programming systems are an emerging and powerful paradigm for machine learning which can make use of background knowledge to produce theories expressed in logic. They have been applied successfully to a wide range of problem domains, from protein structure prediction to satellite fault diagnosis. However, their execution can be computationally demanding. We introduce a scalable FPGA-based architecture for executing inductive logic programs, such that the execution speed largely increases linearly with respect to the number of processors. The architecture contains multiple processors derived from Warren's Abstract Machine, which has been optimised for hardware implementation using techniques such as instruction grouping and speculative assignment. The effectiveness of the architecture is demonstrated using the mutagenesis data set containing 12000 facts of chemical compounds.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131615083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Area and time efficient implementations of matrix multiplication on FPGAs 矩阵乘法在fpga上的面积和时间效率实现
Ju-wook Jang, S. Choi, V. Prasanna
We develop new algorithms and architectures for matrix multiplication on configurable hardware. These designs significantly reduce the latency as well as the area. Our designs improve the previous designs in terms of the area/speed metric where the speed denotes the maximum achievable running frequency. The area/speed metrics for the previous designs and our design are 14.45, 4.93, and 2.35, respectively, for 4 /spl times/ 4 matrix multiplication. The latency of one of the previous design is 0.57 /spl mu/s, while our design takes 0.15 /spl mu/s using 18% less area. The area of our designs is smaller by 11% - 46% compared with the best known systolic designs with the same latency for the matrices of sizes 3 /spl times/ 3 - 12 /spl times/ 12. The performance improvements tend to grow with the problem size.
我们为可配置硬件上的矩阵乘法开发了新的算法和架构。这些设计大大减少了延迟和面积。我们的设计在面积/速度度量方面改进了以前的设计,其中速度表示最大可实现的运行频率。先前设计和我们设计的面积/速度指标分别为14.45,4.93和2.35,用于4 / sp1乘以/ 4矩阵乘法。之前的一种设计延迟为0.57 /spl mu/s,而我们的设计延迟为0.15 /spl mu/s,占地面积减少了18%。我们设计的面积比最著名的收缩设计小11% - 46%,具有相同延迟的大小为3 /spl倍/ 3 - 12 /spl倍/ 12的矩阵。性能改进往往随着问题的大小而增长。
{"title":"Area and time efficient implementations of matrix multiplication on FPGAs","authors":"Ju-wook Jang, S. Choi, V. Prasanna","doi":"10.1109/FPT.2002.1188669","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188669","url":null,"abstract":"We develop new algorithms and architectures for matrix multiplication on configurable hardware. These designs significantly reduce the latency as well as the area. Our designs improve the previous designs in terms of the area/speed metric where the speed denotes the maximum achievable running frequency. The area/speed metrics for the previous designs and our design are 14.45, 4.93, and 2.35, respectively, for 4 /spl times/ 4 matrix multiplication. The latency of one of the previous design is 0.57 /spl mu/s, while our design takes 0.15 /spl mu/s using 18% less area. The area of our designs is smaller by 11% - 46% compared with the best known systolic designs with the same latency for the matrices of sizes 3 /spl times/ 3 - 12 /spl times/ 12. The performance improvements tend to grow with the problem size.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125501851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 65
Dynamic reconfiguration for the common key encryption using FPGA 基于FPGA的通用密钥加密动态重构
Teruyoshi Yamaguchi, T. Hashiyama, S. Okuma
This paper introduces a new encryption method for a block cipher. To keep the ciphertext safe, traditional methods usually use secret keys in simple fixed logic operations. To break the cipher, cryptanalysis has been developed. They use stochastic characteristics such as differential and linear relationships between the input and output data. Encryption algorithms must be secure against these analyses. The proposed method uses the secret key to modify substitution tables called S-Boxes in the encryption algorithm. They are reconfigured on demand using FPGA. Our method is hard enough to prevent the encrypted data from analyzing the secret key using statistical cryptanalysis.
本文介绍了一种新的分组密码加密方法。为了保证密文的安全,传统的方法通常使用密钥进行简单的固定逻辑操作。为了破解密码,人们发明了密码分析。他们使用随机特征,如输入和输出数据之间的微分和线性关系。加密算法必须对这些分析是安全的。该方法使用密钥修改加密算法中的s - box替换表。它们可以使用FPGA按需重新配置。我们的方法足够困难,可以防止加密数据使用统计密码分析来分析密钥。
{"title":"Dynamic reconfiguration for the common key encryption using FPGA","authors":"Teruyoshi Yamaguchi, T. Hashiyama, S. Okuma","doi":"10.1109/FPT.2002.1188715","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188715","url":null,"abstract":"This paper introduces a new encryption method for a block cipher. To keep the ciphertext safe, traditional methods usually use secret keys in simple fixed logic operations. To break the cipher, cryptanalysis has been developed. They use stochastic characteristics such as differential and linear relationships between the input and output data. Encryption algorithms must be secure against these analyses. The proposed method uses the secret key to modify substitution tables called S-Boxes in the encryption algorithm. They are reconfigured on demand using FPGA. Our method is hard enough to prevent the encrypted data from analyzing the secret key using statistical cryptanalysis.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"188 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117285442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Optimising and adapting high-level hardware designs 优化和适应高级硬件设计
J. Coutinho, W. Luk
This paper presents a novel approach that focuses on rapid development and maintenance of optimised hardware designs using a high-level parallel language. We use an existing timing model that states, for instance, that every assignment executes in one clock cycle. This strict timing model gives users control over design scheduling, such as managing the number of cycles and cycle time. Our main contribution is the introduction of a flexible timing model that abstracts optimisation details by supporting high-level transformations and automatic scheduling. Furthermore, we provide techniques that unschedule parallel designs, so that they can be rescheduled to meet new performance and hardware constraints, making designs as implementation independent as possible. With both models, manual development and computerised optimisation can be interleaved to achieve the best effect. Our approach is illustrated by a case study where we port a pipelined convolver to another platform, and achieve either a 300% speedup or a 50% reduction in resource usage.
本文提出了一种新颖的方法,着重于使用高级并行语言快速开发和维护优化的硬件设计。我们使用现有的计时模型,例如,每个任务在一个时钟周期内执行。这种严格的时序模型使用户可以控制设计调度,例如管理周期数和周期时间。我们的主要贡献是引入了一个灵活的定时模型,该模型通过支持高级转换和自动调度来抽象优化细节。此外,我们提供了取消并行设计计划的技术,以便它们可以重新安排以满足新的性能和硬件约束,使设计尽可能独立于实现。有了这两种模型,人工开发和计算机化优化可以交错进行,以达到最佳效果。我们的方法通过一个案例研究来说明,我们将一个流水线卷积器移植到另一个平台上,并实现了300%的加速或50%的资源使用减少。
{"title":"Optimising and adapting high-level hardware designs","authors":"J. Coutinho, W. Luk","doi":"10.1109/FPT.2002.1188676","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188676","url":null,"abstract":"This paper presents a novel approach that focuses on rapid development and maintenance of optimised hardware designs using a high-level parallel language. We use an existing timing model that states, for instance, that every assignment executes in one clock cycle. This strict timing model gives users control over design scheduling, such as managing the number of cycles and cycle time. Our main contribution is the introduction of a flexible timing model that abstracts optimisation details by supporting high-level transformations and automatic scheduling. Furthermore, we provide techniques that unschedule parallel designs, so that they can be rescheduled to meet new performance and hardware constraints, making designs as implementation independent as possible. With both models, manual development and computerised optimisation can be interleaved to achieve the best effect. Our approach is illustrated by a case study where we port a pipelined convolver to another platform, and achieve either a 300% speedup or a 50% reduction in resource usage.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"183 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115063229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
HIDE: a logic based hardware intelligent description environment HIDE:基于逻辑的硬件智能描述环境
S. Belkacemi, K. Benkrid, D. Crookes
This paper presents a high-level hardware description environment based on the logic programming language Prolog, called HIDE. The latter has been designed in an attempt to address the problem of abstract hardware design and hardware efficiency. HIDE provides more abstract hardware descriptions and compositions than are possible in traditional hardware description languages such as VHDL or Verilog. It enables highly scaleable and parameterised composition of blocks using a small set of constructors (e.g. horizontal, vertical composition), and generates pre-placed configurations in EDIF format for Xilinx Virtex FPGAs. The paper presents the syntax and semantics of HIDE and illustrates its use in the construction of a bit parallel multiplier core for Xilinx Virtex FPGAs.
本文提出了一种基于逻辑编程语言Prolog的高级硬件描述环境,称为HIDE。后者是为了解决抽象硬件设计和硬件效率问题而设计的。与传统的硬件描述语言(如VHDL或Verilog)相比,HIDE提供了更多抽象的硬件描述和组合。它使用一小组构造函数(例如水平、垂直组合)实现高度可扩展和参数化的块组合,并为Xilinx Virtex fpga生成EDIF格式的预放置配置。本文介绍了HIDE的语法和语义,并举例说明了它在构建Xilinx Virtex fpga位并行乘法器核中的应用。
{"title":"HIDE: a logic based hardware intelligent description environment","authors":"S. Belkacemi, K. Benkrid, D. Crookes","doi":"10.1109/FPT.2002.1188679","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188679","url":null,"abstract":"This paper presents a high-level hardware description environment based on the logic programming language Prolog, called HIDE. The latter has been designed in an attempt to address the problem of abstract hardware design and hardware efficiency. HIDE provides more abstract hardware descriptions and compositions than are possible in traditional hardware description languages such as VHDL or Verilog. It enables highly scaleable and parameterised composition of blocks using a small set of constructors (e.g. horizontal, vertical composition), and generates pre-placed configurations in EDIF format for Xilinx Virtex FPGAs. The paper presents the syntax and semantics of HIDE and illustrates its use in the construction of a bit parallel multiplier core for Xilinx Virtex FPGAs.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126365425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Implementation of an FPGA based accelerator for virtual private networks 基于FPGA的虚拟专用网加速器的实现
O. Cheung, P. Leong
Virtual Private Networks (VPN) are becoming increasingly popular network architectures for corporate networks. As VPNs are built on the Internet infrastructure, the data exchange among different local area networks will be passed through the Internet and thus can be easily eavesdropped, masqueraded, etc. Therefore, certain security measures must be used to deal with these privacy issues. The Internet Protocol Security (IPSec) by the Internet Engineering Task Force (IETF) addresses the abovementioned security issues and the Free Secure Wide Area Network (FreeS/WAN) is an open source software implementation of IPSec for Linux which uses triple-DES as the default encryption mode. As shown in this paper, the performance of FreeS/WAN with IPSec is 50% of that without encryption. In order to improve its performance, a field programmable gate array (FPGA) based triple-DES accelerator was built on a reconfigurable computing development platform called Pilchard and achieved a throughput of more than 120 Mb/sec for triple-DES in cipher-block chaining mode, a speedup of 3 over a software implementation, Measurements show that an FPGA-accelerated FreeS/WAN offers a 30% speedup for the TCP protocol over the original software library.
虚拟专用网(VPN)是企业网络中日益流行的网络体系结构。由于vpn建立在互联网基础设施上,不同局域网之间的数据交换将通过互联网进行,因此很容易被窃听、伪装等。因此,必须使用一定的安全措施来处理这些隐私问题。互联网工程任务组(IETF)的互联网协议安全(IPSec)解决了上述安全问题,自由安全广域网(FreeS/WAN)是Linux上使用三des作为默认加密模式的IPSec的开源软件实现。如本文所示,使用IPSec的free /WAN的性能是不使用加密的50%。为了提高其性能,在可重构计算开发平台Pilchard上建立了基于现场可编程门阵列(FPGA)的三des加速器,并在密码块链模式下实现了超过120 Mb/秒的吞吐量,比软件实现的速度提高了3倍。测量表明,FPGA加速的FreeS/WAN比原始软件库为TCP协议提供了30%的加速。
{"title":"Implementation of an FPGA based accelerator for virtual private networks","authors":"O. Cheung, P. Leong","doi":"10.1109/FPT.2002.1188662","DOIUrl":"https://doi.org/10.1109/FPT.2002.1188662","url":null,"abstract":"Virtual Private Networks (VPN) are becoming increasingly popular network architectures for corporate networks. As VPNs are built on the Internet infrastructure, the data exchange among different local area networks will be passed through the Internet and thus can be easily eavesdropped, masqueraded, etc. Therefore, certain security measures must be used to deal with these privacy issues. The Internet Protocol Security (IPSec) by the Internet Engineering Task Force (IETF) addresses the abovementioned security issues and the Free Secure Wide Area Network (FreeS/WAN) is an open source software implementation of IPSec for Linux which uses triple-DES as the default encryption mode. As shown in this paper, the performance of FreeS/WAN with IPSec is 50% of that without encryption. In order to improve its performance, a field programmable gate array (FPGA) based triple-DES accelerator was built on a reconfigurable computing development platform called Pilchard and achieved a throughput of more than 120 Mb/sec for triple-DES in cipher-block chaining mode, a speedup of 3 over a software implementation, Measurements show that an FPGA-accelerated FreeS/WAN offers a 30% speedup for the TCP protocol over the original software library.","PeriodicalId":355740,"journal":{"name":"2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127899517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
期刊
2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1