首页 > 最新文献

2007 3rd Southern Conference on Programmable Logic最新文献

英文 中文
A Buffered Crossbar-Based Chip Interconnection Architecture Supporting Quality of Service 一种支持服务质量的基于缓冲交叉条的芯片互连体系结构
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371723
Georgios Kornaros, Y. Papaefstathiou
As systems-on-a-chip (SoCs) become larger, the problem of interconnecting the various subsystems becomes more complicated. In this framework, certain alternatives to the standard busses, based on network technologies, have emerged as an innovative approach for future SoC interconnect. One of the main advantages of such an alternative, is that it can offer certain quality of service (QoS) over the internal cross-connects while at the same time it supports higher transfer rates than the existing on-chip busses. This paper presents a chip interconnection architecture, which is based on a buffered crossbar switch. The main advantage of the proposed system is that it efficiently supports different priority levels; it also provides several gigabits per second of aggregate bandwidth, while it introduces very low latency. Moreover, its hardware complexity is minimal. All those facts make this framework ideal for SoCs that contain IP cores with diverse speed/throughput requirements.
随着片上系统(soc)越来越大,各个子系统的互连问题变得越来越复杂。在这个框架中,基于网络技术的标准总线的某些替代方案已经成为未来SoC互连的创新方法。这种替代方案的主要优点之一是,它可以通过内部交叉连接提供一定的服务质量(QoS),同时它支持比现有片上总线更高的传输速率。本文提出了一种基于缓冲交叉开关的芯片互连结构。该系统的主要优点是能够有效地支持不同的优先级;它还提供了每秒几千兆字节的总带宽,同时引入了非常低的延迟。此外,它的硬件复杂性很小。所有这些事实使该框架非常适合包含具有不同速度/吞吐量要求的IP内核的soc。
{"title":"A Buffered Crossbar-Based Chip Interconnection Architecture Supporting Quality of Service","authors":"Georgios Kornaros, Y. Papaefstathiou","doi":"10.1109/SPL.2007.371723","DOIUrl":"https://doi.org/10.1109/SPL.2007.371723","url":null,"abstract":"As systems-on-a-chip (SoCs) become larger, the problem of interconnecting the various subsystems becomes more complicated. In this framework, certain alternatives to the standard busses, based on network technologies, have emerged as an innovative approach for future SoC interconnect. One of the main advantages of such an alternative, is that it can offer certain quality of service (QoS) over the internal cross-connects while at the same time it supports higher transfer rates than the existing on-chip busses. This paper presents a chip interconnection architecture, which is based on a buffered crossbar switch. The main advantage of the proposed system is that it efficiently supports different priority levels; it also provides several gigabits per second of aggregate bandwidth, while it introduces very low latency. Moreover, its hardware complexity is minimal. All those facts make this framework ideal for SoCs that contain IP cores with diverse speed/throughput requirements.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"297 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122538351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient Hardware Implementations for the Gaussian Normal Basis Multiplication Over GF(2163) GF(2163)上高斯正态基乘法的高效硬件实现
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371722
V. Trujillo-Olaya, Jaime Velasco-Medina, J. C. López-Hernández
This article presents efficient hardware implementations for the Gaussian normal basis multiplication over GF(2163). Hardware implementations of GF(2m) multiplication algorithms are suitable to design elliptic curve cryptoprocessors, which allow that elliptic curve based cryptosystems implemented in hardware provide more physical security and higher performance than software implementations. In this case, the multipliers were designed using conventional, modified and fast- parallel algorithms for the Gaussian normal basis multiplication, the synthesis and simulation were carried out using Quartus II of Altera, and the designs were synthesized on the device EP2A15B724C7. The simulation results show that the multipliers designed present a very good performance using small area.
本文给出了GF(2163)上高斯正态基乘法的有效硬件实现。GF(2m)乘法算法的硬件实现适合于椭圆曲线密码处理器的设计,这使得基于椭圆曲线的密码系统在硬件上实现比软件实现具有更高的物理安全性和更高的性能。在这种情况下,采用常规、改进和快速并行算法设计了高斯正态基乘法乘法器,使用Altera公司的Quartus II进行了合成和仿真,并在器件EP2A15B724C7上进行了合成。仿真结果表明,所设计的乘法器在小面积下具有很好的性能。
{"title":"Efficient Hardware Implementations for the Gaussian Normal Basis Multiplication Over GF(2163)","authors":"V. Trujillo-Olaya, Jaime Velasco-Medina, J. C. López-Hernández","doi":"10.1109/SPL.2007.371722","DOIUrl":"https://doi.org/10.1109/SPL.2007.371722","url":null,"abstract":"This article presents efficient hardware implementations for the Gaussian normal basis multiplication over GF(2163). Hardware implementations of GF(2m) multiplication algorithms are suitable to design elliptic curve cryptoprocessors, which allow that elliptic curve based cryptosystems implemented in hardware provide more physical security and higher performance than software implementations. In this case, the multipliers were designed using conventional, modified and fast- parallel algorithms for the Gaussian normal basis multiplication, the synthesis and simulation were carried out using Quartus II of Altera, and the designs were synthesized on the device EP2A15B724C7. The simulation results show that the multipliers designed present a very good performance using small area.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123844526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of Block-Cipher Library for Reconfigurable Computers 可重构计算机分组密码库的开发
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371747
Miaoqing Huang, T. El-Ghazawi, B. Larson, K. Gaj
Reconfigurable computing is gaining rising attention as an alternative to traditional processing for many applications. Data encryption and decryption is one of these applications, which can get tremendous speedup running on FPGAs instead of microprocessors. We have developed a block-cipher library that covers 15 most popular encryption algorithms, and generated 35 bitstreams running on the SGI's latest version of a reconfigurable computer, RASCRC-100. The end- to-end throughput of 1.136 GB/s have been demonstrated for almost all ciphers, and was limited only by the input/output interface, rather than the FPGA processing time. The library is written in Verilog-HDL, and can be easily ported to other reconfigurable computing platforms. It provides means for cryptographers and computer scientists to program reconfigurable computers without the need for detailed knowledge of hardware design.
在许多应用中,可重构计算作为传统处理的一种替代方案正获得越来越多的关注。数据加密和解密就是其中一种应用,它可以在fpga上而不是微处理器上获得巨大的速度提升。我们开发了一个分组密码库,涵盖了15种最流行的加密算法,并在SGI最新版本的可重构计算机RASCRC-100上生成了35个比特流。对于几乎所有的密码,已经证明了1.136 GB/s的端到端吞吐量,并且仅受输入/输出接口的限制,而不受FPGA处理时间的限制。该库是用Verilog-HDL编写的,可以很容易地移植到其他可重构的计算平台上。它为密码学家和计算机科学家提供了在不需要详细的硬件设计知识的情况下编写可重构计算机的方法。
{"title":"Development of Block-Cipher Library for Reconfigurable Computers","authors":"Miaoqing Huang, T. El-Ghazawi, B. Larson, K. Gaj","doi":"10.1109/SPL.2007.371747","DOIUrl":"https://doi.org/10.1109/SPL.2007.371747","url":null,"abstract":"Reconfigurable computing is gaining rising attention as an alternative to traditional processing for many applications. Data encryption and decryption is one of these applications, which can get tremendous speedup running on FPGAs instead of microprocessors. We have developed a block-cipher library that covers 15 most popular encryption algorithms, and generated 35 bitstreams running on the SGI's latest version of a reconfigurable computer, RASCRC-100. The end- to-end throughput of 1.136 GB/s have been demonstrated for almost all ciphers, and was limited only by the input/output interface, rather than the FPGA processing time. The library is written in Verilog-HDL, and can be easily ported to other reconfigurable computing platforms. It provides means for cryptographers and computer scientists to program reconfigurable computers without the need for detailed knowledge of hardware design.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132475922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FPGA Modulator for Matrix Converter 矩阵变换器的FPGA调制器
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371751
M. Gonzalez, M. Funes, R. Petrocelli, M. Benedetti
This work presents a modulator implemented in a FPGA for power matrix converters. Its function is to operate as a peripheral unit of a digitally-controlled system in order to generate duty cycles for each of the converter switches and provide a safe commutation of the switching devices. The correct generation of duty cycles and sequences was verified, as well as the safe commutation of bi-directional switches. The performance of the modulator in conjunction with the power stage and a DSP, which performs the high level control layer, was also analyzed.
本文提出了一种用FPGA实现的功率矩阵变换器调制器。它的功能是作为数字控制系统的外围单元,以便为每个转换开关产生占空比,并提供开关设备的安全换相。验证了占空比和序列的正确生成,以及双向开关的安全换相。分析了调制器在功率级和高级控制层DSP的配合下的性能。
{"title":"FPGA Modulator for Matrix Converter","authors":"M. Gonzalez, M. Funes, R. Petrocelli, M. Benedetti","doi":"10.1109/SPL.2007.371751","DOIUrl":"https://doi.org/10.1109/SPL.2007.371751","url":null,"abstract":"This work presents a modulator implemented in a FPGA for power matrix converters. Its function is to operate as a peripheral unit of a digitally-controlled system in order to generate duty cycles for each of the converter switches and provide a safe commutation of the switching devices. The correct generation of duty cycles and sequences was verified, as well as the safe commutation of bi-directional switches. The performance of the modulator in conjunction with the power stage and a DSP, which performs the high level control layer, was also analyzed.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134633820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Genetic Algorithm Based Solution for Dynamically Reconfigurable Modules Allocation 一种基于遗传算法的动态可重构模块分配方法
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371745
V. Rana, C. Sandionigi, M. Santambrogio
The advances in the programmable hardware have lead to new architectures, where the hardware can be dynamically adapted to the application to gain better performance. One of the problems in realizing dynamically reconfigurable systems is the allocation of dynamically reconfigurable modules. In this scenario, when a new module has to be reconfigured in the system, there is the need to find a suitable free place where it can be configured. In this work a genetic algorithm has been developed to solve the problem of dynamically reconfigurable modules allocation. The search task has been modeled with a genetic algorithm in which each chromosome represents a configuration status of the programmable devices and both crossover and mutation processes try to change the previously found location for the new module in order to achieve a better fitness, that stands for the goodness of the final solution.
可编程硬件的进步带来了新的体系结构,其中硬件可以动态地适应应用程序以获得更好的性能。实现动态可重构系统的问题之一是动态可重构模块的分配问题。在这种情况下,当必须在系统中重新配置新模块时,需要找到一个合适的空闲位置来对其进行配置。本文提出了一种遗传算法来解决动态可重构模块分配问题。搜索任务用遗传算法建模,其中每条染色体代表可编程设备的一个配置状态,交叉和突变过程都试图改变先前发现的新模块的位置,以获得更好的适合度,这代表了最终解决方案的优劣。
{"title":"A Genetic Algorithm Based Solution for Dynamically Reconfigurable Modules Allocation","authors":"V. Rana, C. Sandionigi, M. Santambrogio","doi":"10.1109/SPL.2007.371745","DOIUrl":"https://doi.org/10.1109/SPL.2007.371745","url":null,"abstract":"The advances in the programmable hardware have lead to new architectures, where the hardware can be dynamically adapted to the application to gain better performance. One of the problems in realizing dynamically reconfigurable systems is the allocation of dynamically reconfigurable modules. In this scenario, when a new module has to be reconfigured in the system, there is the need to find a suitable free place where it can be configured. In this work a genetic algorithm has been developed to solve the problem of dynamically reconfigurable modules allocation. The search task has been modeled with a genetic algorithm in which each chromosome represents a configuration status of the programmable devices and both crossover and mutation processes try to change the previously found location for the new module in order to achieve a better fitness, that stands for the goodness of the final solution.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115202735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Fast Architecture for Radix 10 Coordinates Rotation 基数10坐标旋转的快速架构
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371721
A. J. Morenilla, H. Mora, J. Romero, F.P. Lopez
Although radix-10 based arithmetic has been gaining renewed importance over the last few years, decimal systems are not efficient enough and techniques are still under development. In this paper, a modification of the CORDIC method for decimal arithmetic is proposed and applied to produce fast rotations. The algorithm uses BCD operands as inputs, combining the advantages both decimal and binary systems. The result is an important number of iterations reduction compared with the original decimal CORDIC method. Finally, a FPGA-based radix-10 architecture that can be used to produce rotations with more precision and speed is presented and different experiments showing the advantages of the new method are shown.
虽然基于基数10的算术在过去几年中获得了新的重要性,但十进制系统不够有效,技术仍在发展中。本文提出了对十进制算法CORDIC方法的一种改进,并将其应用于快速旋转。该算法使用BCD操作数作为输入,结合了十进制和二进制的优点。与原始的十进制CORDIC方法相比,该方法大大减少了迭代次数。最后,提出了一种基于fpga的基数-10结构,该结构能够以更高的精度和速度产生旋转,并通过不同的实验显示了新方法的优点。
{"title":"A Fast Architecture for Radix 10 Coordinates Rotation","authors":"A. J. Morenilla, H. Mora, J. Romero, F.P. Lopez","doi":"10.1109/SPL.2007.371721","DOIUrl":"https://doi.org/10.1109/SPL.2007.371721","url":null,"abstract":"Although radix-10 based arithmetic has been gaining renewed importance over the last few years, decimal systems are not efficient enough and techniques are still under development. In this paper, a modification of the CORDIC method for decimal arithmetic is proposed and applied to produce fast rotations. The algorithm uses BCD operands as inputs, combining the advantages both decimal and binary systems. The result is an important number of iterations reduction compared with the original decimal CORDIC method. Finally, a FPGA-based radix-10 architecture that can be used to produce rotations with more precision and speed is presented and different experiments showing the advantages of the new method are shown.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131963402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Comparative Analysis of Multitask Scheduling Algorithms for Reconfigurable Computing Regarding Context Switches and Configuration Cache Usage 基于上下文切换和配置缓存使用的可重构计算多任务调度算法比较分析
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371758
Christopher Spies, L. Indrusiak, M. Glesner
In this paper, we investigate the usability of several well-known real-time scheduling algorithms for a system consisting of a single processor core and multiple dynamically reconfigurable functional units, running a number of processes in parallel. A SystemC simulation model of a wireless sensor network node serves as a case study for assessing the performance of the different algorithms. Specific emphasis is given to the analysis of the configuration cache miss ratio and the number of context switches, which are indicators of costly operations of the reconfigurable units, respectively reconfiguration and saving internal state.
在本文中,我们研究了几种著名的实时调度算法在一个由单个处理器核心和多个动态可重构功能单元组成的系统中的可用性,该系统并行运行多个进程。一个无线传感器网络节点的SystemC仿真模型作为评估不同算法性能的案例研究。重点分析了配置缓存缺失率和上下文切换次数,这是可重构单元重新配置和保存内部状态的昂贵操作的指标。
{"title":"Comparative Analysis of Multitask Scheduling Algorithms for Reconfigurable Computing Regarding Context Switches and Configuration Cache Usage","authors":"Christopher Spies, L. Indrusiak, M. Glesner","doi":"10.1109/SPL.2007.371758","DOIUrl":"https://doi.org/10.1109/SPL.2007.371758","url":null,"abstract":"In this paper, we investigate the usability of several well-known real-time scheduling algorithms for a system consisting of a single processor core and multiple dynamically reconfigurable functional units, running a number of processes in parallel. A SystemC simulation model of a wireless sensor network node serves as a case study for assessing the performance of the different algorithms. Specific emphasis is given to the analysis of the configuration cache miss ratio and the number of context switches, which are indicators of costly operations of the reconfigurable units, respectively reconfiguration and saving internal state.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"138 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123243286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Extending Embedded Computing Scheduling Algorithms for Reconfigurable Computing Systems 可重构计算系统的扩展嵌入式计算调度算法
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371729
P. Saha, T. El-Ghazawi
Current work on automatic task partitioning and scheduling for reconfigurable computing (RC) systems strictly addresses the FPGA hardware, and does not take advantage of the synergy between the microprocessor and the FPGA. Efforts on partitioning between muP and the FPGA are a manual and laborious effort, as a formal methodology for automatic hardware-software partitioning has not been established. Related fields such as heterogeneous computing (HC) and embedded computing (EC) have an extensive body of work for scheduling for heterogeneous processors. Unlike the HC scheduling algorithms, the EC algorithms take into account the differences in computational capabilities of each processing element. In this work, we adapt EC scheduling algorithms for RC systems, and show how simply adapting the algorithms alone is not sufficient to take advantage of the reconfigurable hardware. We introduce new heuristic algorithms based on EC scheduling algorithms and show that they provide up to an order of magnitude improvement in scheduling and execution times.
目前针对可重构计算(RC)系统的自动任务划分和调度工作严格地针对FPGA硬件,并且没有利用微处理器和FPGA之间的协同作用。在muP和FPGA之间进行分区是一项手工且费力的工作,因为尚未建立起用于自动硬件软件分区的正式方法。相关领域,如异构计算(HC)和嵌入式计算(EC)有大量的异构处理器调度工作。与HC调度算法不同,EC算法考虑到每个处理元素的计算能力差异。在这项工作中,我们将EC调度算法应用于RC系统,并展示了简单地调整算法是如何不足以利用可重构硬件的。我们介绍了基于EC调度算法的新启发式算法,并表明它们在调度和执行时间方面提供了高达数量级的改进。
{"title":"Extending Embedded Computing Scheduling Algorithms for Reconfigurable Computing Systems","authors":"P. Saha, T. El-Ghazawi","doi":"10.1109/SPL.2007.371729","DOIUrl":"https://doi.org/10.1109/SPL.2007.371729","url":null,"abstract":"Current work on automatic task partitioning and scheduling for reconfigurable computing (RC) systems strictly addresses the FPGA hardware, and does not take advantage of the synergy between the microprocessor and the FPGA. Efforts on partitioning between muP and the FPGA are a manual and laborious effort, as a formal methodology for automatic hardware-software partitioning has not been established. Related fields such as heterogeneous computing (HC) and embedded computing (EC) have an extensive body of work for scheduling for heterogeneous processors. Unlike the HC scheduling algorithms, the EC algorithms take into account the differences in computational capabilities of each processing element. In this work, we adapt EC scheduling algorithms for RC systems, and show how simply adapting the algorithms alone is not sufficient to take advantage of the reconfigurable hardware. We introduce new heuristic algorithms based on EC scheduling algorithms and show that they provide up to an order of magnitude improvement in scheduling and execution times.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120948107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
FPGA Neural Networks Implementation for Nuclear Pulses Parameters Estimation 核脉冲参数估计的FPGA神经网络实现
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371716
D. Estryk, G. E. Ríos, C. Verrastro
Nuclear pulses parameters estimation is needed in many nuclear applications. Its precision and performance requirements are very demanding, especially in PET applications. Quality of PET images depends on the energy and time resolution of gamma pulses detection. Neural networks estimators were analyzed in contrast with common methods. Two-layer feed-forward networks with three neurons in the hidden layer reached precision goal. The chosen estimators allowed the use of 40 MHz free running ADC obtaining precision of 1ns in the timestamp determination, exceeding coincidence detection requirements. An efficient VHDL implementation on an inexpensive Xilinx Spartan-3 FPGA was achieved that fulfill performance requirements, adding no dead time due to digital processing. The estimators and its FPGA implementations were verified on hardware and characterization were done using nuclear shaped pulses synthesized with an arbitrary function generator.
在许多核应用中都需要核脉冲参数估计。其精度和性能要求非常高,特别是在PET应用中。PET图像的质量取决于伽马脉冲检测的能量和时间分辨率。对神经网络估计方法与常用方法进行了对比分析。隐藏层包含三个神经元的两层前馈网络达到了精度目标。所选择的估计器允许使用40 MHz自由运行的ADC在时间戳确定中获得1ns的精度,超过了一致性检测要求。在价格低廉的Xilinx Spartan-3 FPGA上实现了高效的VHDL实现,满足了性能要求,并且没有由于数字处理而增加死区时间。在硬件上验证了该估计器及其FPGA实现,并利用任意函数发生器合成的核形脉冲进行了表征。
{"title":"FPGA Neural Networks Implementation for Nuclear Pulses Parameters Estimation","authors":"D. Estryk, G. E. Ríos, C. Verrastro","doi":"10.1109/SPL.2007.371716","DOIUrl":"https://doi.org/10.1109/SPL.2007.371716","url":null,"abstract":"Nuclear pulses parameters estimation is needed in many nuclear applications. Its precision and performance requirements are very demanding, especially in PET applications. Quality of PET images depends on the energy and time resolution of gamma pulses detection. Neural networks estimators were analyzed in contrast with common methods. Two-layer feed-forward networks with three neurons in the hidden layer reached precision goal. The chosen estimators allowed the use of 40 MHz free running ADC obtaining precision of 1ns in the timestamp determination, exceeding coincidence detection requirements. An efficient VHDL implementation on an inexpensive Xilinx Spartan-3 FPGA was achieved that fulfill performance requirements, adding no dead time due to digital processing. The estimators and its FPGA implementations were verified on hardware and characterization were done using nuclear shaped pulses synthesized with an arbitrary function generator.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127723683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
FPGA Based Design of CAVLC and Exp-Golomb Coders for H.264/AVC Baseline Entropy Coding 基于FPGA的H.264/AVC基线熵编码CAVLC和ex - golomb编码器设计
Pub Date : 2007-06-18 DOI: 10.1109/SPL.2007.371741
T. Silva, J. Vortmann, Luciano Agostini, S. Bampi, A. Susin
This paper presents the design of a hardware architecture for the entropy coder of H.264/AVC video compression standard, considering the baseline profile. The baseline entropy coder is composed of two main blocks: Exp-Golomb coder and CAVLC coder. This paper presents the architectural design of these two blocks. These architectures were described in VHDL and synthesized to an Altera Stratix-II FPGA. From the synthesis results it was possible to verify that the Exp-Golomb and CAVLC coders reached a throughput of 15.9 million of samples per second for the Exp-Golomb coder and of 103.8 million of samples per second for CAVLC coder. The H.264/AVC baseline entropy coder is being designed through the integration of these two coders and preliminary results indicate that this solution will be able to process HDTV frames in real time.
本文提出了H.264/AVC视频压缩标准的熵编码器的硬件结构设计,并考虑了基线配置。基线熵编码器由两个主要模块组成:Exp-Golomb编码器和CAVLC编码器。本文介绍了这两个街区的建筑设计。这些架构用VHDL描述,并合成到Altera Stratix-II FPGA上。从合成结果可以验证Exp-Golomb编码器和CAVLC编码器达到每秒1590万样本的吞吐量,而CAVLC编码器达到每秒1.038亿样本的吞吐量。H.264/AVC基线熵编码器是通过这两种编码器的集成而设计的,初步结果表明,该方案能够实时处理高清电视帧。
{"title":"FPGA Based Design of CAVLC and Exp-Golomb Coders for H.264/AVC Baseline Entropy Coding","authors":"T. Silva, J. Vortmann, Luciano Agostini, S. Bampi, A. Susin","doi":"10.1109/SPL.2007.371741","DOIUrl":"https://doi.org/10.1109/SPL.2007.371741","url":null,"abstract":"This paper presents the design of a hardware architecture for the entropy coder of H.264/AVC video compression standard, considering the baseline profile. The baseline entropy coder is composed of two main blocks: Exp-Golomb coder and CAVLC coder. This paper presents the architectural design of these two blocks. These architectures were described in VHDL and synthesized to an Altera Stratix-II FPGA. From the synthesis results it was possible to verify that the Exp-Golomb and CAVLC coders reached a throughput of 15.9 million of samples per second for the Exp-Golomb coder and of 103.8 million of samples per second for CAVLC coder. The H.264/AVC baseline entropy coder is being designed through the integration of these two coders and preliminary results indicate that this solution will be able to process HDTV frames in real time.","PeriodicalId":419253,"journal":{"name":"2007 3rd Southern Conference on Programmable Logic","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130827080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
期刊
2007 3rd Southern Conference on Programmable Logic
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1