首页 > 最新文献

2009 International Conference on Reconfigurable Computing and FPGAs最新文献

英文 中文
Decimal Adders/Subtractors in FPGA: Efficient 6-input LUT Implementations FPGA中的十进制加/减法器:高效的6输入LUT实现
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.29
M. Vazquez, G. Sutter, G. Bioul, J. Deschamps
This paper presents FPGA implementations of add/subtract algorithms for 10´s complement BCD numbers. Carry-chain type circuits have been designed on 6-input LUT´s Xilinx Virtex-5 FPGA technologies. Some new concepts are reviewed to compute the P and G functions for carry-chain optimization purposes. Designs are presented with the corresponding time performances and area consumption figures. Results have been compared with 2´s complement binary implementations carried out on the same platform. Better time delays have been registered for decimal number within same range of operands.
本文介绍了10 ' s补码BCD数的加减算法的FPGA实现。在6输入LUT的Xilinx Virtex-5 FPGA技术上设计了携带链型电路。本文回顾了计算携带链优化的P和G函数的一些新概念。设计方案给出了相应的时间性能和面积消耗数据。结果与在同一平台上进行的2 ' s补码二进制实现进行了比较。在相同的操作数范围内,为十进制数注册了更好的时间延迟。
{"title":"Decimal Adders/Subtractors in FPGA: Efficient 6-input LUT Implementations","authors":"M. Vazquez, G. Sutter, G. Bioul, J. Deschamps","doi":"10.1109/ReConFig.2009.29","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.29","url":null,"abstract":"This paper presents FPGA implementations of add/subtract algorithms for 10´s complement BCD numbers. Carry-chain type circuits have been designed on 6-input LUT´s Xilinx Virtex-5 FPGA technologies. Some new concepts are reviewed to compute the P and G functions for carry-chain optimization purposes. Designs are presented with the corresponding time performances and area consumption figures. Results have been compared with 2´s complement binary implementations carried out on the same platform. Better time delays have been registered for decimal number within same range of operands.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131074898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Enhancing the Productivity of Radio Designers with RapidRadio 提高生产力的无线电设计师与RapidRadio
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.84
J. Surís, A. Recio, P. Athanas
In this paper the RapidRadio framework for signal classification and receiver deployment is discussed. The frame- work is a productivity enhancing tool that reduces the required knowledge-base for implementing a receiver on an FPGA-based SDR platform. The ultimate objective of this framework is to identify unknown signals and to build FPGA-based receivers capable of receiving them. The framework’s capacity to classify a signal and deploy a functional receiver is validated with over- the-air experiments.
本文讨论了用于信号分类和接收机部署的RapidRadio框架。该框架是一种提高生产率的工具,减少了在基于fpga的SDR平台上实现接收机所需的知识基础。该框架的最终目标是识别未知信号并构建能够接收它们的基于fpga的接收器。通过空中实验验证了该框架对信号进行分类和部署功能接收器的能力。
{"title":"Enhancing the Productivity of Radio Designers with RapidRadio","authors":"J. Surís, A. Recio, P. Athanas","doi":"10.1109/ReConFig.2009.84","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.84","url":null,"abstract":"In this paper the RapidRadio framework for signal classification and receiver deployment is discussed. The frame- work is a productivity enhancing tool that reduces the required knowledge-base for implementing a receiver on an FPGA-based SDR platform. The ultimate objective of this framework is to identify unknown signals and to build FPGA-based receivers capable of receiving them. The framework’s capacity to classify a signal and deploy a functional receiver is validated with over- the-air experiments.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116009085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
FPGA Implementation of Izhikevich Spiking Neural Networks for Character Recognition 用于字符识别的Izhikevich脉冲神经网络的FPGA实现
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.77
Kenneth L. Rice, M. Bhuiyan, T. Taha, Christopher N. Vutsinas, M. C. Smith
There has been a strong push recently to examine biological scale simulations of neuromorphic algorithms to achieve stronger inference capabilities than current computing algorithms. The recent Izhikevich spiking neuron model is ideally suited for such large scale cortical simulations due to its efficiency and biological accuracy. In this paper we explore the feasibility of using FPGAs for large scale simulations of the Izhikevich model. We developed a modularized processing element to evaluate a large number of Izhikevich spiking neurons in a pipelined manner. This approach allows for easy scalability of the model to larger FPGAs. We utilized a character recognition algorithm based on the Izhikevich model for this study and scaled up the algorithm to use over 9000 neurons. The FPGA implementation of the algorithm on a Xilinx Virtex 4 provided a speedup of approximately 8.5 times an equivalent software implementation on a 2.2 GHz AMD Opteron core. Our results indicate that FPGAs are suitable for large scale cortical simulations utilizing the Izhikevich spiking neuron model.
最近,研究神经形态算法的生物尺度模拟以获得比当前计算算法更强的推理能力的呼声很高。最近的Izhikevich脉冲神经元模型由于其效率和生物学准确性而非常适合这种大规模的皮层模拟。在本文中,我们探讨了使用fpga对Izhikevich模型进行大规模模拟的可行性。我们开发了一个模块化的处理元素,以流水线的方式评估大量的Izhikevich尖峰神经元。这种方法允许将模型轻松扩展到更大的fpga。在本研究中,我们使用了一种基于Izhikevich模型的字符识别算法,并将算法扩展到使用超过9000个神经元。该算法在Xilinx Virtex 4上的FPGA实现提供了在2.2 GHz AMD Opteron核心上等效软件实现的大约8.5倍的加速。我们的研究结果表明fpga适用于利用Izhikevich峰值神经元模型进行大规模皮层模拟。
{"title":"FPGA Implementation of Izhikevich Spiking Neural Networks for Character Recognition","authors":"Kenneth L. Rice, M. Bhuiyan, T. Taha, Christopher N. Vutsinas, M. C. Smith","doi":"10.1109/ReConFig.2009.77","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.77","url":null,"abstract":"There has been a strong push recently to examine biological scale simulations of neuromorphic algorithms to achieve stronger inference capabilities than current computing algorithms. The recent Izhikevich spiking neuron model is ideally suited for such large scale cortical simulations due to its efficiency and biological accuracy. In this paper we explore the feasibility of using FPGAs for large scale simulations of the Izhikevich model. We developed a modularized processing element to evaluate a large number of Izhikevich spiking neurons in a pipelined manner. This approach allows for easy scalability of the model to larger FPGAs. We utilized a character recognition algorithm based on the Izhikevich model for this study and scaled up the algorithm to use over 9000 neurons. The FPGA implementation of the algorithm on a Xilinx Virtex 4 provided a speedup of approximately 8.5 times an equivalent software implementation on a 2.2 GHz AMD Opteron core. Our results indicate that FPGAs are suitable for large scale cortical simulations utilizing the Izhikevich spiking neuron model.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"446 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116329447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 96
Effects of Simplistic Online Synthesis for AMIDAR Processors 简化在线合成对AMIDAR处理器的影响
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.21
Stefan Döbrich, C. Hochberger
Future chip technologies will change the way we deal with hardware design. (1) logic resources will be available in vast amount and (2) engineering specialized designs for particular applications will no longer be the general approach as the non recurring expenses will grow tremendously. Thus, we believe that online synthesis that takes place during the execution of an application is one way to overcome these problems. In this paper we show that even a relative simplistic synthesis approach can have a strong impact on the performance of compute intensive applications.
未来的芯片技术将改变我们处理硬件设计的方式。(1)逻辑资源将大量可用,(2)针对特定应用的工程专门设计将不再是通用方法,因为非经常性费用将大幅增长。因此,我们认为在应用程序执行期间进行的在线合成是克服这些问题的一种方法。在本文中,我们表明,即使是相对简单的综合方法也会对计算密集型应用程序的性能产生强烈的影响。
{"title":"Effects of Simplistic Online Synthesis for AMIDAR Processors","authors":"Stefan Döbrich, C. Hochberger","doi":"10.1109/ReConFig.2009.21","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.21","url":null,"abstract":"Future chip technologies will change the way we deal with hardware design. (1) logic resources will be available in vast amount and (2) engineering specialized designs for particular applications will no longer be the general approach as the non recurring expenses will grow tremendously. Thus, we believe that online synthesis that takes place during the execution of an application is one way to overcome these problems. In this paper we show that even a relative simplistic synthesis approach can have a strong impact on the performance of compute intensive applications.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122083287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
PCIREX: A Fast Prototyping Platform for TMR Dynamically Reconfigurable Systems PCIREX: TMR动态可重构系统的快速原型平台
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.33
A. Astarloa, Jesús Lázaro, U. Bidarte, A. Zuloaga, J. Jiménez
This paper presents a PCI-Express based platform for the analysis and evaluation of designs that combines Triple Modular Redundancy and Dynamic Reconfiguration to provide Fault Tolerance and Self-repairing capabilities. The paper presents the general architecture of the platform and exemplifies its functionality with the implementation of a Self-Repairing CAN Gateway.
本文提出了一个基于PCI-Express的设计分析与评估平台,该平台结合了三模冗余和动态重构,提供了容错和自修复能力。本文介绍了该平台的总体结构,并通过一个自修复CAN网关的实现举例说明了其功能。
{"title":"PCIREX: A Fast Prototyping Platform for TMR Dynamically Reconfigurable Systems","authors":"A. Astarloa, Jesús Lázaro, U. Bidarte, A. Zuloaga, J. Jiménez","doi":"10.1109/ReConFig.2009.33","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.33","url":null,"abstract":"This paper presents a PCI-Express based platform for the analysis and evaluation of designs that combines Triple Modular Redundancy and Dynamic Reconfiguration to provide Fault Tolerance and Self-repairing capabilities. The paper presents the general architecture of the platform and exemplifies its functionality with the implementation of a Self-Repairing CAN Gateway.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":" 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113952320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Triple Line-Based Playout for Go - An Accelerator for Monte Carlo Go 基于三重线的围棋播放-蒙特卡洛围棋加速器
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.75
Kenichi Koizumi, M. Inaba, K. Hiraki, Y. Ishii, T. Miyoshi, Kazuki Yoshizoe
After a computer named “Deep Blue” defeated the world chess champion Garry Kasparov in 1997, researchers studying computer board games focused their attention on the game “Go.” Go is known to be more difficult for computers to play than chess or shogi because (1) the search space for Go is much larger, (2) it is difficult to define an appropriate evaluation function of position, and (3) a position sometimes changes globally in just one move. Recently, a new method called Monte Carlo Go has been developed, which involves performing Monte Carlo simulations to evaluate a position. Monte Carlo Go increases the strength of the Computer-Go program. For Monte Carlo Go, the strength fully depends on the number of simulations. Several attempts were made to accelerate simulations, e.g., by the use of cluster systems and FPGAs. The cluster system yields good results, but it is a very expensive system. On the other hand, acceleration using an FPGA was not so easy because the usage of FPGA resources tends to be high. Previously, FPGA acceleration was feasible for smaller board such as a board with a 9 × 9 grid, while it was not feasible for the standard board with a 19 × 19 grid. In this paper, we propose triple line-based playout for Go (TLPG), a hardware algorithm for generating simulations using an FPGA. By reproducing global information redundantly, TLPG enables the generation of simulations only using local operations; this helps realize compact implementations of hardware logic, and thus, TLPG can handle both 9 × 9 and 19 × 19 grid Go boards. We implement TLPG on Xilinx Virtex-5 (XC5VFX70T-1FF1136) and evaluate it. TLPG can perform 40,649 playouts per second for a 9 × 9 grid Go board and 4,668 playouts per second for a 19 × 19 grid Go board.
1997年,一台名为“深蓝”的计算机击败了国际象棋世界冠军加里·卡斯帕罗夫(Garry Kasparov)后,研究计算机棋盘游戏的研究人员将注意力集中在了“围棋”上。众所周知,对于计算机来说,围棋比国际象棋或幕府棋更难下,因为(1)围棋的搜索空间要大得多,(2)很难定义一个适当的位置评估函数,(3)一个位置有时会在一次移动中全局改变。最近,一种叫做蒙特卡罗围棋的新方法被开发出来,它涉及到进行蒙特卡罗模拟来评估一个位置。蒙特卡罗围棋增加了计算机围棋程序的强度。对于蒙特卡罗围棋,其强度完全取决于模拟次数。为了加速仿真,我们做了一些尝试,例如使用集群系统和fpga。集群系统产生良好的结果,但它是一个非常昂贵的系统。另一方面,使用FPGA加速不是那么容易,因为FPGA资源的使用往往很高。以前,FPGA加速对于较小的板是可行的,例如具有9 × 9网格的板,而对于具有19 × 19网格的标准板则不可行的。在本文中,我们提出了基于三线的围棋游戏(TLPG),这是一种使用FPGA生成模拟的硬件算法。通过冗余复制全局信息,TLPG可以仅使用局部操作生成模拟;这有助于实现硬件逻辑的紧凑实现,因此,TLPG可以处理9 × 9和19 × 19网格围棋棋盘。我们在Xilinx Virtex-5 (XC5VFX70T-1FF1136)上实现了TLPG并对其进行了评估。对于一个9 × 9格的围棋棋盘,TLPG每秒可以执行40,649次下棋,对于一个19 × 19格的围棋棋盘,每秒可以执行4,668次下棋。
{"title":"Triple Line-Based Playout for Go - An Accelerator for Monte Carlo Go","authors":"Kenichi Koizumi, M. Inaba, K. Hiraki, Y. Ishii, T. Miyoshi, Kazuki Yoshizoe","doi":"10.1109/ReConFig.2009.75","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.75","url":null,"abstract":"After a computer named “Deep Blue” defeated the world chess champion Garry Kasparov in 1997, researchers studying computer board games focused their attention on the game “Go.” Go is known to be more difficult for computers to play than chess or shogi because (1) the search space for Go is much larger, (2) it is difficult to define an appropriate evaluation function of position, and (3) a position sometimes changes globally in just one move. Recently, a new method called Monte Carlo Go has been developed, which involves performing Monte Carlo simulations to evaluate a position. Monte Carlo Go increases the strength of the Computer-Go program. For Monte Carlo Go, the strength fully depends on the number of simulations. Several attempts were made to accelerate simulations, e.g., by the use of cluster systems and FPGAs. The cluster system yields good results, but it is a very expensive system. On the other hand, acceleration using an FPGA was not so easy because the usage of FPGA resources tends to be high. Previously, FPGA acceleration was feasible for smaller board such as a board with a 9 × 9 grid, while it was not feasible for the standard board with a 19 × 19 grid. In this paper, we propose triple line-based playout for Go (TLPG), a hardware algorithm for generating simulations using an FPGA. By reproducing global information redundantly, TLPG enables the generation of simulations only using local operations; this helps realize compact implementations of hardware logic, and thus, TLPG can handle both 9 × 9 and 19 × 19 grid Go boards. We implement TLPG on Xilinx Virtex-5 (XC5VFX70T-1FF1136) and evaluate it. TLPG can perform 40,649 playouts per second for a 9 × 9 grid Go board and 4,668 playouts per second for a 19 × 19 grid Go board.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116775266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Runtime Temporal Partitioning Assembly to Reduce FPGA Reconfiguration Time Runtime Temporal Partitioning Assembly减少FPGA重构时间
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.61
Abelardo Jara-Berrocal, A. Gordon-Ross
Large applications that exceed available FPGA resources must time-multiplex these resources using smaller hardware modules. In order to orchestrate this time-multiplexing, temporal partitioning partitions these hardware modules into multiple subsets, each of which fit within the available resources. During a temporal partition transition, the FPGA is reconfigured to the subsequent temporal partition. However, FPGA reconfiguration time can impose significant performance overhead as the entire FPGA fabric must be reconfigured even if only a small portion has changed. Partially reconfigurable (PR) FPGAs can decrease reconfiguration time by only reconfiguring the portions of the FPGA fabric that differ. In this paper, we present a design methodology using a simulated annealing-based module placement optimization engine to minimize FPGA reconfiguration overhead by exploiting module overlap across successive temporal partitions. Experimental results show that our methodology reduces FPGA reconfiguration time by 44% on average.
超过可用FPGA资源的大型应用程序必须使用较小的硬件模块对这些资源进行时间复用。为了编排这种时间多路复用,临时分区将这些硬件模块划分为多个子集,每个子集都适合可用资源。在时间分区转换期间,FPGA被重新配置为后续的时间分区。然而,FPGA重新配置时间会带来显著的性能开销,因为即使只有一小部分发生了变化,也必须重新配置整个FPGA结构。部分可重构(PR) FPGA可以通过仅重新配置FPGA结构的不同部分来减少重新配置时间。在本文中,我们提出了一种设计方法,使用模拟的基于退火的模块放置优化引擎,通过利用连续时间分区之间的模块重叠来最小化FPGA重构开销。实验结果表明,该方法可将FPGA重构时间平均缩短44%。
{"title":"Runtime Temporal Partitioning Assembly to Reduce FPGA Reconfiguration Time","authors":"Abelardo Jara-Berrocal, A. Gordon-Ross","doi":"10.1109/ReConFig.2009.61","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.61","url":null,"abstract":"Large applications that exceed available FPGA resources must time-multiplex these resources using smaller hardware modules. In order to orchestrate this time-multiplexing, temporal partitioning partitions these hardware modules into multiple subsets, each of which fit within the available resources. During a temporal partition transition, the FPGA is reconfigured to the subsequent temporal partition. However, FPGA reconfiguration time can impose significant performance overhead as the entire FPGA fabric must be reconfigured even if only a small portion has changed. Partially reconfigurable (PR) FPGAs can decrease reconfiguration time by only reconfiguring the portions of the FPGA fabric that differ. In this paper, we present a design methodology using a simulated annealing-based module placement optimization engine to minimize FPGA reconfiguration overhead by exploiting module overlap across successive temporal partitions. Experimental results show that our methodology reduces FPGA reconfiguration time by 44% on average.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"40 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114042349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Fuzzy Control for Cyclist Robot Stability Using FPGAs 基于fpga的自行车机器人稳定性模糊控制
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.53
C. Castro, C. Llanos, Walter de Britto Vidal Filho, L. Coelho
This paper presents a fuzzy controller implementation in FPGA (Field Programmable Gate Array) for a robot that rides a bicycle using the well-known Acrobot model. The overall system presents a hardware/software codesign approach and it was achieved by means of a Microblaze FPGA embedded processor and a fuzzy controller, which was implemented directly in hardware. Both the microprocessor and the controller are connected via the Fast Simplex Link - FSL bus. The proposed design methodology involves firstly the fuzzy controller design in software for simulation and testing issues, taking into account the mathematical model of the plant. Afterwards, the controller was synthesized to the hardware description language VHDL using the Xfuzzy 2.0 tool. The fuzzy controller has 2 modules, each one producing a torque control variable. The first module receives both the position and angular speed of the first link of the Acrobot system whereas the second module receives the position and angular speed of the second link. The final torque variable is calculated in the Microblaze taking into account two gains. Each gain represents a priority that is applied to each fuzzy module. These gains were experimentally calculated through several simulation executed in the Matlab computational environment.
本文介绍了一种基于现场可编程门阵列(FPGA)的自行车机器人模糊控制器的实现,该控制器采用著名的Acrobot模型。整个系统采用硬件/软件协同设计的方式,采用Microblaze FPGA嵌入式处理器和模糊控制器直接在硬件上实现。微处理器和控制器都通过Fast Simplex Link - FSL总线连接。所提出的设计方法首先涉及到模糊控制器在软件仿真和测试中的设计问题,同时考虑到系统的数学模型。然后,利用Xfuzzy 2.0工具将控制器合成为硬件描述语言VHDL。模糊控制器有2个模块,每个模块产生一个转矩控制变量。第一模块接收Acrobot系统第一链路的位置和角速度,第二模块接收第二链路的位置和角速度。最终的扭矩变量在Microblaze中计算,考虑了两个增益。每个增益表示应用于每个模糊模块的优先级。这些增益是通过在Matlab计算环境中进行的多次模拟实验计算出来的。
{"title":"Fuzzy Control for Cyclist Robot Stability Using FPGAs","authors":"C. Castro, C. Llanos, Walter de Britto Vidal Filho, L. Coelho","doi":"10.1109/ReConFig.2009.53","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.53","url":null,"abstract":"This paper presents a fuzzy controller implementation in FPGA (Field Programmable Gate Array) for a robot that rides a bicycle using the well-known Acrobot model. The overall system presents a hardware/software codesign approach and it was achieved by means of a Microblaze FPGA embedded processor and a fuzzy controller, which was implemented directly in hardware. Both the microprocessor and the controller are connected via the Fast Simplex Link - FSL bus. The proposed design methodology involves firstly the fuzzy controller design in software for simulation and testing issues, taking into account the mathematical model of the plant. Afterwards, the controller was synthesized to the hardware description language VHDL using the Xfuzzy 2.0 tool. The fuzzy controller has 2 modules, each one producing a torque control variable. The first module receives both the position and angular speed of the first link of the Acrobot system whereas the second module receives the position and angular speed of the second link. The final torque variable is calculated in the Microblaze taking into account two gains. Each gain represents a priority that is applied to each fuzzy module. These gains were experimentally calculated through several simulation executed in the Matlab computational environment.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122610547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Design and Implementation of a Configurable Interleaver/Deinterleaver for Turbo Codes in 3GPP Standard 3GPP标准中Turbo码可配置交织/去交织器的设计与实现
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.16
Hector Borrayo-Sandoval, R. Parra-Michel, L. F. Gonzalez-Perez, Fernando Landeros Printzen, C. F. Uribe
During the last decade, Turbo codes have been taking an increasing importance in channel coding due to its good performance in error correction. One key component in Turbo codes is the interleaver/deinterleaver pair, often designed as reconfigurable coprocessors able to deal with requirements of large data length variability found in the newest communication standards. In this work we introduce a configurable interleaver architecture for the turbo decoder in 3rd Generation Partnership Project (3GPP) standard. It is implemented under the idea of “iterative modulo computation”. Additionally, the presented solution not only generates the interleaved addresses, but also deals with the flow of data streams through the interleaver. The architecture and FPGA implementation results are also presented.
近十年来,Turbo码以其良好的纠错性能在信道编码中占有越来越重要的地位。Turbo码中的一个关键组件是交织/去交织对,通常被设计为可重构的协处理器,能够处理最新通信标准中发现的大数据长度可变性的要求。在这项工作中,我们为第三代合作伙伴计划(3GPP)标准中的涡轮解码器引入了一种可配置的交织器架构。它是在“迭代模计算”思想下实现的。此外,该方案不仅可以生成交错地址,还可以处理通过交错器的数据流。最后给出了系统的结构和FPGA实现结果。
{"title":"Design and Implementation of a Configurable Interleaver/Deinterleaver for Turbo Codes in 3GPP Standard","authors":"Hector Borrayo-Sandoval, R. Parra-Michel, L. F. Gonzalez-Perez, Fernando Landeros Printzen, C. F. Uribe","doi":"10.1109/ReConFig.2009.16","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.16","url":null,"abstract":"During the last decade, Turbo codes have been taking an increasing importance in channel coding due to its good performance in error correction. One key component in Turbo codes is the interleaver/deinterleaver pair, often designed as reconfigurable coprocessors able to deal with requirements of large data length variability found in the newest communication standards. In this work we introduce a configurable interleaver architecture for the turbo decoder in 3rd Generation Partnership Project (3GPP) standard. It is implemented under the idea of “iterative modulo computation”. Additionally, the presented solution not only generates the interleaved addresses, but also deals with the flow of data streams through the interleaver. The architecture and FPGA implementation results are also presented.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133768268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
FPGA Implementation of a Decimal Floating-Point Accurate Scalar Product Unit with a Parallel Fixed-Point Multiplier 带并行定点乘法器的十进制浮点精确标量积单元的FPGA实现
Pub Date : 2009-12-09 DOI: 10.1109/ReConFig.2009.17
Malte Baesler, T. Teufel
Decimal Floating Point (DFP) operations are very important for applications, that cannot tolerate errors from conversions between binary and decimal formats, for instance scientific, commercial, financial and internet-based applications. In this paper we present a parallel decimal fixed-point multiplier, designed to exploit the features of FPGAs. Our multiplier is based on BCD recoding schemes, fast partial product generation and a BCD-4221 Carry Save Adder reduction tree. Furthermore, we extend the multiplier with an accurate scalar product unit in order to provide an important operation with smallest possible rounding error as proposed in. Finally the design is implemented and tested on a Xilinx Virtex-II Pro FPGA platform.
浮点(DFP)操作对于不能容忍二进制和十进制格式转换错误的应用程序非常重要,例如科学、商业、金融和基于互联网的应用程序。本文提出了一种利用fpga的特点设计的并行十进制定点乘法器。我们的乘法器基于BCD编码方案,快速部分积生成和BCD-4221进位保存加法器约简树。此外,我们用精确的标量积单位扩展了乘法器,以便提供一个重要的操作,其舍入误差尽可能小。最后在Xilinx Virtex-II Pro FPGA平台上对该设计进行了实现和测试。
{"title":"FPGA Implementation of a Decimal Floating-Point Accurate Scalar Product Unit with a Parallel Fixed-Point Multiplier","authors":"Malte Baesler, T. Teufel","doi":"10.1109/ReConFig.2009.17","DOIUrl":"https://doi.org/10.1109/ReConFig.2009.17","url":null,"abstract":"Decimal Floating Point (DFP) operations are very important for applications, that cannot tolerate errors from conversions between binary and decimal formats, for instance scientific, commercial, financial and internet-based applications. In this paper we present a parallel decimal fixed-point multiplier, designed to exploit the features of FPGAs. Our multiplier is based on BCD recoding schemes, fast partial product generation and a BCD-4221 Carry Save Adder reduction tree. Furthermore, we extend the multiplier with an accurate scalar product unit in order to provide an important operation with smallest possible rounding error as proposed in. Finally the design is implemented and tested on a Xilinx Virtex-II Pro FPGA platform.","PeriodicalId":325631,"journal":{"name":"2009 International Conference on Reconfigurable Computing and FPGAs","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134492257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2009 International Conference on Reconfigurable Computing and FPGAs
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1