Implementation of a threaded dataflow multiprocessor using FPGAs

K. Tatas, C. Kyriacou
{"title":"Implementation of a threaded dataflow multiprocessor using FPGAs","authors":"K. Tatas, C. Kyriacou","doi":"10.1109/DTIS.2011.5941444","DOIUrl":null,"url":null,"abstract":"This paper presents the FPGA implementation and evaluation of the prototype for a Data-Driven Multithreading Chip-Multiprocessor. In particular, we study the implementation of a Thread Synchronization Unit (TSU) on FPGA, a hardware unit that enables thread execution using dataflow rules on a chip multiprocessor. Threads are scheduled for execution based on data availability, i.e. a thread is fired only if its input data is available. This model of execution is called the non-blocking Data-Driven Multithreading (DDM) model of execution. Due to its dataflow characteristics, this model exploits parallelism and tolerates latency. The DDM model has been evaluated using an execution driven simulator and showed and average speedup of 26 on a 32-node system. For evaluation purposes, implementation on Xilinx Virtex-5 FPGA using the Microblaze processors as execution cores has been performed. Experimental results show that the TSU can be implemented with a moderate hardware budget, and that delays incurred by the operation of the TSU can be tolerated. Furthermore, hardware complexity evaluation shows that the TSU size scales very well with the number of processors in the MPSoC.","PeriodicalId":409387,"journal":{"name":"2011 6th International Conference on Design & Technology of Integrated Systems in Nanoscale Era (DTIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 6th International Conference on Design & Technology of Integrated Systems in Nanoscale Era (DTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DTIS.2011.5941444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents the FPGA implementation and evaluation of the prototype for a Data-Driven Multithreading Chip-Multiprocessor. In particular, we study the implementation of a Thread Synchronization Unit (TSU) on FPGA, a hardware unit that enables thread execution using dataflow rules on a chip multiprocessor. Threads are scheduled for execution based on data availability, i.e. a thread is fired only if its input data is available. This model of execution is called the non-blocking Data-Driven Multithreading (DDM) model of execution. Due to its dataflow characteristics, this model exploits parallelism and tolerates latency. The DDM model has been evaluated using an execution driven simulator and showed and average speedup of 26 on a 32-node system. For evaluation purposes, implementation on Xilinx Virtex-5 FPGA using the Microblaze processors as execution cores has been performed. Experimental results show that the TSU can be implemented with a moderate hardware budget, and that delays incurred by the operation of the TSU can be tolerated. Furthermore, hardware complexity evaluation shows that the TSU size scales very well with the number of processors in the MPSoC.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用fpga实现多线程数据流多处理器
本文介绍了一种数据驱动多线程芯片-多处理器原型的FPGA实现和评估。特别是,我们研究了FPGA上线程同步单元(TSU)的实现,这是一种硬件单元,可以在芯片多处理器上使用数据流规则执行线程。线程是根据数据可用性来安排执行的,也就是说,一个线程只有在它的输入数据可用时才会被触发。这种执行模型称为非阻塞数据驱动多线程(DDM)执行模型。由于其数据流特性,该模型利用并行性并容忍延迟。使用执行驱动模拟器对DDM模型进行了评估,并在32节点系统上显示了平均26的加速。为了评估目的,在Xilinx Virtex-5 FPGA上使用Microblaze处理器作为执行核心进行了实现。实验结果表明,该系统可以在适度的硬件预算下实现,并且可以容忍由TSU运行引起的延迟。此外,硬件复杂性评估表明,TSU尺寸与MPSoC中的处理器数量可以很好地扩展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A test solution for oxide thickness variations in the ATMEL TSTAC™ eFlash technology Low-overhead two-dimensional test pattern generation The design of logic gates using Single Electron Box (SEB) Nano-Devices A mixed signal multiplier using A2 binary representation dedicated to neural networks applications New transient detection circuit to detect ESD-induced disturbance for automatic recovery design in display panels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1