Dynamic SMP clusters with communication on the fly in NoC technology for very fine grain computations

骈文研究 Pub Date : 2004-07-05 DOI:10.1109/ISPDC.2004.20

M. Tudruj, L. Masko

{"title":"Dynamic SMP clusters with communication on the fly in NoC technology for very fine grain computations","authors":"M. Tudruj, L. Masko","doi":"10.1109/ISPDC.2004.20","DOIUrl":null,"url":null,"abstract":"The paper presents a new architecture for systems based on run-time reconfigured shared memory processor clusters meant for implementation using network on chip technology. Clusters constitute local data exchange sub-networks, which dynamically connect processors with shared memory modules. The sub-networks enable exposure of data from one processor's data cache for reading by other processors to their data caches. This inter-processor data exchange paradigm, called \"communication on the fly\", enables direct communication between processor data caches. Dual-ported data caches are assumed to enable parallel reading and writing data between the caches and memory modules. In the proposed architecture, programs are executed according to a cache-controlled macro data flow execution model. Computational tasks are so defined, as to eliminate re-loading of data caches during task execution. A special program macro-data flow graph representation enables modeling of program behaviour for different architectural and program structure assumptions. Simulation results of symbolic execution of program graphs of matrix multiplication are presented in the paper. They show suitability of the proposed architecture for very fine grain parallel computations.","PeriodicalId":62714,"journal":{"name":"骈文研究","volume":"13 1","pages":"97-104"},"PeriodicalIF":0.0000,"publicationDate":"2004-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"骈文研究","FirstCategoryId":"1092","ListUrlMain":"https://doi.org/10.1109/ISPDC.2004.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

Abstract

The paper presents a new architecture for systems based on run-time reconfigured shared memory processor clusters meant for implementation using network on chip technology. Clusters constitute local data exchange sub-networks, which dynamically connect processors with shared memory modules. The sub-networks enable exposure of data from one processor's data cache for reading by other processors to their data caches. This inter-processor data exchange paradigm, called "communication on the fly", enables direct communication between processor data caches. Dual-ported data caches are assumed to enable parallel reading and writing data between the caches and memory modules. In the proposed architecture, programs are executed according to a cache-controlled macro data flow execution model. Computational tasks are so defined, as to eliminate re-loading of data caches during task execution. A special program macro-data flow graph representation enables modeling of program behaviour for different architectural and program structure assumptions. Simulation results of symbolic execution of program graphs of matrix multiplication are presented in the paper. They show suitability of the proposed architecture for very fine grain parallel computations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有动态通信的动态SMP簇在NoC技术中用于非常细粒度计算

本文提出了一种基于运行时重构共享内存处理器集群的系统新架构，旨在利用片上网络技术实现。集群构成本地数据交换子网，动态连接处理器和共享内存模块。子网允许从一个处理器的数据缓存中公开数据，供其他处理器读取到它们的数据缓存中。这种处理器间数据交换范例，称为“动态通信”，支持处理器数据缓存之间的直接通信。假定双端口数据缓存能够在缓存和内存模块之间并行读写数据。在提出的体系结构中，程序根据缓存控制的宏数据流执行模型执行。计算任务是这样定义的，以便在任务执行期间消除数据缓存的重新加载。一种特殊的程序宏数据流图表示可以为不同的体系结构和程序结构假设对程序行为进行建模。给出了矩阵乘法程序图符号执行的仿真结果。它们显示了所提出的架构对非常细粒度并行计算的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊