首页 > 最新文献

Proceedings of the 8th International Workshop on Network on Chip Architectures最新文献

英文 中文
Design of TSV-Sharing Topologies for Cost-Effective 3D Networks-on-Chip 具有成本效益的三维片上网络tsv共享拓扑设计
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835514
Poona Bahrebar, D. Stroobandt
The Through-Silicon Via (TSV) technology has led to major breakthroughs in 3D stacking by providing higher speed and bandwidth, as well as lower power dissipation for the inter-layer communication. However, the current TSV fabrication suffers from a considerable area footprint and yield loss. Thus, it is necessary to restrict the number of TSVs in order to design cost-effective 3D on-chip networks. This critical issue can be addressed by clustering the network such that all of the routers within each cluster share a single TSV pillar for the vertical packet transmission. In some of the existing topologies, additional cluster routers are augmented into the mesh structure to handle the shared TSVs. However, they impose either performance degradation or power/area overhead to the system. Furthermore, the resulting architecture is no longer a mesh. In this paper, we redefine the clusters by replacing some routers in the mesh with the cluster routers, such that the mesh structure is preserved. The simulation results demonstrate a better equilibrium between performance and cost, using the proposed models.
通过硅通孔(TSV)技术为层间通信提供更高的速度和带宽以及更低的功耗,从而在3D堆叠方面取得了重大突破。然而,目前的TSV制造受到相当大的面积占用和产量损失的影响。因此,为了设计具有成本效益的三维片上网络,有必要限制tsv的数量。这个关键问题可以通过将网络集群化来解决,这样每个集群中的所有路由器都共享一个TSV支柱,用于垂直数据包传输。在一些现有的拓扑结构中,额外的集群路由器被添加到网格结构中来处理共享的tsv。然而,它们给系统带来性能下降或功率/面积开销。此外,生成的体系结构不再是网格。在本文中,我们通过用集群路由器替换网格中的一些路由器来重新定义集群,从而保持了网格结构。仿真结果表明,所提出的模型在性能和成本之间取得了较好的平衡。
{"title":"Design of TSV-Sharing Topologies for Cost-Effective 3D Networks-on-Chip","authors":"Poona Bahrebar, D. Stroobandt","doi":"10.1145/2835512.2835514","DOIUrl":"https://doi.org/10.1145/2835512.2835514","url":null,"abstract":"The Through-Silicon Via (TSV) technology has led to major breakthroughs in 3D stacking by providing higher speed and bandwidth, as well as lower power dissipation for the inter-layer communication. However, the current TSV fabrication suffers from a considerable area footprint and yield loss. Thus, it is necessary to restrict the number of TSVs in order to design cost-effective 3D on-chip networks. This critical issue can be addressed by clustering the network such that all of the routers within each cluster share a single TSV pillar for the vertical packet transmission. In some of the existing topologies, additional cluster routers are augmented into the mesh structure to handle the shared TSVs. However, they impose either performance degradation or power/area overhead to the system. Furthermore, the resulting architecture is no longer a mesh. In this paper, we redefine the clusters by replacing some routers in the mesh with the cluster routers, such that the mesh structure is preserved. The simulation results demonstrate a better equilibrium between performance and cost, using the proposed models.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127778426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Power and Latency Management in Heterogeneous 3D NoCs 异构3D noc中的自动化电源和延迟管理
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835517
Awet Yemane Weldezion, M. Ebrahimi, M. Daneshtalab, H. Tenhunen
Beside different core sizes in many-core Systems-on-Chip, the cost and reliability issues of TSVs move 3D NoCs toward heterogonous designs. Such heterogeneity introduces design complexity and new challenges for obtaining a high performance, low power, low area, and a reliable design. By taking all these factors into account, we propose a design as a combination of Q-Learning and deflection routing in a heterogeneous 3D NoCs. This design enables the routing algorithm to dynamically adjust itself to the underlying traffic condition and topology arrangement at run time. Thereby, the network can reach its optimal performance and minimum power consumption shortly after a reconfiguration either because of an occurred fault in the network or a traffic change.
除了多核片上系统的不同内核尺寸外,tsv的成本和可靠性问题使3D noc转向异质设计。这种异构性为获得高性能、低功耗、低面积和可靠的设计带来了设计复杂性和新的挑战。考虑到所有这些因素,我们提出了一种在异构3D noc中结合q -学习和偏转路由的设计。该设计使路由算法能够在运行时根据底层交通状况和拓扑安排动态调整自身。这样,在网络发生故障或流量变化后重新配置后,网络可以在短时间内达到最佳性能和最小功耗。
{"title":"Automated Power and Latency Management in Heterogeneous 3D NoCs","authors":"Awet Yemane Weldezion, M. Ebrahimi, M. Daneshtalab, H. Tenhunen","doi":"10.1145/2835512.2835517","DOIUrl":"https://doi.org/10.1145/2835512.2835517","url":null,"abstract":"Beside different core sizes in many-core Systems-on-Chip, the cost and reliability issues of TSVs move 3D NoCs toward heterogonous designs. Such heterogeneity introduces design complexity and new challenges for obtaining a high performance, low power, low area, and a reliable design. By taking all these factors into account, we propose a design as a combination of Q-Learning and deflection routing in a heterogeneous 3D NoCs. This design enables the routing algorithm to dynamically adjust itself to the underlying traffic condition and topology arrangement at run time. Thereby, the network can reach its optimal performance and minimum power consumption shortly after a reconfiguration either because of an occurred fault in the network or a traffic change.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"312 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124440368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Rethinking Memory System Design (along with Interconnects) 重新思考内存系统设计(以及互连)
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835520
O. Mutlu
The memory system is a fundamental performance and energy bottleneck in almost all computing systems. Recent system design, application, and technology trends that require more capacity, bandwidth, efficiency, and predictability out of the memory system make it an even more important system bottleneck [27, 28]. At the same time, DRAM technology is experiencing difficult circuit and device scaling challenges that make the maintenance and enhancement of its capacity, energy-efficiency, and reliability significantly more costly with conventional techniques (see, for example [7, 8, 11, 12, 15, 17, 18, 22, 23, 32]). In this talk, we examine some promising research and design directions to overcome challenges posed by memory scaling. Specifically, we discuss three key solution directions: 1) enabling new memory architectures, functions, interfaces, and better integration of the memory and the rest of the system, including interconnects (e.g., [1, 2, 19, 20, 34-36]), 2) designing a memory system that intelligently employs multiple memory technologies and coordinates memory and storage management using non-volatile memory technologies (e.g., [16-18, 24, 25, 32, 33, 40-42]), 3) providing predictable performance and QoS to applications sharing the memory system (e.g., [3, 9, 10, 13, 14, 26, 29, 37-39]). As we discuss challenges and solution directions in memory, we will point out research opportunities in interconnects and memory-interconnect co-design (e.g., [2, 4-6, 19, 21, 30, 31]).
存储系统是几乎所有计算系统的基本性能和能量瓶颈。最近的系统设计、应用和技术趋势要求内存系统提供更多的容量、带宽、效率和可预测性,这使其成为更重要的系统瓶颈[27,28]。与此同时,DRAM技术正面临着艰难的电路和器件缩放挑战,这使得维护和增强其容量、能效和可靠性的成本明显高于传统技术(参见[7,8,11,12,15,17,18,22,23,32])。在这次演讲中,我们将探讨一些有前途的研究和设计方向,以克服内存缩放带来的挑战。具体来说,我们讨论了三个关键的解决方向:1)实现新的存储器架构、功能、接口,并更好地集成存储器和系统的其余部分,包括互连(例如,[1,2,19,20,34-36]),2)设计一种存储器系统,智能地采用多种存储器技术,并使用非易失性存储器技术(例如,[16-18,24,25,32,33,40-42])来协调存储器和存储管理,3)为共享存储器系统的应用程序提供可预测的性能和QoS(例如,[3,9,10,13,1])。[14,26,29,37 -39])。当我们讨论存储器中的挑战和解决方案方向时,我们将指出互连和存储器-互连协同设计的研究机会(例如[2,4 - 6,19,21,30,31])。
{"title":"Rethinking Memory System Design (along with Interconnects)","authors":"O. Mutlu","doi":"10.1145/2835512.2835520","DOIUrl":"https://doi.org/10.1145/2835512.2835520","url":null,"abstract":"The memory system is a fundamental performance and energy bottleneck in almost all computing systems. Recent system design, application, and technology trends that require more capacity, bandwidth, efficiency, and predictability out of the memory system make it an even more important system bottleneck [27, 28]. At the same time, DRAM technology is experiencing difficult circuit and device scaling challenges that make the maintenance and enhancement of its capacity, energy-efficiency, and reliability significantly more costly with conventional techniques (see, for example [7, 8, 11, 12, 15, 17, 18, 22, 23, 32]). In this talk, we examine some promising research and design directions to overcome challenges posed by memory scaling. Specifically, we discuss three key solution directions: 1) enabling new memory architectures, functions, interfaces, and better integration of the memory and the rest of the system, including interconnects (e.g., [1, 2, 19, 20, 34-36]), 2) designing a memory system that intelligently employs multiple memory technologies and coordinates memory and storage management using non-volatile memory technologies (e.g., [16-18, 24, 25, 32, 33, 40-42]), 3) providing predictable performance and QoS to applications sharing the memory system (e.g., [3, 9, 10, 13, 14, 26, 29, 37-39]). As we discuss challenges and solution directions in memory, we will point out research opportunities in interconnects and memory-interconnect co-design (e.g., [2, 4-6, 19, 21, 30, 31]).","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121234261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Task mapping and communication routing model for minimizing power consumption in multi-cores 多核环境下最小化功耗的任务映射和通信路由模型
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835515
Sergiu Carpov
In this paper we introduce a novel MILP formulation for the problem of mapping tasks and routing communications on multi-core systems with power minimization objective. The cores have several power consumption modes. Dynamic and static power consumptions are modeled independently and the dynamic power consumption depends on core load rate. Three types of communication routing are examined: single-path, multi-path and fractional multi-path. Initially a mathematical model is introduced and afterwards a linearized mixed-integer program formulation is proposed. We conclude the paper by presenting computational results on task graph instances obtained from StreamIt applications.
针对多核系统中以功率最小化为目标的任务映射和路由通信问题,提出了一种新的MILP公式。核心有几种功耗模式。动态功耗和静态功耗独立建模,动态功耗依赖于核心负荷率。研究了三种类型的通信路由:单路径、多路径和分式多路径。首先建立了一个数学模型,然后提出了一个线性化的混合整数规划公式。最后,我们给出了从StreamIt应用程序中获得的任务图实例的计算结果。
{"title":"Task mapping and communication routing model for minimizing power consumption in multi-cores","authors":"Sergiu Carpov","doi":"10.1145/2835512.2835515","DOIUrl":"https://doi.org/10.1145/2835512.2835515","url":null,"abstract":"In this paper we introduce a novel MILP formulation for the problem of mapping tasks and routing communications on multi-core systems with power minimization objective. The cores have several power consumption modes. Dynamic and static power consumptions are modeled independently and the dynamic power consumption depends on core load rate. Three types of communication routing are examined: single-path, multi-path and fractional multi-path. Initially a mathematical model is introduced and afterwards a linearized mixed-integer program formulation is proposed. We conclude the paper by presenting computational results on task graph instances obtained from StreamIt applications.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122149785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Methodology to verify, debug and evaluate performances of NoC based interconnects 验证、调试和评估基于NoC互连性能的方法
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835521
Patrick Oury, N. Heaton, Stewart Penman
Latest developments in electronic systems lead hardware architects to completely rethink the traffic exchanges inside Systems on Chip. Constraints on more parallelism, less power, more coherency, less order while designing system fabrics and interconnects infers a bunch of new design features. All of them need to be verified, all of them need to be evaluated with respect to their impact on system performance, power consumption and gate count. This paper discusses some of the most challenging features in NoCs and the way verification engineers and architects are tackling correctness and performance checking of them.
电子系统的最新发展导致硬件架构师彻底重新思考片上系统内部的流量交换。在设计系统结构和互连时,更多的并行性、更少的功耗、更多的一致性和更少的顺序的约束推断出了一系列新的设计特征。所有这些都需要验证,所有这些都需要根据它们对系统性能、功耗和门数的影响进行评估。本文讨论了noc中一些最具挑战性的特性,以及验证工程师和架构师处理它们的正确性和性能检查的方法。
{"title":"Methodology to verify, debug and evaluate performances of NoC based interconnects","authors":"Patrick Oury, N. Heaton, Stewart Penman","doi":"10.1145/2835512.2835521","DOIUrl":"https://doi.org/10.1145/2835512.2835521","url":null,"abstract":"Latest developments in electronic systems lead hardware architects to completely rethink the traffic exchanges inside Systems on Chip. Constraints on more parallelism, less power, more coherency, less order while designing system fabrics and interconnects infers a bunch of new design features. All of them need to be verified, all of them need to be evaluated with respect to their impact on system performance, power consumption and gate count. This paper discusses some of the most challenging features in NoCs and the way verification engineers and architects are tackling correctness and performance checking of them.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130538152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Low-Latency and High-Throughput Multiple-Level Arbitration Scheme Supporting Quality-of-Service in Optical On-chip Network 一种支持片上光网络服务质量的低延迟、高吞吐量多级仲裁方案
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835519
Jian Jie, Lai Mingche, Xiao Liquan
As a key technology in optical NoC design, the arbitration scheme should provide differential arbitration service with high throughput and low latency for various types and priorities of traffic in CMPs. In this work, we propose a fast hierarchical arbitration supporting Quality-of-Service. With a multi-priority data buffer queue, arbiters provide differential transmissions and guarantee service for all queues. Our arbiter also presents the transmit bound resource reservation scheme to reserve time slots for all nodes fairly. We propose fast arbitration with a layout of fast optical arbitration channels to decrease the arbitration period, thereby reducing packet transmitting delay. The simulation results show that with our hierarchical arbitration scheme, all nodes are allocated with almost equal service under various patterns; thus, the min-communication-bandwidth and max-transmit-delay is guaranteed to be 5% and 80 cycles under the overload demands. This scheme improves throughput by 17% compared to FeatherWeight under a self-similar traffic pattern and decreases arbitration delay by 15% to 2-pass arbitration.
仲裁方案作为光NoC设计的关键技术,需要为cmp中不同类型、不同优先级的流量提供高吞吐量、低时延的差分仲裁服务。在这项工作中,我们提出了一种支持服务质量的快速分层仲裁。在多优先级数据缓冲队列中,仲裁器为所有队列提供差分传输并保证服务。我们的仲裁器还提出了传输绑定资源预留方案,公平地为所有节点预留时隙。为了缩短仲裁周期,减少数据包传输延迟,我们提出了一种快速光仲裁通道布局的快速仲裁方案。仿真结果表明,采用分层仲裁方案,在各种模式下,所有节点的服务分配几乎相等;因此,在过载需求下,最小通信带宽和最大传输延迟分别保证为5%和80个周期。与自相似流量模式下的FeatherWeight相比,该方案将吞吐量提高了17%,并将仲裁延迟减少了15%至2次仲裁。
{"title":"A Low-Latency and High-Throughput Multiple-Level Arbitration Scheme Supporting Quality-of-Service in Optical On-chip Network","authors":"Jian Jie, Lai Mingche, Xiao Liquan","doi":"10.1145/2835512.2835519","DOIUrl":"https://doi.org/10.1145/2835512.2835519","url":null,"abstract":"As a key technology in optical NoC design, the arbitration scheme should provide differential arbitration service with high throughput and low latency for various types and priorities of traffic in CMPs. In this work, we propose a fast hierarchical arbitration supporting Quality-of-Service. With a multi-priority data buffer queue, arbiters provide differential transmissions and guarantee service for all queues. Our arbiter also presents the transmit bound resource reservation scheme to reserve time slots for all nodes fairly. We propose fast arbitration with a layout of fast optical arbitration channels to decrease the arbitration period, thereby reducing packet transmitting delay. The simulation results show that with our hierarchical arbitration scheme, all nodes are allocated with almost equal service under various patterns; thus, the min-communication-bandwidth and max-transmit-delay is guaranteed to be 5% and 80 cycles under the overload demands. This scheme improves throughput by 17% compared to FeatherWeight under a self-similar traffic pattern and decreases arbitration delay by 15% to 2-pass arbitration.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115314870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NoCVision: A Network-on-Chip Dynamic Visualization Solution 一种片上网络动态可视化解决方案
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835518
V. Gogte, Doowon Lee, Ritesh Parikh, V. Bertacco
Networks-on-chip (NoCs) are the communication infrastructure of choice for integrating the many components of modern silicon systems, deployed anywhere from systems-on-chip, to chip multi-processors, and to heterogeneous systems. The growing design complexity of these systems, coupled with shrinking times-to-market, requires efficient analysis of complex applications mapped onto the network in a short span of time. In this work, we propose NoCVision, a novel platform for the analysis of NoC characteristics and traffic flows. NoCVision enables design-space exploration, performance tuning, and validation of the NoC subsystem. It allows to consolidate and summarize the network's simulation data and visualize it through intuitive diagrams and plots, either in a static form or animating it to depict changes occurring over time during an application's execution. To showcase the features and benefits of NoCVision, we present several case studies developed on a 64-node CMP organized in a 8x8 mesh NoC and running multi-programmed workloads.
片上网络(noc)是集成现代硅系统的许多组件的首选通信基础设施,可以部署在从片上系统到片上多处理器以及异构系统的任何地方。这些系统日益增长的设计复杂性,加上上市时间的缩短,需要在短时间内对映射到网络上的复杂应用程序进行有效的分析。在这项工作中,我们提出了一个新的平台novision,用于分析NoC特征和交通流。novision实现了NoC子系统的设计空间探索、性能调优和验证。它允许整合和总结网络的模拟数据,并通过直观的图表和绘图将其可视化,无论是静态形式还是动画形式,都可以描述应用程序执行期间随时间发生的变化。为了展示NoCVision的功能和优势,我们介绍了在8x8 mesh NoC中组织的64节点CMP上开发的几个案例研究,并运行多编程工作负载。
{"title":"NoCVision: A Network-on-Chip Dynamic Visualization Solution","authors":"V. Gogte, Doowon Lee, Ritesh Parikh, V. Bertacco","doi":"10.1145/2835512.2835518","DOIUrl":"https://doi.org/10.1145/2835512.2835518","url":null,"abstract":"Networks-on-chip (NoCs) are the communication infrastructure of choice for integrating the many components of modern silicon systems, deployed anywhere from systems-on-chip, to chip multi-processors, and to heterogeneous systems. The growing design complexity of these systems, coupled with shrinking times-to-market, requires efficient analysis of complex applications mapped onto the network in a short span of time. In this work, we propose NoCVision, a novel platform for the analysis of NoC characteristics and traffic flows. NoCVision enables design-space exploration, performance tuning, and validation of the NoC subsystem. It allows to consolidate and summarize the network's simulation data and visualize it through intuitive diagrams and plots, either in a static form or animating it to depict changes occurring over time during an application's execution. To showcase the features and benefits of NoCVision, we present several case studies developed on a 64-node CMP organized in a 8x8 mesh NoC and running multi-programmed workloads.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128527503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
System-Level Analysis of Network Interfaces for Hierarchical MPSoCs 分层mpsoc网络接口的系统级分析
Pub Date : 2015-12-05 DOI: 10.1145/2835512.2835513
J. Ax, Gregor Sievers, Martin Flasskamp, W. Kelly, T. Jungeblut, Mario Porrmann
Network Interfaces (NIs) are used in Multiprocessor System-on-Chips (MPSoCs) to connect CPUs to a packet switched Network-on-Chip. In this work we introduce a new NI architecture for our hierarchical CoreVA-MPSoC. The CoreVA-MPSoC targets streaming applications in embedded systems. The main contribution of this paper is a system-level analysis of different NI configurations, considering both software and hardware costs for NoC communication. Different configurations of the NI are compared using a benchmark suite of 10 streaming applications. The best performing NI configuration shows an average speedup of 20 for a CoreVA-MPSoC with 32 CPUs compared to a single CPU. Furthermore, we present physical implementation results using a 28 nm FD-SOI standard cell technology. A hierarchical MPSoC with 8 CPU clusters and 4 CPUs in each cluster running at 800MHz requires an area of 4.56mm2.
NIs (Network interface)是mpsoc (multi - processor System-on-Chips)中的网络接口,用于将cpu连接到分组交换的片上网络。在这项工作中,我们为我们的分层CoreVA-MPSoC引入了一种新的NI架构。CoreVA-MPSoC针对嵌入式系统中的流应用。本文的主要贡献是对不同NI配置的系统级分析,同时考虑了NoC通信的软件和硬件成本。NI的不同配置使用10个流应用程序的基准套件进行比较。性能最佳的NI配置显示,与单个CPU相比,具有32个CPU的CoreVA-MPSoC的平均加速速度为20。此外,我们提出了使用28纳米FD-SOI标准电池技术的物理实现结果。具有8个CPU集群的分层MPSoC,每个集群中有4个CPU运行在800MHz,需要4.56mm2的面积。
{"title":"System-Level Analysis of Network Interfaces for Hierarchical MPSoCs","authors":"J. Ax, Gregor Sievers, Martin Flasskamp, W. Kelly, T. Jungeblut, Mario Porrmann","doi":"10.1145/2835512.2835513","DOIUrl":"https://doi.org/10.1145/2835512.2835513","url":null,"abstract":"Network Interfaces (NIs) are used in Multiprocessor System-on-Chips (MPSoCs) to connect CPUs to a packet switched Network-on-Chip. In this work we introduce a new NI architecture for our hierarchical CoreVA-MPSoC. The CoreVA-MPSoC targets streaming applications in embedded systems. The main contribution of this paper is a system-level analysis of different NI configurations, considering both software and hardware costs for NoC communication. Different configurations of the NI are compared using a benchmark suite of 10 streaming applications. The best performing NI configuration shows an average speedup of 20 for a CoreVA-MPSoC with 32 CPUs compared to a single CPU. Furthermore, we present physical implementation results using a 28 nm FD-SOI standard cell technology. A hierarchical MPSoC with 8 CPU clusters and 4 CPUs in each cluster running at 800MHz requires an area of 4.56mm2.","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128883411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Proceedings of the 8th International Workshop on Network on Chip Architectures 第八届芯片架构网络国际研讨会论文集
{"title":"Proceedings of the 8th International Workshop on Network on Chip Architectures","authors":"","doi":"10.1145/2835512","DOIUrl":"https://doi.org/10.1145/2835512","url":null,"abstract":"","PeriodicalId":424680,"journal":{"name":"Proceedings of the 8th International Workshop on Network on Chip Architectures","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127720579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 8th International Workshop on Network on Chip Architectures
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1