首页 > 最新文献

Proceedings of the 9th International Symposium on Networks-on-Chip最新文献

英文 中文
Fault-Tolerant 3D-NoC Architecture and Design: Recent Advances and Challenges 容错3D-NoC架构与设计:最新进展与挑战
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2788709
Li Jiang, Q. Xu
In this paper, we survey recent research work in the design of fault-tolerant three-dimensional (3D) network-on-chip (NoC), which has drawn lots of research attention from both academia and industry. To be specific, we discuss the emerging defects introduced in 3D integration, the state-of-the-art fault-tolerant 3D router designs, various fault-tolerant routing algorithms in three-dimension, as well as the architecture and design methodologies to tolerate defective TSVs in 3D-NoC. Finally, we highlight open challenges and future research directions in this domain.
本文综述了近年来在容错三维片上网络(NoC)设计方面的研究进展,该技术受到了学术界和工业界的广泛关注。具体而言,我们讨论了3D集成中引入的新缺陷,最先进的容错3D路由器设计,三维中的各种容错路由算法,以及3D- noc中容错tsv的架构和设计方法。最后,我们强调了该领域的开放挑战和未来的研究方向。
{"title":"Fault-Tolerant 3D-NoC Architecture and Design: Recent Advances and Challenges","authors":"Li Jiang, Q. Xu","doi":"10.1145/2786572.2788709","DOIUrl":"https://doi.org/10.1145/2786572.2788709","url":null,"abstract":"In this paper, we survey recent research work in the design of fault-tolerant three-dimensional (3D) network-on-chip (NoC), which has drawn lots of research attention from both academia and industry. To be specific, we discuss the emerging defects introduced in 3D integration, the state-of-the-art fault-tolerant 3D router designs, various fault-tolerant routing algorithms in three-dimension, as well as the architecture and design methodologies to tolerate defective TSVs in 3D-NoC. Finally, we highlight open challenges and future research directions in this domain.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131864246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
On-Chip Decentralized Routers with Balanced Pipelines for Avoiding Interconnect Bottleneck 基于平衡管道的片上分散路由器避免互连瓶颈
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786583
Ryota Yasudo, Hiroki Matsutani, M. Koibuchi, H. Amano, Tadao Nakamura
Technology scaling makes designers face difficulties dealing with wire delay of long global interconnects, especially for high-radix networks. In this context, we propose decentralization of on-chip packet routers. A decentralized router consists of submodules, each of which has particular functionality and they are scattered on a link, thereby long wires are segmented. Our starting point is from a conventional router architecture, and we illustrate four case studies to generalize our proposal. We also propose a new buffer design and how to balance pipelines of a router. A proof-of-concept is shown in 28-nm process technology. Our results demonstrate that the decentralization of an on-chip router enables Link Traversal (LT) stages to be eliminated, and the critical path delay is improved by up to 45% with the reduced area compared with a conventional router. As technology advances, the benefit of the decentralized routers become more substantial in the nano-scale era.
技术扩展使得设计人员在处理长全局互连的线延迟方面面临困难,特别是对于高基数网络。在这种情况下,我们建议分散化片上分组路由器。分散式路由器由子模块组成,每个子模块都有特定的功能,它们分散在一条链路上,从而将长线路分段。我们从传统的路由器架构开始,并举例说明了四个案例研究来概括我们的建议。我们还提出了一种新的缓冲设计以及如何平衡路由器的管道。28纳米制程技术的概念验证。我们的研究结果表明,片上路由器的分散化可以消除链路遍历(LT)阶段,并且与传统路由器相比,在减少面积的情况下,关键路径延迟提高了45%。随着技术的进步,在纳米级时代,去中心化路由器的优势变得更加明显。
{"title":"On-Chip Decentralized Routers with Balanced Pipelines for Avoiding Interconnect Bottleneck","authors":"Ryota Yasudo, Hiroki Matsutani, M. Koibuchi, H. Amano, Tadao Nakamura","doi":"10.1145/2786572.2786583","DOIUrl":"https://doi.org/10.1145/2786572.2786583","url":null,"abstract":"Technology scaling makes designers face difficulties dealing with wire delay of long global interconnects, especially for high-radix networks. In this context, we propose decentralization of on-chip packet routers. A decentralized router consists of submodules, each of which has particular functionality and they are scattered on a link, thereby long wires are segmented. Our starting point is from a conventional router architecture, and we illustrate four case studies to generalize our proposal. We also propose a new buffer design and how to balance pipelines of a router. A proof-of-concept is shown in 28-nm process technology. Our results demonstrate that the decentralization of an on-chip router enables Link Traversal (LT) stages to be eliminated, and the critical path delay is improved by up to 45% with the reduced area compared with a conventional router. As technology advances, the benefit of the decentralized routers become more substantial in the nano-scale era.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115065975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MapPro: Proactive Runtime Mapping for Dynamic Workloads by Quantifying Ripple Effect of Applications on Networks-on-Chip MapPro:通过量化片上网络应用程序的涟漪效应来实现动态工作负载的主动运行时映射
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786589
M. Haghbayan, A. Kanduri, A. Rahmani, P. Liljeberg, A. Jantsch, H. Tenhunen
Increasing dynamic workloads running on NoC-based many-core systems necessitates efficient runtime mapping strategies. With an unpredictable nature of application profiles, selecting a rational region to map an incoming application is an NP-hard problem in view of minimizing congestion and maximizing performance. In this paper, we propose a proactive region selection strategy which prioritizes nodes that offer lower congestion and dispersion. Our proposed strategy, MapPro, quantitatively represents the propagated impact of spatial availability and dispersion on the network with every new mapped application. This allows us to identify a suitable region to accommodate an incoming application that results in minimal congestion and dispersion. We cluster the network into squares of different radii to suit applications of different sizes and proactively select a suitable square for a new application, eliminating the overhead caused with typical reactive mapping approaches. We evaluated our proposed strategy over different traffic patterns and observed gains of up to 41% in energy efficiency, 28% in congestion and 21% dispersion when compared to the state-of-the-art region selection methods.
增加在基于noc的多核系统上运行的动态工作负载需要高效的运行时映射策略。由于应用程序概要具有不可预测的性质,因此从最小化拥塞和最大化性能的角度出发,选择一个合理的区域来映射传入的应用程序是一个np难题。在本文中,我们提出了一种主动区域选择策略,该策略优先考虑具有较低拥塞和分散的节点。我们提出的策略,MapPro,定量地表示空间可用性和网络上的分散对每个新映射应用程序的传播影响。这使我们能够确定一个合适的区域来容纳传入的应用程序,从而减少拥塞和分散。我们将网络聚类成不同半径的正方形,以适应不同规模的应用程序,并主动为新应用程序选择合适的正方形,从而消除了典型的响应式映射方法带来的开销。我们在不同的交通模式下评估了我们提出的策略,并观察到与最先进的区域选择方法相比,能源效率提高了41%,拥堵率提高了28%,分散率提高了21%。
{"title":"MapPro: Proactive Runtime Mapping for Dynamic Workloads by Quantifying Ripple Effect of Applications on Networks-on-Chip","authors":"M. Haghbayan, A. Kanduri, A. Rahmani, P. Liljeberg, A. Jantsch, H. Tenhunen","doi":"10.1145/2786572.2786589","DOIUrl":"https://doi.org/10.1145/2786572.2786589","url":null,"abstract":"Increasing dynamic workloads running on NoC-based many-core systems necessitates efficient runtime mapping strategies. With an unpredictable nature of application profiles, selecting a rational region to map an incoming application is an NP-hard problem in view of minimizing congestion and maximizing performance. In this paper, we propose a proactive region selection strategy which prioritizes nodes that offer lower congestion and dispersion. Our proposed strategy, MapPro, quantitatively represents the propagated impact of spatial availability and dispersion on the network with every new mapped application. This allows us to identify a suitable region to accommodate an incoming application that results in minimal congestion and dispersion. We cluster the network into squares of different radii to suit applications of different sizes and proactively select a suitable square for a new application, eliminating the overhead caused with typical reactive mapping approaches. We evaluated our proposed strategy over different traffic patterns and observed gains of up to 41% in energy efficiency, 28% in congestion and 21% dispersion when compared to the state-of-the-art region selection methods.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"36 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116852939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Mathematical Modeling and Control of Multifractal Workloads for Data-Center-on-a-Chip Optimization 片上数据中心优化中多重分形工作负载的数学建模与控制
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786592
P. Bogdan
Building autonomous data-centers-on-chip (DCoC) for exascale computing requires mathematical frameworks that account and exploit the non-stationary and multi-fractal characteristics of computation and communication workloads. Towards this end, relying on DCoC (Intel's SCC) measurements, we propose a complex dynamical modeling approach that captures the observed multi-fractal characteristics of inter-event times between successive workload changes and the magnitude of the increments in DCoC workloads. Our novel mathematical framework allows for the analysis of higher order moments and enables the formulation of more accurate model predictive control strategies for multi-fractal dynamics. We investigate the impact of the multi-fractal spectrum richness on the performance of the control algorithm. Our mathematical formalism can further be used to model, analyze and solve DCoC design problems (e.g., topology reconfiguration, buffer sizing, mapping, scheduling, resource management, congestion control).
为百亿亿次计算构建自主的片上数据中心(DCoC)需要考虑和利用计算和通信工作负载的非平稳和多重分形特征的数学框架。为此,依靠DCoC(英特尔的SCC)测量,我们提出了一种复杂的动态建模方法,该方法可以捕捉到连续工作负载变化之间事件间时间的多重分形特征和DCoC工作负载增量的大小。我们新颖的数学框架允许对高阶矩进行分析,并能够为多重分形动力学制定更准确的模型预测控制策略。研究了多重分形谱丰富度对控制算法性能的影响。我们的数学形式化可以进一步用于建模、分析和解决DCoC设计问题(例如,拓扑重构、缓冲区大小、映射、调度、资源管理、拥塞控制)。
{"title":"Mathematical Modeling and Control of Multifractal Workloads for Data-Center-on-a-Chip Optimization","authors":"P. Bogdan","doi":"10.1145/2786572.2786592","DOIUrl":"https://doi.org/10.1145/2786572.2786592","url":null,"abstract":"Building autonomous data-centers-on-chip (DCoC) for exascale computing requires mathematical frameworks that account and exploit the non-stationary and multi-fractal characteristics of computation and communication workloads. Towards this end, relying on DCoC (Intel's SCC) measurements, we propose a complex dynamical modeling approach that captures the observed multi-fractal characteristics of inter-event times between successive workload changes and the magnitude of the increments in DCoC workloads. Our novel mathematical framework allows for the analysis of higher order moments and enables the formulation of more accurate model predictive control strategies for multi-fractal dynamics. We investigate the impact of the multi-fractal spectrum richness on the performance of the control algorithm. Our mathematical formalism can further be used to model, analyze and solve DCoC design problems (e.g., topology reconfiguration, buffer sizing, mapping, scheduling, resource management, congestion control).","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"107 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117293117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
Data Criticality in Network-On-Chip Design 片上网络设计中的数据临界性
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786593
Joshua San Miguel, Natalie D. Enright Jerger
Many network-on-chip (NoC) designs focus on maximizing performance, delivering data to each core no later than needed by the application. Yet to achieve greater energy efficiency, we argue that it is just as important that data is delivered no earlier than needed. To address this, we explore data criticality in CMPs. Caches fetch data in bulk (blocks of multiple words). Depending on the application's memory access patterns, some words are needed right away (critical) while other data are fetched too soon (non-critical). On a wide range of applications, we perform a limit study of the impact of data criticality in NoC design. Criticality-oblivious designs can waste up to 37.5% energy, compared to an idealized NoC that fetches each word both no later and no earlier than needed. Furthermore, 62.3% of energy is wasted fetching data that is not used by the application. We present NoCNoC, a practical, criticality-aware NoC design that achieves up to 60.5% energy savings with no loss in performance. Our work moves towards an ideally-efficient NoC, delivering data both no later and no earlier than needed.
许多片上网络(NoC)设计的重点是最大限度地提高性能,在不晚于应用程序所需的时间向每个核心传输数据。然而,为了实现更高的能源效率,我们认为数据不早于所需的时间交付同样重要。为了解决这个问题,我们探讨了cmp中的数据临界性。高速缓存批量获取数据(多个单词的块)。根据应用程序的内存访问模式,有些字是立即需要的(关键),而其他数据则是很快获取的(非关键)。在广泛的应用中,我们对NoC设计中数据临界性的影响进行了限制研究。与不晚于或不早于需要获取每个单词的理想NoC相比,临界无关设计可能浪费高达37.5%的能源。此外,62.3%的能量被浪费在获取应用程序不使用的数据上。我们提出NoCNoC,一种实用的,临界感知的NoC设计,在不损失性能的情况下实现高达60.5%的节能。我们的工作朝着理想高效的NoC方向发展,既不晚也不早地提供数据。
{"title":"Data Criticality in Network-On-Chip Design","authors":"Joshua San Miguel, Natalie D. Enright Jerger","doi":"10.1145/2786572.2786593","DOIUrl":"https://doi.org/10.1145/2786572.2786593","url":null,"abstract":"Many network-on-chip (NoC) designs focus on maximizing performance, delivering data to each core no later than needed by the application. Yet to achieve greater energy efficiency, we argue that it is just as important that data is delivered no earlier than needed. To address this, we explore data criticality in CMPs. Caches fetch data in bulk (blocks of multiple words). Depending on the application's memory access patterns, some words are needed right away (critical) while other data are fetched too soon (non-critical). On a wide range of applications, we perform a limit study of the impact of data criticality in NoC design. Criticality-oblivious designs can waste up to 37.5% energy, compared to an idealized NoC that fetches each word both no later and no earlier than needed. Furthermore, 62.3% of energy is wasted fetching data that is not used by the application. We present NoCNoC, a practical, criticality-aware NoC design that achieves up to 60.5% energy savings with no loss in performance. Our work moves towards an ideally-efficient NoC, delivering data both no later and no earlier than needed.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117257137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Asymmetric NoC Architectures for GPU Systems GPU系统的非对称NoC架构
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786596
Amir Kavyan Ziabari, José L. Abellán, Yenai Ma, A. Joshi, D. Kaeli
While both Chip MultiProcessors (CMPs) and Graphics Processing Units (GPUs) are many-core systems, they exhibit different memory access patterns. CMPs execute threads in parallel, where threads communicate and synchronize through the memory hierarchy (without any coalescing). GPUs on the other hand execute a large number of independent thread blocks and their accesses to memory are frequent and coalesced, resulting in a completely different access pattern. NoC designs for GPUs have not been extensively explored. In this paper, we first evaluate several NoC designs for GPUs to determine the most power/performance efficient NoCs. To improve NoC energy efficiency, we explore an asymmetric NoC design tailored for a GPU's memory access pattern, providing one network for L1-to-L2 communication and a second for L2-to-L1 traffic. Our analysis shows that an asymmetric multi-network Cmesh provides the most energy-efficient communication fabric for our target GPU system.
虽然芯片多处理器(cmp)和图形处理单元(gpu)都是多核系统,但它们表现出不同的内存访问模式。cmp并行执行线程,其中线程通过内存层次结构进行通信和同步(没有任何合并)。另一方面,gpu执行大量独立的线程块,它们对内存的访问是频繁和合并的,导致了完全不同的访问模式。gpu的NoC设计还没有得到广泛的探索。在本文中,我们首先评估了几种gpu的NoC设计,以确定最具功耗/性能效率的NoC。为了提高NoC能源效率,我们探索了针对GPU内存访问模式量身定制的非对称NoC设计,为l1到l2通信提供一个网络,为l2到l1流量提供另一个网络。我们的分析表明,非对称多网络Cmesh为我们的目标GPU系统提供了最节能的通信结构。
{"title":"Asymmetric NoC Architectures for GPU Systems","authors":"Amir Kavyan Ziabari, José L. Abellán, Yenai Ma, A. Joshi, D. Kaeli","doi":"10.1145/2786572.2786596","DOIUrl":"https://doi.org/10.1145/2786572.2786596","url":null,"abstract":"While both Chip MultiProcessors (CMPs) and Graphics Processing Units (GPUs) are many-core systems, they exhibit different memory access patterns. CMPs execute threads in parallel, where threads communicate and synchronize through the memory hierarchy (without any coalescing). GPUs on the other hand execute a large number of independent thread blocks and their accesses to memory are frequent and coalesced, resulting in a completely different access pattern. NoC designs for GPUs have not been extensively explored. In this paper, we first evaluate several NoC designs for GPUs to determine the most power/performance efficient NoCs. To improve NoC energy efficiency, we explore an asymmetric NoC design tailored for a GPU's memory access pattern, providing one network for L1-to-L2 communication and a second for L2-to-L1 traffic. Our analysis shows that an asymmetric multi-network Cmesh provides the most energy-efficient communication fabric for our target GPU system.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129746519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
NoC Architectures as Enablers of Biological Discovery for Personalized and Precision Medicine NoC架构作为个性化和精准医学生物发现的推动者
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2788706
P. Bogdan, Turbo Majumder, A. Ramanathan, Yuankun Xue
This paper overviews the main computational issues in personalized and precision medicine (PPM), and present a cogent case for network-on-chip (NoC)-based multicore platforms as enablers in the process. We identify a series of challenges for the design and optimization of NoC-based solutions for PPM. To capture the characteristics of the cyber-physical sensing and processing, we propose a new computational model built on a dynamical heterogeneous hyper-graph description of application-to-architecture interactions. Starting from these premises, we summarize a few implications on NoC design methodologies, present some NoC-based solutions that deal with some of the challenges, and outline a few open problems.
本文概述了个性化和精准医疗(PPM)中的主要计算问题,并提出了基于片上网络(NoC)的多核平台在这一过程中的推动者的令人信服的案例。我们确定了设计和优化基于noc的PPM解决方案所面临的一系列挑战。为了捕捉信息物理感知和处理的特征,我们提出了一种新的基于应用-架构交互的动态异构超图描述的计算模型。从这些前提出发,我们总结了NoC设计方法的一些含义,提出了一些基于NoC的解决方案,以应对一些挑战,并概述了一些悬而未决的问题。
{"title":"NoC Architectures as Enablers of Biological Discovery for Personalized and Precision Medicine","authors":"P. Bogdan, Turbo Majumder, A. Ramanathan, Yuankun Xue","doi":"10.1145/2786572.2788706","DOIUrl":"https://doi.org/10.1145/2786572.2788706","url":null,"abstract":"This paper overviews the main computational issues in personalized and precision medicine (PPM), and present a cogent case for network-on-chip (NoC)-based multicore platforms as enablers in the process. We identify a series of challenges for the design and optimization of NoC-based solutions for PPM. To capture the characteristics of the cyber-physical sensing and processing, we propose a new computational model built on a dynamical heterogeneous hyper-graph description of application-to-architecture interactions. Starting from these premises, we summarize a few implications on NoC design methodologies, present some NoC-based solutions that deal with some of the challenges, and outline a few open problems.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127346688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fault-tolerant Network-on-Chip based on Fault-aware Flits and Deflection Routing 基于故障感知Flits和偏转路由的片上容错网络
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786585
Armin Runge
Deflection routing is a promising approach for energy and hardware efficient NoCs. Future VLSI designs will have an increasing susceptibility to failures and breakdowns. The inherent redundancy of NoCs can be used to tolerate such failures. We extended the non-fault-tolerant CHIPPER router architecture to enable fault-tolerance. This architecture is based on deflection routing and utilizes a permutation network instead of a crossbar. The permutation network eliminates the sequential dependence of the priority based port allocation. Compared to a crossbar based design, a permutation network allows a faster and smaller router design. Simulations of an 8 × 8 network and more than 30.000 it injections show, that our router architecture is competitive with existing crossbar based fault-tolerant router architectures.
偏转路由是一种很有前途的节能和硬件高效noc方法。未来的超大规模集成电路设计将越来越容易出现故障和故障。noc固有的冗余可以用来容忍这种故障。我们扩展了非容错的CHIPPER路由器架构,使容错成为可能。该架构基于偏转路由,并利用排列网络代替交叉排。排列网络消除了基于优先级的端口分配的顺序依赖性。与基于交叉条的设计相比,排列网络允许更快和更小的路由器设计。对一个8 × 8网络和3万多个网络注入的仿真表明,我们的路由器架构与现有的基于crossbar的容错路由器架构相比具有一定的竞争力。
{"title":"Fault-tolerant Network-on-Chip based on Fault-aware Flits and Deflection Routing","authors":"Armin Runge","doi":"10.1145/2786572.2786585","DOIUrl":"https://doi.org/10.1145/2786572.2786585","url":null,"abstract":"Deflection routing is a promising approach for energy and hardware efficient NoCs. Future VLSI designs will have an increasing susceptibility to failures and breakdowns. The inherent redundancy of NoCs can be used to tolerate such failures. We extended the non-fault-tolerant CHIPPER router architecture to enable fault-tolerance. This architecture is based on deflection routing and utilizes a permutation network instead of a crossbar. The permutation network eliminates the sequential dependence of the priority based port allocation. Compared to a crossbar based design, a permutation network allows a faster and smaller router design. Simulations of an 8 × 8 network and more than 30.000 it injections show, that our router architecture is competitive with existing crossbar based fault-tolerant router architectures.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123942717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Parka: Thermally Insulated Nanophotonic Interconnects 热绝缘纳米光子互连
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2786597
Y. Demir, N. Hardavellas
Silicon-photonics are emerging as the prime candidate technology for energy-efficient on-chip interconnects at future process nodes. However, current designs are primarily based on microrings, which are highly sensitive to temperature. As a result, current silicon-photonic interconnect designs expend a significant amount of energy heating the microrings to a designated narrow temperature range, only to have the majority of the thermal energy waste away and dissipate through the heat sink, and in the process of doing so heat up the logic layer, causing significant performance degradation to the cores and inducing thermal emergencies. We propose Parka, a nanophotonic interconnect that encases the photonic die in a thermal insulator that keeps its temperature stable with low energy expenditure, while minimizing the spatial and temporal thermal coupling between logic and silicon-photonic components. Parka reduces the microring energy by 3.8--5.4x and achieves 11--23% speedup on average (34% max) depending on the cooling solution used.
硅光子学正在成为未来工艺节点上节能片上互连的主要候选技术。然而,目前的设计主要基于对温度高度敏感的微环。因此,目前的硅-光子互连设计花费了大量的能量将微环加热到指定的狭窄温度范围,只是为了让大部分热能浪费掉并通过散热器消散,并且在此过程中加热逻辑层,导致核心性能显著下降并引起热紧急情况。我们提出了Parka,一种纳米光子互连,它将光子芯片封装在热绝缘体中,以低能量消耗保持其温度稳定,同时最大限度地减少逻辑和硅光子组件之间的空间和时间热耦合。根据所使用的冷却方案,Parka将微环能量降低了3.8- 5.4倍,平均加速率达到11- 23%(最高34%)。
{"title":"Parka: Thermally Insulated Nanophotonic Interconnects","authors":"Y. Demir, N. Hardavellas","doi":"10.1145/2786572.2786597","DOIUrl":"https://doi.org/10.1145/2786572.2786597","url":null,"abstract":"Silicon-photonics are emerging as the prime candidate technology for energy-efficient on-chip interconnects at future process nodes. However, current designs are primarily based on microrings, which are highly sensitive to temperature. As a result, current silicon-photonic interconnect designs expend a significant amount of energy heating the microrings to a designated narrow temperature range, only to have the majority of the thermal energy waste away and dissipate through the heat sink, and in the process of doing so heat up the logic layer, causing significant performance degradation to the cores and inducing thermal emergencies. We propose Parka, a nanophotonic interconnect that encases the photonic die in a thermal insulator that keeps its temperature stable with low energy expenditure, while minimizing the spatial and temporal thermal coupling between logic and silicon-photonic components. Parka reduces the microring energy by 3.8--5.4x and achieves 11--23% speedup on average (34% max) depending on the cooling solution used.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124545428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Fabrics on Die: Where Function, Debug and Test Meet 模具上的织物:功能、调试和测试的交汇处
Pub Date : 2015-09-28 DOI: 10.1145/2786572.2788712
Priyadarsan Patra, C. Prudvi
In this paper, we briefly present how packet-based networks or fabrics, have found their way into diverse usages on high-end industrial designs today. We outline the salient features, use models and challenges involved in implementation and application of these fabrics, not only in functional communication but also in power-management, silicon debug and high-volume-manufacturing test. Both debug and test hooks in SOC/NOC and some test/debug scenarios are discussed. We touch on some recent advances in functional networks and their implications to debug & test.
在本文中,我们简要介绍了基于分组的网络或结构如何在当今高端工业设计中找到各种用途。我们概述了这些结构的显著特征,使用模型和涉及的实现和应用挑战,不仅在功能通信,而且在电源管理,硅调试和大批量生产测试。讨论了SOC/NOC中的调试和测试钩子,以及一些测试/调试场景。我们讨论了功能网络的一些最新进展及其对调试和测试的影响。
{"title":"Fabrics on Die: Where Function, Debug and Test Meet","authors":"Priyadarsan Patra, C. Prudvi","doi":"10.1145/2786572.2788712","DOIUrl":"https://doi.org/10.1145/2786572.2788712","url":null,"abstract":"In this paper, we briefly present how packet-based networks or fabrics, have found their way into diverse usages on high-end industrial designs today. We outline the salient features, use models and challenges involved in implementation and application of these fabrics, not only in functional communication but also in power-management, silicon debug and high-volume-manufacturing test. Both debug and test hooks in SOC/NOC and some test/debug scenarios are discussed. We touch on some recent advances in functional networks and their implications to debug & test.","PeriodicalId":228605,"journal":{"name":"Proceedings of the 9th International Symposium on Networks-on-Chip","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122215928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 9th International Symposium on Networks-on-Chip
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1