2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip最新文献

英文中文

HARAQ: Congestion-Aware Learning Model for Highly Adaptive Routing Algorithm in On-Chip Networks 片上网络中高度自适应路由算法的拥塞感知学习模型

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.10

M. Ebrahimi, M. Daneshtalab, F. Farahnakian, J. Plosila, P. Liljeberg, M. Palesi, H. Tenhunen

The occurrence of congestion in on-chip networks can severely degrade the performance due to increased message latency. In mesh topology, minimal methods can propagate messages over two directions at each switch. When shortest paths are congested, sending more messages through them can deteriorate the congestion condition considerably. In this paper, we present an adaptive routing algorithm for on-chip networks that provide a wide range of alternative paths between each pair of source and destination switches. Initially, the algorithm determines all permitted turns in the network including 180-degree turns on a single channel without creating cycles. The implementation of the algorithm provides the best usage of all allowable turns to route messages more adaptively in the network. On top of that, for selecting a less congested path, an optimized and scalable learning method is utilized. The learning method is based on local and global congestion information and can estimate the latency from each output channel to the destination region.

由于消息延迟增加，片上网络中出现拥塞会严重降低性能。在网格拓扑结构中，最小方法可以在每个交换机的两个方向上传播消息。当最短路径拥塞时，通过最短路径发送更多的消息会严重恶化拥塞状况。在本文中，我们提出了一种自适应路由算法，用于片上网络，该网络在每对源和目标交换机之间提供广泛的可选路径。最初，该算法确定网络中所有允许的转弯，包括单个通道上的180度转弯，而不会产生周期。该算法的实现提供了所有允许匝数的最佳使用，使网络中的消息路由更自适应。在此基础上，采用一种优化的、可扩展的学习方法来选择较少拥塞的路径。该学习方法基于局部和全局拥塞信息，可以估计从每个输出通道到目的区域的延迟。

引用次数: 87

Modeling and Power Evaluation of On-Chip Router Components in Spintronics 自旋电子学中片上路由器元件的建模与功率评估

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.13

Pierre Schamberger, Zhonghai Lu, Xianyang Jiang, Meikang Qiu

On-chip routers are power hungry components. Besides exploiting current CMOS-based power-saving techniques, it is also desirable to investigate the power saving potential enabled by new technologies and devices. This paper investigates the potential of exploiting the emerging spin-electronics based MTJ (Magnetic Tunnel Junction) devices with application to on-chip router modules, in particular, buffers and crossbars. To this end, we build MTJ models, design circuits based on mixed MTJ-CMOS devices, and evaluate their switching power consumption, using their pure CMOS counterparts as the baseline. Our study shows that the new technology can significantly improve power efficiency for buffers but the gain for crossbars is less clear.

片上路由器是耗电的组件。除了利用当前基于cmos的节能技术外，还需要研究新技术和新器件所带来的节能潜力。本文研究了利用新兴的基于自旋电子学的MTJ(磁隧道结)器件应用于片上路由器模块的潜力，特别是缓冲器和交叉棒。为此，我们建立MTJ模型，设计基于混合MTJ-CMOS器件的电路，并以纯CMOS器件为基准，评估其开关功耗。我们的研究表明，新技术可以显著提高缓冲器的功率效率，但横梁的增益不太清楚。

引用次数: 4

A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time Systems 一种用于实时系统的静态调度时分多路片上网络

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.25

Martin Schoeberl, F. Brandner, J. Sparsø, Evangelia Kasapaki

This paper explores the design of a circuit-switched network-on-chip (NoC) based on time-division-multiplexing (TDM) for use in hard real-time systems. Previous work has primarily considered application-specific systems. The work presented here targets general-purpose hardware platforms. We consider a system with IP-cores, where the TDM-NoC must provide directed virtual circuits - all with the same bandwidth - between all nodes. This may not be a frequent scenario, but a general platform should provide this capability, and it is an interesting point in the design space to study. The paper presents an FPGA-friendly hardware design, which is simple, fast, and consumes minimal resources. Furthermore, an algorithm to find minimum-period schedules for all-to-all virtual circuits on top of typical physical NoC topologies like 2D-mesh, torus, bidirectional torus, tree, and fat-tree is presented. The static schedule makes the NoC time-predictable and enables worst-case execution time analysis of communicating real-time tasks.

本文探讨了一种用于硬实时系统的基于时分复用(TDM)的电路交换片上网络(NoC)设计。以前的工作主要考虑特定于应用程序的系统。这里介绍的工作针对通用硬件平台。我们考虑一个具有ip核的系统，其中TDM-NoC必须在所有节点之间提供具有相同带宽的定向虚拟电路。这可能不是一个常见的场景，但是一般的平台应该提供这种功能，这是设计领域中值得研究的一个有趣的点。本文提出了一种简单、快速、资源消耗少的fpga友好型硬件设计方案。此外，提出了一种在典型物理NoC拓扑(如二维网格、环面、双向环面、树和胖树)上寻找全对全虚拟电路最小周期调度的算法。静态调度使NoC时间可预测，并支持通信实时任务的最坏情况执行时间分析。

引用次数: 126

Engineering a Bandwidth-Scalable Optical Layer for a 3D Multi-core Processor with Awareness of Layout Constraints 具有布局约束意识的3D多核处理器的带宽可扩展光学层设计

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.29

L. Ramini, D. Bertozzi, L. Carloni

The performance of future chip multi-processors will only scale with the number of integrated cores if there is a corresponding increase in memory access efficiency. The focus of this paper on a 3D-stacked wavelength-routed optical layer for high bandwidth and low latency processor-memory communication goes in this direction and complements ongoing efforts on photonically integrated bandwidth-rich DRAM devices. This target environment dictates layout constraints that make the difference in discriminating between alternative design choices of the optical layer. This paper assesses network partitioning options and bandwidth scalability techniques with deep technology and layout awareness, the main contribution lying in the characterization and precise quantification of such interaction effects between the technology platform, the layout constraints and the network-level quality metrics of a passive optical NoC.

未来芯片多处理器的性能只有在内存访问效率相应提高的情况下，才会随着集成内核数量的增加而扩大。本文的重点是用于高带宽和低延迟处理器-存储器通信的3d堆叠波长路由光层，并补充了正在进行的光子集成带宽丰富的DRAM设备的工作。这种目标环境决定了布局约束，使得区分不同的光学层设计选择有所不同。本文评估了具有深度技术和布局意识的网络划分选项和带宽可扩展性技术，主要贡献在于表征和精确量化无源光学NoC的技术平台、布局约束和网络级质量指标之间的相互作用效应。

引用次数: 32

Synthesis of NoC Interconnects for Custom MPSoC Architectures 用于定制MPSoC架构的NoC互连的合成

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.16

G. Khan, A. Tino

As technology continues to demand high performance, low power, and integration density, NoC system designers consider multiple aspects during the design phase. This paper addresses these issues and presents an NoC design methodology for generating high quality interconnects for custom Multiprocessor System-on-Chip (MPSoC) architectures. Our design methodology incorporates the main objectives of power and performance during topology synthesis while employing both analytical and simulation based automated techniques. A rendezvous interaction performance analysis method is presented where Layered Queuing Network models are invoked to observe the asynchronous interactions between NoC components and identify possible performance degradation in the on-chip network. Several experiments are conducted using various SoC benchmark applications to compare the power and performance outcomes of our proposed technique.

随着技术不断要求高性能、低功耗和集成密度，NoC系统设计人员在设计阶段要考虑多个方面。本文解决了这些问题，并提出了一种NoC设计方法，用于为定制的多处理器片上系统(MPSoC)架构生成高质量的互连。我们的设计方法在拓扑合成过程中结合了功率和性能的主要目标，同时采用基于分析和仿真的自动化技术。提出了一种集合交互性能分析方法，该方法通过调用分层排队网络模型来观察片上网络中NoC组件之间的异步交互，并识别可能出现的性能下降。使用各种SoC基准应用程序进行了几个实验，以比较我们提出的技术的功率和性能结果。

引用次数: 11

Dynamic Flow Regulation for IP Integration on Network-on-Chip 片上网络IP集成的动态流量调节

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.21

Zhonghai Lu, Yi Wang

Flow regulation is a traffic shaping technique, which can be used to achieve communication performance guarantees with low buffering cost when integrating IPs to network-on-chip architectures. This paper presents dynamic flow regulation, which overcomes the rigidity of static flow regulation that pre-configures regulation parameters statically and only once. The dynamic regulation is made possible by employing a sliding window based online flow ($sigma$, $rho$) characterization technique, where $sigma$ bounds traffic burstiness and $rho$ reflects the average rate. The characterization method is effective and can be implemented in hardware with small area and high speed. The resulting dynamic regulation can adaptively adjust the traffic regulation strength in response to real traffic workload scenarios. As such, it makes more efficient use of the system interconnect resources, leading to significant improvement in network performance.

流量调节是一种流量整形技术，可以在将ip集成到片上网络架构时，以较低的缓冲成本实现通信性能的保证。本文提出了一种动态流量调节方法，克服了静态流量调节的刚性，即静态预配置一次调节参数。通过采用基于滑动窗口的在线流量($sigma$, $rho$)表征技术，动态调节成为可能，其中$sigma$限制流量突发，$rho$反映平均速率。该表征方法是有效的，可以在小面积、高速度的硬件上实现。所得到的动态调节可以根据实际的交通负载场景自适应调整交通调节强度。因此，它可以更有效地利用系统互连资源，从而显著提高网络性能。

引用次数: 10

Efficient Timing Channel Protection for On-Chip Networks 片上网络的有效时序通道保护

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.24

Yu Wang, Danfeng Zhang, Yao Wang, Ed Suh, A. Myers, Andrew Ferraiuolo, G. Suh, Nithin Michael, A. Tang, Jiang Xu, Huazhong Yang

On-chip network is often dynamically shared among applications that are concurrently running on a chip-multiprocessor (CMP). In general, such shared resources imply that applications can affect each other's timing characteristics through interference in shared resources. For example, in on-chip networks, multiple flows can compete for links and buffers. We show that this interference is an attack vector through which a malicious application may be able to infer data-dependent information about other applications (side channel attacks), or two applications can exchange information covertly when direct communications are prohibited (covert channel attacks). To prevent these timing channel attacks, we propose an efficient scheme which uses priority-based arbitration and a static limit mechanism to provide one-way information-leak protection. The proposed technique requires minimal changes to the router hardware. The simulation results show that the protection scheme effectively eliminates a timing channel from high-security to low-security domains with minimal performance overheads for realistic traffic patterns.

片上网络通常在并发运行在芯片多处理器(CMP)上的应用程序之间动态共享。通常，这种共享资源意味着应用程序可以通过对共享资源的干扰来影响彼此的时序特征。例如，在片上网络中，多个流可以竞争链路和缓冲区。我们表明，这种干扰是一种攻击向量，恶意应用程序可以通过它推断有关其他应用程序的数据依赖信息(侧信道攻击)，或者两个应用程序可以在禁止直接通信时秘密交换信息(隐蔽信道攻击)。为了防止这些定时信道攻击，我们提出了一种有效的方案，该方案使用基于优先级的仲裁和静态限制机制来提供单向信息泄漏保护。所提出的技术需要对路由器硬件进行最小的更改。仿真结果表明，该保护方案以最小的性能开销有效地消除了从高安全域到低安全域的时序信道。

{"title":"Efficient Timing Channel Protection for On-Chip Networks","authors":"Yu Wang, Danfeng Zhang, Yao Wang, Ed Suh, A. Myers, Andrew Ferraiuolo, G. Suh, Nithin Michael, A. Tang, Jiang Xu, Huazhong Yang","doi":"10.1109/NOCS.2012.24","DOIUrl":"https://doi.org/10.1109/NOCS.2012.24","url":null,"abstract":"On-chip network is often dynamically shared among applications that are concurrently running on a chip-multiprocessor (CMP). In general, such shared resources imply that applications can affect each other's timing characteristics through interference in shared resources. For example, in on-chip networks, multiple flows can compete for links and buffers. We show that this interference is an attack vector through which a malicious application may be able to infer data-dependent information about other applications (side channel attacks), or two applications can exchange information covertly when direct communications are prohibited (covert channel attacks). To prevent these timing channel attacks, we propose an efficient scheme which uses priority-based arbitration and a static limit mechanism to provide one-way information-leak protection. The proposed technique requires minimal changes to the router hardware. The simulation results show that the protection scheme effectively eliminates a timing channel from high-security to low-security domains with minimal performance overheads for realistic traffic patterns.","PeriodicalId":6333,"journal":{"name":"2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip","volume":"1 1","pages":"142-151"},"PeriodicalIF":0.0,"publicationDate":"2012-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83061601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 117

DSENT - A Tool Connecting Emerging Photonics with Electronics for Opto-Electronic Networks-on-Chip Modeling DSENT -连接新兴光子学与电子学的工具，用于光电片上网络建模

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.31

Chen Sun, C. Chen, George Kurian, Lan Wei, Jason E. Miller, A. Agarwal, L. Peh, V. Stojanović

With the rise of many-core chips that require substantial bandwidth from the network on chip (NoC), integrated photonic links have been investigated as a promising alternative to traditional electrical interconnects. While numerous opto-electronic NoCs have been proposed, evaluations of photonic architectures have thus-far had to use a number of simplifications, reflecting the need for a modeling tool that accurately captures the tradeoffs for the emerging technology and its impacts on the overall network. In this paper, we present DSENT, a NoC modeling tool for rapid design space exploration of electrical and opto-electrical networks. We explain our modeling framework and perform an energy-driven case study, focusing on electrical technology scaling, photonic parameters, and thermal tuning. Our results show the implications of different technology scenarios and, in particular, the need to reduce laser and thermal tuning power in a photonic network due to their non-data-dependent nature.

随着需要大量带宽的多核芯片的兴起，集成光子链路作为传统电互连的一种有前途的替代方案被研究。虽然已经提出了许多光电noc，但到目前为止，对光子体系结构的评估不得不使用许多简化方法，这反映出需要一种建模工具来准确地捕捉新兴技术及其对整个网络的影响的权衡。在本文中，我们提出了DSENT，一种用于快速设计空间探索的电气和光电网络的NoC建模工具。我们解释了我们的建模框架，并执行了一个能量驱动的案例研究，重点是电气技术缩放，光子参数和热调谐。我们的研究结果显示了不同技术场景的含义，特别是由于光子网络的非数据依赖性，需要降低激光和热调谐功率。

引用次数: 523

Overlaid Mesh Topology Design and Deadlock Free Routing in Wireless Network-on-Chip 无线片上网络的覆盖网格拓扑设计与无死锁路由

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1109/NOCS.2012.11

Dan Zhao, Rui-Qing Wu

To bridge the widening gap between computation requirements of terascale application and communication efficiency faced by many-core processor chips, wireless Network-on-Chip (WiNoC) has been proposed by using the recently developed CMOS ultra wideband interconnection. In this research, we propose an unequal RF nodes overlaid mesh topology design to improve the on-chip communication performance. A network capacity model is developed for fast searching of optimal topology configuration. A high-efficient, low-cost zone-aided routing scheme is designed to facilitate deadlock freedom. The simulation study demonstrates topology modeling effectiveness, routing efficiency, and promising network performance of the overlaid mesh WiNoC over a regular 2D mesh baseline.

为了解决太万亿级应用的计算需求与多核处理器芯片所面临的通信效率之间日益扩大的差距，利用最新发展的CMOS超宽带互连技术提出了无线片上网络(WiNoC)。在这项研究中，我们提出了一个不相等的射频节点覆盖网格拓扑设计，以提高片上通信性能。为了快速搜索最优拓扑结构，建立了网络容量模型。为了避免死锁，设计了一种高效、低成本的区域辅助路由方案。仿真研究证明了在常规二维网格基线上叠加网格WiNoC的拓扑建模有效性、路由效率和良好的网络性能。

引用次数: 18

In-network Monitoring and Control Policy for DVFS of CMP Networks-on-Chip and Last Level Caches CMP片上网络和末级缓存DVFS的网内监控策略

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

Pub Date : 2012-05-09 DOI: 10.1145/2504905

X. Chen, Zheng Xu, Hyungjun Kim, Paul V. Gratz, Jiang Hu, M. Kishinevsky, Ümit Y. Ogras

In chip design today and for a foreseeable future, on-chip communication is not only a performance bottleneck but also a substantial power consumer. This work focuses on employing dynamic voltage and frequency scaling (DVFS) policies for networks-on-chip (NoC) and shared, distributed last-level caches (LLC). In particular, we consider a practical system architecture where the distributed LLC and the NoC share a voltage/frequency domain which is separate from the core domain. This architecture enables controlling the relative speed between the cores and memory hierarchy without introducing synchronization delays within the NoC. DVFS for this architecture is more difficult than individual link/core-based DVFS since it involves spatially distributed monitoring and control. We propose an average memory access time (AMAT)-based monitoring technique and integrate it with DVFS based on PID control theory. Simulations on PARSEC benchmarks yield a 33% dynamic energy savings with a negligible impact on system performance.

在当今和可预见的未来，芯片上的通信不仅是性能瓶颈，也是一个巨大的功耗消耗。这项工作的重点是在片上网络(NoC)和共享、分布式最后一级缓存(LLC)中采用动态电压和频率缩放(DVFS)策略。特别是，我们考虑了一种实用的系统架构，其中分布式LLC和NoC共享与核心域分离的电压/频率域。这种体系结构可以控制内核和内存层次之间的相对速度，而不会在NoC中引入同步延迟。这种体系结构的DVFS比基于单个链接/核心的DVFS更困难，因为它涉及空间分布的监视和控制。提出了一种基于平均存储器访问时间(AMAT)的监控技术，并将其与基于PID控制理论的DVFS相结合。在PARSEC基准测试上的模拟产生了33%的动态节能，对系统性能的影响可以忽略不计。

引用次数: 61

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀