2013 International Symposium on System on Chip (SoC)最新文献

英文中文

A family of modular area- and energy-efficient QRD-accelerator architectures 一系列模块化面积和节能qrd加速器架构

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675277

U. Vishnoi, T. Noll

QR-decomposition accelerators are attractive SoC components for many applications with a wide range of specifications. A new family of highly area- and energy-efficient, modular two-way linear-array QRD architectures based on the Givens algorithm and CORDIC rotations is proposed. The template architecture allows for implementations of real-/complex-valued and integer/floating-point QRDs. An accurate algebraic cost model enables cross-level optimization over architecture, micro-architecture and circuit level using a rich set of parameters. Quantitative results for exemplary applications are presented for implementations in 40-nm CMOS, proving the significant improvement of efficiency.

qr分解加速器是具有广泛规格的许多应用的有吸引力的SoC组件。提出了一种新的基于Givens算法和CORDIC旋转的高面积和高能效模块化双向线性阵列QRD架构。模板体系结构允许实值/复数值和整数/浮点qrd的实现。精确的代数成本模型可以使用丰富的参数集对架构，微架构和电路级别进行跨级别优化。在40纳米CMOS中实现的示例性应用的定量结果证明了效率的显着提高。

引用次数: 9

Split-cost communication model for improved MPSoC application mapping 改进MPSoC应用映射的分成本通信模型

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675280

M. Odendahl, J. Castrillón, Vitaliy Volevach, R. Leupers, G. Ascheid

Automated mapping of dataflow applications to state-of-the-art, heterogeneous Multiprocessor Systems on Chip (MPSoCs) with complex interconnects and communication means is an ongoing research endeavor. We implement, measure and analyze three different communication libraries for a representative, off-the-shelf platform of this kind. The results of the analysis are used to show the need of a new cost model to properly characterize inter-task communication. Afterwards, this paper presents an algorithm to solve the mapping problem jointly for computation and communication using this cost model. A case study with four real streaming applications shows that the obtained mapping is able to reduce the execution time. Compared to a mapping decision where all channels are mapped to shared memory, the makespan fell down up to 10% due to an automated selection of a more appropriate communication library.

将数据流应用自动映射到具有复杂互连和通信手段的最先进的异构多处理器片上系统(mpsoc)是一项正在进行的研究工作。我们为这种典型的现成平台实现、测量和分析了三种不同的通信库。分析结果表明，需要一种新的成本模型来正确表征任务间通信。在此基础上，提出了一种利用该代价模型共同解决计算和通信映射问题的算法。对四个实际流应用程序的案例研究表明，获得的映射能够减少执行时间。与所有通道都映射到共享内存的映射决策相比，由于自动选择了更合适的通信库，makespan减少了10%。

引用次数: 11

TNODE: A low power sensor node processor for secure wireless networks TNODE:用于安全无线网络的低功耗传感器节点处理器

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675259

G. Panic, O. Schrape, T. Basmer, F. Vater, K. Tittelbach-Helmrich

In this paper we describe a sensor node crypto processor designed for use in wireless sensor networks with strong security demands. The presented system-on-chip is a mixed-signal processor-based design containing the hardware crypto accelerators (AES, ECC, SHA-1) that provide the means for secure communication in the network. The unique system architecture combines an asynchronous processor core with synchronous peripherals resulting in a low-power system operation. The designed chip integrates an embedded Flash memory and a 12-bit ADC making it a suitable solution for small-size sensor node devices. The paper describes the chip architecture and discusses the most important implementation and verification issues. Finally, the results of the chip measurement have been presented.

本文设计了一种传感器节点加密处理器，用于对安全性要求较高的无线传感器网络。所提出的片上系统是一种基于混合信号处理器的设计，包含硬件加密加速器(AES, ECC, SHA-1)，为网络中的安全通信提供了手段。独特的系统架构将异步处理器核心与同步外设相结合，从而实现低功耗系统操作。设计的芯片集成了嵌入式闪存和12位ADC，使其成为小型传感器节点设备的合适解决方案。本文描述了芯片的结构，并讨论了最重要的实现和验证问题。最后给出了芯片测量的结果。

引用次数: 5

Dependency analysis and visualization tool for Kactus2 IP-XACT design framework 依赖性分析和可视化工具Kactus2 IP-XACT设计框架

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675261

J. Määttä, Mikko Honkonen, Tommi Korhonen, E. Salminen, T. Hämäläinen

Large-scale HW and SW projects contain thousands of source files, which requires proper file management in order to keep track of changes and keep the code in compilable state. Different parts of the system depend on each other, and even a small change in a certain part of the code may break the other parts. Dependency analysis can be used to prevent such problems by visualizing the SW structure so that dependencies are easily seen by the developer. This paper presents a novel tool for file dependency and change analysis and visualization that was implemented into our IP-XACT based Kactus2 design environment (GPL2). The tool is capable of sorting source files into IP-XACT file sets, extracting and visualizing file dependencies, and keeping track of changed files. It also offers the ability to create manual dependencies, e.g., between source code and documentation. The dependency and change analysis of 1k source code files containing 140k lines of code is performed in less than two minutes.

大型硬件和软件项目包含数千个源文件，这需要适当的文件管理，以便跟踪更改并保持代码处于可编译状态。系统的不同部分相互依赖，即使是代码某一部分的一个小变化也可能破坏其他部分。依赖性分析可以通过可视化软件结构来防止此类问题，这样开发人员就可以很容易地看到依赖性。本文提出了一种新的文件依赖关系、变更分析和可视化工具，并在基于IP-XACT的Kactus2设计环境(GPL2)中实现。该工具能够将源文件分类为IP-XACT文件集，提取和可视化文件依赖项，并跟踪更改的文件。它还提供了创建手动依赖的能力，例如，在源代码和文档之间。对包含140k行代码的1k个源代码文件的依赖性和变更分析在不到两分钟的时间内完成。

引用次数: 4

Implementation and evaluation of configuration scrubbing on CGRAs: A case study 在CGRAs上实现和评估配置洗涤:一个案例研究

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675262

Syed M. A. H. Jafri, S. Piestrak, A. Hemani, K. Paul, J. Plosila, H. Tenhunen

This paper investigates the overhead imposed by various configuration scrubbing techniques used in fault-tolerant Coarse Grained Reconfigurable Arrays (CGRAs). Today, reconfigurable architectures host large configuration memories. As we progress further in the nanometer regime, these configuration memories have become increasingly susceptible to single event upsets caused e.g. by cosmic radiation. Configuration scrubbing is a frequently used technique to protect these configuration memories against single event upsets. Existing works on configuration scrubbing deal only with FPGA without any reference to the CGRAs (in which configuration memories consume up to 50% of silicon area). Moreover, in the known literature lacks a comprehensive comparison of various configuration scrubbing techniques to guide system designers about the merits/demerits of different scrubbing methods which could be applied to CGRAs. To address these problems, in this paper we classify various configuration scrubbing techniques and quantify their trade-offs when implemented on a CGRA. Synthesis results reveal that scrubbing logic incurs negligible silicon overhead (up to 3% of the area of computational units). Simulation results obtained for a few algorithms/applications (FFT, FIR, matrix multiplication, and WLAN) show that the choice of the configuration scrubbing scheme (external vs. internal) has significant impact on both the size of configuration memory and the number of reconfiguration cycles (respectively 20-80% more and up to 38 times more for the former).

本文研究了容错粗粒度可重构阵列(CGRAs)中使用的各种配置清洗技术所带来的开销。如今，可重构架构承载了大量配置内存。随着我们在纳米领域的进一步发展，这些构型记忆越来越容易受到单一事件的干扰，例如由宇宙辐射引起的干扰。配置清理是一种常用的技术，用于保护这些配置内存不受单个事件干扰。现有的配置清洗工作只处理FPGA，没有任何参考CGRAs(其中配置存储器消耗高达50%的硅面积)。此外，在已知的文献中，缺乏对各种构型洗涤技术的全面比较，以指导系统设计人员了解可应用于CGRAs的不同洗涤方法的优缺点。为了解决这些问题，在本文中，我们对各种配置清洗技术进行了分类，并量化了它们在CGRA上实现时的权衡。合成结果表明，擦洗逻辑产生的硅开销可以忽略不计(最多占计算单元面积的3%)。对一些算法/应用程序(FFT、FIR、矩阵乘法和WLAN)获得的仿真结果表明，配置擦洗方案的选择(外部vs内部)对配置内存的大小和重新配置周期的数量都有显著影响(前者分别多20-80%和38倍)。

{"title":"Implementation and evaluation of configuration scrubbing on CGRAs: A case study","authors":"Syed M. A. H. Jafri, S. Piestrak, A. Hemani, K. Paul, J. Plosila, H. Tenhunen","doi":"10.1109/ISSoC.2013.6675262","DOIUrl":"https://doi.org/10.1109/ISSoC.2013.6675262","url":null,"abstract":"This paper investigates the overhead imposed by various configuration scrubbing techniques used in fault-tolerant Coarse Grained Reconfigurable Arrays (CGRAs). Today, reconfigurable architectures host large configuration memories. As we progress further in the nanometer regime, these configuration memories have become increasingly susceptible to single event upsets caused e.g. by cosmic radiation. Configuration scrubbing is a frequently used technique to protect these configuration memories against single event upsets. Existing works on configuration scrubbing deal only with FPGA without any reference to the CGRAs (in which configuration memories consume up to 50% of silicon area). Moreover, in the known literature lacks a comprehensive comparison of various configuration scrubbing techniques to guide system designers about the merits/demerits of different scrubbing methods which could be applied to CGRAs. To address these problems, in this paper we classify various configuration scrubbing techniques and quantify their trade-offs when implemented on a CGRA. Synthesis results reveal that scrubbing logic incurs negligible silicon overhead (up to 3% of the area of computational units). Simulation results obtained for a few algorithms/applications (FFT, FIR, matrix multiplication, and WLAN) show that the choice of the configuration scrubbing scheme (external vs. internal) has significant impact on both the size of configuration memory and the number of reconfiguration cycles (respectively 20-80% more and up to 38 times more for the former).","PeriodicalId":228272,"journal":{"name":"2013 International Symposium on System on Chip (SoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131233224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Extending IP-XACT to embedded system HW/SW integration 将IP-XACT扩展到嵌入式系统软硬件集成

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-12-02 DOI: 10.1109/ISSoC.2013.6675264

Antti Kamppi, Lauri Matilainen, J. Määttä, E. Salminen, T. Hämäläinen

Typical MPSoC FPGA product design is a rigid waterfall process proceeding one-way from HW to SW design. Any changes to HW trigger the SW project re-creation from the beginning. When several product variations or speculative development time exploration is required, the disk bloats easily with hundreds of Board Support Package (BSP), configuration and SW project files. In this paper, we present an IP-XACT based design flow that solves the problems by agile re-use of HW and SW components, automation and single golden reference source for information. We also present new extensions to IP-XACT since the standard lacks SW related features. Three use cases demonstrate how the BSP is changed, an application is moved to another processor and a function is moved from SW implementation to a HW accelerator. Our flow reduces the design time to one third compared to the conventional FPGA flow, the number of automated design phases is doubled and any manual error prone data transfer between HW and SW tools is completely avoided.

典型的MPSoC FPGA产品设计是一个严格的瀑布式过程，从硬件设计到软件设计是单向的。对硬件的任何更改都会从一开始触发软件项目的重新创建。当需要几个产品变体或推测开发时间探索时，磁盘很容易膨胀，因为有数百个Board Support Package (BSP)、配置和SW项目文件。在本文中，我们提出了一个基于IP-XACT的设计流程，该流程通过灵活重用硬件和软件组件、自动化和单一黄金信息源来解决这些问题。我们还为IP-XACT提供了新的扩展，因为该标准缺乏与软件相关的特性。三个用例演示了如何更改BSP，如何将应用程序移动到另一个处理器，以及如何将功能从软件实现移动到硬件加速器。与传统的FPGA流程相比，我们的流程将设计时间减少了三分之一，自动化设计阶段的数量增加了一倍，并且完全避免了硬件和软件工具之间容易出错的人工数据传输。

引用次数: 13

Proactive Thermal-Budget-Based Beltway Routing algorithm for thermal-aware 3D NoC systems 基于热感知3D NoC系统的主动热预算环城公路路由算法

2013 International Symposium on System on Chip (SoC)

Pub Date : 2013-10-01 DOI: 10.1109/ISSoC.2013.6675281

Che-Chuan Kuo, Kun-Chih Chen, En-Jui Chang, A. Wu

The thermal problems of three-dimensional Network-on-Chip (3D NoC) systems become more serious because of die stacking. Besides, for high-performance requirement, the minimal adaptive routing algorithms result in unbalanced traffic load and worse temperature distribution in the system. On the other hand, the conventional selection strategies determine the routing path based on the traffic information, which leads to unawareness of the potential thermal hotspot and huge performance impact. To solve the problems, in this paper, we first define a novel thermal-aware routing index, Mean Time To Throttle (MTTT), which represents the remaining active time of the node before the temperature achieves the alarming level. Based on the information of MTTT, we propose a Proactive Thermal-Budget-Based Beltway Routing (PTB3R) to balance the temperature distribution of the NoC system. The experimental results show that the proposed PTB3R can help to reduce the number of throttled nodes by 25.56%~86.95% and improve network throughput by around 15.04%~19.87%.

由于芯片的堆叠，三维片上网络系统的热问题变得越来越严重。此外，由于对高性能的要求，最小自适应路由算法会导致系统的流量负载不均衡和温度分布变差。另一方面，传统的路由选择策略根据流量信息确定路由路径，导致无法意识到潜在的热热点，对性能影响很大。为了解决这些问题，本文首先定义了一种新的热感知路由指标——平均节流时间(Mean Time To Throttle, MTTT)，它表示节点在温度达到告警水平之前的剩余活动时间。基于MTTT的信息，我们提出了一种基于主动热预算的环城公路路由(PTB3R)来平衡NoC系统的温度分布。实验结果表明，所提出的PTB3R可将受限节点数量减少25.56%~86.95%，将网络吞吐量提高15.04%~19.87%左右。

引用次数: 17

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2013 International Symposium on System on Chip (SoC)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀