首页 > 最新文献

2009 IEEE Computer Society Annual Symposium on VLSI最新文献

英文 中文
Overview of the Scalable Communications Core: A Reconfigurable Wireless Baseband in 65nm CMOS 可扩展通信核心概述:65nm CMOS可重构无线基带
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.15
A. Chun, Kyle McCanta, E. Sandoval, Kapil Gulati
The Scalable Communications Core (SCC) is a  flexible baseband processor that consists of a heterogeneous set of coarse-grained, programmable accelerators connected via a packet-based 3-ary 2-cube Network-on-Chip (NoC).  SCC supports multiple wireless protocols to meet the demand for ubiquitous communications and computing with low power and area.We have recently completed a prototype test chip in a 65nm process and validated it for WiFi and WiMAX protocols. The area and energy efficiency of our test chip is comparable to other basebands found in the literature.  To demonstrate its flexibility, additional protocols have been mapped to the architecture.
可扩展通信核心(SCC)是一种灵活的基带处理器,它由一组异构的粗粒度可编程加速器组成,通过基于分组的3-ary 2-cube片上网络(NoC)连接。SCC支持多种无线协议,以满足无处不在的通信和低功耗、低面积计算的需求。我们最近完成了一个65纳米工艺的原型测试芯片,并对WiFi和WiMAX协议进行了验证。我们的测试芯片的面积和能量效率与文献中发现的其他基带相当。为了演示其灵活性,将附加协议映射到该体系结构。
{"title":"Overview of the Scalable Communications Core: A Reconfigurable Wireless Baseband in 65nm CMOS","authors":"A. Chun, Kyle McCanta, E. Sandoval, Kapil Gulati","doi":"10.1109/ISVLSI.2009.15","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.15","url":null,"abstract":"The Scalable Communications Core (SCC) is a  flexible baseband processor that consists of a heterogeneous set of coarse-grained, programmable accelerators connected via a packet-based 3-ary 2-cube Network-on-Chip (NoC).  SCC supports multiple wireless protocols to meet the demand for ubiquitous communications and computing with low power and area.We have recently completed a prototype test chip in a 65nm process and validated it for WiFi and WiMAX protocols. The area and energy efficiency of our test chip is comparable to other basebands found in the literature.  To demonstrate its flexibility, additional protocols have been mapped to the architecture.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"315 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116686821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Floorplan Driven High Level Synthesis for Crosstalk Noise Minimization in Macro-cell Based Designs 基于大单元设计的串扰噪声最小化的平面驱动高电平合成
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.59
Hariharan Sankaran, S. Katkoori
In DSM regime, due to higher interconnect densities, the coupling noise between adjacent signals is aggravated and can lead to many timing violations. In traditional high-level synthesis (HLS), due to lack detailed physical details, it is difficult to accurately estimate crosstalk. Crosstalk minimization is typically done during routing, which makes it computationally expensive to be used within an iterative design flow. In this paper, we propose a floorplan driven highlevel synthesis framework for minimizing crosstalk in a bus-based architecture. The proposed framework employs a Simulated Annealing engine to simultaneously explore HLS (scheduling, allocation, and binding) and floorplan (module swap, module move, and module rotate) subspaces. The effect of a high-level decision is evaluated by updating the floorplan and identifying crosstalk prone buses (i.e., those buses exceeding Lcrit). The primary goal is to minimize the number of crosstalk violations with minimum area and latency overheads. We have validated the approach by synthesizing netlists down to layout-level using Cadence-SOC encounter followed by detailed crosstalk noise analysis using Cadence Celtic. Experimental results for three DSP benchmarks (DCT, EWF, and FFT) demonstrate that the proposed approach can reduce crosstalk violations by as much as 96% (in 180 nm technology node) with an average reduction of 75% over the designs synthesized with traditional sequential flow.
在DSM体制下,由于较高的互连密度,相邻信号之间的耦合噪声会加剧,并可能导致许多时序违规。在传统的高阶合成(HLS)中,由于缺乏详细的物理细节,难以准确地估计串扰。串扰最小化通常在路由过程中完成,这使得在迭代设计流程中使用它的计算成本很高。在本文中,我们提出了一个平面图驱动的高级综合框架,以减少基于总线的体系结构中的串扰。该框架采用模拟退火引擎同时探索HLS(调度、分配和绑定)和floorplan(模块交换、模块移动和模块旋转)子空间。通过更新平面图和识别容易串扰的总线(即那些超过Lcrit的总线)来评估高层决策的效果。主要目标是以最小的面积和延迟开销最小化串扰违规的数量。我们通过使用Cadence- soc遭遇合成网络列表到布局级别,然后使用Cadence Celtic进行详细的串扰噪声分析,验证了该方法。三个DSP基准(DCT, EWF和FFT)的实验结果表明,该方法可以减少多达96%的串扰违反(在180 nm技术节点上),平均减少75%的设计与传统的顺序流合成。
{"title":"Floorplan Driven High Level Synthesis for Crosstalk Noise Minimization in Macro-cell Based Designs","authors":"Hariharan Sankaran, S. Katkoori","doi":"10.1109/ISVLSI.2009.59","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.59","url":null,"abstract":"In DSM regime, due to higher interconnect densities, the coupling noise between adjacent signals is aggravated and can lead to many timing violations. In traditional high-level synthesis (HLS), due to lack detailed physical details, it is difficult to accurately estimate crosstalk. Crosstalk minimization is typically done during routing, which makes it computationally expensive to be used within an iterative design flow. In this paper, we propose a floorplan driven highlevel synthesis framework for minimizing crosstalk in a bus-based architecture. The proposed framework employs a Simulated Annealing engine to simultaneously explore HLS (scheduling, allocation, and binding) and floorplan (module swap, module move, and module rotate) subspaces. The effect of a high-level decision is evaluated by updating the floorplan and identifying crosstalk prone buses (i.e., those buses exceeding Lcrit). The primary goal is to minimize the number of crosstalk violations with minimum area and latency overheads. We have validated the approach by synthesizing netlists down to layout-level using Cadence-SOC encounter followed by detailed crosstalk noise analysis using Cadence Celtic. Experimental results for three DSP benchmarks (DCT, EWF, and FFT) demonstrate that the proposed approach can reduce crosstalk violations by as much as 96% (in 180 nm technology node) with an average reduction of 75% over the designs synthesized with traditional sequential flow.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123548996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Efficient Rerouting Algorithms for Congestion Mitigation 缓解拥塞的有效重路由算法
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.38
M. Chaudhry, Z. Asad, A. Sprintson, J. Hu
Congestion mitigation and overflow avoidance are two of the major goals of the global routing stage. With a significant increase in the chip size and routing complexity,congestion and overflow have become critical issues in physical design automation. In this paper we present several routing algorithms for congestion reduction and overflow avoidance.Our methods are based on ripping up nets that go through the congested regions and replacing them with congestion-aware Steiner trees. We propose several efficient algorithms for finding congestion-aware Steiner trees and evaluate their performance using the ISPD routing benchmarks. We also show that the novel technique of network coding contributes to further improvements in routability and reduction of congestion. Accordingly, we propose an algorithm for identifying efficient congestion-aware network coding topologies and evaluate its performance.
缓解拥塞和避免溢出是全局路由阶段的两个主要目标。随着芯片尺寸和路由复杂性的显著增加,拥塞和溢出已成为物理设计自动化中的关键问题。本文提出了几种减少拥塞和避免溢出的路由算法。我们的方法是拆除穿过拥堵区域的网,代之以感知拥堵的斯坦纳树。我们提出了几种有效的算法来寻找拥塞感知的斯坦纳树,并使用ISPD路由基准评估它们的性能。我们还表明,新的网络编码技术有助于进一步提高可达性和减少拥塞。因此,我们提出了一种算法来识别有效的拥塞感知网络编码拓扑并评估其性能。
{"title":"Efficient Rerouting Algorithms for Congestion Mitigation","authors":"M. Chaudhry, Z. Asad, A. Sprintson, J. Hu","doi":"10.1109/ISVLSI.2009.38","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.38","url":null,"abstract":"Congestion mitigation and overflow avoidance are two of the major goals of the global routing stage. With a significant increase in the chip size and routing complexity,congestion and overflow have become critical issues in physical design automation. In this paper we present several routing algorithms for congestion reduction and overflow avoidance.Our methods are based on ripping up nets that go through the congested regions and replacing them with congestion-aware Steiner trees. We propose several efficient algorithms for finding congestion-aware Steiner trees and evaluate their performance using the ISPD routing benchmarks. We also show that the novel technique of network coding contributes to further improvements in routability and reduction of congestion. Accordingly, we propose an algorithm for identifying efficient congestion-aware network coding topologies and evaluate its performance.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129765485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An Efficient Hardware Architecture for Multimedia Encryption and Authentication Using the Discrete Wavelet Transform 基于离散小波变换的高效多媒体加密认证硬件架构
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.26
A. Pande, Joseph Zambreno
This paper introduces a zero-overhead encryption and authentication scheme for real-time embedded multimedia systems. The parametrized construction of the Discrete Wavelet Transform (DWT) compression block is used to introduce a free parameter in the design. It allows building a keyspace for lightweight multimedia encryption. The parametrization yields rational coefficients leading to an efficient fixed point hardware implementation. A clock speed of over 240 MHz was achieved on a Xilinx Virtex 5 FPGA. Comparison with existing approaches was performed to indicate the high throughput and low hardware overhead in adding the security feature to the DWT architecture.
介绍了一种用于实时嵌入式多媒体系统的零开销加密与认证方案。采用离散小波变换(DWT)压缩块的参数化构造,在设计中引入自由参数。它允许为轻量级多媒体加密构建一个密钥空间。参数化产生合理的系数,从而实现高效的定点硬件实现。在Xilinx Virtex 5 FPGA上实现了超过240 MHz的时钟速度。通过与现有方法的比较,表明在DWT体系结构中添加安全特性具有高吞吐量和低硬件开销。
{"title":"An Efficient Hardware Architecture for Multimedia Encryption and Authentication Using the Discrete Wavelet Transform","authors":"A. Pande, Joseph Zambreno","doi":"10.1109/ISVLSI.2009.26","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.26","url":null,"abstract":"This paper introduces a zero-overhead encryption and authentication scheme for real-time embedded multimedia systems. The parametrized construction of the Discrete Wavelet Transform (DWT) compression block is used to introduce a free parameter in the design. It allows building a keyspace for lightweight multimedia encryption. The parametrization yields rational coefficients leading to an efficient fixed point hardware implementation. A clock speed of over 240 MHz was achieved on a Xilinx Virtex 5 FPGA. Comparison with existing approaches was performed to indicate the high throughput and low hardware overhead in adding the security feature to the DWT architecture.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132108345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Lifetime Reliability Aware Design Flow Techniques for Dual-Vdd Based Platform FPGAs 基于双vdd平台fpga的寿命可靠性感知设计流程技术
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.42
P. Mangalagiri, N. Vijaykrishnan
Increasing on-chip power densities with aggressive technology scaling has led to a low-power FPGA fabric with dual supply voltages. Such low-power techniques coupled with the heterogeneity of components on a FPGA have led to non-uniform aging of components due to temperature and voltage dependent failure mechanisms. In this paper, we present techniques in placement and routing stages of the design flow that will increase the average life-time of components by ensuring uniform aging. We first study the impact of temperature and voltage variations on lifetime reliability of components. In the presence of such variations, we study the impact of aging in FPGA interconnects due to Electromigration (EM), and di-electric breakdown due to Time Dependent Dielectric Breakdown (TDDB). We also consider the performance degradation due to Hot Carrier Instability (HCI) in our design flow optimizations.The proposed reliability aware design flow techniques achieve anaverage of 65.8% and 75% improvement in lifetime of LUTs and interconnect wires respectively.
随着芯片上功率密度的不断增加和技术的不断扩展,双电源电压的低功耗FPGA结构应运而生。这种低功耗技术与FPGA上组件的异质性相结合,由于温度和电压相关的失效机制,导致组件的不均匀老化。在本文中,我们提出了在设计流程的放置和路由阶段的技术,这些技术将通过确保均匀老化来增加组件的平均寿命。我们首先研究了温度和电压变化对元件寿命可靠性的影响。在存在这些变化的情况下,我们研究了由于电迁移(EM)和由于时间相关介电击穿(TDDB)导致的双电击穿对FPGA互连老化的影响。在我们的设计流程优化中,我们还考虑了由于热载流子不稳定性(HCI)而导致的性能下降。所提出的可靠性感知设计流程技术分别使lut和互连线的寿命平均提高了65.8%和75%。
{"title":"Lifetime Reliability Aware Design Flow Techniques for Dual-Vdd Based Platform FPGAs","authors":"P. Mangalagiri, N. Vijaykrishnan","doi":"10.1109/ISVLSI.2009.42","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.42","url":null,"abstract":"Increasing on-chip power densities with aggressive technology scaling has led to a low-power FPGA fabric with dual supply voltages. Such low-power techniques coupled with the heterogeneity of components on a FPGA have led to non-uniform aging of components due to temperature and voltage dependent failure mechanisms. In this paper, we present techniques in placement and routing stages of the design flow that will increase the average life-time of components by ensuring uniform aging. We first study the impact of temperature and voltage variations on lifetime reliability of components. In the presence of such variations, we study the impact of aging in FPGA interconnects due to Electromigration (EM), and di-electric breakdown due to Time Dependent Dielectric Breakdown (TDDB). We also consider the performance degradation due to Hot Carrier Instability (HCI) in our design flow optimizations.The proposed reliability aware design flow techniques achieve anaverage of 65.8% and 75% improvement in lifetime of LUTs and interconnect wires respectively.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130812371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Scheduling for an Embedded Architecture with a Flexible Datapath 具有灵活数据路径的嵌入式架构调度
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.6
Thomas Schilling, Magnus Själander, P. Larsson-Edefors
Embedded systems put stringent demands on post-fabrication  flexibility as well as computing performance efficiency. The FlexSoC  scheme approaches the implementation of embedded systems from a  general-purpose processor point of view: The FlexCore processor has  a datapath whose configuration is under instruction control; in  its minimal configuration, the processor represents a simple 5-stage  pipeline. However, thanks to a flexible processor interconnect, the  FlexCore datapath configuration can be changed at run-time to boost  performance for the currently executed code. The consequence of this  flexibility is that pipelining is not hard-coded into the datapath,  but all instruction scheduling needs to be done by software at  compile time. We present a scheduling technique for the FlexCore  processor allowing for efficient use of datapath resources over a  flexible interconnect. The flexible interconnect indeed offers  plenty of opportunities for parallel operations, but it also makes  the analysis of instruction dependencies difficult. Thus, we propose  to use a SAT-solver to enable the scheduler to efficiently check  constraints on computing and communication resources. In an  evaluation on four different benchmarks, our scheduler is shown to  produce schedules that are as efficient as fine-tuned, manual  schedules.
嵌入式系统对加工后的灵活性和计算性能效率提出了严格的要求。FlexSoC方案从通用处理器的角度实现嵌入式系统:FlexCore处理器有一个数据路径,其配置在指令控制下;在其最小配置中,处理器表示一个简单的5阶段管道。然而,由于灵活的处理器互连,FlexCore数据路径配置可以在运行时更改,以提高当前执行代码的性能。这种灵活性的结果是,流水线没有硬编码到数据路径中,但所有指令调度都需要在编译时由软件完成。我们提出了一种FlexCore处理器的调度技术,允许在灵活的互连上有效地使用数据路径资源。灵活的互连确实为并行操作提供了大量的机会,但它也使指令依赖关系的分析变得困难。因此,我们建议使用sat求解器使调度程序能够有效地检查计算和通信资源的约束。在对四个不同基准的评估中,我们的调度器显示可以生成与经过微调的手动调度器一样高效的调度器。
{"title":"Scheduling for an Embedded Architecture with a Flexible Datapath","authors":"Thomas Schilling, Magnus Själander, P. Larsson-Edefors","doi":"10.1109/ISVLSI.2009.6","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.6","url":null,"abstract":"Embedded systems put stringent demands on post-fabrication  flexibility as well as computing performance efficiency. The FlexSoC  scheme approaches the implementation of embedded systems from a  general-purpose processor point of view: The FlexCore processor has  a datapath whose configuration is under instruction control; in  its minimal configuration, the processor represents a simple 5-stage  pipeline. However, thanks to a flexible processor interconnect, the  FlexCore datapath configuration can be changed at run-time to boost  performance for the currently executed code. The consequence of this  flexibility is that pipelining is not hard-coded into the datapath,  but all instruction scheduling needs to be done by software at  compile time. We present a scheduling technique for the FlexCore  processor allowing for efficient use of datapath resources over a  flexible interconnect. The flexible interconnect indeed offers  plenty of opportunities for parallel operations, but it also makes  the analysis of instruction dependencies difficult. Thus, we propose  to use a SAT-solver to enable the scheduler to efficiently check  constraints on computing and communication resources. In an  evaluation on four different benchmarks, our scheduler is shown to  produce schedules that are as efficient as fine-tuned, manual  schedules.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129727708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Variation Aware Routing for Three-Dimensional FPGAs 三维fpga的变化感知路由
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.44
Chen Dong, S. Chilstedt, Deming Chen
To maximize the potential of three-dimensional integrated circuit architectures, 3D CAD tools must be developed that are on-par with their 2D counterparts. In this paper, we present a statistical static timing analysis (SSTA) engine designed to deal with both the uncorrelated and correlated variations in 3D FPGAs. We consider the effects of intra-die and inter-die variation. Using the 3D physical design tool TPR as a base, we develop a new 3D routing algorithm which improves the average performance of two layer designs by over 22% and three layer designs by over 27%. To the best of our knowledge, this is the first physical design tool to consider variation in the routing and timing analysis of 3D FPGAs.
为了最大限度地发挥三维集成电路架构的潜力,必须开发与2D对等的3D CAD工具。在本文中,我们提出了一个统计静态时序分析(SSTA)引擎,旨在处理三维fpga中不相关和相关的变化。我们考虑了模内和模间变化的影响。以三维物理设计工具TPR为基础,我们开发了一种新的三维路由算法,使两层设计的平均性能提高22%以上,三层设计的平均性能提高27%以上。据我们所知,这是第一个考虑3D fpga路由和时序分析变化的物理设计工具。
{"title":"Variation Aware Routing for Three-Dimensional FPGAs","authors":"Chen Dong, S. Chilstedt, Deming Chen","doi":"10.1109/ISVLSI.2009.44","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.44","url":null,"abstract":"To maximize the potential of three-dimensional integrated circuit architectures, 3D CAD tools must be developed that are on-par with their 2D counterparts. In this paper, we present a statistical static timing analysis (SSTA) engine designed to deal with both the uncorrelated and correlated variations in 3D FPGAs. We consider the effects of intra-die and inter-die variation. Using the 3D physical design tool TPR as a base, we develop a new 3D routing algorithm which improves the average performance of two layer designs by over 22% and three layer designs by over 27%. To the best of our knowledge, this is the first physical design tool to consider variation in the routing and timing analysis of 3D FPGAs.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128970971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Leakage Power and Side Channel Security of Nanoscale Cryptosystem-on-Chip (CoC) 纳米级芯片上密码系统(CoC)的泄漏功率与侧信道安全性
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.46
Amir Khatib Zadeh, C. Gebotys
This paper investigates the viability of using leakage power consumption as a source of side channel information. The side channel effect is characterized in leakage power. It is shown that the increasing trend of leakage power is highly correlated with security vulnerability of cryptosystems. Addressing the severity of the side channel threat in nanoscale Cryptosystem-on-Chip (CoC), we examine the leakage reduction techniques for the side channel security application. The result shows among the circuit-based reduction techniques high Vth transistor assignment which significantly reduces both average and standard deviation of the leakage power can be exploited as a side channel aware leakage reduction in design and implementation of CoC in submicron era. The findings in this work which are presented for the first time are crucial for the development of side channel resistant cryptosystems in the upcoming CMOS technologies.
本文研究了使用泄漏功耗作为侧信道信息来源的可行性。侧通道效应表现为泄漏功率。结果表明,泄漏功率的增加趋势与密码系统的安全漏洞高度相关。为了解决纳米级芯片上密码系统(CoC)中侧信道威胁的严重性,我们研究了侧信道安全应用的泄漏减少技术。结果表明,在基于电路的减少技术中,高v晶体管分配可以显著降低泄漏功率的平均和标准差,可以作为亚微米时代CoC设计和实现的侧通道感知泄漏减少技术。本文首次提出的研究结果对于未来CMOS技术中抗侧信道密码系统的开发至关重要。
{"title":"Leakage Power and Side Channel Security of Nanoscale Cryptosystem-on-Chip (CoC)","authors":"Amir Khatib Zadeh, C. Gebotys","doi":"10.1109/ISVLSI.2009.46","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.46","url":null,"abstract":"This paper investigates the viability of using leakage power consumption as a source of side channel information. The side channel effect is characterized in leakage power. It is shown that the increasing trend of leakage power is highly correlated with security vulnerability of cryptosystems. Addressing the severity of the side channel threat in nanoscale Cryptosystem-on-Chip (CoC), we examine the leakage reduction techniques for the side channel security application. The result shows among the circuit-based reduction techniques high Vth transistor assignment which significantly reduces both average and standard deviation of the leakage power can be exploited as a side channel aware leakage reduction in design and implementation of CoC in submicron era. The findings in this work which are presented for the first time are crucial for the development of side channel resistant cryptosystems in the upcoming CMOS technologies.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123173803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
NoC Power Optimization Using a Reconfigurable Router 使用可重构路由器的NoC电源优化
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.7
C. Concatto, D. Matos, L. Carro, F. Kastensmidt, A. Susin, M. Kreutz
In real applications there are different communication needs among the cores. When NoCs are the means to interconnect the cores, the use of some techniques to optimize the communication are indispensable. From the performance point of view, large buffer sizes ensure performance during different applications execution, but unfortunately, these same buffers are the main responsible for the router total power dissipation. Another aspect is that by sizing buffers for the worst case latency incurs in extra dissipation for the mean case, which is much more frequent. To cope with this problem, in this paper we propose a dynamically reconfigurable router for a NoC. With the reconfigurable router it was possible to reduce the congestion in the network, while at the same time reducing power dissipation and improving energy.
在实际应用中,内核之间有不同的通信需求。当noc作为核心互连的手段时,使用一些技术来优化通信是必不可少的。从性能的角度来看,大的缓冲区大小确保了不同应用程序执行期间的性能,但不幸的是,这些相同的缓冲区是路由器总功耗的主要原因。另一个方面是,在最坏情况下调整缓冲区的大小会导致平均情况下的额外耗散,这种情况要频繁得多。为了解决这一问题,本文提出了一种NoC动态可重构路由器。使用可重构路由器,可以减少网络中的拥塞,同时降低功耗和提高能量。
{"title":"NoC Power Optimization Using a Reconfigurable Router","authors":"C. Concatto, D. Matos, L. Carro, F. Kastensmidt, A. Susin, M. Kreutz","doi":"10.1109/ISVLSI.2009.7","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.7","url":null,"abstract":"In real applications there are different communication needs among the cores. When NoCs are the means to interconnect the cores, the use of some techniques to optimize the communication are indispensable. From the performance point of view, large buffer sizes ensure performance during different applications execution, but unfortunately, these same buffers are the main responsible for the router total power dissipation. Another aspect is that by sizing buffers for the worst case latency incurs in extra dissipation for the mean case, which is much more frequent. To cope with this problem, in this paper we propose a dynamically reconfigurable router for a NoC. With the reconfigurable router it was possible to reduce the congestion in the network, while at the same time reducing power dissipation and improving energy.","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127620724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Dual-Sum Single-Carry Self-Timed Adder Designs 双和单进位自计时加法器设计
Pub Date : 2009-05-13 DOI: 10.1109/ISVLSI.2009.13
P. Balasubramanian, D. Edwards
This paper presents designs of self-timed dual-sum single-carry or dual-bit adder function blocks, constructed using commercially available synchronous library resources (standard cells) and validated using synchronous tools. Specifically, the proposed adder modules qualify as either quasi-delay-insensitive or speed-independent and satisfy Seitz’s weak-indication timing constraints. The delay-insensitive version of the ripple carry adder topology has been used to analyze the designs. The indication (completion) is either made implicit in the topology (local indication) or considerably isolated from the actual data path (a new variant of global indication). The proposed adders are found to exhibit improved power and performance parameters, whilst being competitive in terms of area, in comparison with those pertaining to other self-timed logic realizations
本文介绍了自定时双和单进位或双位加法器功能块的设计,使用商业上可获得的同步库资源(标准单元)构建并使用同步工具进行验证。具体来说,所提出的加法器模块是准延迟不敏感或速度无关的,并且满足Seitz的弱指示时序约束。采用延迟不敏感版本的纹波进位加法器拓扑来分析设计。指示(补全)要么隐式地出现在拓扑中(局部指示),要么与实际数据路径完全隔离(全局指示的新变体)。与其他自定时逻辑实现相比,所提出的加法器显示出改进的功率和性能参数,同时在面积方面具有竞争力
{"title":"Dual-Sum Single-Carry Self-Timed Adder Designs","authors":"P. Balasubramanian, D. Edwards","doi":"10.1109/ISVLSI.2009.13","DOIUrl":"https://doi.org/10.1109/ISVLSI.2009.13","url":null,"abstract":"This paper presents designs of self-timed dual-sum single-carry or dual-bit adder function blocks, constructed using commercially available synchronous library resources (standard cells) and validated using synchronous tools. Specifically, the proposed adder modules qualify as either quasi-delay-insensitive or speed-independent and satisfy Seitz’s weak-indication timing constraints. The delay-insensitive version of the ripple carry adder topology has been used to analyze the designs. The indication (completion) is either made implicit in the topology (local indication) or considerably isolated from the actual data path (a new variant of global indication). The proposed adders are found to exhibit improved power and performance parameters, whilst being competitive in terms of area, in comparison with those pertaining to other self-timed logic realizations","PeriodicalId":137508,"journal":{"name":"2009 IEEE Computer Society Annual Symposium on VLSI","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131885102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2009 IEEE Computer Society Annual Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1