2009 22nd International Conference on VLSI Design最新文献_第5页

Common Power Format: A User-driven Ecosystem For Proven Low Power Design Flows 通用电源格式:一个用户驱动的生态系统，证明低功耗设计流程

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.102

S. Dasgupta

Low power design has emerged as one of the urgent needs in IC design. The International Technology Roadmap for Semiconductors (ITRS) has identified the challenges surrounding low power design as one of the fundamental bottlenecks in exploiting the full capabilities of some of the advanced technology nodes. In fact, data from major chip design houses have underscored this need. Much attention has been focused world-wide on the three existing formats for expressing low power constraints and intent: Common Power Format (CPF) from Silicon Integration Initiative (Si2), UPF 1.0 from Accellera, and UPF 2.0/P1801 from IEEE. However, the real challenge lies in the development of design flows and tools that exploit the content expressed by designers in these formats to solve reallife, power-related issues in design. Therefore, it should come as no surprise that at Si2 the focus has been on both developing and standardizing CPF in a coalition of both users and EDA suppliers, and in creating an ecosystem that provides training and adoption aids for CPF to support its adoption by chip designers and tool developers alike and proliferation of CPF into design flows in IC companies around the world. This presentation begins with a brief introduction on Si2 and the Low Power Coalition (LPC) and the processes used in LPC to drive the development of CPF. There will be a discussion on the CPF roadmap with an introduction of the current standard CPF version 1.1 identifying the key enhancements over the previous version 1.0, and the roadmap leading to version 1.2 where interoperability with P1801 is one of the focus items. Next, we will describe some of the enablers provided by Si2 to support adoption, such as, training materials, a parser, a reference guide and a relational analyzer which can be used both to train in CPF as well as to analyze the contents of multiple CPF files used across the design. The talk will include examples of adoption by EDA companies and will conclude with results achieved to-date among IC design companies with references to some real-life success stories in low power design.

低功耗设计已成为集成电路设计的迫切需求之一。国际半导体技术路线图(ITRS)已经确定了围绕低功耗设计的挑战，这是开发一些先进技术节点全部功能的基本瓶颈之一。事实上，来自主要芯片设计公司的数据已经强调了这一需求。全世界的注意力都集中在表达低功耗约束和意图的三种现有格式上:来自Silicon Integration Initiative (Si2)的通用功率格式(CPF)，来自Accellera的UPF 1.0，以及来自IEEE的UPF 2.0/P1801。然而，真正的挑战在于开发设计流程和工具，利用设计师以这些格式表达的内容来解决设计中与现实生活、权力相关的问题。因此，毫不奇怪，Si2的重点是在用户和EDA供应商的联盟中开发和标准化CPF，并创建一个生态系统，为CPF提供培训和采用辅助，以支持芯片设计人员和工具开发人员对CPF的采用，并将CPF扩展到全球IC公司的设计流程中。本演讲首先简要介绍了Si2和低功耗联盟(LPC)，以及LPC中用于推动CPF发展的过程。将讨论CPF路线图，介绍当前标准的CPF 1.1版本，确定相对于前一个版本1.0的主要增强，以及通向1.2版本的路线图，其中与P1801的互操作性是重点项目之一。接下来，我们将描述Si2为支持采用而提供的一些支持，例如培训材料、解析器、参考指南和关系分析器，它们既可用于培训CPF，也可用于分析整个设计中使用的多个CPF文件的内容。讲座将包括EDA公司采用的例子，并将以IC设计公司迄今取得的成果作为总结，并参考一些现实生活中低功耗设计的成功案例。

{"title":"Common Power Format: A User-driven Ecosystem For Proven Low Power Design Flows","authors":"S. Dasgupta","doi":"10.1109/VLSI.Design.2009.102","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.102","url":null,"abstract":"Low power design has emerged as one of the urgent needs in IC design. The International Technology Roadmap for Semiconductors (ITRS) has identified the challenges surrounding low power design as one of the fundamental bottlenecks in exploiting the full capabilities of some of the advanced technology nodes. In fact, data from major chip design houses have underscored this need. Much attention has been focused world-wide on the three existing formats for expressing low power constraints and intent: Common Power Format (CPF) from Silicon Integration Initiative (Si2), UPF 1.0 from Accellera, and UPF 2.0/P1801 from IEEE. However, the real challenge lies in the development of design flows and tools that exploit the content expressed by designers in these formats to solve reallife, power-related issues in design. Therefore, it should come as no surprise that at Si2 the focus has been on both developing and standardizing CPF in a coalition of both users and EDA suppliers, and in creating an ecosystem that provides training and adoption aids for CPF to support its adoption by chip designers and tool developers alike and proliferation of CPF into design flows in IC companies around the world. This presentation begins with a brief introduction on Si2 and the Low Power Coalition (LPC) and the processes used in LPC to drive the development of CPF. There will be a discussion on the CPF roadmap with an introduction of the current standard CPF version 1.1 identifying the key enhancements over the previous version 1.0, and the roadmap leading to version 1.2 where interoperability with P1801 is one of the focus items. Next, we will describe some of the enablers provided by Si2 to support adoption, such as, training materials, a parser, a reference guide and a relational analyzer which can be used both to train in CPF as well as to analyze the contents of multiple CPF files used across the design. The talk will include examples of adoption by EDA companies and will conclude with results achieved to-date among IC design companies with references to some real-life success stories in low power design.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115431549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Efficient Synthesis of a Uniformly Spread Layout Aware Pareto Surface for Analog Circuits 模拟电路中均匀分布布局感知Pareto曲面的有效合成

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.67

Almitra Pradhan, R. Vemuri

Accurate and fast optimization of analog circuits is an important requirement of current synthesis methods. Obtaining the entire pareto optimal surface for conflicting performance objectives is essential for design space exploration as well as circuit sizing. Layout parasitics prevent the circuit from realizing the estimated optimal performance values but are not considered in most existing pareto-front generation methods. We develop a layout-aware circuit matrix modeling method along with an efficient multi-objective optimizer to synthesize the parasitic inclusive pareto-optimal performance surface. The algorithm achieves a pareto surface with points spread uniformly in all regions. The sensitivity of critical performance to candidate design points is used to select the best sizing solution during synthesis. Experiments on benchmark circuits show the effectiveness of the proposed method in obtaining a speedup of an order of 10$^{3}$ with negligible loss of accuracy as compared to SPICE.

准确、快速地优化模拟电路是当前综合方法的重要要求。获得冲突性能目标的整个帕累托最优曲面对于设计空间探索和电路尺寸确定至关重要。布局寄生使电路无法实现估计的最优性能值，但在大多数现有的pareto-front生成方法中没有考虑到这一点。提出了一种可感知布图的电路矩阵建模方法，并结合高效的多目标优化器来合成寄生包容性帕累托最优性能面。该算法得到了一个点均匀分布在所有区域的帕累托曲面。在综合过程中，利用临界性能对候选设计点的敏感性来选择最佳施胶方案。在基准电路上的实验表明，与SPICE相比，该方法可以有效地获得10$^{3}$数量级的加速，而精度损失可以忽略不计。

引用次数: 31

Conservative QCA Gate (CQCA) for Designing Concurrently Testable Molecular QCA Circuits 保守QCA门(CQCA)用于设计可并行测试的分子QCA电路

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.75

H. Thapliyal, N. Ranganathan

Nanocircuits based on molecular QCA are prone to high error rates. In this paper, we present a novel conservative logic gate termed 'CQCA' (conservative QCA) to design concurrently testable circuits for molecular QCA. In conservative logic gates, there would be an equal number of 1s in the output as there would be on the input. Thus, conservative logic gates are parity preserving, that is, the parity of the input vectors is equal to the output vectors. CQCA is proposed in this work as molecular QCA is based on majority voting. We analyzed the fault patterns in existing popular conservative Fredkin gate and proposed CQCA gate due to single missing/additional cell defect in molecular QCA. We found that if there is a fault in molecular QCA implementation of Fredkin and CQCA gates, there is a parity mismatch between the input and the output; otherwise the input parity is same as output parity. Thus, any permanent and transient fault in molecular QCA can be concurrently detected if implemented with conservative Fredkin and CQCA gates. We applied novel method of using majority and minority voting to detect the fault in conservative gates. We propose to use CQCA gate compared to existing popular Fredkin gate as CQCA excels Fredkin gate in parameters of complexity(number of majority voter), speed and area. The results are well supported by synthesizing standard benchmark combinational functions. The QCA design of 2 pair 2 rail checker is also presented for the first time ever in literature. The design of QCA layouts and the verification of the designs are performed using the QCADesigner and HDLQ tools.

基于分子QCA的纳米电路容易出现高错误率。本文提出了一种新的保守逻辑门CQCA (conservative QCA)，用于分子QCA的并发测试电路设计。在保守逻辑门中，输出中有相同数量的1和输入中有相同数量的1。因此，保守逻辑门是奇偶保持的，即输入向量的奇偶性等于输出向量。由于分子QCA是基于多数投票的，所以本文提出了CQCA。我们分析了现有流行的保守Fredkin门的故障模式，并提出了由于分子QCA中单个缺失/附加细胞缺陷而导致的CQCA门。我们发现，如果在Fredkin和CQCA门的分子QCA实现中存在故障，则输入和输出之间存在奇偶失配;否则输入奇偶校验与输出奇偶校验相同。因此，如果采用保守的Fredkin和CQCA门，可以同时检测分子QCA中的任何永久和瞬态故障。采用多数和少数投票的新方法检测保守门的故障。与现有流行的Fredkin门相比，我们建议使用CQCA门，因为CQCA在复杂性(多数投票人数量)、速度和面积等参数上优于Fredkin门。通过综合标准基准组合函数得到了较好的支持。在文献中首次提出了2对2轨校核器的QCA设计。利用qcaddesigner和HDLQ工具进行了QCA布局设计和设计验证。

{"title":"Conservative QCA Gate (CQCA) for Designing Concurrently Testable Molecular QCA Circuits","authors":"H. Thapliyal, N. Ranganathan","doi":"10.1109/VLSI.Design.2009.75","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.75","url":null,"abstract":"Nanocircuits based on molecular QCA are prone to high error rates. In this paper, we present a novel conservative logic gate termed 'CQCA' (conservative QCA) to design concurrently testable circuits for molecular QCA. In conservative logic gates, there would be an equal number of 1s in the output as there would be on the input. Thus, conservative logic gates are parity preserving, that is, the parity of the input vectors is equal to the output vectors. CQCA is proposed in this work as molecular QCA is based on majority voting. We analyzed the fault patterns in existing popular conservative Fredkin gate and proposed CQCA gate due to single missing/additional cell defect in molecular QCA. We found that if there is a fault in molecular QCA implementation of Fredkin and CQCA gates, there is a parity mismatch between the input and the output; otherwise the input parity is same as output parity. Thus, any permanent and transient fault in molecular QCA can be concurrently detected if implemented with conservative Fredkin and CQCA gates. We applied novel method of using majority and minority voting to detect the fault in conservative gates. We propose to use CQCA gate compared to existing popular Fredkin gate as CQCA excels Fredkin gate in parameters of complexity(number of majority voter), speed and area. The results are well supported by synthesizing standard benchmark combinational functions. The QCA design of 2 pair 2 rail checker is also presented for the first time ever in literature. The design of QCA layouts and the verification of the designs are performed using the QCADesigner and HDLQ tools.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"137 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128694344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 55

Design, Implementation and Validation of an Open Source IP-PBX/VoIP Gateway SoC 一个开源IP-PBX/VoIP网关SoC的设计、实现和验证

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.47

S. Apostolacos, G. Lykakis, A. Meliones, Vassilis Vlagoulis, Emmanuel Touloupis, G. Konstantoulakis

The telephony world is consistently moving to the transmission of voice through packet networks, so as to unify data and voice and to enable the provisioning of new services in a less costly manner. Service providers are offloading the task of converting analog voice to VoIP to the end-points. This allows the ISPs and ITSPs to reduce their costs and increase the uniformity of their interfaces with their clients. In this paper we present an IP-PBX/VoIP Gateway system based on a single SoC that performs all the required processing. This SoC includes a CPU for hosting a full-fledged operating system and user applications, as well as a DSP subsystem for voice processing. The system targets the low density market of home gateways and SME IP-PBXs, where cost is the main factor. We prove it is feasible to implement a 2-4 channel IP-PBX/VoIP gateway on a SoC based purely on both software and hardware provided by the open-source community, reducing both upfront and final product costs thus allowing new players into the market.

电话世界一直在向通过分组网络传输语音的方向发展，以便统一数据和语音，并以较低的成本提供新的服务。服务提供商正在将模拟语音转换为VoIP的任务转移到终端。这允许isp和itsp降低他们的成本，并增加他们与客户端接口的一致性。在本文中，我们提出了一个基于单个SoC的IP-PBX/VoIP网关系统，该系统执行所有所需的处理。该SoC包括一个用于托管成熟操作系统和用户应用程序的CPU，以及用于语音处理的DSP子系统。该系统针对以成本为主要因素的低密度家庭网关和中小企业ip - pbx市场。我们证明在SoC上实现2-4通道IP-PBX/VoIP网关是可行的，该网关完全基于开源社区提供的软件和硬件，从而降低了前期和最终产品成本，从而允许新参与者进入市场。

引用次数: 0

Switched-Capacitor Based Buck Converter Design Using Current Limiter for Better Efficiency and Output Ripple 利用限流器设计基于开关电容的降压变换器以提高效率和输出纹波

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.43

Tamal Das, P. Mandal

In this paper we are addressing power efficiency and output ripple of an embedded switched-capacitor based DC/DC Buck Converters. Here we propose to use current pump based switched-capacitor circuit in buck converter. The current pump circuit limits transition current of the switched-capacitors and hence, improves power efficiency and reduces output ripple. We have also proposed an equivalent macro model of this type of current pump based switched-capacitor converter which would help to get a better essence of the closed loop stability of the system and would reveal clearly trade-offs among load current, flying capacitance and clock frequency. A transistor level implementation of the proposed buck converter in 0.18µ technology is provided. For a load current of 8mA (maximum)the achieved power efficiency is 72.7% and the output ripple is 27mV. The flying capacitors in the converter are 2x108pF and the load capacitor is 125pF.

本文研究了一种基于开关电容的嵌入式DC/DC降压变换器的功率效率和输出纹波。本文提出在降压变换器中采用基于电流泵的开关电容电路。电流泵浦电路限制了开关电容器的转换电流，从而提高了功率效率并减少了输出纹波。我们还提出了这类基于电流泵的开关电容变换器的等效宏观模型，这将有助于更好地了解系统闭环稳定性的本质，并将清楚地揭示负载电流，飞行电容和时钟频率之间的权衡。给出了采用0.18µ技术的降压变换器的晶体管级实现。负载电流为8mA(最大值)时，实现的功率效率为72.7%，输出纹波为27mV。变换器中的飞行电容为2x108pF，负载电容为125pF。

引用次数: 11

Cone Resynthesis ECO Methodology for Multi-Million Gate Designs 数百万闸门设计的锥形再合成ECO方法

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.28

S. Raman, Mike Lubyanitsky

In this paper, we talk about techniques to incrementally resynthesize logic cones within a large design impacted by multiple RTL changes in order to accommodate a late functional ECO. In design methodologies where the RTL is hierarchical and the post route netlist is flat, mapping a change in the behavioral description to the post layout netlist is very complicated and may not even be feasible if the RTL is not written in a synthesis friendly manner. We try to attack this problem by introducing a technique that causes minimum perturbation to the gate level netlist, thereby retaining to a large degree, the goodness metrics of timing convergence, routability and layout cleanliness that were achieved during the various design milestones. This paper talks about the cone resynthesis ECO methodology in detail and highlights its usefulness during tight product deliverable schedules.

在本文中，我们讨论了在受多个RTL更改影响的大型设计中增量地重新合成逻辑锥的技术，以适应后期功能ECO。在RTL是分层的，而post route netlist是平面的设计方法中，将行为描述中的变化映射到post layout netlist是非常复杂的，如果RTL不是以综合友好的方式编写的，甚至可能是不可行的。我们试图通过引入一种技术来解决这个问题，该技术可以对门级网表产生最小的扰动，从而在很大程度上保留在各个设计里程碑期间实现的时间收敛、可达性和布局清洁度的优良指标。本文详细讨论了锥形再合成ECO方法，并强调了它在产品交付时间表紧张时的有用性。

引用次数: 3

Design for Manufacturability and Reliability in Nano Era 纳米时代可制造性与可靠性设计

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.115

Goutam Debnath, P. J. Thadikaran

The bottom line of any company is to maximize the profit from any given product. There are many factors influencing the product design resulting in a profitable business. One of the biggest factors is the manufacturability of the product. It is becoming more and more crucial to meet the 6+6 (6 months for the development and 6 months for qualifying the product to ship to customer) product life cycle to accommodate the rapid changing technology hungry market demand. Smooth, reliable, and efficient product ramp through manufacturing is the key of success for meeting TTM, capturing higher percentage of total available market (TAM). This tutorial is going to address the difficulties industries are facing today in designing manufacturing friendly highly complex giga-scale products in submicron technology. As we are heavily into deep submicron era, the error margin or the tolerance guard band is getting tighter and tighter with respect to the previous generation of fabrication process. On this note, it is important to pay attention to Design For Manufacturing (DFM) related issues early in the design cycle as oppose to later in the design. These include, however not limited to, all kinds debugging hooks in the design for easy debugging of billion of transistors in a given design, paying attention to manufacturing friendly physical design rules, making sure of adequate test coverage to toggle most of the design nodes, making sure optimal guard band is implemented for transistor degradation for the lifetime of the product, and last but not least, all reliability (ESD, EM/SH, LU, etc) related issues are resolved in pre-silicon design before Tape out. In the past, manufacturing issues were not given much attention; time has changed and designers must have to be more sensitive than ever before in addressing manufacturing related issues early in the design cycle. In a nut shell, this tutorial will capture the must have knowledge for design engineers (irrespective of front-end or back-end) who are involved in high performance VLSI design, as DFM features moving upstream in the design cycle. Audience will walk out with a good understanding on how to integrate specific manufacturing concerns into a product’s design to obtain a product that is easier to manufacture with excellent overall quality in a shortest development time.

任何公司的底线都是从任何给定的产品中获得最大的利润。有许多因素影响产品设计，从而导致盈利的业务。最大的因素之一是产品的可制造性。满足6+6(6个月用于开发，6个月用于确认产品交付给客户)产品生命周期以适应快速变化的技术需求变得越来越重要。顺利、可靠和高效的产品生产是成功满足TTM的关键，可以获得更高比例的总可用市场(TAM)。本教程将解决当今工业在亚微米技术中设计制造友好的高度复杂的千兆级产品时所面临的困难。随着我们进入深亚微米时代，相对于上一代制造工艺，误差范围或公差保护带越来越紧。在这一点上，在设计周期的早期关注面向制造的设计(DFM)相关问题是很重要的，而不是在设计的后期。这些包括，但不限于，设计中的各种调试挂钩，以便在给定的设计中轻松调试数十亿个晶体管，注意制造友好的物理设计规则，确保有足够的测试覆盖以切换大多数设计节点，确保在产品的生命周期内实现晶体管退化的最佳保护带，以及最后但并非最不重要的是，所有可靠性(ESD, EM/SH, LU, LU)。在带出前的预硅设计中解决了相关问题。过去，制造问题没有得到太多关注;时代变了，设计师必须比以往任何时候都更加敏感，在设计周期的早期解决与制造相关的问题。简而言之，本教程将为参与高性能VLSI设计的设计工程师(无论前端或后端)捕获必须具备的知识，因为DFM功能在设计周期中向上游移动。观众将对如何将特定的制造问题集成到产品设计中，以在最短的开发时间内获得更容易制造且整体质量优异的产品有很好的理解。

{"title":"Design for Manufacturability and Reliability in Nano Era","authors":"Goutam Debnath, P. J. Thadikaran","doi":"10.1109/VLSI.Design.2009.115","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.115","url":null,"abstract":"The bottom line of any company is to maximize the profit from any given product. There are many factors influencing the product design resulting in a profitable business. One of the biggest factors is the manufacturability of the product. It is becoming more and more crucial to meet the 6+6 (6 months for the development and 6 months for qualifying the product to ship to customer) product life cycle to accommodate the rapid changing technology hungry market demand. Smooth, reliable, and efficient product ramp through manufacturing is the key of success for meeting TTM, capturing higher percentage of total available market (TAM). This tutorial is going to address the difficulties industries are facing today in designing manufacturing friendly highly complex giga-scale products in submicron technology. As we are heavily into deep submicron era, the error margin or the tolerance guard band is getting tighter and tighter with respect to the previous generation of fabrication process. On this note, it is important to pay attention to Design For Manufacturing (DFM) related issues early in the design cycle as oppose to later in the design. These include, however not limited to, all kinds debugging hooks in the design for easy debugging of billion of transistors in a given design, paying attention to manufacturing friendly physical design rules, making sure of adequate test coverage to toggle most of the design nodes, making sure optimal guard band is implemented for transistor degradation for the lifetime of the product, and last but not least, all reliability (ESD, EM/SH, LU, etc) related issues are resolved in pre-silicon design before Tape out. In the past, manufacturing issues were not given much attention; time has changed and designers must have to be more sensitive than ever before in addressing manufacturing related issues early in the design cycle. In a nut shell, this tutorial will capture the must have knowledge for design engineers (irrespective of front-end or back-end) who are involved in high performance VLSI design, as DFM features moving upstream in the design cycle. Audience will walk out with a good understanding on how to integrate specific manufacturing concerns into a product’s design to obtain a product that is easier to manufacture with excellent overall quality in a shortest development time.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125358604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Power Management for Mobile Multimedia: From Audio to Video & Games 移动多媒体的电源管理:从音频到视频和游戏

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.118

S. Chakraborty, Ye Wang

Multimedia applications today constitute a sizeable workload that needs to be supported by a host of mobile devices ranging from cell phones, to PDAs and portable game consoles. Battery life is a major design concern for all of these devices. Whereas both – the complexity of multimedia applications and the hardware architecture of these devices – have progressed at a phenomenal rate over the last one decade, progress in the area of battery technology has been relatively stagnant. As a result, currently a lot of effort is being spent to develop high-level power management and application tuning techniques to minimize energy consumption and thereby prolong battery life. Such techniques include dynamically scaling the underlying processor’s voltage and clock frequency in response to a time-varying workload, powering down certain system components when not being frequently used, and backlight scaling in LCDs with controlled image-quality degradation. Some of the application tuning techniques include selectively ignoring certain perceptually-irrelevant computations during audio decoding, and injecting metadata with workload information into video clips which can then be used to accurately estimate the decoding workload at runtime for better power management. In this tutorial, we plan to give a comprehensive overview of this area and discuss power management schemes for a broad spectrum of multimedia applications. In particular, we will talk about several power management and application tuning techniques specifically directed towards audio decoding, video processing and interactive 3-D game applications. Starting from the basics of power management for portable devices, we will discuss the necessary mathematical techniques, give high-level overviews of relevant algorithms and also present the hardware setup that is necessary to perform research and development in this area. The main objective of this tutorial will be to cover various techniques for power management for audio, video and graphics-intensive game applications running on battery-operated portable devices. In particular, we would illustrate how power management techniques differ for audio, video and game applications and would present a number of techniques for each of these classes of applications. We would also give an overview of open research problems and the challenges facing this area. Finally, we would describe some of the hardware platforms that we have been using to conduct research in this domain and give demonstrations of selected power management techniques.

今天的多媒体应用程序构成了相当大的工作量，需要大量移动设备(从移动电话到pda和便携式游戏机)来支持。电池寿命是所有这些设备设计的主要关注点。尽管多媒体应用程序的复杂性和这些设备的硬件架构在过去十年中都取得了惊人的进展，但电池技术领域的进展却相对停滞不前。因此，目前正在花费大量精力开发高级电源管理和应用程序调优技术，以最大限度地减少能耗，从而延长电池寿命。这些技术包括根据时变工作负载动态缩放底层处理器的电压和时钟频率，在不经常使用时关闭某些系统组件的电源，以及在控制图像质量退化的情况下缩放lcd的背光。一些应用程序调优技术包括在音频解码期间选择性地忽略某些感知无关的计算，以及在视频剪辑中注入带有工作负载信息的元数据，这些元数据可用于在运行时准确估计解码工作负载，从而实现更好的电源管理。在本教程中，我们计划对该领域进行全面概述，并讨论广泛的多媒体应用程序的电源管理方案。特别是，我们将讨论针对音频解码、视频处理和交互式3d游戏应用的几种电源管理和应用程序调优技术。从便携式设备电源管理的基础开始，我们将讨论必要的数学技术，给出相关算法的高级概述，并介绍在该领域进行研究和开发所需的硬件设置。本教程的主要目的是介绍在电池供电的便携式设备上运行的音频、视频和图形密集型游戏应用程序的各种电源管理技术。特别是，我们将说明音频、视频和游戏应用程序的电源管理技术的不同之处，并将为这些应用程序中的每一类提供一些技术。我们还将概述开放的研究问题和该领域面临的挑战。最后，我们将描述一些硬件平台，我们一直在使用这些硬件平台进行该领域的研究，并演示所选的电源管理技术。

{"title":"Power Management for Mobile Multimedia: From Audio to Video & Games","authors":"S. Chakraborty, Ye Wang","doi":"10.1109/VLSI.Design.2009.118","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.118","url":null,"abstract":"Multimedia applications today constitute a sizeable workload that needs to be supported by a host of mobile devices ranging from cell phones, to PDAs and portable game consoles. Battery life is a major design concern for all of these devices. Whereas both – the complexity of multimedia applications and the hardware architecture of these devices – have progressed at a phenomenal rate over the last one decade, progress in the area of battery technology has been relatively stagnant. As a result, currently a lot of effort is being spent to develop high-level power management and application tuning techniques to minimize energy consumption and thereby prolong battery life. Such techniques include dynamically scaling the underlying processor’s voltage and clock frequency in response to a time-varying workload, powering down certain system components when not being frequently used, and backlight scaling in LCDs with controlled image-quality degradation. Some of the application tuning techniques include selectively ignoring certain perceptually-irrelevant computations during audio decoding, and injecting metadata with workload information into video clips which can then be used to accurately estimate the decoding workload at runtime for better power management. In this tutorial, we plan to give a comprehensive overview of this area and discuss power management schemes for a broad spectrum of multimedia applications. In particular, we will talk about several power management and application tuning techniques specifically directed towards audio decoding, video processing and interactive 3-D game applications. Starting from the basics of power management for portable devices, we will discuss the necessary mathematical techniques, give high-level overviews of relevant algorithms and also present the hardware setup that is necessary to perform research and development in this area. The main objective of this tutorial will be to cover various techniques for power management for audio, video and graphics-intensive game applications running on battery-operated portable devices. In particular, we would illustrate how power management techniques differ for audio, video and game applications and would present a number of techniques for each of these classes of applications. We would also give an overview of open research problems and the challenges facing this area. Finally, we would describe some of the hardware platforms that we have been using to conduct research in this domain and give demonstrations of selected power management techniques.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127626781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

The Future of Low Power Design is Here: IEEE P1801, aka, UPF 2.0 低功耗设计的未来就在这里:IEEE P1801，也就是UPF 2.0

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.103

Stephen Bailey

Industry adoption of Accellera’s Unified Power Format (UPF) has been broad and swift. And why shouldn’t it be? For the first time, UPF made it possible to specify the power design intent in combination with the HDL specification of the design for use throughout the design, verification and implementation flows. UPF’s portability and feature set opened the door for more efficient design of low power systems. Now, UPF 2.0 is just around the corner. The IEEE P1801 working group, by the time of this conference, will have completed the sequel and it will be well on its way to IEEE standardization. Rumors are that there are significant changes to UPF 2.0. Why has been UPF been enhanced? What value will the new capabilities deliver? Do the changes obsolete UPF 1.0? This presentation will provide an overview of the major changes in UPF 2.0, its relationship with UPF 1.0 and the value that everyone doing low power designs will want to know.

业界广泛而迅速地采用了Accellera的统一电源格式(UPF)。为什么不应该呢?UPF第一次将电源设计意图与设计的HDL规范结合在一起，使其在整个设计、验证和实现流程中使用成为可能。UPF的便携性和特性为更高效的低功耗系统设计打开了大门。现在，UPF 2.0即将问世。到本次会议召开时，IEEE P1801工作组将完成后续工作，并将顺利地走向IEEE标准化。有传言说UPF 2.0有重大变化。为什么UPF得到了增强?新功能将带来什么价值?改变过时的UPF 1.0吗?本演讲将概述UPF 2.0的主要变化，它与UPF 1.0的关系以及每个做低功耗设计的人都想知道的价值。

引用次数: 1

RADJAM: A Novel Approach for Reduction of Soft Errors in Logic Circuits RADJAM:一种减少逻辑电路软误差的新方法

2009 22nd International Conference on VLSI Design

Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.76

K. Bhattacharya, N. Ranganathan

The task of achieving reliability against transient faults poses a significant challenge due to technology scaling trends. Several optimization techniques have been proposed in the literature for preventing soft errors in logic circuits. However, most approaches for avoiding soft errors in logic circuits have significant overheads in terms of delay, area or power. In this work, we propose a circuit level technique called RADJAM (RADiation JAMmer) to prevent soft errors, occurring due to radiation strikes, in logic cells. The RADJAM circuit when inserted at the output of a logic can reduce the generation of transient glitches significantly. Further, we propose an algorithm to insert RADJAM cells on selective nodes in a logic circuit. The algorithm uses signal logic probabilities and circuit slack for insertion of RADJAM cells on circuit nodes, thus improving the reliability of the logic circuit with minimal impact on the overall circuit delay. The proposed algorithm has been implemented and validated on the ISCAS85 benchmarks. Experimental results indicate that RADJAM optimized logic circuits can reduce soft error rates by around 39% with marginal delay, area and power overheads.

由于技术的规模化趋势，实现瞬时故障可靠性的任务提出了重大挑战。为了防止逻辑电路中的软错误，文献中提出了几种优化技术。然而，大多数避免逻辑电路软错误的方法在延迟、面积或功率方面都有显著的开销。在这项工作中，我们提出了一种称为RADJAM(辐射干扰器)的电路级技术，以防止逻辑单元中由于辐射打击而发生的软错误。在逻辑输出端插入RADJAM电路可以显著减少瞬态故障的产生。此外，我们提出了一种将RADJAM单元插入逻辑电路中选择节点的算法。该算法利用信号逻辑概率和电路松弛在电路节点上插入RADJAM单元，从而在对整体电路延迟影响最小的情况下提高了逻辑电路的可靠性。该算法已在ISCAS85基准测试中得到了实现和验证。实验结果表明，RADJAM优化后的逻辑电路在最小延迟、最小面积和最小功耗的情况下，可将软错误率降低约39%。

{"title":"RADJAM: A Novel Approach for Reduction of Soft Errors in Logic Circuits","authors":"K. Bhattacharya, N. Ranganathan","doi":"10.1109/VLSI.Design.2009.76","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.76","url":null,"abstract":"The task of achieving reliability against transient faults poses a significant challenge due to technology scaling trends. Several optimization techniques have been proposed in the literature for preventing soft errors in logic circuits. However, most approaches for avoiding soft errors in logic circuits have significant overheads in terms of delay, area or power. In this work, we propose a circuit level technique called RADJAM (RADiation JAMmer) to prevent soft errors, occurring due to radiation strikes, in logic cells. The RADJAM circuit when inserted at the output of a logic can reduce the generation of transient glitches significantly. Further, we propose an algorithm to insert RADJAM cells on selective nodes in a logic circuit. The algorithm uses signal logic probabilities and circuit slack for insertion of RADJAM cells on circuit nodes, thus improving the reliability of the logic circuit with minimal impact on the overall circuit delay. The proposed algorithm has been implemented and validated on the ISCAS85 benchmarks. Experimental results indicate that RADJAM optimized logic circuits can reduce soft error rates by around 39% with marginal delay, area and power overheads.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124152534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13