首页 > 最新文献

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)最新文献

英文 中文
Reliability, thermal, and power modeling and optimization 可靠性,热,和功率建模和优化
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654140
R. Dick
This tutorial provides an overview of challenges to designing and implementing reliable integrated circuits and systems, and suggests areas for future study. It illustrates some concepts in detail, explaining the challenges of appropriately considering the impact of temperature on reliability in fault-tolerant systems. Finally, it points out considerations that may influence adoption of reliability modeling and optimization techniques and stresses the importance of considering the most relevant fault processes during reliability modeling and optimization.
本教程概述了设计和实现可靠集成电路和系统所面临的挑战,并提出了未来研究的领域。它详细说明了一些概念,解释了适当考虑温度对容错系统可靠性影响的挑战。最后,指出了可能影响可靠性建模和优化技术采用的考虑因素,并强调了在可靠性建模和优化过程中考虑最相关的故障过程的重要性。
{"title":"Reliability, thermal, and power modeling and optimization","authors":"R. Dick","doi":"10.1109/ICCAD.2010.5654140","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654140","url":null,"abstract":"This tutorial provides an overview of challenges to designing and implementing reliable integrated circuits and systems, and suggests areas for future study. It illustrates some concepts in detail, explaining the challenges of appropriately considering the impact of temperature on reliability in fault-tolerant systems. Finally, it points out considerations that may influence adoption of reliability modeling and optimization techniques and stresses the importance of considering the most relevant fault processes during reliability modeling and optimization.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133658603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimal algorithm for profile-based power gating: A compiler technique for reducing leakage on execution units in microprocessors 基于配置文件的功率门控优化算法:一种减少微处理器执行单元泄漏的编译器技术
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653652
Danbee Park, Jungseob Lee, N. Kim, Taewhan Kim
This paper proposes a compiler-based solution to the problem of inserting power gating instructions into code to control activation/deactivation (i.e., ON/OFF) of functional units in microprocessor during the code execution, so that the leakage power is maximally saved. Precisely, based on an execution profile of code containing conditional braches and/or loops, we propose a polynomial time optimal algorithm, called PG-instr, of inserting ON/OFF instructions into code with the objective of minimzing the expected total leakage power while considerig the power and delay overhead on power gating.
本文提出了一种基于编译器的方法,在代码中插入功率门控指令来控制代码执行过程中微处理器功能单元的激活/停用(即ON/OFF),从而最大限度地节省泄漏功率。准确地说,基于包含条件分支和/或循环的代码的执行概要,我们提出了一个多项式时间最优算法,称为PG-instr,在考虑功率门控的功率和延迟开销的同时,将on /OFF指令插入到代码中,目标是最小化期望的总泄漏功率。
{"title":"Optimal algorithm for profile-based power gating: A compiler technique for reducing leakage on execution units in microprocessors","authors":"Danbee Park, Jungseob Lee, N. Kim, Taewhan Kim","doi":"10.1109/ICCAD.2010.5653652","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653652","url":null,"abstract":"This paper proposes a compiler-based solution to the problem of inserting power gating instructions into code to control activation/deactivation (i.e., ON/OFF) of functional units in microprocessor during the code execution, so that the leakage power is maximally saved. Precisely, based on an execution profile of code containing conditional braches and/or loops, we propose a polynomial time optimal algorithm, called PG-instr, of inserting ON/OFF instructions into code with the objective of minimzing the expected total leakage power while considerig the power and delay overhead on power gating.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"448 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115614430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A simple implementation of determinant decision diagram 行列式决策图的一个简单实现
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654333
G. Shi
Determinant decision diagram (DDD) uses a Binary Decision Diagram (BDD) to represent the Laplace expansion of a determinant. It is used as the core computation engine in some modern symbolic circuit simulators. The traditional implementations rely on a BDD package for the common-data sharing operations in which symbol ordering plays an essential role. This paper proposes a simple implementation method which does not use any BDD package. Sharing is implemented by directly hashing minors, while the requirement on symbol ordering is weakened to an expansion ordering. The basic mechanism used is a natural formulation of layered expansion which is analogous to manual expansion of a determinant, hence it is easily understood. The simplified DDD construction method not only makes the DDD implementation straightforward, but also results in greater efficiency. A simulator developed based on this new method solves the μa725 op-amp circuit in a few seconds by flat expansion.
行列式决策图(DDD)使用二进制决策图(BDD)来表示行列式的拉普拉斯展开。在一些现代符号电路模拟器中,它被用作核心计算引擎。传统的实现依赖于BDD包进行公共数据共享操作,其中符号排序起着至关重要的作用。本文提出了一种不使用任何BDD包的简单实现方法。共享是通过直接对子节点进行哈希来实现的,而对符号排序的要求被削弱为扩展排序。使用的基本机制是分层展开的自然公式,类似于行列式的手动展开,因此很容易理解。简化的DDD构造方法不仅使DDD的实现更加直观,而且提高了工作效率。在此基础上开发的仿真器通过平展的方式在几秒内解决了μa725运放电路的问题。
{"title":"A simple implementation of determinant decision diagram","authors":"G. Shi","doi":"10.1109/ICCAD.2010.5654333","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654333","url":null,"abstract":"Determinant decision diagram (DDD) uses a Binary Decision Diagram (BDD) to represent the Laplace expansion of a determinant. It is used as the core computation engine in some modern symbolic circuit simulators. The traditional implementations rely on a BDD package for the common-data sharing operations in which symbol ordering plays an essential role. This paper proposes a simple implementation method which does not use any BDD package. Sharing is implemented by directly hashing minors, while the requirement on symbol ordering is weakened to an expansion ordering. The basic mechanism used is a natural formulation of layered expansion which is analogous to manual expansion of a determinant, hence it is easily understood. The simplified DDD construction method not only makes the DDD implementation straightforward, but also results in greater efficiency. A simulator developed based on this new method solves the μa725 op-amp circuit in a few seconds by flat expansion.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115620990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Design automation towards reliable analog integrated circuits 朝着可靠的模拟集成电路设计自动化
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654159
G. Gielen, Elie Maricau, P. D. Wit
Reliability is becoming one of the major concerns in designing integrated circuits in nanometer CMOS technologies. Problems related to degradation mechanisms like NBTI or soft breakdown, as well as increased external interference such as caused by crosstalk and EMI, cause time-dependent circuit performance degradation. Variability only makes these things more severe. This creates a need for innovative design techniques and design tools that help designers coping with these reliability and variability problems. This tutorial paper gives a brief description of design tools for the efficient analysis and identification of reliability problems in analog circuits, as a first step towards the automated design of guaranteed reliable analog circuits.
可靠性已成为纳米CMOS技术设计集成电路的主要关注点之一。与NBTI或软击穿等退化机制相关的问题,以及由串扰和EMI引起的外部干扰增加,都会导致随时间变化的电路性能退化。可变性只会让这些事情变得更加严重。这就需要创新的设计技术和设计工具来帮助设计师应对这些可靠性和可变性问题。本文简要介绍了用于有效分析和识别模拟电路可靠性问题的设计工具,这是实现保证可靠模拟电路自动化设计的第一步。
{"title":"Design automation towards reliable analog integrated circuits","authors":"G. Gielen, Elie Maricau, P. D. Wit","doi":"10.1109/ICCAD.2010.5654159","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654159","url":null,"abstract":"Reliability is becoming one of the major concerns in designing integrated circuits in nanometer CMOS technologies. Problems related to degradation mechanisms like NBTI or soft breakdown, as well as increased external interference such as caused by crosstalk and EMI, cause time-dependent circuit performance degradation. Variability only makes these things more severe. This creates a need for innovative design techniques and design tools that help designers coping with these reliability and variability problems. This tutorial paper gives a brief description of design tools for the efficient analysis and identification of reliability problems in analog circuits, as a first step towards the automated design of guaranteed reliable analog circuits.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115735695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Aging analysis at gate and macro cell level 门和宏细胞水平的老化分析
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654309
Dominik Lorenz, M. Barke, Ulf Schlichtmann
Aging, which can be regarded as a time-dependent variability, has until recently not received much attention in the field of electronic design automation. This is changing because increasing reliability costs threaten the continued scaling of ICs. We investigate the impact of aging effects on single combinatorial gates and present methods that help to reduce the reliability costs by accurately analyzing the performance degradation of aged circuits at gate and macro cell level.
老化是一种随时间变化的变量,直到最近才在电子设计自动化领域受到重视。这种情况正在改变,因为不断增加的可靠性成本威胁着集成电路的持续扩展。我们研究了老化效应对单个组合门的影响,并提出了通过准确分析老化电路在门和宏细胞水平上的性能退化来帮助降低可靠性成本的方法。
{"title":"Aging analysis at gate and macro cell level","authors":"Dominik Lorenz, M. Barke, Ulf Schlichtmann","doi":"10.1109/ICCAD.2010.5654309","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654309","url":null,"abstract":"Aging, which can be regarded as a time-dependent variability, has until recently not received much attention in the field of electronic design automation. This is changing because increasing reliability costs threaten the continued scaling of ICs. We investigate the impact of aging effects on single combinatorial gates and present methods that help to reduce the reliability costs by accurately analyzing the performance degradation of aged circuits at gate and macro cell level.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115697125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
GLADE: A modern global router considering layer directives GLADE:一个考虑层指令的现代全局路由器
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654094
Yen-Jung Chang, Tsung-Hsien Lee, Ting-Chi Wang
Global routing is a very crucial stage in a design cycle, because it physically plans the routes of nets on a chip. In order to boost the research and development of global routing techniques, ISPD held contests and released benchmarks in 2007 and 2008, respectively. However, the contests may lead researchers away from facing other real problems in practice. In this paper we study a new global routing problem that not only considers traditional routing objectives such as overflow and wirelength but also focuses on honoring layer directives that are usually specified for timing-critical nets to alleviate performance degrading. Based on novel extensions of an academic router, we present a new global router called GLADE for the addressed problem. The experimental results show that GLADE can effectively generate a high-quality solution, which balances the metrics under consideration, for each test case from the set of recently released ICCAD 2009 benchmarks.
全局路由是设计周期中非常关键的阶段,因为它在芯片上物理地规划网络的路由。为了促进全球路由技术的研究和发展,ISPD分别在2007年和2008年举行了竞赛并发布了基准。然而,这些竞赛可能会使研究人员在实践中远离面对其他实际问题。本文研究了一种新的全局路由问题,该问题不仅考虑了溢出和无线长度等传统路由目标,而且注重遵守通常为时间关键型网络指定的层指令,以减轻性能下降。针对所解决的问题,我们在对一种学术路由器进行新颖扩展的基础上,提出了一种新的全局路由器GLADE。实验结果表明,对于最近发布的ICCAD 2009基准测试集中的每个测试用例,GLADE可以有效地生成一个高质量的解决方案,该解决方案可以平衡所考虑的指标。
{"title":"GLADE: A modern global router considering layer directives","authors":"Yen-Jung Chang, Tsung-Hsien Lee, Ting-Chi Wang","doi":"10.1109/ICCAD.2010.5654094","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654094","url":null,"abstract":"Global routing is a very crucial stage in a design cycle, because it physically plans the routes of nets on a chip. In order to boost the research and development of global routing techniques, ISPD held contests and released benchmarks in 2007 and 2008, respectively. However, the contests may lead researchers away from facing other real problems in practice. In this paper we study a new global routing problem that not only considers traditional routing objectives such as overflow and wirelength but also focuses on honoring layer directives that are usually specified for timing-critical nets to alleviate performance degrading. Based on novel extensions of an academic router, we present a new global router called GLADE for the addressed problem. The experimental results show that GLADE can effectively generate a high-quality solution, which balances the metrics under consideration, for each test case from the set of recently released ICCAD 2009 benchmarks.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116184305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Resilient microprocessor design for improving performance and energy efficiency 弹性微处理器设计,提高性能和能源效率
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654317
K. Bowman, J. Tschanz
In this tutorial, a 45nm resilient microprocessor core with error-detection and recovery circuits demonstrates the opportunity for improving performance and energy efficiency by mitigating the impact of dynamic parameter variations. The design methodology describes the additional steps beyond a standard design flow for integrating error-detection and recovery circuits into a microprocessor core. Silicon measurements indicate that the resilient design enables a 41% throughput benefit at iso-energy or a 22% energy reduction at iso-throughput, as compared to a conventional design.
在本教程中,一个带有错误检测和恢复电路的45nm弹性微处理器内核演示了通过减轻动态参数变化的影响来提高性能和能源效率的机会。设计方法描述了将错误检测和恢复电路集成到微处理器核心的标准设计流程之外的附加步骤。硅测量表明,与传统设计相比,弹性设计在等能量下可以提高41%的吞吐量,在等能量下可以降低22%的能耗。
{"title":"Resilient microprocessor design for improving performance and energy efficiency","authors":"K. Bowman, J. Tschanz","doi":"10.1109/ICCAD.2010.5654317","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654317","url":null,"abstract":"In this tutorial, a 45nm resilient microprocessor core with error-detection and recovery circuits demonstrates the opportunity for improving performance and energy efficiency by mitigating the impact of dynamic parameter variations. The design methodology describes the additional steps beyond a standard design flow for integrating error-detection and recovery circuits into a microprocessor core. Silicon measurements indicate that the resilient design enables a 41% throughput benefit at iso-energy or a 22% energy reduction at iso-throughput, as compared to a conventional design.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123339362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Boolean matching of function vectors with strengthened learning 增强学习的函数向量布尔匹配
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654215
Chih-Fan Lai, J. H. Jiang, Kuo-Hua Wang
Boolean matching for multiple-output functions determines whether two given (in)completely-specified function vectors can be identical to each other under permutation and/or negation of their inputs and outputs. Despite its importance in design rectification, technology mapping, and other logic synthesis applications, there is no much direct study on this subject due to its generality and consequent computational complexity. This paper extends our prior Boolean matching decision procedure BooM to consider multiple-output functions. Through conflict-driven learning and partial assignment reduction, Boolean matching in the most general setting can still be accomplishable even when all other techniques lose their foundation and become unapplicable. Experiments demonstrate the indispensable power of strengthened learning for practical applications.
多输出函数的布尔匹配决定了两个给定的(in)完全指定的函数向量在其输入和输出的置换和/或负向下是否可以彼此相同。尽管它在设计校正、技术映射和其他逻辑综合应用中很重要,但由于它的普遍性和随之而来的计算复杂性,对这一主题的直接研究并不多。本文将先前的布尔匹配决策过程扩展到考虑多输出函数。通过冲突驱动的学习和部分赋值约简,即使所有其他技术都失去了基础而变得不适用,布尔匹配在最一般的设置中仍然可以实现。实验证明了强化学习对于实际应用的不可缺少的力量。
{"title":"Boolean matching of function vectors with strengthened learning","authors":"Chih-Fan Lai, J. H. Jiang, Kuo-Hua Wang","doi":"10.1109/ICCAD.2010.5654215","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654215","url":null,"abstract":"Boolean matching for multiple-output functions determines whether two given (in)completely-specified function vectors can be identical to each other under permutation and/or negation of their inputs and outputs. Despite its importance in design rectification, technology mapping, and other logic synthesis applications, there is no much direct study on this subject due to its generality and consequent computational complexity. This paper extends our prior Boolean matching decision procedure BooM to consider multiple-output functions. Through conflict-driven learning and partial assignment reduction, Boolean matching in the most general setting can still be accomplishable even when all other techniques lose their foundation and become unapplicable. Experiments demonstrate the indispensable power of strengthened learning for practical applications.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"13 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123648696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Efficient arithmetic sum-of-product (SOP) based Multiple Constant Multiplication (MCM) for FFT 基于高效算术乘积和(SOP)的FFT多重常数乘法(MCM)
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654269
V. Karkala, Joseph Wanstrath, Travis Lacour, S. Khatri
In this paper, we present an arithmetic sum-of-products (SOP) based realization of the general Multiple Constant Multiplication (MCM) algorithm. We also propose an enhanced SOP based algorithm, which uses Partial Max-SAT (PMSAT) to further optimize the SOP. The enhanced algorithm attempts to reduce the number of rows (partial products) of the SOP, by i) shifting coefficients to realize other coefficients when possible, ii) exploring multiple implementations of each coefficient using a Minimal Signed Digit (MSD) format and iii) exploiting the mutual exclusiveness within certain groups of partial products. Hardware implementations of the Fast Fourier Transform (FFT) algorithm require the incoming data to be multiplied by one of several constant coefficients. We test/validate it for FFT, which is an important problem. We compare our SOP-based architectures with the best existing implementation of MCM for FFT (which utilizes a cascade of adders), and show that our approaches show a significant improvement in area and delay. Our architecture was synthesized using 65nm technology libraries.
本文提出了一种基于算术积和(SOP)的通用多重常数乘法(MCM)算法的实现方法。我们还提出了一种改进的基于SOP的算法,该算法使用Partial Max-SAT (PMSAT)进一步优化SOP。增强的算法试图减少SOP的行数(部分乘积),通过i)在可能的情况下移动系数以实现其他系数,ii)使用最小符号数字(MSD)格式探索每个系数的多个实现,以及iii)利用某些部分乘积组内的互斥性。快速傅里叶变换(FFT)算法的硬件实现要求输入数据乘以几个常数系数之一。我们对FFT进行了测试/验证,这是一个重要的问题。我们将基于sop的架构与现有最佳的FFT MCM实现(利用级联加法器)进行了比较,并表明我们的方法在面积和延迟方面有显着改善。我们的架构是使用65nm技术库合成的。
{"title":"Efficient arithmetic sum-of-product (SOP) based Multiple Constant Multiplication (MCM) for FFT","authors":"V. Karkala, Joseph Wanstrath, Travis Lacour, S. Khatri","doi":"10.1109/ICCAD.2010.5654269","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654269","url":null,"abstract":"In this paper, we present an arithmetic sum-of-products (SOP) based realization of the general Multiple Constant Multiplication (MCM) algorithm. We also propose an enhanced SOP based algorithm, which uses Partial Max-SAT (PMSAT) to further optimize the SOP. The enhanced algorithm attempts to reduce the number of rows (partial products) of the SOP, by i) shifting coefficients to realize other coefficients when possible, ii) exploring multiple implementations of each coefficient using a Minimal Signed Digit (MSD) format and iii) exploiting the mutual exclusiveness within certain groups of partial products. Hardware implementations of the Fast Fourier Transform (FFT) algorithm require the incoming data to be multiplied by one of several constant coefficients. We test/validate it for FFT, which is an important problem. We compare our SOP-based architectures with the best existing implementation of MCM for FFT (which utilizes a cascade of adders), and show that our approaches show a significant improvement in area and delay. Our architecture was synthesized using 65nm technology libraries.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"11 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116800350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A lower bound computation method for evaluation of statistical design techniques 统计设计技术评价的下界计算方法
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654194
Vineeth Veetil, D. Sylvester, D. Blaauw
Increase in variability in the nanometer era has contributed to pessimistic guardbands for conventional circuit design techniques that optimize at worst-case process corners. Smart deterministic approaches have been proposed that employ statistical timing analysis to reduce pessimism in the guardbands while retaining the deterministic nature of the algorithms. Other statistical optimization techniques focus on algorithms to maximize robustness of design while being aware of variability. It is not clear how much improvement can be gained using the latter set of approaches over more simple deterministic approaches. This work presents a new lower bound to evaluate these statistical optimization techniques, drawing inspiration from recent advances in sampling based SSTA. We prove that the presented lower bound gives the minimum possible area that can be achieved for a design while meeting a particular timing yield, which is the percentage of die that meeting a specified timing constraint. We then compare several statistical design optimization approaches, including one proposed in this paper called SLOP, against the computed lower bound. We show that even the simplest statistical optimization approaches produce area results which are, on average, within 9.6% of the lower bound while the best ones performed only marginally better, reaching within 3.7% of the bound. This demonstrates that the proposed bound is a close bound. In addition, it also shows that the existing optimization methods have nearly exhausted the obtainable improvement from being statistically aware and mostly provide trade-offs in runtime speed.
纳米时代变异性的增加导致了传统电路设计技术在最坏情况下优化的悲观保护带。已经提出了智能确定性方法,采用统计时序分析来减少警戒线中的悲观情绪,同时保留算法的确定性性质。其他统计优化技术侧重于算法,以最大限度地提高设计的鲁棒性,同时意识到可变性。目前尚不清楚使用后一组方法比更简单的确定性方法能获得多少改进。这项工作提出了一个新的下限来评估这些统计优化技术,从基于采样的SSTA的最新进展中汲取灵感。我们证明了所提出的下界给出了在满足特定时序良率(即满足特定时序约束的模具的百分比)的情况下,设计可以实现的最小可能面积。然后,我们比较了几种统计设计优化方法,包括本文提出的一种称为SLOP的方法,与计算的下界进行比较。我们表明,即使是最简单的统计优化方法产生的面积结果,平均而言,在下限的9.6%以内,而最好的方法只稍微好一点,达到下限的3.7%以内。这证明了所提出的界是一个紧密界。此外,它还表明,现有的优化方法几乎耗尽了从统计感知中获得的改进,并且主要提供运行时速度的权衡。
{"title":"A lower bound computation method for evaluation of statistical design techniques","authors":"Vineeth Veetil, D. Sylvester, D. Blaauw","doi":"10.1109/ICCAD.2010.5654194","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654194","url":null,"abstract":"Increase in variability in the nanometer era has contributed to pessimistic guardbands for conventional circuit design techniques that optimize at worst-case process corners. Smart deterministic approaches have been proposed that employ statistical timing analysis to reduce pessimism in the guardbands while retaining the deterministic nature of the algorithms. Other statistical optimization techniques focus on algorithms to maximize robustness of design while being aware of variability. It is not clear how much improvement can be gained using the latter set of approaches over more simple deterministic approaches. This work presents a new lower bound to evaluate these statistical optimization techniques, drawing inspiration from recent advances in sampling based SSTA. We prove that the presented lower bound gives the minimum possible area that can be achieved for a design while meeting a particular timing yield, which is the percentage of die that meeting a specified timing constraint. We then compare several statistical design optimization approaches, including one proposed in this paper called SLOP, against the computed lower bound. We show that even the simplest statistical optimization approaches produce area results which are, on average, within 9.6% of the lower bound while the best ones performed only marginally better, reaching within 3.7% of the bound. This demonstrates that the proposed bound is a close bound. In addition, it also shows that the existing optimization methods have nearly exhausted the obtainable improvement from being statistically aware and mostly provide trade-offs in runtime speed.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121621124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1