2011 24th Internatioal Conference on VLSI Design最新文献

英文中文

Extraction of Aspect Ratio for Non-Manhattan CMOS Devices 非曼哈顿CMOS器件宽高比的提取

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.71

Shiv Kumar, V. Chandratre, S. Mohammed, C. K. Pithawa

Non-Manhattan CMOS devices are gaining attention because of their special properties. In this paper waffle and closed gate structures are discussed and issues related to their use in CAD tools are addressed. The waffle devices are used where large aspect ratio with low parasitic capacitances and lower silicon overhead is required. The closed-gate layout is used in rad-hard digital libraries due to their edgeless geometry. These devices are difficult to handle because Process Design Kit (PDK) is developed for Manhattan geometries. This paper discusses how the models for Manhattan devices can’t be extended to predict accurate I-V characteristics of the non-manhattan devices. An analytical model is developed to map non-Manhattan devices to equivalent Manhattan devices. A test chip in 0.7?m CMOS technology was developed to validate the concept. The PDK was modified to introduce these structures in normal analog design flow.

非曼哈顿CMOS器件因其特殊的性能而受到越来越多的关注。本文讨论了华夫结构和闭门结构，并讨论了它们在CAD工具中的应用问题。华夫格器件用于需要大宽高比、低寄生电容和低硅开销的地方。由于其无边缘的几何形状，闭门布局用于雷达硬数字图书馆。这些设备很难处理，因为过程设计套件(PDK)是为曼哈顿几何形状开发的。本文讨论了曼哈顿器件的模型不能推广到准确预测非曼哈顿器件的I-V特性的问题。建立了一个解析模型，将非曼哈顿装置映射到等效的曼哈顿装置。测试芯片在0.7?开发了m CMOS技术来验证该概念。对PDK进行了修改，以便在正常的模拟设计流程中引入这些结构。

引用次数: 2

Cell Library Characterization at Low Voltage Using Non-linear Operating Point Analysis of Local Variations 利用非线性工作点局部变化分析低电压下的电池库特性

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.43

R. Rithe, Sharon Chou, Jie Gu, Alice Wang, Satyendra R. Datla, G. Gammie, D. Buss, A. Chandrakasan

When CMOS is operated at a supply voltage of 0.5V and below, Random Do pant Fluctuations (RDFs) result in a stochastic component of logic delay that can be comparable to the nominal delay. Moreover, the Probability Density Function (PDF) of this stochastic delay can be highly non-Gaussian. The Non-Linear, Operating Point Analysis of Local Variations (NLOPALV) technique has been shown to be accurate and computationally efficient in simulating any point on the delay PDF of a logic Timing Path (TP). This paper applies the NLOPALV approach to characterizing the stochastic delay of logic cells. NLOPALV theory is presented, and NLOPALV is used to characterize a cell library designed in 28 nm CMOS. NLOPALV is accurate to within 5% compared to SPICE-based Monte Carlo analysis.

当CMOS在0.5V及以下的电源电压下工作时，随机波动(RDFs)会导致逻辑延迟的随机成分，可以与标称延迟相媲美。此外，这种随机延迟的概率密度函数(PDF)可以是非高斯的。非线性、局部变化工作点分析(NLOPALV)技术在模拟逻辑时序路径(TP)的延迟PDF上的任何点时都是准确的和计算效率高的。本文应用NLOPALV方法来表征逻辑单元的随机延迟。提出了NLOPALV理论，并利用NLOPALV对28nm CMOS设计的单元库进行了表征。与基于spice的蒙特卡罗分析相比，NLOPALV的准确度在5%以内。

引用次数: 5

A Statistical Learning Based Modeling Approach and Its Application in Leakage Library Characterization 基于统计学习的建模方法及其在泄漏库描述中的应用

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.23

Min Zhang, R. Häußler, M. Olbrich, H. Kinzelbach, E. Barke

In statistical analysis, modeling circuit performance for non-linear problems demands large computational effort. In semi-custom design, statistical leakage library characterization is a highly complex yet fundamental task. The log-linear model provides an unacceptable poor accuracy in modeling a large number of standard cells. To improve model quality, simply increasing model order is not practicable because it leads to an exponential increase in run time. Instead of assuming one model type for the entire library beforehand, we developed an approach generating a model for each cell individually. The key contribution is the use of a cross term matrix and an active sampling scheme, which significantly reduces model size and model generation time. The effectiveness of our approach is clearly shown by experiments on industrial standard cell libraries. As we regard the circuit block as a black box, our approach is suitable for modeling various circuit performances.

在统计分析中，非线性问题的电路性能建模需要大量的计算量。在半定制设计中，统计泄漏库表征是一项非常复杂而又基础的任务。对数线性模型在模拟大量标准细胞时精度低得令人无法接受。为了提高模型质量，简单地增加模型顺序是不可行的，因为这会导致运行时间呈指数增长。我们没有预先为整个库假设一种模型类型，而是开发了一种为每个单元单独生成模型的方法。关键的贡献是使用交叉项矩阵和主动采样方案，这大大减少了模型大小和模型生成时间。在工业标准细胞库上的实验清楚地表明了我们方法的有效性。由于我们将电路块视为一个黑盒，因此我们的方法适用于各种电路性能的建模。

引用次数: 0

Realizing Cycle Accurate Processor Memory Simulation via Interface Abstraction 通过接口抽象实现周期精确的处理器内存仿真

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.36

S. Min, Jorgen Peddersen, S. Parameswaran

SoC designers typically use a processor simulator to generate a memory trace and apply the generated trace to a memory simulator in order to collect the performance statistics of a complete system. This is an inaccurate process for most applications, making it difficult to optimize the processor and memory configurations. In this paper, we study the problems encountered in the typical simulation approach and propose a methodology which utilizes an interface layer component to link the processor simulator and memory simulator seamlessly. The interface layer component presented in this paper can be used as the connector between the processor module and memory module in building an execution-driven approach which can be applied to process run-time memory requests rather than the traditional trace driven simulation approaches. By applying the proposed interface layer component to link the processor simulator and memory simulator, the estimated performance statistics of the system and the average power consumption of the memory system can be collected with high accuracy. We prove the necessity of our approach by evaluating six benchmarks. Over these benchmarks, there is an 80% variation in the choice of memory latency to achieve the most accurate power consumption and a 16% variation in the choice of memory latency to achieve the most accurate execution time. The increase in accuracy comes at an average increase in simulation time of 13.5%.

SoC设计人员通常使用处理器模拟器来生成内存跟踪，并将生成的跟踪应用于内存模拟器，以收集完整系统的性能统计数据。对于大多数应用程序来说，这是一个不准确的过程，使得优化处理器和内存配置变得困难。在本文中，我们研究了典型仿真方法中遇到的问题，并提出了一种利用接口层组件无缝连接处理器模拟器和存储器模拟器的方法。本文提出的接口层组件可以作为处理器模块和内存模块之间的连接器，用于构建执行驱动方法，该方法可以应用于处理运行时内存请求，而不是传统的跟踪驱动仿真方法。利用所提出的接口层组件将处理器模拟器和存储器模拟器连接起来，可以高精度地获取系统的估计性能统计数据和存储器系统的平均功耗。我们通过评估六个基准来证明我们的方法的必要性。在这些基准测试中，为了获得最准确的功耗，在内存延迟的选择上有80%的变化，为了获得最准确的执行时间，在内存延迟的选择上有16%的变化。精度的提高是在模拟时间平均增加13.5%的基础上实现的。

{"title":"Realizing Cycle Accurate Processor Memory Simulation via Interface Abstraction","authors":"S. Min, Jorgen Peddersen, S. Parameswaran","doi":"10.1109/VLSID.2011.36","DOIUrl":"https://doi.org/10.1109/VLSID.2011.36","url":null,"abstract":"SoC designers typically use a processor simulator to generate a memory trace and apply the generated trace to a memory simulator in order to collect the performance statistics of a complete system. This is an inaccurate process for most applications, making it difficult to optimize the processor and memory configurations. In this paper, we study the problems encountered in the typical simulation approach and propose a methodology which utilizes an interface layer component to link the processor simulator and memory simulator seamlessly. The interface layer component presented in this paper can be used as the connector between the processor module and memory module in building an execution-driven approach which can be applied to process run-time memory requests rather than the traditional trace driven simulation approaches. By applying the proposed interface layer component to link the processor simulator and memory simulator, the estimated performance statistics of the system and the average power consumption of the memory system can be collected with high accuracy. We prove the necessity of our approach by evaluating six benchmarks. Over these benchmarks, there is an 80% variation in the choice of memory latency to achieve the most accurate power consumption and a 16% variation in the choice of memory latency to achieve the most accurate execution time. The increase in accuracy comes at an average increase in simulation time of 13.5%.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133894388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A Library Development Framework for a Coarse Grain Reconfigurable Architecture 面向粗粒度可重构体系结构的图书馆开发框架

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.54

Omer Malik, A. Hemani, M. A. Shami

A framework for efficiently capturing the rich micro architectural space of a substantial Matlab like library of DSP functions for a regular Coarse Grain Reconfigurable Architecture (CGRA) fabric is proposed. A subset of C has been proposed to model the DSP functions and an automatic tool to generate the configware for the CGRA fabric developed. A method to estimate the average energy of such functions is reported with error margin of less than 3%. Such a framework is proposed as the basis for raising the abstraction to automate synthesis of the entire physical layers.

提出了一种基于规则粗粒可重构结构(CGRA)的框架，用于高效捕获大量类似Matlab的DSP函数库的丰富微结构空间。提出了一个C语言子集来对DSP功能进行建模，并提出了一个自动生成CGRA结构组态软件的工具。本文报道了一种估计这些函数平均能量的方法，误差小于3%。这种框架被提议作为提高抽象以自动合成整个物理层的基础。

引用次数: 17

A Robust and Reconfigurable Multi-mode Power Gating Architecture 一种鲁棒且可重构的多模电源门控架构

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.29

Zhaobo Zhang, X. Kavousianos, K. Chakrabarty, Y. Tsiatouhas

Multi-threshold CMOS is a very effective technique for reducing standby leakage power during long periods of inactivity. Recently, a power-gating scheme was presented to support multiple power-off modes and reduce the leakage power during short periods of inactivity. However, this scheme suffers from high sensitivity to process variations, which impedes manufacturability and also limits its applicability to at most two intermediate power-off modes. We propose a new power-gating technique that is tolerant to process variations and scalable to more than two intermediate power-off modes. In addition, the proposed design requires minimum design effort and offers greater power reduction and smaller area cost than the previous method. Analysis and extensive simulation results demonstrate the effectiveness of the proposed design.

多阈值CMOS技术是降低长时间不工作时待机漏功率的一种非常有效的技术。最近，提出了一种支持多种断电模式的电源门控方案，以减少短时间不活动时的泄漏功率。然而，该方案对工艺变化高度敏感，这阻碍了可制造性，也限制了其至多适用于两种中间断电模式。我们提出了一种新的功率门控技术，它可以容忍工艺变化，并可扩展到两种以上的中间断电模式。此外，与之前的方法相比，所提出的设计需要最小的设计工作量，并提供更大的功耗降低和更小的面积成本。分析和广泛的仿真结果证明了该设计的有效性。

引用次数: 14

Optimized Multitone Test Stimulus Driven Diagnosis of RF Transceivers Using Model Parameter Estimation 基于模型参数估计的射频收发器多音测试刺激优化诊断

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.65

A. Banerjee, Vishwanath Natarajan, Shreyas Sen, A. Chatterjee, G. Srinivasan, S. Bhattacharya

Test time and test complexity reduction has become a critical challenge in modern RF testing. Prior “alternative” test methods have achieved fast testing at the cost of using supervised learning algorithms that require “training”. In contrast, behavioral model parameter estimation based test methods require the use of accurate models but no “training” is necessary, reducing test deployment costs. In this work, a new test generation approach is proposed that allows behavioral model parameter estimation to be performed from a single optimized OFDM data frame. A genetic multi-tone test stimulus optimization algorithm is developed to maximize the accuracy with which a nonlinear solver can determine RF transceiver model parameters from raw downconverted test response data. The transceiver model proposed is the most comprehensive to date and includes AM-PM distortion and 5th order nonlinearity effects. Simulation results show that using the optimized multitone test stimulus, all the model parameters can be computed accurately using a single data acquisition (4X-5X faster than prior parameter estimation techniques and comparable to alternative test times). Data from an experiment performed on a hardware prototype validates the proposed concept.

降低测试时间和测试复杂度已成为现代射频测试面临的关键挑战。之前的“替代”测试方法已经以使用需要“训练”的监督学习算法为代价实现了快速测试。相比之下，基于行为模型参数估计的测试方法需要使用准确的模型，但不需要“训练”，从而降低了测试部署成本。在这项工作中，提出了一种新的测试生成方法，允许从单个优化的OFDM数据帧执行行为模型参数估计。为了使非线性求解器从原始下转换测试响应数据中确定射频收发器模型参数的精度最大化，提出了一种遗传多音测试刺激优化算法。提出的收发器模型是迄今为止最全面的，包括AM-PM失真和五阶非线性效应。仿真结果表明，利用优化后的多音测试刺激，单次数据采集即可准确计算出所有模型参数(比先前的参数估计技术快4 -5倍，与备选测试时间相当)。在硬件样机上进行的实验数据验证了所提出的概念。

{"title":"Optimized Multitone Test Stimulus Driven Diagnosis of RF Transceivers Using Model Parameter Estimation","authors":"A. Banerjee, Vishwanath Natarajan, Shreyas Sen, A. Chatterjee, G. Srinivasan, S. Bhattacharya","doi":"10.1109/VLSID.2011.65","DOIUrl":"https://doi.org/10.1109/VLSID.2011.65","url":null,"abstract":"Test time and test complexity reduction has become a critical challenge in modern RF testing. Prior “alternative” test methods have achieved fast testing at the cost of using supervised learning algorithms that require “training”. In contrast, behavioral model parameter estimation based test methods require the use of accurate models but no “training” is necessary, reducing test deployment costs. In this work, a new test generation approach is proposed that allows behavioral model parameter estimation to be performed from a single optimized OFDM data frame. A genetic multi-tone test stimulus optimization algorithm is developed to maximize the accuracy with which a nonlinear solver can determine RF transceiver model parameters from raw downconverted test response data. The transceiver model proposed is the most comprehensive to date and includes AM-PM distortion and 5th order nonlinearity effects. Simulation results show that using the optimized multitone test stimulus, all the model parameters can be computed accurately using a single data acquisition (4X-5X faster than prior parameter estimation techniques and comparable to alternative test times). Data from an experiment performed on a hardware prototype validates the proposed concept.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132614342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25

Hazard-Aware Directed Transition Fault ATPG for Effective Critical Path Test 有效关键路径测试的危险感知定向过渡故障ATPG

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.42

V. Devanathan, Ishaan Santhosh Shah

Aggressive speed and voltage binning schemes are widely used in the industry to combat process variation. Generating structural tests that are effective for speed and voltage binning is very important to reduce cost and improve quality. We observe that hazards are common along critical paths of many industrial designs and conventional path-delay ATPG is ineffective for paths with static hazards. We propose a directed transition fault ATPG scheme that works with commercial ATPG tools to test the critical paths with hazards. The proposed scheme is implemented on industrial designs and silicon results are presented.

积极的速度和电压分组方案在工业中广泛用于对抗工艺变化。生成有效的速度和电压分闸结构试验对降低成本和提高质量具有重要意义。我们观察到，在许多工业设计的关键路径上，危险是常见的，传统的路径延迟ATPG对于具有静态危险的路径是无效的。我们提出了一种直接过渡故障ATPG方案，该方案与商用ATPG工具一起测试具有危险的关键路径。该方案已在工业设计中实现，并给出了硅实验结果。

引用次数: 0

A GPU Algorithm for IC Floorplanning: Specification, Analysis and Optimization 一种用于集成电路平面规划的GPU算法:规范、分析与优化

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.19

Yiding Han, Koushik Chakraborty, Sanghamitra Roy, Vilasita Kuntamukkala

In this paper, we propose a novel floor planning algorithm for GPUs. Floor planning is an inherently sequential algorithm, far from the typical programs suitable for Single Instruction Multiple Thread (SIMT) style concurrency in a GPU. We propose a fundamentally different approach of exploring the floor plan solution space, where we evaluate concurrent moves on a given floor plan. We illustrate several performance optimization techniques for this algorithm in GPUs. Compared to the sequential algorithm, our techniques achieve 4-30X speedup for a range of MCNC benchmarks, while delivering comparable or better solution quality.

在本文中，我们提出了一种新的gpu地板规划算法。楼层规划是一个固有的顺序算法，与典型的适合GPU中单指令多线程(SIMT)风格并发的程序相去甚远。我们提出了一种完全不同的探索平面图解决方案空间的方法，其中我们评估给定平面图上的并发移动。我们举例说明了该算法在gpu中的几种性能优化技术。与顺序算法相比，我们的技术在一系列MCNC基准测试中实现了4-30倍的加速，同时提供了相当或更好的解决方案质量。

引用次数: 3

MEMS-Based Power Gating for Highly Scalable Periodic and Event-Driven Processing 基于mems的高可扩展周期和事件驱动处理的功率门控

2011 24th Internatioal Conference on VLSI Design

Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.66

Michael B. Henry, Robert Lyerly, L. Nazhandali, A. Fruehling, D. Peroulis

For periodic and event-driven applications with long standby times, controlling leakage is essential. This paper investigates using MEMS switches for power gating processors, which eliminates standby leakage power and allows for highly scalable processing. We show that when power gating with a MEMS switch, low technology nodes and low threshold voltages, which offer low switching energy and high speeds, are optimal. We also compare a MEMS-gated processor to two recent low leakage processors and show that it is ideal for applications with 100+ ms standby times. With CMOS compatibility on the horizon, MEMS switches are an attractive option for low-leakage applications.

对于具有长待机时间的周期性和事件驱动的应用程序，控制泄漏是必不可少的。本文研究了将MEMS开关用于功率门控处理器，它消除了待机泄漏功率并允许高度可扩展的处理。我们表明，当使用MEMS开关进行功率门控时，低技术节点和低阈值电压，提供低开关能量和高速度，是最佳的。我们还将mems门控处理器与两种最新的低泄漏处理器进行了比较，并表明它非常适合待机时间为100+ ms的应用。随着CMOS兼容性的发展，MEMS开关是低泄漏应用的一个有吸引力的选择。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2011 24th Internatioal Conference on VLSI Design

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀