首页 > 最新文献

2011 24th Internatioal Conference on VLSI Design最新文献

英文 中文
Power Scalable Digital Baseband Architecture for IEEE 802.15.4 IEEE 802.15.4的功率可扩展数字基带架构
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.64
S. Dwivedi, B. Amrutur, N. Bhat
We propose a power scalable digital base band for a low-IF receiver for IEEE 802.15.4-2006. The digital section’s sampling frequency and bit width are used as knobs to reduce the power under favorable signal and interference scenarios, thus recovering the design margins introduced to handle worst case conditions. We propose tuning of these knobs based on measurements of Signal and the interference levels. We show that in a 0.13u CMOS technology, for an adaptive digital base band section of the receiver designed to meet the 802.15.4 standard specification, power saving can be up to nearly 85% (0.49mW against 3.3mW) in favorable interference and signal conditions.
我们为IEEE 802.15.4-2006的低中频接收器提出了一种功率可扩展的数字基带。数字部分的采样频率和位宽用作旋钮,在有利的信号和干扰情况下降低功率,从而恢复处理最坏情况时引入的设计余量。我们建议根据信号和干扰水平的测量来调整这些旋钮。我们表明,在0.13u CMOS技术中,对于设计满足802.15.4标准规范的接收器的自适应数字基带部分,在良好的干扰和信号条件下,省电可达近85% (0.49mW对3.3mW)。
{"title":"Power Scalable Digital Baseband Architecture for IEEE 802.15.4","authors":"S. Dwivedi, B. Amrutur, N. Bhat","doi":"10.1109/VLSID.2011.64","DOIUrl":"https://doi.org/10.1109/VLSID.2011.64","url":null,"abstract":"We propose a power scalable digital base band for a low-IF receiver for IEEE 802.15.4-2006. The digital section’s sampling frequency and bit width are used as knobs to reduce the power under favorable signal and interference scenarios, thus recovering the design margins introduced to handle worst case conditions. We propose tuning of these knobs based on measurements of Signal and the interference levels. We show that in a 0.13u CMOS technology, for an adaptive digital base band section of the receiver designed to meet the 802.15.4 standard specification, power saving can be up to nearly 85% (0.49mW against 3.3mW) in favorable interference and signal conditions.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133003925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Evolution of Oscillation in a Quadrature Oscillator 正交振荡器振荡的演化
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.25
Diptendu Ghosh, R. Gharpurey
The growth of oscillation in a quadrature oscillator that employs phase shifters in the coupling-path between a pair of LC-loaded negative resistance cores, is analyzed. Such an oscillator is known to have two stable modes of oscillation. Under a noise-initiated startup from an unstable initial condition, quasiharmonic assumption and the Method of First Approximation are used to demonstrate that compression mechanisms lead to preferential enhancement of one mode, while attenuating the other. The magnitude of phase shift in the coupling-path is shown to directly affect the ratio between temporal rates of mode buildup and decay.
分析了在一对lc负载的负电阻磁芯之间的耦合路径中采用移相器的正交振荡器的振荡增长。这样的振荡器已知有两种稳定的振荡模式。在不稳定初始条件下的噪声启动下,利用拟调和假设和第一逼近方法证明了压缩机制导致一种模式优先增强,而另一种模式衰减。耦合路径中相移的大小直接影响模态累积和衰减的时间速率之比。
{"title":"Evolution of Oscillation in a Quadrature Oscillator","authors":"Diptendu Ghosh, R. Gharpurey","doi":"10.1109/VLSID.2011.25","DOIUrl":"https://doi.org/10.1109/VLSID.2011.25","url":null,"abstract":"The growth of oscillation in a quadrature oscillator that employs phase shifters in the coupling-path between a pair of LC-loaded negative resistance cores, is analyzed. Such an oscillator is known to have two stable modes of oscillation. Under a noise-initiated startup from an unstable initial condition, quasiharmonic assumption and the Method of First Approximation are used to demonstrate that compression mechanisms lead to preferential enhancement of one mode, while attenuating the other. The magnitude of phase shift in the coupling-path is shown to directly affect the ratio between temporal rates of mode buildup and decay.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127121274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Wakeup Time and Wakeup Energy Estimation in Power-Gated Logic Clusters 电源门控逻辑集群的唤醒时间和唤醒能量估计
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.18
Vivek D. Tovinakere, O. Sentieys, Steven Derrien
Run-time power gating for aggressive leakage reduction has brought into focus the cost of mode transition overheads due to frequent switching between sleep and active modes of circuit operation. In order to design circuits for effective power gating, logic circuits must be characterized for overheads they present during mode transitions. In this paper, we describe a method to determine steady-state virtual-supply voltage in active mode and hence present a model for virtual supply voltage in terms of basic circuit parameters. Further, we derive expressions for estimation of two mode transition overheads: wakeup time and wakeup energy for a power-gated logic cluster using the proposed model. Finally we demonstrate its application to four ISCAS benchmark circuits while also analyzing the accuracy of approximations used in the model.
运行时功率门控的积极减少泄漏已经引起了人们对模式转换开销的关注,这是由于在电路工作的睡眠模式和有源模式之间频繁切换造成的。为了设计有效的功率门控电路,必须对逻辑电路在模式转换期间出现的开销进行表征。本文描述了一种确定有源模式下稳态虚供电电压的方法,并由此建立了基于基本电路参数的虚供电电压模型。此外,我们推导了两种模式转换开销的估计表达式:使用所提出的模型的电源门控逻辑集群的唤醒时间和唤醒能量。最后,我们展示了它在四个ISCAS基准电路中的应用,同时也分析了模型中使用的近似精度。
{"title":"Wakeup Time and Wakeup Energy Estimation in Power-Gated Logic Clusters","authors":"Vivek D. Tovinakere, O. Sentieys, Steven Derrien","doi":"10.1109/VLSID.2011.18","DOIUrl":"https://doi.org/10.1109/VLSID.2011.18","url":null,"abstract":"Run-time power gating for aggressive leakage reduction has brought into focus the cost of mode transition overheads due to frequent switching between sleep and active modes of circuit operation. In order to design circuits for effective power gating, logic circuits must be characterized for overheads they present during mode transitions. In this paper, we describe a method to determine steady-state virtual-supply voltage in active mode and hence present a model for virtual supply voltage in terms of basic circuit parameters. Further, we derive expressions for estimation of two mode transition overheads: wakeup time and wakeup energy for a power-gated logic cluster using the proposed model. Finally we demonstrate its application to four ISCAS benchmark circuits while also analyzing the accuracy of approximations used in the model.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129567723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Variation-Aware TED-Based Approach for Nano-CMOS RTL Leakage Optimization 基于变化感知ted的纳米cmos RTL泄漏优化方法
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.40
Shibaji Banerjee, J. Mathew, D. Pradhan, S. Mohanty, M. Ciesielski
As technology scales down to nanometer regime the process variations have profound effect on circuit characteristics. Meeting timing and power constraints under such process variations in nano-CMOS circuit design is increasingly difficult. This causes a shifting from worst-case based analysis and optimization to statistical or probability based analysis and optimization at every level of circuit abstraction. This paper presents a TED (Taylor Expansion Diagram) based multi ? Tox techniques during high-level synthesis (HLS). A variation-aware simultaneous scheduling and resource binding algorithm is proposed which maximizes the power yield under timing yield and performance constraint. For this purpose, a multi ? Tox library is characterized under process variation. The delay and power distribution of different functional units are exhaustively studied. The proposed variation-aware algorithm uses those components for generating low power RTL under a given timing yield and performance constraint. The experimental results show significant improvement as high as 95% on leakage power yield under given constraints.
随着纳米技术的发展,工艺的变化对电路的特性有着深远的影响。在纳米cmos电路设计中,满足这种工艺变化下的时序和功率限制越来越困难。这导致从基于最坏情况的分析和优化转变为基于统计或概率的分析和优化,在每个层次的电路抽象。本文提出了一种基于泰勒展开图(TED)的多?高水平合成(HLS)中的Tox技术。提出了一种能感知变化的同步调度和资源绑定算法,该算法能在时间产出和性能约束下实现功率产出最大化。为了这个目的,一个多?Tox库在过程变化下进行表征。对不同功能单元的时延和功率分布进行了详尽的研究。提出的变化感知算法利用这些组件在给定的时序良率和性能约束下产生低功耗RTL。实验结果表明,在给定的约束条件下,泄漏功率的提高可达95%。
{"title":"Variation-Aware TED-Based Approach for Nano-CMOS RTL Leakage Optimization","authors":"Shibaji Banerjee, J. Mathew, D. Pradhan, S. Mohanty, M. Ciesielski","doi":"10.1109/VLSID.2011.40","DOIUrl":"https://doi.org/10.1109/VLSID.2011.40","url":null,"abstract":"As technology scales down to nanometer regime the process variations have profound effect on circuit characteristics. Meeting timing and power constraints under such process variations in nano-CMOS circuit design is increasingly difficult. This causes a shifting from worst-case based analysis and optimization to statistical or probability based analysis and optimization at every level of circuit abstraction. This paper presents a TED (Taylor Expansion Diagram) based multi ? Tox techniques during high-level synthesis (HLS). A variation-aware simultaneous scheduling and resource binding algorithm is proposed which maximizes the power yield under timing yield and performance constraint. For this purpose, a multi ? Tox library is characterized under process variation. The delay and power distribution of different functional units are exhaustively studied. The proposed variation-aware algorithm uses those components for generating low power RTL under a given timing yield and performance constraint. The experimental results show significant improvement as high as 95% on leakage power yield under given constraints.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128428605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient Trace Signal Selection for Post Silicon Validation and Debug 高效跟踪信号选择后硅验证和调试
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.14
K. Basu, P. Mishra
Post-silicon validation is an essential part of modern integrated circuit design to capture bugs and design errors that escape pre-silicon validation phase. A major problem governing post-silicon debug is the observability of internal signals since the chip has already been manufactured. Storage requirements limit the number of signals that can be traced, therefore, a major challenge is how to reconstruct the majority of the remaining signals based on traced values. Existing approaches focus on selecting signals with an emphasis on partial restorability, which does not guarantee a good signal restoration. We propose an approach that efficiently selects a set of signals based on total restorability criteria. Our experimental results demonstrate that our signal selection algorithm is both computationally more efficient and can restore up to three times more signals compared to existing methods.
后硅验证是现代集成电路设计的一个重要组成部分,用于捕获逃避前硅验证阶段的错误和设计错误。控制后硅调试的一个主要问题是内部信号的可观察性,因为芯片已经制造出来了。存储要求限制了可以跟踪的信号的数量,因此,一个主要的挑战是如何基于跟踪值重建大部分剩余的信号。现有的方法侧重于选择信号,强调部分可恢复性,这并不能保证良好的信号恢复。我们提出了一种基于完全可恢复性标准的有效选择信号集的方法。实验结果表明,我们的信号选择算法不仅计算效率更高,而且与现有方法相比,可以恢复多达三倍的信号。
{"title":"Efficient Trace Signal Selection for Post Silicon Validation and Debug","authors":"K. Basu, P. Mishra","doi":"10.1109/VLSID.2011.14","DOIUrl":"https://doi.org/10.1109/VLSID.2011.14","url":null,"abstract":"Post-silicon validation is an essential part of modern integrated circuit design to capture bugs and design errors that escape pre-silicon validation phase. A major problem governing post-silicon debug is the observability of internal signals since the chip has already been manufactured. Storage requirements limit the number of signals that can be traced, therefore, a major challenge is how to reconstruct the majority of the remaining signals based on traced values. Existing approaches focus on selecting signals with an emphasis on partial restorability, which does not guarantee a good signal restoration. We propose an approach that efficiently selects a set of signals based on total restorability criteria. Our experimental results demonstrate that our signal selection algorithm is both computationally more efficient and can restore up to three times more signals compared to existing methods.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"80 11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126930993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
Development of a Micro-mechanical Logic Inverter for Low Frequency MEMS Sensor Interfacing 用于低频MEMS传感器接口的微机械逻辑逆变器的研制
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.26
S. Chakraborty, T. K. Bhattacharyya
This paper presents the development of a micro mechanical inverter for signal processing applications in low frequency MEMS sensors. The inverter consists of two MEMS contact switches connected in complementary configuration. The working principle of the inverter has been thoroughly explained and the design and performance analysis of the inverter have been systematically worked out using a top-down approach. PolyMUMPs surface micro machining process has been utilized for implementing and fabricating the MEMS inverter. The mechanical response and the switching response of the cantilevers have been extensively investigated. Static functional characterization of the inverter has been successfully carried out.
本文介绍了一种用于低频MEMS传感器信号处理的微机械逆变器的研制。逆变器由两个MEMS触点开关组成,以互补的方式连接。详细阐述了逆变器的工作原理,采用自顶向下的方法系统地进行了逆变器的设计和性能分析。利用PolyMUMPs表面微加工工艺实现和制造MEMS逆变器。对悬臂梁的力学响应和开关响应进行了广泛的研究。成功地进行了逆变器的静态功能表征。
{"title":"Development of a Micro-mechanical Logic Inverter for Low Frequency MEMS Sensor Interfacing","authors":"S. Chakraborty, T. K. Bhattacharyya","doi":"10.1109/VLSID.2011.26","DOIUrl":"https://doi.org/10.1109/VLSID.2011.26","url":null,"abstract":"This paper presents the development of a micro mechanical inverter for signal processing applications in low frequency MEMS sensors. The inverter consists of two MEMS contact switches connected in complementary configuration. The working principle of the inverter has been thoroughly explained and the design and performance analysis of the inverter have been systematically worked out using a top-down approach. PolyMUMPs surface micro machining process has been utilized for implementing and fabricating the MEMS inverter. The mechanical response and the switching response of the cantilevers have been extensively investigated. Static functional characterization of the inverter has been successfully carried out.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128122480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low Power Asynchronous Data Acquisition Front End for Wireless Body Sensor Area Network 无线人体传感器区域网络低功耗异步数据采集前端
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.89
M. Trakimas, Sungkil Hwang, S. Sonkusale
Wireless body sensor area networks (WBAN) is one of the key technologies to solve the rising healthcare costs through early detection, and point-of-care diagnosis and health management. However there is a stringent power requirement on individual sensor nodes in such networks. Consequently traditional signal chain of amplify-digitize-transmit generates large amounts of data that cannot be sustained due to limited energy and bandwidth. In this paper we propose an asynchronous data acquisition platform that provides inherent digitization and compression at the source. The proposed implementation consists of low noise front-end amplifier (AFE) with tunable bandwidth and an asynchronous clockless analog-to-digital converter (ADC). Data compression is achieved by the inherent signal dependent sampling of the asynchronous architecture. The AFE and ADC were fabricated in a 0.18μm CMOS technology and consume a total of 79μW. Measured results for asynchronous ECG signal acquisition are presented.
无线身体传感器区域网络(WBAN)是通过早期检测、即时诊断和健康管理来解决医疗成本上升的关键技术之一。然而,在这种网络中,对单个传感器节点有严格的功率要求。传统的放大-数字化传输信号链由于能量和带宽的限制,产生的数据量很大,无法持续传输。在本文中,我们提出了一种异步数据采集平台,在源端提供固有的数字化和压缩。提出的实现由带宽可调的低噪声前端放大器(AFE)和异步无时钟模数转换器(ADC)组成。数据压缩是通过异步结构固有的信号依赖采样来实现的。AFE和ADC采用0.18μm CMOS工艺制作,功耗为79μW。给出了异步心电信号采集的测量结果。
{"title":"Low Power Asynchronous Data Acquisition Front End for Wireless Body Sensor Area Network","authors":"M. Trakimas, Sungkil Hwang, S. Sonkusale","doi":"10.1109/VLSID.2011.89","DOIUrl":"https://doi.org/10.1109/VLSID.2011.89","url":null,"abstract":"Wireless body sensor area networks (WBAN) is one of the key technologies to solve the rising healthcare costs through early detection, and point-of-care diagnosis and health management. However there is a stringent power requirement on individual sensor nodes in such networks. Consequently traditional signal chain of amplify-digitize-transmit generates large amounts of data that cannot be sustained due to limited energy and bandwidth. In this paper we propose an asynchronous data acquisition platform that provides inherent digitization and compression at the source. The proposed implementation consists of low noise front-end amplifier (AFE) with tunable bandwidth and an asynchronous clockless analog-to-digital converter (ADC). Data compression is achieved by the inherent signal dependent sampling of the asynchronous architecture. The AFE and ADC were fabricated in a 0.18μm CMOS technology and consume a total of 79μW. Measured results for asynchronous ECG signal acquisition are presented.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128396502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Self-Calibrating Equalizer for Optimal Jitter Performance Using On-chip Eye Monitoring 自校准均衡器的最佳抖动性能使用芯片上的眼睛监测
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.38
Srinivasaraman Chandrasekaran, K. Desai, A. Sendhil, William Ng
Magnitude of equalization applied by a receiver equalization circuit varies across silicon process and environmental conditions. We propose a novel method to auto calibrate a programmable receiver equalization circuit to a target gain equalization value without the use of any external test equipment or channel. This method is built upon on-chip eye monitoring and internal loop back capabilities, which are used to measure the gain equalization value. By executing this on-chip gain equalization measurement for various equalizer settings, the setting which produces equalization that is closest to the target value can be determined. This has been implemented in 45nm CMOS for a PCI Express 2.0 transceiver hardware running at 5Gbps. Lab results with test silicon demonstrate the on-chip eye height measurement capabilities.
接收器均衡电路应用的均衡幅度因硅工艺和环境条件而异。我们提出了一种新的方法来自动校准可编程接收机均衡电路到目标增益均衡值,而不使用任何外部测试设备或通道。该方法建立在片上眼监控和内部回环功能的基础上,用于测量增益均衡值。通过对各种均衡器设置执行片上增益均衡测量,可以确定产生最接近目标值的均衡的设置。这已经在45nm CMOS中实现,用于运行5Gbps的PCI Express 2.0收发器硬件。使用测试硅的实验室结果证明了片上眼高度测量能力。
{"title":"Self-Calibrating Equalizer for Optimal Jitter Performance Using On-chip Eye Monitoring","authors":"Srinivasaraman Chandrasekaran, K. Desai, A. Sendhil, William Ng","doi":"10.1109/VLSID.2011.38","DOIUrl":"https://doi.org/10.1109/VLSID.2011.38","url":null,"abstract":"Magnitude of equalization applied by a receiver equalization circuit varies across silicon process and environmental conditions. We propose a novel method to auto calibrate a programmable receiver equalization circuit to a target gain equalization value without the use of any external test equipment or channel. This method is built upon on-chip eye monitoring and internal loop back capabilities, which are used to measure the gain equalization value. By executing this on-chip gain equalization measurement for various equalizer settings, the setting which produces equalization that is closest to the target value can be determined. This has been implemented in 45nm CMOS for a PCI Express 2.0 transceiver hardware running at 5Gbps. Lab results with test silicon demonstrate the on-chip eye height measurement capabilities.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133374288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
LA-LRU: A Latency-Aware Replacement Policy for Variation Tolerant Caches LA-LRU:一种延迟感知的容差缓存替换策略
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.24
Aarul Jain, Aviral Shrivastava, C. Chakrabarti
Parameter variations in deep sub-micron integrated circuits cause chip characteristics to deviate during semiconductor fabrication process. These variations are dominant in memory systems such as caches and the delay spread due to process variation impacts the performance of a cache based system significantly. In this paper, we propose two schemes to reduce the performance impact of variations in caches: i) Latency-Aware Least Recently Used (LA-LRU) replacement policy which ensures that cache blocks that are affected by process variation are accessed less frequently, and ii) Block Rearrangement scheme that distributes cache blocks with high latencies to all sets uniformly. We implemented our schemes on the Wattch Simple Scalar toolset for Xscale, PowerPC and Alpha21264-like processor configurations. Our experiments on SPEC 2000 benchmarks show that our scheme improves the average memory access time of caches by 11% to 22%, almost eliminating any performance degradation due to variations. We also synthesized the LA-LRU logic, to find out that we can obtain this benefit at negligible increase in the power consumption of the cache.
深亚微米集成电路在半导体制造过程中参数的变化会导致芯片特性的偏离。这些变化在诸如缓存之类的内存系统中占主导地位,并且由于进程变化而导致的延迟扩展会显著影响基于缓存的系统的性能。在本文中,我们提出了两种方案来减少缓存变化对性能的影响:i)延迟感知的最近最少使用(LA-LRU)替换策略,该策略确保受进程变化影响的缓存块被较少访问;ii)块重排方案,该方案将具有高延迟的缓存块统一分配到所有集合。我们在watch Simple Scalar工具集上实现了我们的方案,该工具集适用于Xscale、PowerPC和类似alpha21264的处理器配置。我们在SPEC 2000基准测试上的实验表明,我们的方案将缓存的平均内存访问时间提高了11%到22%,几乎消除了由于变化而导致的任何性能下降。我们还合成了LA-LRU逻辑,以发现我们可以在缓存功耗增加可以忽略不计的情况下获得这种好处。
{"title":"LA-LRU: A Latency-Aware Replacement Policy for Variation Tolerant Caches","authors":"Aarul Jain, Aviral Shrivastava, C. Chakrabarti","doi":"10.1109/VLSID.2011.24","DOIUrl":"https://doi.org/10.1109/VLSID.2011.24","url":null,"abstract":"Parameter variations in deep sub-micron integrated circuits cause chip characteristics to deviate during semiconductor fabrication process. These variations are dominant in memory systems such as caches and the delay spread due to process variation impacts the performance of a cache based system significantly. In this paper, we propose two schemes to reduce the performance impact of variations in caches: i) Latency-Aware Least Recently Used (LA-LRU) replacement policy which ensures that cache blocks that are affected by process variation are accessed less frequently, and ii) Block Rearrangement scheme that distributes cache blocks with high latencies to all sets uniformly. We implemented our schemes on the Wattch Simple Scalar toolset for Xscale, PowerPC and Alpha21264-like processor configurations. Our experiments on SPEC 2000 benchmarks show that our scheme improves the average memory access time of caches by 11% to 22%, almost eliminating any performance degradation due to variations. We also synthesized the LA-LRU logic, to find out that we can obtain this benefit at negligible increase in the power consumption of the cache.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133656032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Path Delay Tuning for Performance Gain in the Face of Random Manufacturing Variations 面向随机制造变化的性能增益路径延迟调谐
Pub Date : 2011-01-02 DOI: 10.1109/VLSID.2011.35
Kautalya Mishra, Ahmed Faraz, A. Singh
One of the factors now beginning to seriously limit clock rates in large synchronous designs is manufacturing variations in device parameters. Moreover, such random process variations are increasing significantly with device scaling as technology approaches the end of the silicon roadmap. In a large design containing several millions of transistors, virtually every manufactured part will have a few hundreds of transistors that are significant performance outliers. Any one such device in a critical path can greatly limit the highest clock rate that can be achieved by the chip. In this paper we propose and analyze a new design approach that allows for the post manufacture tuning and speed-up of exceptionally slow circuit paths to recover much of the performance lost due to such outlier devices. We show that such tuning of exceptionally slow paths can result in a significant increase in the average clock speed attainable by the manufactured parts. We also show this method to be defect tolerant, implying an additional benefit of increasing the semiconductor yield.
现在在大型同步设计中开始严重限制时钟速率的因素之一是设备参数的制造变化。此外,随着技术接近硅路线图的终点,这种随机工艺变化随着器件的扩展而显著增加。在包含数百万个晶体管的大型设计中,实际上每个制造的部件都会有几百个晶体管,这些晶体管是显著的性能异常值。在关键路径上的任何一个这样的器件都可以极大地限制芯片所能达到的最高时钟速率。在本文中,我们提出并分析了一种新的设计方法,该方法允许对异常慢的电路路径进行后期调谐和加速,以恢复由于此类异常器件造成的大部分性能损失。我们表明,这种异常缓慢路径的调谐可以导致制造部件可达到的平均时钟速度的显着增加。我们还证明了这种方法具有缺陷容忍度,这意味着增加半导体产量的额外好处。
{"title":"Path Delay Tuning for Performance Gain in the Face of Random Manufacturing Variations","authors":"Kautalya Mishra, Ahmed Faraz, A. Singh","doi":"10.1109/VLSID.2011.35","DOIUrl":"https://doi.org/10.1109/VLSID.2011.35","url":null,"abstract":"One of the factors now beginning to seriously limit clock rates in large synchronous designs is manufacturing variations in device parameters. Moreover, such random process variations are increasing significantly with device scaling as technology approaches the end of the silicon roadmap. In a large design containing several millions of transistors, virtually every manufactured part will have a few hundreds of transistors that are significant performance outliers. Any one such device in a critical path can greatly limit the highest clock rate that can be achieved by the chip. In this paper we propose and analyze a new design approach that allows for the post manufacture tuning and speed-up of exceptionally slow circuit paths to recover much of the performance lost due to such outlier devices. We show that such tuning of exceptionally slow paths can result in a significant increase in the average clock speed attainable by the manufactured parts. We also show this method to be defect tolerant, implying an additional benefit of increasing the semiconductor yield.","PeriodicalId":371062,"journal":{"name":"2011 24th Internatioal Conference on VLSI Design","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122372139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2011 24th Internatioal Conference on VLSI Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1