首页 > 最新文献

2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)最新文献

英文 中文
Fully-integrated switched-capacitor voltage regulator with on-chip current-sensing and workload optimization in 32nm SOI CMOS 完全集成的开关电容电压调节器,具有片上电流传感和32nm SOI CMOS工作负载优化
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273504
Xiaoyang Mi, Debashis Mandal, V. Sathe, B. Bakkaloglu, Jae-sun Seo
Efficient, stable, and fast power delivery against fluctuating workloads have become a critical concern for applications from battery-powered devices to high-performance servers. With high density on-chip capacitors, fully-integrated switched-capacitor (SC) voltage converters provide high efficiency down-conversion from a battery or off-chip voltage regulation modules. However, maintaining such efficiency with minimal supply noise across a wide range of fluctuating load currents remains challenging. In this paper, we propose an on-chip current sensing technique to dynamically modulate both switching frequency and switch widths of SC voltage converters, enhancing fast transient response and higher efficiency across a wide range of load currents. In conjunction with SC converters, we employ a low-dropout regulator (LDO) driven by a push-pull operational transconductance amplifier (OTA), whose current is mirrored and sensed with minimal power and efficiency overhead. The sensed load current directly controls the frequency and width of SC converters through a voltage-controlled oscillator (VCO) and a time-to-digital converter, respectively. In 32nm SOI CMOS, the proposed voltage regulator maintains 77-82% efficiency at 0.95V output voltage with less than 20mV steady-state ripple across 10X load current range of 100mA-1A and 33mV droop voltage for a 80mA/ns load transition, while providing a projected current density of 6W/mm2.
针对波动的工作负载,高效、稳定和快速的供电已经成为从电池供电设备到高性能服务器等应用程序的关键问题。采用高密度片上电容器,完全集成的开关电容器(SC)电压变换器从电池或片外电压调节模块提供高效率的下转换。然而,在大范围的波动负载电流下保持这样的效率和最小的电源噪声仍然是一个挑战。在本文中,我们提出了一种片上电流传感技术来动态调制SC电压变换器的开关频率和开关宽度,从而在大范围的负载电流范围内增强快速瞬态响应和更高的效率。与SC转换器一起,我们采用了由推挽式跨导放大器(OTA)驱动的低差稳压器(LDO),其电流以最小的功率和效率开销进行镜像和检测。检测到的负载电流分别通过压控振荡器(VCO)和时间数字转换器直接控制SC变换器的频率和宽度。在32nm SOI CMOS中,该稳压器在0.95V输出电压下保持77-82%的效率,在100mA-1A的10倍负载电流范围内保持小于20mV的稳态纹波,在80mA/ns负载转换时提供33mV的下垂电压,同时提供6W/mm2的投射电流密度。
{"title":"Fully-integrated switched-capacitor voltage regulator with on-chip current-sensing and workload optimization in 32nm SOI CMOS","authors":"Xiaoyang Mi, Debashis Mandal, V. Sathe, B. Bakkaloglu, Jae-sun Seo","doi":"10.1109/ISLPED.2015.7273504","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273504","url":null,"abstract":"Efficient, stable, and fast power delivery against fluctuating workloads have become a critical concern for applications from battery-powered devices to high-performance servers. With high density on-chip capacitors, fully-integrated switched-capacitor (SC) voltage converters provide high efficiency down-conversion from a battery or off-chip voltage regulation modules. However, maintaining such efficiency with minimal supply noise across a wide range of fluctuating load currents remains challenging. In this paper, we propose an on-chip current sensing technique to dynamically modulate both switching frequency and switch widths of SC voltage converters, enhancing fast transient response and higher efficiency across a wide range of load currents. In conjunction with SC converters, we employ a low-dropout regulator (LDO) driven by a push-pull operational transconductance amplifier (OTA), whose current is mirrored and sensed with minimal power and efficiency overhead. The sensed load current directly controls the frequency and width of SC converters through a voltage-controlled oscillator (VCO) and a time-to-digital converter, respectively. In 32nm SOI CMOS, the proposed voltage regulator maintains 77-82% efficiency at 0.95V output voltage with less than 20mV steady-state ripple across 10X load current range of 100mA-1A and 33mV droop voltage for a 80mA/ns load transition, while providing a projected current density of 6W/mm2.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116625737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A novel slope detection technique for robust STTRAM sensing 一种鲁棒stream传感斜坡检测新技术
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273482
Seyedhamidreza Motaman, Swaroop Ghosh, J. Kulkarni
Spin-Torque-Transfer RAM (STTRAM) is a promising technology for high density on-chip cache due to low standby power and high speed. However, the process variation of magnetic tunnel junction (MTJ) and access transistor poses serious challenge to sensing. Nondestructive sensing suffers from reference resistance variation whereas destructive sensing suffers from failures due to unoptimized selection of data and reference currents. We propose a novel slope detection technique to exploit MTJ resistance switching from high to low state using low-overhead sample-and-hold circuit. The proposed sensing technique is destructive in nature and can be combined with double sampling for improved robustness. Simulation results reveal <;0.12% failure under process variation using single sampling (at 0.2% area overhead) and <;0.08% failures with double sampling (at 0.6% area overhead). The overall sense time is found to be 6.8ns.
由于低待机功率和高速度,自旋转矩传输RAM (strtram)是一种很有前途的高密度片上高速缓存技术。然而,磁隧道结(MTJ)和接入晶体管的工艺变化给传感带来了严峻的挑战。无损检测受参考电阻变化的影响,而破坏性检测因数据和参考电流选择不优化而失败。我们提出了一种利用低开销采样保持电路实现MTJ电阻从高状态切换到低状态的斜率检测技术。所提出的传感技术本质上是破坏性的,并且可以与双重采样相结合以提高鲁棒性。模拟结果显示,在工艺变化下,使用单次采样(面积开销为0.2%)的失败率< 0.12%,而使用两次采样(面积开销为0.6%)的失败率< 0.08%。整体感知时间为6.8ns。
{"title":"A novel slope detection technique for robust STTRAM sensing","authors":"Seyedhamidreza Motaman, Swaroop Ghosh, J. Kulkarni","doi":"10.1109/ISLPED.2015.7273482","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273482","url":null,"abstract":"Spin-Torque-Transfer RAM (STTRAM) is a promising technology for high density on-chip cache due to low standby power and high speed. However, the process variation of magnetic tunnel junction (MTJ) and access transistor poses serious challenge to sensing. Nondestructive sensing suffers from reference resistance variation whereas destructive sensing suffers from failures due to unoptimized selection of data and reference currents. We propose a novel slope detection technique to exploit MTJ resistance switching from high to low state using low-overhead sample-and-hold circuit. The proposed sensing technique is destructive in nature and can be combined with double sampling for improved robustness. Simulation results reveal <;0.12% failure under process variation using single sampling (at 0.2% area overhead) and <;0.08% failures with double sampling (at 0.6% area overhead). The overall sense time is found to be 6.8ns.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114043123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A micropower energy harvesting circuit with piezoelectric transformer-based ultra-low voltage start-up 一种基于压电变压器的超低电压启动微功率能量收集电路
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273527
A. Romani, Antonio Camarda, A. Baldazzi, M. Tartagni
This paper introduces the use of piezoelectric transformers (PTs) as key elements for ultra-low voltage start-up circuits for battery-less energy harvesting applications. Firstly, a step-up oscillator topology based on a PT and a JFET is presented. The circuit is able to start from voltages as low as 16 mV, and to boost the output voltage up to 1.32 V in a no load condition. In order to validate the proposed approach, a surrounding power management and conversion circuit is developed. This circuit is able to automatically enable a boost DC/DC converter once the start-up circuit has generated a sufficient voltage. The whole circuit self-starts with an input voltage of 30 mV, and the maximum conversion efficiency referred to the maximum power point (MPP) is higher than 40%, with an intrinsic current consumption as low as 1.3 μA.
本文介绍了压电变压器(PTs)作为无电池能量收集应用的超低电压启动电路的关键元件。首先提出了一种基于PT和JFET的升压振荡器拓扑结构。该电路能够从低至16 mV的电压启动,并在空载条件下将输出电压提升至1.32 V。为了验证所提出的方法,开发了一个周边的电源管理和转换电路。一旦启动电路产生足够的电压,该电路能够自动启用升压DC/DC转换器。整个电路在输入电压为30 mV时自动启动,以最大功率点(MPP)为基准的最大转换效率高于40%,固有电流消耗低至1.3 μA。
{"title":"A micropower energy harvesting circuit with piezoelectric transformer-based ultra-low voltage start-up","authors":"A. Romani, Antonio Camarda, A. Baldazzi, M. Tartagni","doi":"10.1109/ISLPED.2015.7273527","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273527","url":null,"abstract":"This paper introduces the use of piezoelectric transformers (PTs) as key elements for ultra-low voltage start-up circuits for battery-less energy harvesting applications. Firstly, a step-up oscillator topology based on a PT and a JFET is presented. The circuit is able to start from voltages as low as 16 mV, and to boost the output voltage up to 1.32 V in a no load condition. In order to validate the proposed approach, a surrounding power management and conversion circuit is developed. This circuit is able to automatically enable a boost DC/DC converter once the start-up circuit has generated a sufficient voltage. The whole circuit self-starts with an input voltage of 30 mV, and the maximum conversion efficiency referred to the maximum power point (MPP) is higher than 40%, with an intrinsic current consumption as low as 1.3 μA.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117304305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Power benefit study of monolithic 3D IC at the 7nm technology node 7nm工艺节点单片3D集成电路的功耗效益研究
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273514
Kyungwook Chang, Kartik Acharya, S. Sinha, B. Cline, G. Yeric, S. Lim
Monolithic 3D IC (M3D) is one potential technology to help with the challenges of continued circuit power and performance scaling. In this paper, for the first time, the power benefits of monolithic 3D IC (M3D) using a 7nm FinFET technology are investigated. The predictive 7nm Process Design Kit (PDK) and standard cell library for both high performance (HP) and low standby power (LSTP) device technologies are built based on NanGate 45nm PDK using accurate dimensional, material, and electrical parameters from publications and a commercial-grade tool flow. In addition, we implement full-chip M3D GDS layouts using both 7nm HP and LSTP cells and industry-standard physical design tools, and evaluate the resulting full-chip power, performance, and area metrics. Our study first shows that 7nm HP M3D designs outperform 7nm HP 2D designs by 16.8% in terms of iso-performance total power reduction. Moreover, 7nm LSTP M3D designs reduce the total power consumption by 14.3% compared to their 2D counterparts. This convincingly demonstrates the power benefits of M3D technologies in both high performance as well as low power future generation devices.
单片3D集成电路(M3D)是一种潜在的技术,可以帮助解决持续电路功率和性能扩展的挑战。本文首次对采用7nm FinFET技术的单片3D集成电路(M3D)的功耗效益进行了研究。用于高性能(HP)和低待机功率(LSTP)器件技术的预测性7nm工艺设计套件(PDK)和标准单元库是基于NanGate 45nm PDK构建的,使用出版物和商业级工具流程中的精确尺寸,材料和电气参数。此外,我们还使用7nm HP和LSTP单元以及行业标准物理设计工具实现了全芯片M3D GDS布局,并评估了由此产生的全芯片功耗、性能和面积指标。我们的研究首先表明,在同等性能的总功耗降低方面,7nm HP M3D设计比7nm HP 2D设计高出16.8%。此外,7nm LSTP M3D设计与2D设计相比,总功耗降低了14.3%。这令人信服地证明了M3D技术在高性能和低功耗未来一代设备中的功率优势。
{"title":"Power benefit study of monolithic 3D IC at the 7nm technology node","authors":"Kyungwook Chang, Kartik Acharya, S. Sinha, B. Cline, G. Yeric, S. Lim","doi":"10.1109/ISLPED.2015.7273514","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273514","url":null,"abstract":"Monolithic 3D IC (M3D) is one potential technology to help with the challenges of continued circuit power and performance scaling. In this paper, for the first time, the power benefits of monolithic 3D IC (M3D) using a 7nm FinFET technology are investigated. The predictive 7nm Process Design Kit (PDK) and standard cell library for both high performance (HP) and low standby power (LSTP) device technologies are built based on NanGate 45nm PDK using accurate dimensional, material, and electrical parameters from publications and a commercial-grade tool flow. In addition, we implement full-chip M3D GDS layouts using both 7nm HP and LSTP cells and industry-standard physical design tools, and evaluate the resulting full-chip power, performance, and area metrics. Our study first shows that 7nm HP M3D designs outperform 7nm HP 2D designs by 16.8% in terms of iso-performance total power reduction. Moreover, 7nm LSTP M3D designs reduce the total power consumption by 14.3% compared to their 2D counterparts. This convincingly demonstrates the power benefits of M3D technologies in both high performance as well as low power future generation devices.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126584589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
A neuromorphic neural spike clustering processor for deep-brain sensing and stimulation systems 一种用于深层脑传感和刺激系统的神经形态神经尖峰聚类处理器
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273496
Beinuo Zhang, Zhewei Jiang, Qi Wang, Jae-sun Seo, Mingoo Seok
This paper presents algorithm and digital hardware design, inspired by biological spiking neural networks, to perform unsupervised, online spike-clustering with high accuracy and low-power consumption in the context of deep-brain sensing and stimulation systems. The proposed hardware contains 1220 digital neurons and 4.86k latch-based synapses, and achieves the average sorting accuracy of 91% whereas the conventional hardware based on the Osort algorithm achieves 69% for the same datasets. Implemented in a 65nm high-Vth, the processor exhibits a footprint of 0.25mm2/ch. and a power consumption of 9.3μW/ch. at VDD of 0.3V.
本文提出了一种受生物峰值神经网络启发的算法和数字硬件设计,用于在深度脑传感和刺激系统中进行高精度、低功耗的无监督在线峰值聚类。该硬件包含1220个数字神经元和4.86k个闩锁突触,在相同的数据集上,基于Osort算法的传统硬件的平均排序准确率为91%,而基于Osort算法的传统硬件的平均排序准确率为69%。该处理器采用65nm高vth工艺,占地面积为0.25mm2/ch。功耗为9.3μW/ch。VDD为0.3V时。
{"title":"A neuromorphic neural spike clustering processor for deep-brain sensing and stimulation systems","authors":"Beinuo Zhang, Zhewei Jiang, Qi Wang, Jae-sun Seo, Mingoo Seok","doi":"10.1109/ISLPED.2015.7273496","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273496","url":null,"abstract":"This paper presents algorithm and digital hardware design, inspired by biological spiking neural networks, to perform unsupervised, online spike-clustering with high accuracy and low-power consumption in the context of deep-brain sensing and stimulation systems. The proposed hardware contains 1220 digital neurons and 4.86k latch-based synapses, and achieves the average sorting accuracy of 91% whereas the conventional hardware based on the Osort algorithm achieves 69% for the same datasets. Implemented in a 65nm high-Vth, the processor exhibits a footprint of 0.25mm2/ch. and a power consumption of 9.3μW/ch. at VDD of 0.3V.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127930709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Fine-grained write scheduling for PCM performance improvement under write power budget 在写功率预算下,用于改进PCM性能的细粒度写调度
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273484
Chun-Hao Lai, Shun-Chih Yu, Chia-Lin Yang, Hsiang-Pang Li
Phase-change memory (PCM) has gained much attention recently since it offers several advantages over DRAM, such as high cell density and low leakage power. PCM has similar read power and latency as DRAM; however, its write power and latency are significantly higher than DRAM. Therefore, one challenge with PCM is how to increase write throughput under write power budget constraints. To increase write concurrency, PCM often adopts division programming, where a write occurs in a series of divisions, so that writes to different banks proceed concurrently. In this study, we observe that since the write scheduling granularity in the memory controller differs from the actual write granularity in PCM chips, i.e., requests vs. divisions, the available power budget cannot be fully utilized. We therefore propose enhancing the interface between the memory controller and PCM chips to allow the memory controller to schedule writes in the division granularity. To further increase power budget utilization, we design a variable-length division mechanism to allow the division granularity to be adjusted at runtime according to the available write power budget. Our experimental results show that these techniques improve system performance by up to 65%.
相变存储器(PCM)由于具有高单元密度和低泄漏功率等优点,近年来受到了广泛的关注。PCM具有与DRAM相似的读取功率和延迟;但是,它的写功率和延迟明显高于DRAM。因此,PCM的一个挑战是如何在写入功率预算限制下提高写入吞吐量。为了增加写并发性,PCM通常采用除法编程,其中写操作发生在一系列除法中,因此对不同银行的写操作可以并发进行。在本研究中,我们观察到,由于内存控制器中的写入调度粒度与PCM芯片中的实际写入粒度不同,即请求与分割,因此无法充分利用可用的功率预算。因此,我们建议增强存储器控制器和PCM芯片之间的接口,以允许存储器控制器在除法粒度中调度写入。为了进一步提高功耗预算利用率,我们设计了一种可变长度分割机制,允许在运行时根据可用的写功耗预算调整分割粒度。我们的实验结果表明,这些技术提高了系统性能高达65%。
{"title":"Fine-grained write scheduling for PCM performance improvement under write power budget","authors":"Chun-Hao Lai, Shun-Chih Yu, Chia-Lin Yang, Hsiang-Pang Li","doi":"10.1109/ISLPED.2015.7273484","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273484","url":null,"abstract":"Phase-change memory (PCM) has gained much attention recently since it offers several advantages over DRAM, such as high cell density and low leakage power. PCM has similar read power and latency as DRAM; however, its write power and latency are significantly higher than DRAM. Therefore, one challenge with PCM is how to increase write throughput under write power budget constraints. To increase write concurrency, PCM often adopts division programming, where a write occurs in a series of divisions, so that writes to different banks proceed concurrently. In this study, we observe that since the write scheduling granularity in the memory controller differs from the actual write granularity in PCM chips, i.e., requests vs. divisions, the available power budget cannot be fully utilized. We therefore propose enhancing the interface between the memory controller and PCM chips to allow the memory controller to schedule writes in the division granularity. To further increase power budget utilization, we design a variable-length division mechanism to allow the division granularity to be adjusted at runtime according to the available write power budget. Our experimental results show that these techniques improve system performance by up to 65%.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125676182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Lucid infrared thermography of thermally-constrained processors 热约束处理器的清晰红外热成像
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273538
H. Amrouch, J. Henkel
Thermal analysis is a prerequisite for developing reliability increasing techniques for thermally-constrained processors, i.e. processors with a high power density. For that purpose, infrared (IR) camera measurement setups have been deployed with the purpose to provide direct feedback of the impact that thermal mitigation techniques have. To obtain lucid IR images1, the IR-opaque cooling must be removed and hence, an alternative IR-transparent cooling needs to be provided to protect the chip. To this end, the majority of state-of-the-art employs an IR coolant liquid to prevent the chip from overheating. The problem is that several aspects like thermal convection may interfere with the measured IR radiations resulting in equivocal IR images. Thus, they decrease the accuracy in a way that leads to incorrectly estimating reliability. Solving this prominent problem, we introduce an IR-transparent cooling that cools the chip from its rear side allowing the camera to perspicuously capture the IR emissions as no additional layer in between impedes the radiation. It maintains the on-chip temperatures within a safe range equivalent to the original heat sink-based cooling. We demonstrate how state-of-the-art inaccurate thermal analysis results in incorrectly estimating reliability. Our setup is the most accurate, least intrusive one that has been both proposed and actually applied to state-of-the-art multi-cores (Intel 45nm dual-core and 22nm octa-core).
热分析是开发热约束处理器(即具有高功率密度的处理器)可靠性提高技术的先决条件。为此目的,部署了红外(IR)相机测量装置,目的是提供热缓解技术影响的直接反馈。为了获得清晰的红外图像1,必须去除红外不透明冷却,因此,需要提供另一种红外透明冷却来保护芯片。为此,大多数最先进的技术采用红外冷却液来防止芯片过热。问题是,热对流等几个方面可能会干扰测量的红外辐射,导致模糊的红外图像。因此,它们以一种导致不正确估计可靠性的方式降低了准确性。为了解决这个突出的问题,我们引入了一种红外透明冷却技术,从芯片的背面冷却,使相机能够清晰地捕捉到红外辐射,因为两者之间没有额外的层阻碍辐射。它将芯片上的温度保持在一个安全的范围内,相当于原来的基于散热器的冷却。我们展示了最先进的不准确的热分析如何导致不正确地估计可靠性。我们的设置是最精确的,最小干扰的,已经提出并实际应用于最先进的多核(英特尔45纳米双核和22纳米八核)。
{"title":"Lucid infrared thermography of thermally-constrained processors","authors":"H. Amrouch, J. Henkel","doi":"10.1109/ISLPED.2015.7273538","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273538","url":null,"abstract":"Thermal analysis is a prerequisite for developing reliability increasing techniques for thermally-constrained processors, i.e. processors with a high power density. For that purpose, infrared (IR) camera measurement setups have been deployed with the purpose to provide direct feedback of the impact that thermal mitigation techniques have. To obtain lucid IR images1, the IR-opaque cooling must be removed and hence, an alternative IR-transparent cooling needs to be provided to protect the chip. To this end, the majority of state-of-the-art employs an IR coolant liquid to prevent the chip from overheating. The problem is that several aspects like thermal convection may interfere with the measured IR radiations resulting in equivocal IR images. Thus, they decrease the accuracy in a way that leads to incorrectly estimating reliability. Solving this prominent problem, we introduce an IR-transparent cooling that cools the chip from its rear side allowing the camera to perspicuously capture the IR emissions as no additional layer in between impedes the radiation. It maintains the on-chip temperatures within a safe range equivalent to the original heat sink-based cooling. We demonstrate how state-of-the-art inaccurate thermal analysis results in incorrectly estimating reliability. Our setup is the most accurate, least intrusive one that has been both proposed and actually applied to state-of-the-art multi-cores (Intel 45nm dual-core and 22nm octa-core).","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131064897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
ThermTap: An online power analyzer and thermal simulator for Android devices ThermTap:用于Android设备的在线功率分析仪和热模拟器
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273537
M. Dousti, Majid Ghasemi-Gol, M. Nazemi, M. Pedram
This paper introduces ThermTap, which enables system and software developers to monitor the power consumption and temperature of various hardware components in an Android device as a function of running applications and processes. ThermTap comprises of a power analyzer, called PowerTap, and an online thermal simulator, called Therminator 2. With accurate power macro-models, PowerTap collates activity profiles of major components of a portable device from the OS kernel device drivers in an event-driven manner to generate power traces. In turn, Therminator 2 reads these traces and, using a compact thermal model of the device, generates various temperature maps including those for the device components and device skin. Fast thermal simulation techniques enable Therminator 2 to be executed in realtime. With precise per-process and per-application temperature maps that ThermTap produces, it enables software and system developers to find thermal bugs in their software. A case study is presented on identifying a thermal bug in the software running on an Android device.
本文介绍了ThermTap,它使系统和软件开发人员能够监控Android设备中各种硬件组件的功耗和温度,作为运行应用程序和进程的功能。ThermTap由一个名为PowerTap的功率分析仪和一个名为Therminator 2的在线热模拟器组成。通过精确的功率宏模型,PowerTap以事件驱动的方式从操作系统内核设备驱动程序中整理便携式设备主要组件的活动配置文件,以生成功率跟踪。反过来,Therminator 2读取这些迹线,并使用设备的紧凑热模型,生成各种温度图,包括设备组件和设备皮肤的温度图。快速热模拟技术使Therminator 2能够实时执行。通过ThermTap生成的精确的每个进程和每个应用程序的温度图,它使软件和系统开发人员能够发现他们软件中的热缺陷。本文介绍了如何识别在Android设备上运行的软件中的热bug的案例研究。
{"title":"ThermTap: An online power analyzer and thermal simulator for Android devices","authors":"M. Dousti, Majid Ghasemi-Gol, M. Nazemi, M. Pedram","doi":"10.1109/ISLPED.2015.7273537","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273537","url":null,"abstract":"This paper introduces ThermTap, which enables system and software developers to monitor the power consumption and temperature of various hardware components in an Android device as a function of running applications and processes. ThermTap comprises of a power analyzer, called PowerTap, and an online thermal simulator, called Therminator 2. With accurate power macro-models, PowerTap collates activity profiles of major components of a portable device from the OS kernel device drivers in an event-driven manner to generate power traces. In turn, Therminator 2 reads these traces and, using a compact thermal model of the device, generates various temperature maps including those for the device components and device skin. Fast thermal simulation techniques enable Therminator 2 to be executed in realtime. With precise per-process and per-application temperature maps that ThermTap produces, it enables software and system developers to find thermal bugs in their software. A case study is presented on identifying a thermal bug in the software running on an Android device.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130564441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Analysis and optimization of CMOS switched-capacitor converters CMOS开关电容变换器的分析与优化
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273535
V. Sathe, Jae-sun Seo
Energy-efficiency continues to limit peak computational performance in digital systems. To drive continued energy-improvements, designers of modern digital systems are relying on multiple, smaller voltage domains for enhanced voltage-scaling. Switched-capacitor (SC) voltage converters are a promising alternative to traditional switched-inductor regulators due to their suitability for efficient, fully-integrated regulation of finer voltage domains. However, several important problems regarding the analysis and optimization of SC converter design remain unaddressed. This paper develops a comprehensive analysis of SC converter output resistance to establish the optimal switching frequency and switch resistance for maximum converter efficiency. The proposed analysis is validated through simulation experiments conducted using an industrial 65nm CMOS technology.
能源效率继续限制数字系统的峰值计算性能。为了推动持续的能源改进,现代数字系统的设计者依靠多个更小的电压域来增强电压缩放。开关电容(SC)电压变换器是传统开关电感调节器的一个很有前途的替代品,因为它们适合于更精细的电压域的高效、完全集成的调节。然而,关于SC变换器设计的分析和优化的几个重要问题仍然没有得到解决。本文对SC变换器的输出电阻进行了全面的分析,以确定最大变换器效率的最佳开关频率和开关电阻。通过采用工业65纳米CMOS技术进行的仿真实验验证了所提出的分析。
{"title":"Analysis and optimization of CMOS switched-capacitor converters","authors":"V. Sathe, Jae-sun Seo","doi":"10.1109/ISLPED.2015.7273535","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273535","url":null,"abstract":"Energy-efficiency continues to limit peak computational performance in digital systems. To drive continued energy-improvements, designers of modern digital systems are relying on multiple, smaller voltage domains for enhanced voltage-scaling. Switched-capacitor (SC) voltage converters are a promising alternative to traditional switched-inductor regulators due to their suitability for efficient, fully-integrated regulation of finer voltage domains. However, several important problems regarding the analysis and optimization of SC converter design remain unaddressed. This paper develops a comprehensive analysis of SC converter output resistance to establish the optimal switching frequency and switch resistance for maximum converter efficiency. The proposed analysis is validated through simulation experiments conducted using an industrial 65nm CMOS technology.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129993577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Collaborative gate implementation selection and adaptivity assignment for robust combinational circuits 鲁棒组合电路的协同门实现选择与自适应分配
Pub Date : 2015-07-22 DOI: 10.1109/ISLPED.2015.7273501
Hao He, Jiafan Wang, Jiang Hu
Adaptive design is a power-efficient approach to variation resilience in VLSI circuits. However, its implementation, especially that of fine-grained adaptivity, can easily result in large overhead. Although numerous previous works have demonstrated the effectiveness of adaptive design, very few works have emphasized its overhead control. In order to make adaptive design a truly practical approach, we develop a method that systematically optimizes adaptivity assignment with consideration of overhead reduction. At the same time, a variability aware gate implementation selection technique is investigated and applied in conjunction with the adaptivity assignment. Experimental results on benchmark circuits indicate that our approach can greatly decrease adaptivity overhead while satisfy performance and robustness constraints.
在VLSI电路中,自适应设计是一种低功耗的抗变弹性方法。然而,它的实现,特别是细粒度自适应的实现,很容易导致巨大的开销。虽然已有大量的研究证明了自适应设计的有效性,但很少有研究强调其开销控制。为了使自适应设计成为一种真正实用的方法,我们开发了一种考虑降低开销的系统优化自适应分配的方法。同时,研究了一种可变性感知门实现选择技术,并将其与自适应赋值相结合。在基准电路上的实验结果表明,该方法在满足性能和鲁棒性约束的同时,大大降低了自适应开销。
{"title":"Collaborative gate implementation selection and adaptivity assignment for robust combinational circuits","authors":"Hao He, Jiafan Wang, Jiang Hu","doi":"10.1109/ISLPED.2015.7273501","DOIUrl":"https://doi.org/10.1109/ISLPED.2015.7273501","url":null,"abstract":"Adaptive design is a power-efficient approach to variation resilience in VLSI circuits. However, its implementation, especially that of fine-grained adaptivity, can easily result in large overhead. Although numerous previous works have demonstrated the effectiveness of adaptive design, very few works have emphasized its overhead control. In order to make adaptive design a truly practical approach, we develop a method that systematically optimizes adaptivity assignment with consideration of overhead reduction. At the same time, a variability aware gate implementation selection technique is investigated and applied in conjunction with the adaptivity assignment. Experimental results on benchmark circuits indicate that our approach can greatly decrease adaptivity overhead while satisfy performance and robustness constraints.","PeriodicalId":421236,"journal":{"name":"2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117268251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1