首页 > 最新文献

ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.最新文献

英文 中文
Probabilistic dual-Vth leakage optimization under variability 变异性下概率双v值泄漏优化
A. Davoodi, Ankur Srivastava
In this paper we address the problem of growing leakage variability through effective dual-threshold voltage assignment. We propose a probabilistic dynamic programming-based method to assign dual-threshold voltages such that the overall expected leakage is minimized under a given probability of violating the timing constraint (timing yield). The key characteristics of our strategy are two pruning criteria that stochastically identify pareto-optimal solutions and prune the sub-optimal ones. Compared to other variability-driven dual-threshold voltage assignment schemes, the main advantages of our approach are 1) considering correlations due to common sources of variation, 2) providing controllable runtime, which in one of the proposed strategies is comparable to the deterministic algorithm, and 3) performing optimization based on all the signal paths simultaneously, as opposed to one path at a time. Experimental results indicate that the proposed probabilistic scheme is significantly better than a comparable deterministic dual-threshold voltage assignment, both in terms of expected leakage and the probability of violating the timing constraint
在本文中,我们通过有效的双阈值电压分配来解决日益增长的泄漏变异性问题。我们提出了一种基于概率动态规划的方法来分配双阈值电压,使得在给定的违反时序约束的概率(时序屈服)下,总体期望泄漏最小化。我们的策略的关键特征是两个修剪标准,随机识别帕累托最优解和修剪次最优解。与其他可变性驱动的双阈值电压分配方案相比,我们的方法的主要优点是1)考虑由于共同变化源引起的相关性,2)提供可控制的运行时间,这在所提出的策略之一中可与确定性算法相比较,以及3)同时基于所有信号路径执行优化,而不是一次执行一个路径。实验结果表明,在期望泄漏和违反时间约束的概率方面,所提出的概率方案明显优于同类的确定性双阈值电压分配方案
{"title":"Probabilistic dual-Vth leakage optimization under variability","authors":"A. Davoodi, Ankur Srivastava","doi":"10.1145/1077603.1077641","DOIUrl":"https://doi.org/10.1145/1077603.1077641","url":null,"abstract":"In this paper we address the problem of growing leakage variability through effective dual-threshold voltage assignment. We propose a probabilistic dynamic programming-based method to assign dual-threshold voltages such that the overall expected leakage is minimized under a given probability of violating the timing constraint (timing yield). The key characteristics of our strategy are two pruning criteria that stochastically identify pareto-optimal solutions and prune the sub-optimal ones. Compared to other variability-driven dual-threshold voltage assignment schemes, the main advantages of our approach are 1) considering correlations due to common sources of variation, 2) providing controllable runtime, which in one of the proposed strategies is comparable to the deterministic algorithm, and 3) performing optimization based on all the signal paths simultaneously, as opposed to one path at a time. Experimental results indicate that the proposed probabilistic scheme is significantly better than a comparable deterministic dual-threshold voltage assignment, both in terms of expected leakage and the probability of violating the timing constraint","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":" 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113951844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
PARE: a power-aware hardware data prefetching engine PARE:功率感知的硬件数据预取引擎
Yao Guo, M. Naser, C. A. Moritz
Aggressive hardware prefetching often significantly increases energy consumption in the memory system. Experiments show that a major fraction of prefetching energy degradation is due to the hardware history table related energy costs. In this paper, the authors presented PARE, a power-aware prefetching engine that uses a newly designed indexed hardware history table. Compared to the conventional single table design, the new prefetching table consumes 7-11X less power per access. With the help of compiler-based location-set analysis, it is shown that the proposed PARE design improves energy consumption by as much as 40% in the data memory systems in 70nm processor designs.
激进的硬件预取通常会显著增加内存系统的能耗。实验表明,预取能量退化的主要原因是硬件历史表相关的能量消耗。在本文中,作者提出了PARE,一个功率感知预取引擎,它使用新设计的索引硬件历史表。与传统的单表设计相比,新的预取表每次访问可节省7-11倍的功耗。在基于编译器的位置集分析的帮助下,结果表明,在70nm处理器设计的数据存储系统中,提出的PARE设计将能耗提高了40%。
{"title":"PARE: a power-aware hardware data prefetching engine","authors":"Yao Guo, M. Naser, C. A. Moritz","doi":"10.1145/1077603.1077686","DOIUrl":"https://doi.org/10.1145/1077603.1077686","url":null,"abstract":"Aggressive hardware prefetching often significantly increases energy consumption in the memory system. Experiments show that a major fraction of prefetching energy degradation is due to the hardware history table related energy costs. In this paper, the authors presented PARE, a power-aware prefetching engine that uses a newly designed indexed hardware history table. Compared to the conventional single table design, the new prefetching table consumes 7-11X less power per access. With the help of compiler-based location-set analysis, it is shown that the proposed PARE design improves energy consumption by as much as 40% in the data memory systems in 70nm processor designs.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114782292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Energy efficient strategies for deployment of a two-level wireless sensor network 两级无线传感器网络部署的节能策略
A. Iranli, M. Maleki, Massoud Pedram
We investigate and develop energy-efficient strategies for deployment of wireless sensor networks (WSN) for the purpose of monitoring some phenomenon of interest in a coverage region. We first describe a two-level WSN structure where the sensors in the lower level monitor their surrounding environment and the micro-servers in the top level provide connectivity between the sensors and a base station. We then formulate and solve the problem of assigning positions and initial energy levels to the micro-servers and concurrently partitioning the sensors into clusters assigned to individual micro-servers so as maximize the monitoring lifetime of the two-level WSN subject to a total energy budget. This problem, called MDEA, is solved for both collinear deployment and planar deployment situations. Our experimental results show that the design and deployment of such a two-level WSN increase the network lifetime by a factor of two or more compared to a flat WSN with the same total initial energy and quality of monitoring.
我们研究和开发了无线传感器网络(WSN)部署的节能策略,目的是监测覆盖区域内的一些感兴趣的现象。我们首先描述了一个两层的WSN结构,其中底层的传感器监控其周围环境,顶层的微服务器提供传感器与基站之间的连接。然后,我们制定并解决了为微服务器分配位置和初始能量水平的问题,并同时将传感器划分为分配给单个微服务器的集群,从而在总能量预算的情况下最大化两级WSN的监测寿命。这个问题称为MDEA,可用于共线部署和平面部署情况。我们的实验结果表明,与具有相同总初始能量和监测质量的平面WSN相比,这种两级WSN的设计和部署使网络寿命增加了两倍或更多。
{"title":"Energy efficient strategies for deployment of a two-level wireless sensor network","authors":"A. Iranli, M. Maleki, Massoud Pedram","doi":"10.1145/1077603.1077659","DOIUrl":"https://doi.org/10.1145/1077603.1077659","url":null,"abstract":"We investigate and develop energy-efficient strategies for deployment of wireless sensor networks (WSN) for the purpose of monitoring some phenomenon of interest in a coverage region. We first describe a two-level WSN structure where the sensors in the lower level monitor their surrounding environment and the micro-servers in the top level provide connectivity between the sensors and a base station. We then formulate and solve the problem of assigning positions and initial energy levels to the micro-servers and concurrently partitioning the sensors into clusters assigned to individual micro-servers so as maximize the monitoring lifetime of the two-level WSN subject to a total energy budget. This problem, called MDEA, is solved for both collinear deployment and planar deployment situations. Our experimental results show that the design and deployment of such a two-level WSN increase the network lifetime by a factor of two or more compared to a flat WSN with the same total initial energy and quality of monitoring.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"3 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132805920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 55
LAP: a logic activity packing methodology for leakage power-tolerant FPGAs LAP:一种用于容漏功率fpga的逻辑活动封装方法
Hassan Hassan, M. Anis, M. Elmasry
As FPGAs enter the nanometer regime, several modifications are needed to reduce the increasing leakage power dissipation. Hence, this work presents some modifications to the FPGAs CAD flow to mitigate leakage power dissipation through the use of multi-threshold CMOS technologies to pack and place logic blocks that exhibit similar idleness close to each other so they can be turned off during their idle time. The modifications are integrated into the VPR flow and tested on several FPGA benchmarks using a CMOS 0.13/spl mu/m dual-V/sub th/ technology, resulting in an average leakage power savings of at least 20%.
随着fpga进入纳米时代,需要对其进行一些改进以降低不断增加的泄漏功耗。因此,这项工作提出了对fpga CAD流程的一些修改,通过使用多阈值CMOS技术来封装和放置具有相似空闲的逻辑块,从而减少泄漏功耗,从而可以在空闲时间关闭它们。这些改进被集成到VPR流程中,并使用CMOS 0.13/spl mu/m双v /sub /技术在多个FPGA基准测试中进行了测试,平均泄漏功耗节省至少20%。
{"title":"LAP: a logic activity packing methodology for leakage power-tolerant FPGAs","authors":"Hassan Hassan, M. Anis, M. Elmasry","doi":"10.1145/1077603.1077664","DOIUrl":"https://doi.org/10.1145/1077603.1077664","url":null,"abstract":"As FPGAs enter the nanometer regime, several modifications are needed to reduce the increasing leakage power dissipation. Hence, this work presents some modifications to the FPGAs CAD flow to mitigate leakage power dissipation through the use of multi-threshold CMOS technologies to pack and place logic blocks that exhibit similar idleness close to each other so they can be turned off during their idle time. The modifications are integrated into the VPR flow and tested on several FPGA benchmarks using a CMOS 0.13/spl mu/m dual-V/sub th/ technology, resulting in an average leakage power savings of at least 20%.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130369360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A low power current steering digital to analog converter in 0.18 micron CMOS 一种0.18微米CMOS的低功率电流转向数模转换器
D. Mercer
This paper discusses a number of circuit techniques which address the DC and AC distortion performance of a low power current steering digital-to-analog converter design. The design provides 14 bit resolution and 200 MSPS conversion rate in a 1P4M 0.18 micron CMOS process, with optional 3.3 volt compatible devices, while operating over a wide 3.6 to 1.8 volt supply range. A power dissipation/conversion rate figure of merit of as low as 0.17 mW/MSPS was achieved for 1.8V operation and as low as 0.28 mW/MSPS at 3.3V. SFDR of 70 dB is achieved at a 50 MHz output frequency.
本文讨论了解决低功率电流转向数模转换器直流和交流失真性能的一些电路技术。该设计在1P4M 0.18微米CMOS工艺中提供14位分辨率和200 MSPS转换率,可选3.3伏兼容器件,同时在3.6至1.8伏的宽供电范围内工作。在1.8V工作时,功耗/转换率可低至0.17 mW/MSPS,在3.3V工作时可低至0.28 mW/MSPS。在50mhz输出频率下实现70db的SFDR。
{"title":"A low power current steering digital to analog converter in 0.18 micron CMOS","authors":"D. Mercer","doi":"10.1145/1077603.1077621","DOIUrl":"https://doi.org/10.1145/1077603.1077621","url":null,"abstract":"This paper discusses a number of circuit techniques which address the DC and AC distortion performance of a low power current steering digital-to-analog converter design. The design provides 14 bit resolution and 200 MSPS conversion rate in a 1P4M 0.18 micron CMOS process, with optional 3.3 volt compatible devices, while operating over a wide 3.6 to 1.8 volt supply range. A power dissipation/conversion rate figure of merit of as low as 0.17 mW/MSPS was achieved for 1.8V operation and as low as 0.28 mW/MSPS at 3.3V. SFDR of 70 dB is achieved at a 50 MHz output frequency.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131442279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Energy efficient SEU-tolerance in DVS-enabled real-time systems through information redundancy 通过信息冗余,在支持dvs的实时系统中实现高能效的seu容忍度
A. Ejlali, M. Schmitz, B. Al-Hashimi, S. Miremadi, P. Rosinger
Concerns about the reliability of real-time embedded systems that employ dynamic voltage scaling has recently been highlighted [R. Melhem, D. Mosse, E. Elnozahy (2004), Y. Zhang, K. Chakrabarty (2004) and D. Zhu, R. Melhem, D. Mosse (2004)], focusing on transient-fault-tolerance techniques based on time-redundancy. In this paper we analyze the usage of information redundancy in DVS-enabled systems with the aim of improving both the system tolerance to transient faults as well as the energy consumption. We demonstrate through a case study that it is possible to achieve both higher fault-tolerance and less energy using a combination of information and time redundancy when compared with using time redundancy alone. This even holds despite the impact of the information redundancy hardware overhead and its associated switching activities.
对采用动态电压标度的实时嵌入式系统可靠性的关注最近得到了强调[R]。Melhem, D. Mosse, E. Elnozahy (2004), Y. Zhang, K. Chakrabarty(2004)和D. Zhu, R. Melhem, D. Mosse(2004)],重点研究了基于时间冗余的瞬态容错技术。本文分析了信息冗余在支持dvs系统中的应用,目的是提高系统对暂态故障的容忍度和能量消耗。我们通过一个案例研究证明,与单独使用时间冗余相比,使用信息和时间冗余的组合可以实现更高的容错性和更少的能量。即使不考虑信息冗余硬件开销及其相关的交换活动的影响,这一点也是成立的。
{"title":"Energy efficient SEU-tolerance in DVS-enabled real-time systems through information redundancy","authors":"A. Ejlali, M. Schmitz, B. Al-Hashimi, S. Miremadi, P. Rosinger","doi":"10.1145/1077603.1077669","DOIUrl":"https://doi.org/10.1145/1077603.1077669","url":null,"abstract":"Concerns about the reliability of real-time embedded systems that employ dynamic voltage scaling has recently been highlighted [R. Melhem, D. Mosse, E. Elnozahy (2004), Y. Zhang, K. Chakrabarty (2004) and D. Zhu, R. Melhem, D. Mosse (2004)], focusing on transient-fault-tolerance techniques based on time-redundancy. In this paper we analyze the usage of information redundancy in DVS-enabled systems with the aim of improving both the system tolerance to transient faults as well as the energy consumption. We demonstrate through a case study that it is possible to achieve both higher fault-tolerance and less energy using a combination of information and time redundancy when compared with using time redundancy alone. This even holds despite the impact of the information redundancy hardware overhead and its associated switching activities.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130975809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Cost-effective low-power processor-in-memory-based reconfigurable datapath for multimedia applications 多媒体应用的低成本、低功耗、基于内存处理器的可重构数据路径
M. Lanuzza, M. Margala, P. Corsonello
Multimedia applications have become a dominant computing workload for computer systems as well as for wireless-based devices. Due to their repetitive computing and memory intensive nature, they can take effective advantage from processor-in-memory (PIM) technology. In this paper, a new low-power PIM-based 32-bit reconfigurable datapath optimized for multimedia applications is presented. The new circuit efficiently performs parallel arithmetic operations on either 8-, 16-, or 32-bit integer data or on 32-bit single precision floating-point data. As a result, high flexibility is provided at a very low hardware cost. When implemented using the UMC 0.18 /spl mu/m 1.8 V CMOS technology, the proposed datapath exhibits a 285 MHz running frequency, dissipates just 0.12 mW/MHz and occupies a silicon area of only 107,323 /spl mu/m/sub 2/. When performing 2D-DCT, proposed architecture consumes 74% less power and is 28% more power efficient compared to top-of-the-line commercial TI DSP.
多媒体应用程序已经成为计算机系统以及基于无线的设备的主要计算工作负载。由于它们的重复计算和内存密集型特性,它们可以有效地利用内存中的处理器(PIM)技术。本文提出了一种新的低功耗、基于pim的32位可重构多媒体数据路径。该电路对8位、16位或32位整数数据或32位单精度浮点数据有效地执行并行算术运算。因此,以非常低的硬件成本提供了高灵活性。当使用UMC 0.18 /spl mu/m 1.8 V CMOS技术实现时,所提出的数据路径具有285 MHz的运行频率,功耗仅为0.12 mW/MHz,并且占用的硅面积仅为107,323 /spl mu/m/sub 2/。在执行2D-DCT时,与顶级商用TI DSP相比,所提出的架构功耗降低74%,功耗效率提高28%。
{"title":"Cost-effective low-power processor-in-memory-based reconfigurable datapath for multimedia applications","authors":"M. Lanuzza, M. Margala, P. Corsonello","doi":"10.1145/1077603.1077645","DOIUrl":"https://doi.org/10.1145/1077603.1077645","url":null,"abstract":"Multimedia applications have become a dominant computing workload for computer systems as well as for wireless-based devices. Due to their repetitive computing and memory intensive nature, they can take effective advantage from processor-in-memory (PIM) technology. In this paper, a new low-power PIM-based 32-bit reconfigurable datapath optimized for multimedia applications is presented. The new circuit efficiently performs parallel arithmetic operations on either 8-, 16-, or 32-bit integer data or on 32-bit single precision floating-point data. As a result, high flexibility is provided at a very low hardware cost. When implemented using the UMC 0.18 /spl mu/m 1.8 V CMOS technology, the proposed datapath exhibits a 285 MHz running frequency, dissipates just 0.12 mW/MHz and occupies a silicon area of only 107,323 /spl mu/m/sub 2/. When performing 2D-DCT, proposed architecture consumes 74% less power and is 28% more power efficient compared to top-of-the-line commercial TI DSP.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123314732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Cascaded carry-select adder (C/sup 2/SA): a new structure for low-power CSA design 级联载波选择加法器(C/sup 2/SA):一种用于低功耗载波选择加法器设计的新结构
Yiran Chen, Hai Helen Li, K. Roy, Cheng-Kok Koh
In this paper we propose a novel low-power carry-select adder (CSA) design called cascaded CSA (C/sup 2/SA). Based on the prediction of the critical path delay of current operation, C/sup 2/SA can automatically work with one or two clock-cycle latency and a scaled supply voltage to achieve power improvement. Post-layout simulations of a 64-bit C/sup 2/SA in 180nm technology show that C/sup 2/SA can operate at a lower supply voltage, attaining 40.7% energy saving, while maintaining a similar (average) latency per operation (LPO) compared to standard CSA.
本文提出了一种新型的低功耗载波选择加法器(CSA)设计,称为级联式CSA (C/sup 2/SA)。基于对电流运行关键路径延迟的预测,C/sup 2/SA可以在一个或两个时钟周期的延迟和按比例调整的电源电压下自动工作,从而实现功率提升。基于180nm技术的64位C/sup 2/SA布局后仿真表明,C/sup 2/SA可以在较低的电源电压下工作,节能40.7%,同时保持与标准CSA相似的(平均)每次操作延迟(LPO)。
{"title":"Cascaded carry-select adder (C/sup 2/SA): a new structure for low-power CSA design","authors":"Yiran Chen, Hai Helen Li, K. Roy, Cheng-Kok Koh","doi":"10.1145/1077603.1077634","DOIUrl":"https://doi.org/10.1145/1077603.1077634","url":null,"abstract":"In this paper we propose a novel low-power carry-select adder (CSA) design called cascaded CSA (C/sup 2/SA). Based on the prediction of the critical path delay of current operation, C/sup 2/SA can automatically work with one or two clock-cycle latency and a scaled supply voltage to achieve power improvement. Post-layout simulations of a 64-bit C/sup 2/SA in 180nm technology show that C/sup 2/SA can operate at a lower supply voltage, attaining 40.7% energy saving, while maintaining a similar (average) latency per operation (LPO) compared to standard CSA.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121466550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A non-uniform cache architecture for low power system design 一种用于低功耗系统设计的非统一缓存架构
T. Ishihara, F. Fallah
This paper proposes a non-uniform cache architecture for reducing the power consumption of memory systems. The non-uniform cache allows having different associativity values (i.e., the number of cache-ways) for different cache-sets. An algorithm determines the optimum number of cache-ways for each cache-set and generates object code suitable for the non-uniform cache memory. The paper also proposes a compiler technique for reducing redundant cache-way accesses and cache-tag accesses. Experiments demonstrate that the technique can reduce the power consumption of memory systems by up to 76% compared to the best result achieved by the conventional method.
为了降低存储系统的功耗,本文提出了一种非统一缓存架构。非统一缓存允许不同的缓存集具有不同的关联值(即缓存方式的数量)。一种算法确定每个缓存集的最佳缓存路径数,并生成适合于非均匀缓存的目标代码。本文还提出了一种减少冗余缓存路径访问和缓存标签访问的编译技术。实验表明,与传统方法相比,该技术可将存储系统的功耗降低76%。
{"title":"A non-uniform cache architecture for low power system design","authors":"T. Ishihara, F. Fallah","doi":"10.1145/1077603.1077690","DOIUrl":"https://doi.org/10.1145/1077603.1077690","url":null,"abstract":"This paper proposes a non-uniform cache architecture for reducing the power consumption of memory systems. The non-uniform cache allows having different associativity values (i.e., the number of cache-ways) for different cache-sets. An algorithm determines the optimum number of cache-ways for each cache-set and generates object code suitable for the non-uniform cache memory. The paper also proposes a compiler technique for reducing redundant cache-way accesses and cache-tag accesses. Experiments demonstrate that the technique can reduce the power consumption of memory systems by up to 76% compared to the best result achieved by the conventional method.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124351461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Low-power fanout optimization using multiple threshold voltage inverters 使用多个阈值电压逆变器的低功耗风扇输出优化
B. Amelifard, F. Fallah, Massoud Pedram
This paper addresses the problem of low-power fanout optimization with multiple threshold voltage inverters. Introducing splitting and merging conversions that preserve delay, power, and input capacitance, the fanout tree is converted to a set of inverter chains and for each chain the optimal sizes and threshold voltages are determined. Experimental results show that using this technique, the power dissipation of fanout tree is reduced by an average of 33% for a state-of-the-art CMOS technology.
本文研究了具有多个阈值电压逆变器的低功率风扇输出优化问题。引入分离和合并转换,以保持延迟、功率和输入电容,将扇出树转换为一组逆变器链,并为每个链确定最佳尺寸和阈值电压。实验结果表明,采用该技术,扇出树的功耗平均降低33%,达到最先进的CMOS技术。
{"title":"Low-power fanout optimization using multiple threshold voltage inverters","authors":"B. Amelifard, F. Fallah, Massoud Pedram","doi":"10.1145/1077603.1077628","DOIUrl":"https://doi.org/10.1145/1077603.1077628","url":null,"abstract":"This paper addresses the problem of low-power fanout optimization with multiple threshold voltage inverters. Introducing splitting and merging conversions that preserve delay, power, and input capacitance, the fanout tree is converted to a set of inverter chains and for each chain the optimal sizes and threshold voltages are determined. Experimental results show that using this technique, the power dissipation of fanout tree is reduced by an average of 33% for a state-of-the-art CMOS technology.","PeriodicalId":256018,"journal":{"name":"ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126909659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1