首页 > 最新文献

Proceedings of 1997 International Symposium on Low Power Electronics and Design最新文献

英文 中文
Reducing TLB power requirements 降低TLB电源需求
Toni Juan, T. Lang, J. Navarro
Translation look-aside buffers (TLBs) are small caches to speed-up address translation in processors with virtual memory. This paper considers two issues: (1) a comparison of the power consumption of fully-associative, set-associative, and direct mapped TLBs for the same miss rate and (2) the proposal of modifications of the basic cells and of the structure of set-associative TLBs to reduce the power. The power evaluation is done using a model and the miss rates are obtained from simulations of the SPEC92 benchmark. With respect to (1) we conclude that for small TLBs (high miss rates) fully-associative TLBs consume less power but for larger TLBs (low miss rates) set-associative TLBs are better. Moreover, the proposed modifications produce significant reductions in power consumption. Our evaluations show a reduction of 40 to 60% compared to the best traditional TLB. The proposed TLB implementation produces an increase in delay and in area. However, these increases are tolerable because the cycle time is determined by the slower cache and because the TLB area corresponds to only a small portion of the chip area.
转换暂置缓冲区(tlb)是一种小型缓存,用于在具有虚拟内存的处理器中加速地址转换。本文考虑了两个问题:(1)在相同脱靶率下,比较了全关联tlb、集关联tlb和直接映射tlb的功耗;(2)提出了修改集关联tlb的基本单元和结构以降低功耗的建议。利用模型进行了功率评估,并通过对SPEC92基准的仿真得到了脱靶率。关于(1),我们得出结论,对于小tlb(高缺失率),全关联tlb消耗更少的功率,但对于大tlb(低缺失率),集关联tlb更好。此外,所提出的修改可以显著减少电力消耗。我们的评估显示,与最好的传统TLB相比,减少了40%至60%。拟议的TLB实施会增加延迟和面积。然而,这些增加是可以容忍的,因为周期时间是由较慢的缓存决定的,因为TLB区域只对应芯片区域的一小部分。
{"title":"Reducing TLB power requirements","authors":"Toni Juan, T. Lang, J. Navarro","doi":"10.1145/263272.263332","DOIUrl":"https://doi.org/10.1145/263272.263332","url":null,"abstract":"Translation look-aside buffers (TLBs) are small caches to speed-up address translation in processors with virtual memory. This paper considers two issues: (1) a comparison of the power consumption of fully-associative, set-associative, and direct mapped TLBs for the same miss rate and (2) the proposal of modifications of the basic cells and of the structure of set-associative TLBs to reduce the power. The power evaluation is done using a model and the miss rates are obtained from simulations of the SPEC92 benchmark. With respect to (1) we conclude that for small TLBs (high miss rates) fully-associative TLBs consume less power but for larger TLBs (low miss rates) set-associative TLBs are better. Moreover, the proposed modifications produce significant reductions in power consumption. Our evaluations show a reduction of 40 to 60% compared to the best traditional TLB. The proposed TLB implementation produces an increase in delay and in area. However, these increases are tolerable because the cycle time is determined by the slower cache and because the TLB area corresponds to only a small portion of the chip area.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"11 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132906855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 109
Exploiting the locality of memory references to reduce the address bus energy 利用内存引用的局部性来减少地址总线能量
E. Musoll, T. Lang, J. Cortadella
The energy consumption at the I/O pins is a significant part of the overall chip consumption. This paper presents a method for encoding an external address bus which lowers its activity and, thus, decreases the energy. This method relies on the locality of memory references. Since applications favor a few working zones of their address space at each instant, for an address to one of these zones only the offset of this reference with respect to the previous reference to that zone needs to be sent over the bus, along with an identifier of the current working zone. This is combined with a modified one-shot encoding for the offset. An estimate of the area and energy overhead of the encoder/decoder are given; their effect is small. The approach has been applied to two memory-intensive examples, obtaining a bus-activity reduction of about 2/3 in both of them. Comparisons are given with previous methods for bus encoding, showing significant improvement.
I/O引脚的能量消耗是整个芯片消耗的一个重要部分。本文提出了一种对外部地址总线进行编码的方法,该方法降低了外部地址总线的活动性,从而降低了总线的能量。此方法依赖于内存引用的位置。由于应用程序在每个瞬间倾向于使用其地址空间的几个工作区域,因此对于其中一个区域的地址,只需通过总线发送此引用相对于先前对该区域的引用的偏移量,以及当前工作区域的标识符。这与对偏移量进行修改的一次性编码相结合。给出了编码器/解码器的面积和能量开销的估计;它们的影响很小。该方法已应用于两个内存密集型示例,在这两个示例中,总线活动都减少了约2/3。与以前的总线编码方法进行了比较,显示出显著的改进。
{"title":"Exploiting the locality of memory references to reduce the address bus energy","authors":"E. Musoll, T. Lang, J. Cortadella","doi":"10.1145/263272.263334","DOIUrl":"https://doi.org/10.1145/263272.263334","url":null,"abstract":"The energy consumption at the I/O pins is a significant part of the overall chip consumption. This paper presents a method for encoding an external address bus which lowers its activity and, thus, decreases the energy. This method relies on the locality of memory references. Since applications favor a few working zones of their address space at each instant, for an address to one of these zones only the offset of this reference with respect to the previous reference to that zone needs to be sent over the bus, along with an identifier of the current working zone. This is combined with a modified one-shot encoding for the offset. An estimate of the area and energy overhead of the encoder/decoder are given; their effect is small. The approach has been applied to two memory-intensive examples, obtaining a bus-activity reduction of about 2/3 in both of them. Comparisons are given with previous methods for bus encoding, showing significant improvement.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124704112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 100
An object code compression approach to embedded processors 嵌入式处理器的目标代码压缩方法
Y. Yoshida, Bao-Yu Song, H. Okuhata, T. Onoye, I. Shirakawa
A low-power processor architecture is described dedicatedly for embedded application programs by means of an object code compression approach. This approach unifies duplicated instructions existing in the embedded program and assigns a compressed object code to such an instruction. An instruction decompressor is constructed so as to generate an object code from each compressed object code (pseudo code) input. A single-chip implementation of this decompressor together with a processor core can effectively reduce the bandwidth required for the I/O interface. To demonstrate the practicability of the proposed approach, experiments are applied to an embedded processor ARM610 which attains 62.5% code compression, and hence 42.3% of the power consumption of instruction memory can be reduced.
利用目标代码压缩的方法,专门描述了一种嵌入式应用程序的低功耗处理器体系结构。这种方法将存在于嵌入式程序中的重复指令统一起来,并将压缩后的目标代码分配给这样的指令。构造指令解压缩器,以便从每个压缩的目标码(伪码)输入生成目标码。该解压缩器的单芯片实现与处理器核心一起可以有效地减少I/O接口所需的带宽。为了证明该方法的实用性,在嵌入式处理器ARM610上进行了实验,达到了62.5%的代码压缩率,从而降低了42.3%的指令存储器功耗。
{"title":"An object code compression approach to embedded processors","authors":"Y. Yoshida, Bao-Yu Song, H. Okuhata, T. Onoye, I. Shirakawa","doi":"10.1145/263272.263349","DOIUrl":"https://doi.org/10.1145/263272.263349","url":null,"abstract":"A low-power processor architecture is described dedicatedly for embedded application programs by means of an object code compression approach. This approach unifies duplicated instructions existing in the embedded program and assigns a compressed object code to such an instruction. An instruction decompressor is constructed so as to generate an object code from each compressed object code (pseudo code) input. A single-chip implementation of this decompressor together with a processor core can effectively reduce the bandwidth required for the I/O interface. To demonstrate the practicability of the proposed approach, experiments are applied to an embedded processor ARM610 which attains 62.5% code compression, and hence 42.3% of the power consumption of instruction memory can be reduced.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"140 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129652791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 89
Engineering change for power optimization using global sensitivity and synthesis flexibility 利用全局灵敏度和综合灵活性进行功率优化的工程变更
Premal Buch, C. Lennard, A. Newton
A technology dependent power optimization technique is proposed which formulates the problem of hot spot reduction as a variant of the engineering change (EC) problem. A technique is presented for determining the sensitivity of circuit power dissipation to functional changes considering both local and global effects. This sensitivity is combined with a measure of synthesis flexibility to identify hot regions in the circuit which have a lot of flexibility in making functional changes and for whom a small functional change can greatly affect the overall power dissipation. An incompletely specified target function is constructed for the hot region such that any implementation satisfying it is expected to reduce power. A rewiring algorithm is used to solve the resulting EC problem without affecting circuit area, gate capacitance or delay under the unit delay model. Experimental results on a set of MCNC benchmark circuits show that the proposed approach can give up to 13% reduction in power dissipation with an average reduction of 4%.
提出了一种技术依赖的功率优化技术,该技术将热点减小问题表述为工程变更问题的变体。提出了一种既考虑局部效应又考虑全局效应的电路功耗灵敏度测定方法。这种灵敏度与合成灵活性的测量相结合,以识别电路中的热区,这些热区在功能变化方面具有很大的灵活性,并且对其来说,小的功能变化可以极大地影响整体功耗。为热区构造了一个不完全指定的目标函数,使得任何满足该函数的实现都有望降低功耗。在不影响电路面积、栅极电容或单位延迟模型下的延迟的情况下,采用一种重新布线算法来解决由此产生的EC问题。在一组MCNC基准电路上的实验结果表明,该方法可使功耗降低13%,平均降低4%。
{"title":"Engineering change for power optimization using global sensitivity and synthesis flexibility","authors":"Premal Buch, C. Lennard, A. Newton","doi":"10.1145/263272.263290","DOIUrl":"https://doi.org/10.1145/263272.263290","url":null,"abstract":"A technology dependent power optimization technique is proposed which formulates the problem of hot spot reduction as a variant of the engineering change (EC) problem. A technique is presented for determining the sensitivity of circuit power dissipation to functional changes considering both local and global effects. This sensitivity is combined with a measure of synthesis flexibility to identify hot regions in the circuit which have a lot of flexibility in making functional changes and for whom a small functional change can greatly affect the overall power dissipation. An incompletely specified target function is constructed for the hot region such that any implementation satisfying it is expected to reduce power. A rewiring algorithm is used to solve the resulting EC problem without affecting circuit area, gate capacitance or delay under the unit delay model. Experimental results on a set of MCNC benchmark circuits show that the proposed approach can give up to 13% reduction in power dissipation with an average reduction of 4%.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121363259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hybrid dual-threshold design techniques for high-performance processors with low-power features 低功耗高性能处理器的混合双阈值设计技术
U. Ko, Andrew Pua, A. Hill, Pranjal Srivastava
This paper investigates delay, power and area of several critical library components for high-performance, low-power microprocessor designs. To improve performance of a 0.18-/spl mu/m technology at a supply voltage of 1.8 V, the proposed hybrid dual-V/sub t/ (HDVT) circuit architectures enhance speed of low-V/sub t/ by 21% while reducing leakage power dissipation of low-V/sub t/ by an order of magnitude for combinatorial logic. For sequential elements, a HDVT split-slave dual-path (HSSDP) and Push-Pull Isolation (HPPI) registers are proposed to improve 29-92% performance over an HDVT-conventional registers with 20-89% less energy consumption. For the datapath, a HDVT hierarchical, reduced-swing, dual-V/sub t/ logic (HHRSL) comparator is proposed to improve the delay of prior arts by up to 50%.
本文研究了用于高性能、低功耗微处理器设计的几个关键库元件的延迟、功耗和面积。为了提高电源电压为1.8 V时0.18-/spl mu/m技术的性能,所提出的混合双V/sub / (HDVT)电路架构将低V/sub /的速度提高了21%,同时将低V/sub /的泄漏功耗降低了一个数量级。对于顺序元件,提出了HDVT分离-从机双路径(HSSDP)和推挽隔离(HPPI)寄存器,与HDVT传统寄存器相比,性能提高29-92%,能耗降低20-89%。对于数据路径,提出了一种HDVT分层、减摆、双v /sub /逻辑(HHRSL)比较器,可将现有技术的延迟提高高达50%。
{"title":"Hybrid dual-threshold design techniques for high-performance processors with low-power features","authors":"U. Ko, Andrew Pua, A. Hill, Pranjal Srivastava","doi":"10.1145/263272.263362","DOIUrl":"https://doi.org/10.1145/263272.263362","url":null,"abstract":"This paper investigates delay, power and area of several critical library components for high-performance, low-power microprocessor designs. To improve performance of a 0.18-/spl mu/m technology at a supply voltage of 1.8 V, the proposed hybrid dual-V/sub t/ (HDVT) circuit architectures enhance speed of low-V/sub t/ by 21% while reducing leakage power dissipation of low-V/sub t/ by an order of magnitude for combinatorial logic. For sequential elements, a HDVT split-slave dual-path (HSSDP) and Push-Pull Isolation (HPPI) registers are proposed to improve 29-92% performance over an HDVT-conventional registers with 20-89% less energy consumption. For the datapath, a HDVT hierarchical, reduced-swing, dual-V/sub t/ logic (HHRSL) comparator is proposed to improve the delay of prior arts by up to 50%.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124105447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Charge-pump assisted low-power/low-voltage CMOS op amp design 电荷泵辅助的低功耗/低压CMOS运放设计
Jianjun J. Zhou, R. M. Ziazadeh, H. Ng, Hiok-Tiaq Ng, D. Allstot
Low-voltage/low-power requirements have become essential considerations in modern mixed-signal designs. In this paper, we present a 1.8 V multi-stage opamp designed using a standard 0.6 /spl mu/m n-well CMOS process. The op amp designs include a new high efficiency DC-DC converter used in a low-voltage frequency compensation configuration that tracks process, temperature, and supply voltage variations and enables accurate pole-zero cancellation. Low-power operation is compared in passive and active g/sub m/ feedforward compensation techniques. An automatic threshold calibration scheme is also described.
低电压/低功耗要求已成为现代混合信号设计中必不可少的考虑因素。在本文中,我们提出了一个1.8 V多级运放,采用标准的0.6 /spl mu/m n阱CMOS工艺设计。运算放大器设计包括一个新的高效率DC-DC转换器,用于低压频率补偿配置,跟踪工艺、温度和电源电压变化,并实现精确的极零抵消。比较了无源和有源g/sub /前馈补偿技术的低功耗运行。描述了一种自动阈值校准方案。
{"title":"Charge-pump assisted low-power/low-voltage CMOS op amp design","authors":"Jianjun J. Zhou, R. M. Ziazadeh, H. Ng, Hiok-Tiaq Ng, D. Allstot","doi":"10.1145/263272.263299","DOIUrl":"https://doi.org/10.1145/263272.263299","url":null,"abstract":"Low-voltage/low-power requirements have become essential considerations in modern mixed-signal designs. In this paper, we present a 1.8 V multi-stage opamp designed using a standard 0.6 /spl mu/m n-well CMOS process. The op amp designs include a new high efficiency DC-DC converter used in a low-voltage frequency compensation configuration that tracks process, temperature, and supply voltage variations and enables accurate pole-zero cancellation. Low-power operation is compared in passive and active g/sub m/ feedforward compensation techniques. An automatic threshold calibration scheme is also described.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126355310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Power reduction techniques for a spread spectrum based correlator 扩频相关器的功率降低技术
David Garrett, M. Stan
This paper presents the design of a low power spread spectrum correlator. We look at two major approaches and evaluate the best alternative for power reduction. We first consider a shift register FIFO implementation and look at reducing the switching activity for the arithmetic operations with a change in the addition algorithm. The correlation calculation can be modified to include storage of the previous result so that arithmetic circuits need only compute the difference between the present and next value. A binary adder tree with bypass can then reduce power by shutting off unnecessary computations. We then look at minimizing the power for sample storage by limiting the amount of data moved per cycle. This can be achieved by using a register file FIFO implementation. Interestingly the two power minimization techniques, bypass adder tree and register file FIFO implementation, were found to be strongly non-orthogonal, with the final effect that the register file changes the data statistics in such a way that it cancels the savings for the adder tree with bypass. The final solution of a register file with standard adder tree was found to have the lowest power dissipation. Using Bus-Invert for encoding the data as it enters the FIFO further reduces the power consumption due to the global bus of the register file.
本文介绍了一种低功率扩频相关器的设计。我们研究了两种主要的方法,并评估了节能的最佳替代方案。我们首先考虑移位寄存器FIFO实现,并通过更改加法算法来减少算术运算的切换活动。相关计算可以修改为包括前一个结果的存储,这样算术电路只需要计算当前值和下一个值之间的差。带旁路的二加法器树可以通过关闭不必要的计算来降低功耗。然后,我们将通过限制每个周期移动的数据量来最小化样本存储的功率。这可以通过使用寄存器文件FIFO实现来实现。有趣的是,两种功率最小化技术,旁路加法器树和寄存器文件FIFO实现,被发现是强烈非正交的,最终的效果是,寄存器文件以这样一种方式改变数据统计,它取消了旁路加法器树的节省。采用标准加法器树的寄存器文件的最终解具有最低的功耗。使用总线反转编码数据,因为它进入先进先出进一步降低功耗,由于寄存器文件的全局总线。
{"title":"Power reduction techniques for a spread spectrum based correlator","authors":"David Garrett, M. Stan","doi":"10.1145/263272.263339","DOIUrl":"https://doi.org/10.1145/263272.263339","url":null,"abstract":"This paper presents the design of a low power spread spectrum correlator. We look at two major approaches and evaluate the best alternative for power reduction. We first consider a shift register FIFO implementation and look at reducing the switching activity for the arithmetic operations with a change in the addition algorithm. The correlation calculation can be modified to include storage of the previous result so that arithmetic circuits need only compute the difference between the present and next value. A binary adder tree with bypass can then reduce power by shutting off unnecessary computations. We then look at minimizing the power for sample storage by limiting the amount of data moved per cycle. This can be achieved by using a register file FIFO implementation. Interestingly the two power minimization techniques, bypass adder tree and register file FIFO implementation, were found to be strongly non-orthogonal, with the final effect that the register file changes the data statistics in such a way that it cancels the savings for the adder tree with bypass. The final solution of a register file with standard adder tree was found to have the lowest power dissipation. Using Bus-Invert for encoding the data as it enters the FIFO further reduces the power consumption due to the global bus of the register file.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126494765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Clocked CMOS adiabatic logic with integrated single-phase power-clock supply: experimental results 时钟CMOS绝热逻辑集成单相电源时钟:实验结果
D. Maksimović, V. Oklobdzija, B. Nikolić, K. Current
In this paper we describe the design and experimental evaluation of a clocked CMOS adiabatic logic (CAL). CAL is a dual-rail logic that operates from a single-phase AC power-clock supply in the 'adiabatic' mode, or from a DC power supply in the 'non-adiabatic' mode. In the adiabatic mode, the power-clock supply waveform is generated using an on-chip switching transistor and a small external inductor between the chip and a low-voltage DC supply. Circuit operation and performance are evaluated using a chain of inverters realized in 1.2 /spl mu/m technology. Experimental results show energy savings in the adiabatic mode versus the non-adiabatic mode at clock frequencies up to about 40 MHz.
本文描述了一种时钟型CMOS绝热逻辑(CAL)的设计和实验评估。CAL是一种双轨逻辑,在“绝热”模式下由单相交流电源时钟供电,或在“非绝热”模式下由直流电源供电。在绝热模式下,电源时钟波形是通过片上开关晶体管和芯片与低压直流电源之间的小型外部电感产生的。使用1.2 /spl mu/m技术实现的逆变器链来评估电路的运行和性能。实验结果表明,在时钟频率高达约40 MHz时,绝热模式比非绝热模式节能。
{"title":"Clocked CMOS adiabatic logic with integrated single-phase power-clock supply: experimental results","authors":"D. Maksimović, V. Oklobdzija, B. Nikolić, K. Current","doi":"10.1145/263272.263365","DOIUrl":"https://doi.org/10.1145/263272.263365","url":null,"abstract":"In this paper we describe the design and experimental evaluation of a clocked CMOS adiabatic logic (CAL). CAL is a dual-rail logic that operates from a single-phase AC power-clock supply in the 'adiabatic' mode, or from a DC power supply in the 'non-adiabatic' mode. In the adiabatic mode, the power-clock supply waveform is generated using an on-chip switching transistor and a small external inductor between the chip and a low-voltage DC supply. Circuit operation and performance are evaluated using a chain of inverters realized in 1.2 /spl mu/m technology. Experimental results show energy savings in the adiabatic mode versus the non-adiabatic mode at clock frequencies up to about 40 MHz.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127438656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
High-performance, low-power design techniques for dynamic to static logic interface 动态到静态逻辑接口的高性能、低功耗设计技术
June Jiang, Kan Lu, U. Ko
To optimize performance and power of a processor with both precharged and static circuit styles, a self-timed modified cascode latch (MCL) is proposed for dual-rail domino to static logic interface. Compared to conventional self-timed cascode and cross-coupled NAND latches, the innovative MCL achieves the highest performance and lowest power dissipation with reasonable noise immunity. Ease of embedding logic functions in these self-timed latches is also studied. For interfacing single-rail domino to static logic, the pseudo-inverter latch (PIL) is the most power efficient latch when compared with the conventional transparent and cross-coupled NAND latches. Based on a 0.18 /spl mu/m CMOS nominal process with a 1.6 V supply voltage, effects on these latches' power dissipation and delay from scaling supply voltage and output load are presented respectively.
为了优化具有预充电和静态电路风格的处理器的性能和功耗,提出了一种自定时修改级联码锁存器(MCL),用于双轨多米诺到静态逻辑接口。与传统的自定时级联码和交叉耦合NAND锁存器相比,创新的MCL实现了最高的性能和最低的功耗,并具有合理的抗噪性。研究了在这些自定时锁存器中嵌入逻辑函数的难易性。对于将单轨多米诺骨牌连接到静态逻辑,与传统的透明和交叉耦合NAND锁存器相比,伪逆变锁存器(PIL)是最节能的锁存器。以1.6 V电源电压下0.18 /spl mu/m CMOS标称工艺为基础,分别研究了电源电压和输出负载对锁存器功耗和延迟的影响。
{"title":"High-performance, low-power design techniques for dynamic to static logic interface","authors":"June Jiang, Kan Lu, U. Ko","doi":"10.1145/263272.263275","DOIUrl":"https://doi.org/10.1145/263272.263275","url":null,"abstract":"To optimize performance and power of a processor with both precharged and static circuit styles, a self-timed modified cascode latch (MCL) is proposed for dual-rail domino to static logic interface. Compared to conventional self-timed cascode and cross-coupled NAND latches, the innovative MCL achieves the highest performance and lowest power dissipation with reasonable noise immunity. Ease of embedding logic functions in these self-timed latches is also studied. For interfacing single-rail domino to static logic, the pseudo-inverter latch (PIL) is the most power efficient latch when compared with the conventional transparent and cross-coupled NAND latches. Based on a 0.18 /spl mu/m CMOS nominal process with a 1.6 V supply voltage, effects on these latches' power dissipation and delay from scaling supply voltage and output load are presented respectively.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133924394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A low-power design method using multiple supply voltages 一种使用多个电源电压的低功耗设计方法
M. Igarashi, K. Usami, K. Nogami, F. Minami, Y. Kawasaki, T. Aoki, M. Takano, C. Mizuno, T. Ishikawa, M. Kanazawa, Shinji Sonoda, M. Ichida, N. Hatanaka
We present a low-power design method that utilizes the multiple supply voltages. The proposed method reduces the power consumption of random logic circuits by 47% on the average, with up to 15% area overhead, by the combination of "Clustered Voltage Scaling (CVS) scheme" and "Row by Row optimized Power Supply (RRPS) scheme". By the CVS scheme, the optimal netlist, that uses the minimized number of the level converters and the maximized number of the low Vdd gates under the timing constraints, is generated. To avoid the wiring resource consumption and the increase of the interconnect delay caused by the layout constraints of the multiple-supply-voltage design, a new power bus wiring scheme called "RRPS scheme" is proposed. The proposed method is applied to a media processor chip Mpact/sup TM/ and achieved the above mentioned results. In this paper, the emphasis is put on the interrelation between the generation of the two-supply-voltage netlist with the CVS scheme and the layout technology, such as the power supply scheme and the placement of the multiple-supply-voltage gates. The clocking scheme for the multiple supply voltages is also discussed.
我们提出了一种利用多个电源电压的低功耗设计方法。该方法通过“集束电压缩放(CVS)方案”和“逐行优化电源(RRPS)方案”的结合,将随机逻辑电路的功耗平均降低47%,面积开销高达15%。通过CVS方案,生成了在时序约束下,电平变换器数量最少,低Vdd门数量最多的最优网络列表。为了避免多电源电压设计的布线限制所造成的布线资源消耗和互连延迟的增加,提出了一种新的电源母线布线方案“RRPS方案”。将该方法应用于媒体处理器芯片Mpact/sup TM/上,取得了上述结果。本文重点讨论了用CVS方案生成双供压网表与供电方案、多供压门的布置等布局技术的相互关系。讨论了多电源电压的时钟方案。
{"title":"A low-power design method using multiple supply voltages","authors":"M. Igarashi, K. Usami, K. Nogami, F. Minami, Y. Kawasaki, T. Aoki, M. Takano, C. Mizuno, T. Ishikawa, M. Kanazawa, Shinji Sonoda, M. Ichida, N. Hatanaka","doi":"10.1145/263272.263279","DOIUrl":"https://doi.org/10.1145/263272.263279","url":null,"abstract":"We present a low-power design method that utilizes the multiple supply voltages. The proposed method reduces the power consumption of random logic circuits by 47% on the average, with up to 15% area overhead, by the combination of \"Clustered Voltage Scaling (CVS) scheme\" and \"Row by Row optimized Power Supply (RRPS) scheme\". By the CVS scheme, the optimal netlist, that uses the minimized number of the level converters and the maximized number of the low Vdd gates under the timing constraints, is generated. To avoid the wiring resource consumption and the increase of the interconnect delay caused by the layout constraints of the multiple-supply-voltage design, a new power bus wiring scheme called \"RRPS scheme\" is proposed. The proposed method is applied to a media processor chip Mpact/sup TM/ and achieved the above mentioned results. In this paper, the emphasis is put on the interrelation between the generation of the two-supply-voltage netlist with the CVS scheme and the layout technology, such as the power supply scheme and the placement of the multiple-supply-voltage gates. The clocking scheme for the multiple supply voltages is also discussed.","PeriodicalId":334688,"journal":{"name":"Proceedings of 1997 International Symposium on Low Power Electronics and Design","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127762060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
期刊
Proceedings of 1997 International Symposium on Low Power Electronics and Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1