首页 > 最新文献

Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.最新文献

英文 中文
A low-power design methodology for high-resolution pipelined analog-to-digital converters 高分辨率流水线模数转换器的低功耗设计方法
R. Lotfi, M. Taherzadeh‐Sani, M. Azizi, O. Shoaei
In this paper a general method to design a pipelined ADC with minimum power consumption is presented. By expressing the total static power consumption and the total input-referred noise of the converter as functions of the capacitor values and the resolutions of the converter stages, a simple optimization algorithm is employed to calculate the optimum values of these parameters, which lead to minimum power consumption while a. specified noise requirement is satisfied. To determine the bias current values of operational amplifiers, a novel optimal choice for settling and slewing time parameters is proposed applicable to both single-stage and two-stage Miller-compensated opamp, structures. Using the proposed methodology, the optimum values for capacitors, the resolutions and the opamp device sizes of all stages are determined in order to minimize the total power consumption. Design examples are presented and compared with conventional approaches to show the effectiveness of the proposed methodology.
本文提出了一种设计具有最小功耗的流水线ADC的通用方法。通过将变换器的总静态功耗和总输入参考噪声表示为电容值和变换器级分辨率的函数,采用简单的优化算法计算出这些参数的最优值,从而在满足指定噪声要求的情况下使功耗最小。为了确定运放的偏置电流值,提出了一种适用于单级和两级米勒补偿运放结构的最优沉降和旋转时间参数。使用所提出的方法,确定电容器的最佳值,分辨率和所有级的运放器件尺寸,以尽量减少总功耗。给出了设计实例,并与传统方法进行了比较,以表明所提出方法的有效性。
{"title":"A low-power design methodology for high-resolution pipelined analog-to-digital converters","authors":"R. Lotfi, M. Taherzadeh‐Sani, M. Azizi, O. Shoaei","doi":"10.1109/LPE.2003.1231890","DOIUrl":"https://doi.org/10.1109/LPE.2003.1231890","url":null,"abstract":"In this paper a general method to design a pipelined ADC with minimum power consumption is presented. By expressing the total static power consumption and the total input-referred noise of the converter as functions of the capacitor values and the resolutions of the converter stages, a simple optimization algorithm is employed to calculate the optimum values of these parameters, which lead to minimum power consumption while a. specified noise requirement is satisfied. To determine the bias current values of operational amplifiers, a novel optimal choice for settling and slewing time parameters is proposed applicable to both single-stage and two-stage Miller-compensated opamp, structures. Using the proposed methodology, the optimum values for capacitors, the resolutions and the opamp device sizes of all stages are determined in order to minimize the total power consumption. Design examples are presented and compared with conventional approaches to show the effectiveness of the proposed methodology.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126179993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Pipeline muffling and a priori current ramping: architectural techniques to reduce high-frequency inductive noise 管道消声和先验电流斜坡:降低高频电感噪声的结构技术
Michael D. Powell, T. N. Vijaykumar
While circuit and package designers have addressed microprocessor inductive noise issues in the past, multi-gigahertz clock frequencies and billion-transistor-level integration are exacerbating the problem, necessitating microarchitectural solutions. The large net on-die decoupling capacitance used to address this noise throughout the chip consumes substantial area and can cause a large leakage current. This paper proposes microarchitectural techniques to reduce high-frequency current variability, reducing the need for decoupling capacitors. We observe that we can control inductive noise by reducing current variability either in space (i.e., variability in usage of circuit blocks) or in time (i.e., variability within a circuit block across clock cycles). We propose pipeline muffling, a novel technique to reduce changes in the number of resources being utilized by controlling instruction issue, trading off some energy and performance to control di/dt in space. We also extend a previous technique, which incurs performance and energy degradation, and propose a priori current ramping to allow the current of a resource to ramp up ahead of usage, with virtually no performance loss, and ramp down immediately after usage, with little energy loss. Our techniques guarantee a worst-case bound on the di/dt, which is required to reduce the demand for decoupling capacitors, saving area and reducing leakage.
虽然电路和封装设计师过去已经解决了微处理器的感应噪声问题,但千兆赫时钟频率和十亿晶体管级集成正在加剧这一问题,因此需要微架构解决方案。用于解决整个芯片噪声的大净片上去耦电容消耗了大量面积,并可能导致大泄漏电流。本文提出了微结构技术,以减少高频电流的可变性,减少对去耦电容器的需求。我们观察到,我们可以通过减少空间(即电路块使用的可变性)或时间(即电路块跨时钟周期的可变性)上的电流可变性来控制电感噪声。我们提出了管道消声,这是一种通过控制指令发布来减少资源使用数量变化的新技术,它牺牲了一些能量和性能来控制空间中的di/dt。我们还扩展了先前的技术,该技术会导致性能和能量下降,并提出了一个先验的电流斜坡,以允许资源的电流在使用之前上升,几乎没有性能损失,并在使用后立即下降,几乎没有能量损失。我们的技术保证了di/dt的最坏情况边界,这是减少去耦电容器需求,节省面积和减少泄漏所必需的。
{"title":"Pipeline muffling and a priori current ramping: architectural techniques to reduce high-frequency inductive noise","authors":"Michael D. Powell, T. N. Vijaykumar","doi":"10.1145/871506.871562","DOIUrl":"https://doi.org/10.1145/871506.871562","url":null,"abstract":"While circuit and package designers have addressed microprocessor inductive noise issues in the past, multi-gigahertz clock frequencies and billion-transistor-level integration are exacerbating the problem, necessitating microarchitectural solutions. The large net on-die decoupling capacitance used to address this noise throughout the chip consumes substantial area and can cause a large leakage current. This paper proposes microarchitectural techniques to reduce high-frequency current variability, reducing the need for decoupling capacitors. We observe that we can control inductive noise by reducing current variability either in space (i.e., variability in usage of circuit blocks) or in time (i.e., variability within a circuit block across clock cycles). We propose pipeline muffling, a novel technique to reduce changes in the number of resources being utilized by controlling instruction issue, trading off some energy and performance to control di/dt in space. We also extend a previous technique, which incurs performance and energy degradation, and propose a priori current ramping to allow the current of a resource to ramp up ahead of usage, with virtually no performance loss, and ramp down immediately after usage, with little energy loss. Our techniques guarantee a worst-case bound on the di/dt, which is required to reduce the demand for decoupling capacitors, saving area and reducing leakage.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128190835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
Integrated architectural/physical planning approach for minimization of current surge in high performance clock-gated microprocessors 在高性能时钟门控微处理器中实现电流浪涌最小化的集成架构/物理规划方法
Yiran Chen, K. Roy, Cheng-Kok Koh
We propose an integrated architectural/physical planning approach to reduce the power supply noise due to current surge in high performance, general-purpose, clock-gated microprocessors. The proposed approach combines dynamic selection of functional units on-the-fly, dynamic issue width scaling and physical planning with soft module, to balance the current demand across layout. Experimental results show that the proposed approach could reduce the peak noise by 6.54% and consequently, the decoupling capacitance requirement by 21.8%. The degradation in IPC (instruction Per Cycle) due to the selection logic and issue width scaling is only 1.86e-7 (without increasing clock cycle period) in 0.18 /spl mu/m technology.
我们提出了一种集成的架构/物理规划方法,以减少高性能、通用、时钟门控微处理器中由于电流浪涌而产生的电源噪声。该方法将功能单元的动态选择、问题宽度的动态缩放和物理规划与软模块相结合,实现了当前需求跨布局的平衡。实验结果表明,该方法可将峰值噪声降低6.54%,去耦电容要求降低21.8%。在0.18 /spl mu/m技术中,由于选择逻辑和问题宽度缩放,IPC(每周期指令)的退化仅为1.86e-7(不增加时钟周期)。
{"title":"Integrated architectural/physical planning approach for minimization of current surge in high performance clock-gated microprocessors","authors":"Yiran Chen, K. Roy, Cheng-Kok Koh","doi":"10.1145/871506.871563","DOIUrl":"https://doi.org/10.1145/871506.871563","url":null,"abstract":"We propose an integrated architectural/physical planning approach to reduce the power supply noise due to current surge in high performance, general-purpose, clock-gated microprocessors. The proposed approach combines dynamic selection of functional units on-the-fly, dynamic issue width scaling and physical planning with soft module, to balance the current demand across layout. Experimental results show that the proposed approach could reduce the peak noise by 6.54% and consequently, the decoupling capacitance requirement by 21.8%. The degradation in IPC (instruction Per Cycle) due to the selection logic and issue width scaling is only 1.86e-7 (without increasing clock cycle period) in 0.18 /spl mu/m technology.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"151 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127286879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Row-by-row dynamic source-line voltage control (RRDSV) scheme for two orders of magnitude leakage current reduction of sub-1-V-V/sub DD/ SRAM's 逐行动态源线电压控制(RRDSV)方案,可将sub-1- v /sub DD/ SRAM的漏电流降低两个数量级
K. Min, K. Kanda, H. Kawaguchi, K. Inagaki, F. R. Saliba, Hoon-Dae Choi, Hyunjun Choi, D. Kim, D. M. Kim, T. Sakurai
A new row-by-row dynamic source-line voltage control (RRDSV) scheme is proposed to reduce the active leakage as well as the stand-by leakage in SRAM. By dynamically controlling the source-line voltage of cells row by row, the cell leakage through inactive cells can be reduced by two orders of magnitude. Moreover, the bit-line leakage through pass transistors can be completely cut off. This leakage reduction is caused from the cooperation of reverse body-to-source biasing and drain induced barrier lowering (DIBL) effects. A test chip has been fabricated using 0.18-/spl mu/m triple-well CMOS technology to verify the data retention capability of this RRDSV scheme. The minimum retention voltage in the RRDSV is measured to be reduced by more than 60 mV, when shielding metal is inserted to protect the memory cell nodes from bit-line coupling noise. It can reduce the leakage by another 50% in addition to the reduction by two orders of magnitude.
提出了一种新的逐行动态源线电压控制(RRDSV)方案,以降低SRAM中的有源漏和备用漏。通过逐行动态控制电池的电源电压,可以将非活性电池的漏液降低两个数量级。此外,可以完全切断通型晶体管的位线泄漏。这种泄漏减少是由反向体源偏置和漏极诱导势垒降低(DIBL)效应的共同作用造成的。采用0.18-/spl mu/m三孔CMOS工艺制作了测试芯片,验证了该RRDSV方案的数据保留能力。当插入屏蔽金属以保护存储单元节点免受位线耦合噪声的影响时,测量到RRDSV中的最小保持电压降低了60 mV以上。在减少两个数量级的基础上,再减少50%的泄漏。
{"title":"Row-by-row dynamic source-line voltage control (RRDSV) scheme for two orders of magnitude leakage current reduction of sub-1-V-V/sub DD/ SRAM's","authors":"K. Min, K. Kanda, H. Kawaguchi, K. Inagaki, F. R. Saliba, Hoon-Dae Choi, Hyunjun Choi, D. Kim, D. M. Kim, T. Sakurai","doi":"10.1145/871506.871526","DOIUrl":"https://doi.org/10.1145/871506.871526","url":null,"abstract":"A new row-by-row dynamic source-line voltage control (RRDSV) scheme is proposed to reduce the active leakage as well as the stand-by leakage in SRAM. By dynamically controlling the source-line voltage of cells row by row, the cell leakage through inactive cells can be reduced by two orders of magnitude. Moreover, the bit-line leakage through pass transistors can be completely cut off. This leakage reduction is caused from the cooperation of reverse body-to-source biasing and drain induced barrier lowering (DIBL) effects. A test chip has been fabricated using 0.18-/spl mu/m triple-well CMOS technology to verify the data retention capability of this RRDSV scheme. The minimum retention voltage in the RRDSV is measured to be reduced by more than 60 mV, when shielding metal is inserted to protect the memory cell nodes from bit-line coupling noise. It can reduce the leakage by another 50% in addition to the reduction by two orders of magnitude.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130535438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
New optimal design strategies and analysis of ultra-low leakage circuits for nano-scale SOI technology 纳米级SOI技术超低漏电电路优化设计新策略及分析
K. Das, R. Joshi, C. Chuang, P. Cook, Richard B. Brown
This paper proposes new SOI circuit strategies for simultaneous reduction of standby gate and sub-threshold leakages. Various enhanced MTCMOS design alternatives are analyzed. A new method for assigning the V/sub TH/ and sizes of header and footer transistors is proposed, and stacking of headers/footers is analyzed. The optimum stacking height and tapering/sizing ratio under various design constraints are determined. Our strategies reduce MTCMOS standby leakage further by as much as 20/spl times/ and reduce virtual supply noise by 15%.
本文提出了同时减少待机栅极和亚阈值泄漏的新型SOI电路策略。分析了各种增强型MTCMOS设计方案。提出了一种新的头脚晶体管V/sub / TH和尺寸分配方法,并对头脚晶体管的堆叠进行了分析。确定了各种设计约束条件下的最佳堆垛高度和锥形/浆料比。我们的策略将MTCMOS待机泄漏进一步降低了20倍,并将虚拟电源噪声降低了15%。
{"title":"New optimal design strategies and analysis of ultra-low leakage circuits for nano-scale SOI technology","authors":"K. Das, R. Joshi, C. Chuang, P. Cook, Richard B. Brown","doi":"10.1109/LPE.2003.1231855","DOIUrl":"https://doi.org/10.1109/LPE.2003.1231855","url":null,"abstract":"This paper proposes new SOI circuit strategies for simultaneous reduction of standby gate and sub-threshold leakages. Various enhanced MTCMOS design alternatives are analyzed. A new method for assigning the V/sub TH/ and sizes of header and footer transistors is proposed, and stacking of headers/footers is analyzed. The optimum stacking height and tapering/sizing ratio under various design constraints are determined. Our strategies reduce MTCMOS standby leakage further by as much as 20/spl times/ and reduce virtual supply noise by 15%.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122763065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Reducing data cache energy consumption via cached load/store queue 通过缓存负载/存储队列减少数据缓存能耗
D. Nicolaescu, A. Veidenbaum, A. Nicolau
High-performance processors use a large set-associative L1 data cache with multiple ports. As clock speeds and size increase such a cache consumes a significant percentage of the total processor energy. This paper proposes a method of saving energy by reducing the number of data cache accesses. It does so by modifying the load/store queue design to allow "caching" of previously accessed data values on both loads and stores after the corresponding memory access instruction has been committed. It is shown that a 32 entry modified LSQ design allows an average of 38.5% of the loads in the SpecINT95 benchmarks and 18.9% in the SpecFP95 benchmarks to get their data from the LSQ. The reduction in the number of Ll cache accesses results in up to a 40% reduction in the L1 data cache energy consumption and in an up to a 16% improvement in the energy-delay product while requiring almost no additional hardware or complex control logic.
高性能处理器使用具有多个端口的大型集关联L1数据缓存。随着时钟速度和大小的增加,这样的缓存会消耗处理器总能量的很大一部分。本文提出了一种通过减少数据缓存访问次数来节约能源的方法。它通过修改加载/存储队列设计来实现这一点,以便在提交相应的内存访问指令后,允许在加载和存储上“缓存”先前访问的数据值。结果表明,32项修改后的LSQ设计允许SpecINT95基准测试中平均38.5%的负载和SpecFP95基准测试中18.9%的负载从LSQ获取数据。l缓存访问次数的减少导致L1数据缓存能耗降低40%,能量延迟产品提高16%,同时几乎不需要额外的硬件或复杂的控制逻辑。
{"title":"Reducing data cache energy consumption via cached load/store queue","authors":"D. Nicolaescu, A. Veidenbaum, A. Nicolau","doi":"10.1145/871506.871569","DOIUrl":"https://doi.org/10.1145/871506.871569","url":null,"abstract":"High-performance processors use a large set-associative L1 data cache with multiple ports. As clock speeds and size increase such a cache consumes a significant percentage of the total processor energy. This paper proposes a method of saving energy by reducing the number of data cache accesses. It does so by modifying the load/store queue design to allow \"caching\" of previously accessed data values on both loads and stores after the corresponding memory access instruction has been committed. It is shown that a 32 entry modified LSQ design allows an average of 38.5% of the loads in the SpecINT95 benchmarks and 18.9% in the SpecFP95 benchmarks to get their data from the LSQ. The reduction in the number of Ll cache accesses results in up to a 40% reduction in the L1 data cache energy consumption and in an up to a 16% improvement in the energy-delay product while requiring almost no additional hardware or complex control logic.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123087265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Low-voltage low-power high dB-Linear CMOS exponential function generator using highly-linear V-I converter 采用高线性V-I变换器的低电压低功率高db -线性CMOS指数函数发生器
Quoc-Hoang Duong, Trung‐Kien Nguyen, Sang-Gug Lee
A CMOS voltage-to-current converter with exponential characteristic is presented in this paper. The concept of Taylor series expansion is used for realizing the exponential characteristic. The proposed exponential V-I converter is composed of a current-to-current squarer and a linear V-I converter with the use of linearization technique. Based on a 0.25 /spl mu/m CMOS process, simulations show a 23 dB of linear-output current range and the linearity within 20 dB with error less than /spl plusmn/0.5dB is achieved. The total power consumption is below 0.2 mW with 1.25 V supply voltage. The proposed circuit can be used for the design of an extremely low-voltage low-power variable gain amplifier (VGA).
本文介绍了一种具有指数特性的CMOS电压电流变换器。利用泰勒级数展开的概念来实现指数特性。所提出的指数型V-I变换器由电流-电流平方器和线性型V-I变换器组成,采用线性化技术。基于0.25 /spl mu/m的CMOS工艺,仿真结果表明,该电路的线性输出电流范围为23 dB,线性度在20 dB以内,误差小于/spl plusmn/0.5dB。总功耗低于0.2 mW,电源电压为1.25 V。该电路可用于超低电压低功率可变增益放大器(VGA)的设计。
{"title":"Low-voltage low-power high dB-Linear CMOS exponential function generator using highly-linear V-I converter","authors":"Quoc-Hoang Duong, Trung‐Kien Nguyen, Sang-Gug Lee","doi":"10.1109/LPE.2003.1231912","DOIUrl":"https://doi.org/10.1109/LPE.2003.1231912","url":null,"abstract":"A CMOS voltage-to-current converter with exponential characteristic is presented in this paper. The concept of Taylor series expansion is used for realizing the exponential characteristic. The proposed exponential V-I converter is composed of a current-to-current squarer and a linear V-I converter with the use of linearization technique. Based on a 0.25 /spl mu/m CMOS process, simulations show a 23 dB of linear-output current range and the linearity within 20 dB with error less than /spl plusmn/0.5dB is achieved. The total power consumption is below 0.2 mW with 1.25 V supply voltage. The proposed circuit can be used for the design of an extremely low-voltage low-power variable gain amplifier (VGA).","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116062546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
On load latency in low-power caches 低功耗缓存中的负载延迟
Soontae Kim, N. Vijaykrishnan, M. J. Irwin, L. John
Many of the recently proposed techniques to reduce power consumption in caches introduce an additional level of nondeterminism in cache access latency. Due to this additional latency, instructions speculatively issued and dependent on a non-deterministic load must be re-executed. Our experiments show that there is a large performance degradation and associated energy wastage due to these effects of instruction re-execution. To address this problem, we propose an early cache set resolution scheme. It is based on the observation that the displacement values used for address generation are generally small. Our experimental evaluation shows that this technique is quite effective in mitigating this problem.
最近提出的许多降低缓存功耗的技术在缓存访问延迟中引入了额外的不确定性。由于这种额外的延迟,必须重新执行依赖于不确定性负载的推测性发出的指令。我们的实验表明,由于指令重执行的这些影响,存在很大的性能下降和相关的能量浪费。为了解决这个问题,我们提出了一个早期缓存集解析方案。这是基于观察到用于地址生成的位移值通常很小。我们的实验评估表明,该技术在缓解这一问题上是相当有效的。
{"title":"On load latency in low-power caches","authors":"Soontae Kim, N. Vijaykrishnan, M. J. Irwin, L. John","doi":"10.1145/871506.871570","DOIUrl":"https://doi.org/10.1145/871506.871570","url":null,"abstract":"Many of the recently proposed techniques to reduce power consumption in caches introduce an additional level of nondeterminism in cache access latency. Due to this additional latency, instructions speculatively issued and dependent on a non-deterministic load must be re-executed. Our experiments show that there is a large performance degradation and associated energy wastage due to these effects of instruction re-execution. To address this problem, we propose an early cache set resolution scheme. It is based on the observation that the displacement values used for address generation are generally small. Our experimental evaluation shows that this technique is quite effective in mitigating this problem.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"358 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122814072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Low-power high-level synthesis for FPGA architectures FPGA架构的低功耗高级合成
Deming Chen, J. Cong, Yiping Fan
This paper addresses two aspects of low-power design for FPGA circuits. First, we present an RT-level power estimator for FPGAs with consideration of wire length. The power estimator closely reflects both dynamic and static power contributed by various FPGA components in 0.1 /spl mu/m technology. The power estimation error is 16.2% on average. Second, we present a low power high level synthesis system, named LOPASS, for FPGA designs. It includes two algorithms for power consumption reduction: (i) a simulated annealing engine that carries out resource selection function unit binding, scheduling, register binding, and data pat. generation simultaneously to effectively reduce power; (ii) an enhanced weighted bipartite matching algorithm that is able to reduce the total amount of MUX ports by 22.7%. Experimental results show that LOPASS is able to reduce-power consumption by 35.8% compared to the results-of -Synopsys' Behavioral Compiler.
本文从两个方面论述了FPGA电路的低功耗设计。首先,我们提出了一种考虑导线长度的fpga rt级功率估计器。功率估计器在0.1 /spl mu/m的技术下,紧密地反映了各种FPGA组件所贡献的动态和静态功率。功率估计误差平均为16.2%。其次,我们提出了一种用于FPGA设计的低功耗高电平合成系统,称为LOPASS。它包括两种降低功耗的算法:(i)模拟退火引擎,实现资源选择功能单元绑定、调度、寄存器绑定和数据部分。同时发电,有效降低功率;(ii)一种增强的加权二部匹配算法,能够将MUX端口总数减少22.7%。实验结果表明,与synopsys行为编译器的结果相比,LOPASS可以降低35.8%的功耗。
{"title":"Low-power high-level synthesis for FPGA architectures","authors":"Deming Chen, J. Cong, Yiping Fan","doi":"10.1145/871506.871541","DOIUrl":"https://doi.org/10.1145/871506.871541","url":null,"abstract":"This paper addresses two aspects of low-power design for FPGA circuits. First, we present an RT-level power estimator for FPGAs with consideration of wire length. The power estimator closely reflects both dynamic and static power contributed by various FPGA components in 0.1 /spl mu/m technology. The power estimation error is 16.2% on average. Second, we present a low power high level synthesis system, named LOPASS, for FPGA designs. It includes two algorithms for power consumption reduction: (i) a simulated annealing engine that carries out resource selection function unit binding, scheduling, register binding, and data pat. generation simultaneously to effectively reduce power; (ii) an enhanced weighted bipartite matching algorithm that is able to reduce the total amount of MUX ports by 22.7%. Experimental results show that LOPASS is able to reduce-power consumption by 35.8% compared to the results-of -Synopsys' Behavioral Compiler.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131176349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 114
Energy-efficient instruction set synthesis for application-specific processors 针对特定应用处理器的节能指令集合成
Jongeun Lee, Kiyoung Choi, N. Dutt
Several techniques have been proposed to enhance the energy-efficiency of ASIPs (Application-Specific Instruction set Processors). While those techniques can reduce the energy consumption with a minimal change in the instruction set (IS), they fail to exploit the opportunity of designing the entire IS from the energy-efficiency perspective. In this paper, we present an energy-efficient IS synthesis technique that can comprehensively reduce the energy-delay product (EDP) of ASIPs through optimal instruction encoding, considering both the instruction bitwidth and the dynamic instruction count. Experimental results with a typical embedded RISC processor show that our technique can generate application-specific IS's that are up to 40% more energy-efficient over the native IS for several application benchmarks.
已经提出了几种技术来提高专用指令集处理器(Application-Specific Instruction set processor, asip)的能效。虽然这些技术可以通过对指令集(IS)进行最小的更改来降低能耗,但它们无法利用从能效角度设计整个IS的机会。在本文中,我们提出了一种节能的IS合成技术,该技术可以在考虑指令位宽和动态指令数的情况下,通过优化指令编码,全面降低asip的能量延迟积(EDP)。在典型嵌入式RISC处理器上的实验结果表明,我们的技术可以生成特定应用的IS,在几个应用基准测试中,比本地IS节能高达40%。
{"title":"Energy-efficient instruction set synthesis for application-specific processors","authors":"Jongeun Lee, Kiyoung Choi, N. Dutt","doi":"10.1145/871506.871588","DOIUrl":"https://doi.org/10.1145/871506.871588","url":null,"abstract":"Several techniques have been proposed to enhance the energy-efficiency of ASIPs (Application-Specific Instruction set Processors). While those techniques can reduce the energy consumption with a minimal change in the instruction set (IS), they fail to exploit the opportunity of designing the entire IS from the energy-efficiency perspective. In this paper, we present an energy-efficient IS synthesis technique that can comprehensively reduce the energy-delay product (EDP) of ASIPs through optimal instruction encoding, considering both the instruction bitwidth and the dynamic instruction count. Experimental results with a typical embedded RISC processor show that our technique can generate application-specific IS's that are up to 40% more energy-efficient over the native IS for several application benchmarks.","PeriodicalId":355883,"journal":{"name":"Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115332409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003. ISLPED '03.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1