首页 > 最新文献

ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)最新文献

英文 中文
Cycle-accurate energy consumption measurement and analysis: Case study of ARM7TDMI 周期精确的能耗测量与分析:以ARM7TDMI为例
N. Chang, Kwanho Kim, H. Lee
We introduce an energy consumption analysis of complex digital systems through a case study of ARM7TDMI RISC processor by using a new energy measurement technique. We developed a cycle-accurate energy consumption measurement system based on charge transfer which is robust to spiky noise and is capable of collecting a range of power consumption profiles in real time. The relative energy variation of the RISC core is measured by changing the op-code, the instruction fetch address, the register number, the register value, the data fetch address, and the immediate operand value in each pipeline stage, respectively. We demonstrated energy characterization of a pipelined RISC processor for high-level power reduction.
本文以ARM7TDMI RISC处理器为例,介绍了一种新的能量测量技术对复杂数字系统的能耗分析。我们开发了一种基于电荷转移的循环精确能耗测量系统,该系统对尖噪声具有鲁棒性,并且能够实时收集一系列功耗曲线。通过改变每个流水线阶段的操作码、指令取地址、寄存器号、寄存器值、数据取地址和直接操作数值来测量RISC内核的相对能量变化。我们演示了一种流水线RISC处理器的能量表征,以实现高水平的功耗降低。
{"title":"Cycle-accurate energy consumption measurement and analysis: Case study of ARM7TDMI","authors":"N. Chang, Kwanho Kim, H. Lee","doi":"10.1145/344166.344576","DOIUrl":"https://doi.org/10.1145/344166.344576","url":null,"abstract":"We introduce an energy consumption analysis of complex digital systems through a case study of ARM7TDMI RISC processor by using a new energy measurement technique. We developed a cycle-accurate energy consumption measurement system based on charge transfer which is robust to spiky noise and is capable of collecting a range of power consumption profiles in real time. The relative energy variation of the RISC core is measured by changing the op-code, the instruction fetch address, the register number, the register value, the data fetch address, and the immediate operand value in each pipeline stage, respectively. We demonstrated energy characterization of a pipelined RISC processor for high-level power reduction.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124684894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 106
Achieving utility arbitrarily close to the optimal with limited energy 以有限的能量实现任意接近最优的效用
G. Qu, M. Potkonjak
Energy is one of the limited resources for modern systems, especially battery-operated devices and personal digital assistants. The backlog in new technologies for a more powerful battery is changing the traditional system design philosophies. For example, due to the limitation on battery life, it is more realistic to design for the optimal benefit from limited resource rather than design to meet all the applications' requirement. We consider the following problem: a system achieves a certain amount of utility from a set of applications by providing certain levels of quality of service (QoS). We want to allocate the limited system resources to get the maximal system utility. We formulate this utility maximization problem, which is NP-hard in general, and propose heuristic algorithms that are capable of finding solutions provably arbitrarily close to the optimal. We have also derived explicit formulae to guide the allocation of resources to actually achieve such solutions. Simulation shows that our approach can use 99.9% of the given resource to achieve 25.6% and 32.17% more system utilities over two other heuristics, while providing QoS guarantees to the application program.
能源是现代系统的有限资源之一,尤其是电池驱动的设备和个人数字助理。为制造更强大的电池而积压的新技术正在改变传统的系统设计理念。例如,由于电池寿命的限制,从有限的资源中获得最佳效益的设计比满足所有应用需求的设计更现实。我们考虑以下问题:系统通过提供一定级别的服务质量(QoS),从一组应用程序中获得一定数量的效用。我们希望分配有限的系统资源以获得最大的系统效用。我们提出了这种效用最大化问题,它通常是np困难的,并提出了启发式算法,能够找到可证明的任意接近最优解的解。我们还推导出明确的公式来指导资源的分配,以实际实现这种解决办法。仿真结果表明,该方法在使用99.9%的给定资源的情况下,比其他两种启发式方法分别获得25.6%和32.17%的系统效用,同时为应用程序提供了QoS保证。
{"title":"Achieving utility arbitrarily close to the optimal with limited energy","authors":"G. Qu, M. Potkonjak","doi":"10.1145/344166.344545","DOIUrl":"https://doi.org/10.1145/344166.344545","url":null,"abstract":"Energy is one of the limited resources for modern systems, especially battery-operated devices and personal digital assistants. The backlog in new technologies for a more powerful battery is changing the traditional system design philosophies. For example, due to the limitation on battery life, it is more realistic to design for the optimal benefit from limited resource rather than design to meet all the applications' requirement. We consider the following problem: a system achieves a certain amount of utility from a set of applications by providing certain levels of quality of service (QoS). We want to allocate the limited system resources to get the maximal system utility. We formulate this utility maximization problem, which is NP-hard in general, and propose heuristic algorithms that are capable of finding solutions provably arbitrarily close to the optimal. We have also derived explicit formulae to guide the allocation of resources to actually achieve such solutions. Simulation shows that our approach can use 99.9% of the given resource to achieve 25.6% and 32.17% more system utilities over two other heuristics, while providing QoS guarantees to the application program.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126121561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A low power unified cache architecture providing power and performance flexibility 低功耗统一缓存架构,提供功耗和性能灵活性
Afzal Malik, B. Moyer, D. Čermák
Advances in technology have allowed portable electronic devices to become smaller and more complex, placing stringent power and performance requirements on the device's components. The M.CORE M3 architecture was developed specifically for these embedded applications. To address the growing need for longer battery life and higher performance, an 8-Kbyte, 4-way set-associative, unified (instruction and data) cache with programmable features was added to the M3 core. These features allow the architecture to be optimized based on the application's requirements. In this paper we focus on the features of the M340 cache sub-system and illustrate the effect on power and performance through benchmark analysis and actual silicon measurements.
技术的进步使便携式电子设备变得更小、更复杂,对设备的组件提出了严格的功率和性能要求。M.CORE M3架构是专门为这些嵌入式应用程序开发的。为了满足日益增长的对更长的电池寿命和更高性能的需求,M3内核中添加了一个具有可编程功能的8kbyte、4路集合关联、统一(指令和数据)缓存。这些特性允许根据应用程序的需求对体系结构进行优化。本文重点介绍了M340高速缓存子系统的特点,并通过基准分析和实际硅测量说明其对功耗和性能的影响。
{"title":"A low power unified cache architecture providing power and performance flexibility","authors":"Afzal Malik, B. Moyer, D. Čermák","doi":"10.1145/344166.344610","DOIUrl":"https://doi.org/10.1145/344166.344610","url":null,"abstract":"Advances in technology have allowed portable electronic devices to become smaller and more complex, placing stringent power and performance requirements on the device's components. The M.CORE M3 architecture was developed specifically for these embedded applications. To address the growing need for longer battery life and higher performance, an 8-Kbyte, 4-way set-associative, unified (instruction and data) cache with programmable features was added to the M3 core. These features allow the architecture to be optimized based on the application's requirements. In this paper we focus on the features of the M340 cache sub-system and illustrate the effect on power and performance through benchmark analysis and actual silicon measurements.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128063233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 271
Voltage dependent gate capacitance and its impact in estimating power and delay of CMOS digital circuits with low supply voltage 电压依赖性门电容及其对低电源电压CMOS数字电路功率和时延估计的影响
K. Nose, S. Chae, T. Sakurai
Gate capacitance has complex voltage dependency on terminal voltages but the impact of this voltage dependency of gate capacitance on power and delay has not been fully investigated, especially, in low-voltage, low-power designs. Introducing an effective gate capacitance, C/sub G,eff/ it is shown that the power and delay of CMOS digital circuit can be estimated accurately. C/sub G,eff/ is a strong function of V/sub TH//V/sub DD/ and V/sub TH//V/sub DD/ tends to increase in low-voltage region. Hence, the effective capacitance relative to oxide capacitance, C/sub OX/, is decreasing in low-voltage, low-power designs. Therefore, considering C/sub G,eff/ in accurate power and delay estimation becomes more important in the future.
门电容对终端电压有复杂的电压依赖性,但对这种电压依赖性对功率和延迟的影响尚未得到充分研究,特别是在低压、低功耗设计中。通过引入有效栅极电容C/sub G、eff/,可以准确估计CMOS数字电路的功率和时延。C/sub G,eff/是V/sub TH//V/sub DD/的强函数,V/sub TH//V/sub DD/在低压区趋于增大。因此,在低电压、低功耗设计中,相对于氧化物电容的有效电容C/sub OX/正在减小。因此,考虑C/sub / G、eff/ in准确的功率和时延估计在未来变得更加重要。
{"title":"Voltage dependent gate capacitance and its impact in estimating power and delay of CMOS digital circuits with low supply voltage","authors":"K. Nose, S. Chae, T. Sakurai","doi":"10.1145/344166.344601","DOIUrl":"https://doi.org/10.1145/344166.344601","url":null,"abstract":"Gate capacitance has complex voltage dependency on terminal voltages but the impact of this voltage dependency of gate capacitance on power and delay has not been fully investigated, especially, in low-voltage, low-power designs. Introducing an effective gate capacitance, C/sub G,eff/ it is shown that the power and delay of CMOS digital circuit can be estimated accurately. C/sub G,eff/ is a strong function of V/sub TH//V/sub DD/ and V/sub TH//V/sub DD/ tends to increase in low-voltage region. Hence, the effective capacitance relative to oxide capacitance, C/sub OX/, is decreasing in low-voltage, low-power designs. Therefore, considering C/sub G,eff/ in accurate power and delay estimation becomes more important in the future.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"212 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117119650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Operating-system directed power reduction 操作系统导向的功率降低
Yung-Hsiang Lu, L. Benini, G. Micheli
This paper presents a new approach for power reduction by taking a global, software-centric view. It analyzes the sources of power consumption: tasks that require services from hardware components. When a component is not used by any task, it can enter a sleeping state to save power. Operating systems have detailed information about tasks; therefore, OS is the best place for power reduction. Our technique is effective in identifying hardware idleness and shutting down unused components. We implement this technique in Linux and show that it can save more than 50% power compared to traditional hardware-centric shutdown techniques.
本文提出了一种以全局、以软件为中心的观点来降低功耗的新方法。它分析功耗的来源:需要硬件组件提供服务的任务。当一个组件没有被任何任务使用时,它可以进入休眠状态以节省电量。操作系统有关于任务的详细信息;因此,操作系统是降低功耗的最佳场所。我们的技术在识别硬件空闲和关闭未使用的组件方面是有效的。我们在Linux中实现了这种技术,并证明与传统的以硬件为中心的关机技术相比,它可以节省50%以上的电力。
{"title":"Operating-system directed power reduction","authors":"Yung-Hsiang Lu, L. Benini, G. Micheli","doi":"10.1145/344166.344189","DOIUrl":"https://doi.org/10.1145/344166.344189","url":null,"abstract":"This paper presents a new approach for power reduction by taking a global, software-centric view. It analyzes the sources of power consumption: tasks that require services from hardware components. When a component is not used by any task, it can enter a sleeping state to save power. Operating systems have detailed information about tasks; therefore, OS is the best place for power reduction. Our technique is effective in identifying hardware idleness and shutting down unused components. We implement this technique in Linux and show that it can save more than 50% power compared to traditional hardware-centric shutdown techniques.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"216 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114322858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 146
High-level power estimation with interconnect effects 具有互连效应的高级功率估计
Kavel M. Büyüksahin, F. Najm
We extend earlier work on high-level average power estimation to include the power due to interconnect loading. The resulting technique is a combination of an RTL-level gate count prediction method and average interconnect estimation based on Rent's rule. The method can be adapted to be used with different place and route engines and standard cell libraries. For a number of benchmark circuits, the method is verified by extracting wire lengths from a layout of each circuit and then comparing the predicted (at RTL) power against that measured using SPICE. An average error of 14.4% is obtained for the average interconnect length, and an average error of 25.8% is obtained for average power estimation including interconnect effects.
我们扩展了先前关于高级平均功率估计的工作,以包括由于互连负载引起的功率。由此产生的技术是rtl级门数预测方法和基于Rent规则的平均互连估计的结合。该方法可以适应不同的位置和路径引擎以及标准单元库。对于许多基准电路,通过从每个电路的布局中提取导线长度,然后将预测(在RTL)功率与使用SPICE测量的功率进行比较,验证了该方法。平均互连长度估计的平均误差为14.4%,考虑互连效应的平均功率估计的平均误差为25.8%。
{"title":"High-level power estimation with interconnect effects","authors":"Kavel M. Büyüksahin, F. Najm","doi":"10.1145/344166.344581","DOIUrl":"https://doi.org/10.1145/344166.344581","url":null,"abstract":"We extend earlier work on high-level average power estimation to include the power due to interconnect loading. The resulting technique is a combination of an RTL-level gate count prediction method and average interconnect estimation based on Rent's rule. The method can be adapted to be used with different place and route engines and standard cell libraries. For a number of benchmark circuits, the method is verified by extracting wire lengths from a layout of each circuit and then comparing the predicted (at RTL) power against that measured using SPICE. An average error of 14.4% is obtained for the average interconnect length, and an average error of 25.8% is obtained for average power estimation including interconnect effects.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127420348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Design issues for Dynamic Voltage Scaling 动态电压缩放的设计问题
T. Burd, R. Brodersen
Processors in portable electronic devices generally have a computational load which has time-varying performance requirements. Dynamic Voltage Scaling is a method to vary the processor's supply voltage so that it consumes the minimal amount of energy by operating at the minimum performance level required by the active software processes. A dynamically varying supply voltage has implications on the processor circuit design and design flow, but with some minimal constraints it is straightforward to design a processor with this capability.
便携式电子设备中的处理器通常具有具有时变性能要求的计算负载。动态电压缩放是一种改变处理器供电电压的方法,以便通过在活动软件进程所需的最低性能水平上运行来消耗最少的能量。动态变化的电源电压对处理器电路设计和设计流程有影响,但在一些最小的限制下,设计具有这种能力的处理器是很简单的。
{"title":"Design issues for Dynamic Voltage Scaling","authors":"T. Burd, R. Brodersen","doi":"10.1145/344166.344181","DOIUrl":"https://doi.org/10.1145/344166.344181","url":null,"abstract":"Processors in portable electronic devices generally have a computational load which has time-varying performance requirements. Dynamic Voltage Scaling is a method to vary the processor's supply voltage so that it consumes the minimal amount of energy by operating at the minimum performance level required by the active software processes. A dynamically varying supply voltage has implications on the processor circuit design and design flow, but with some minimal constraints it is straightforward to design a processor with this capability.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121144454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 398
A recursive algorithm for low-power memory partitioning 低功耗内存分区的递归算法
L. Benini, A. Macii, M. Poncino
Memory-processor integration offers new opportunities for reducing the energy of a system. In the case of embedded systems, one solution consists of mapping the most frequently accessed addresses onto the on-chip SRAM to guarantee power and performance efficiency. This option is especially effective when memory access patterns can be profiled and studied at design time (as in typical real-time embedded systems). In this work, we propose an algorithm for the automatic partitioning of on-chip SRAM in multiple banks that can be independently accessed. Starting from the dynamic execution profile of an embedded application running on a given processor core, we synthesize a multi-banked SRAM architecture optimally fitted to the execution profile. The algorithm provides a globally optimum solution to the problem under realistic assumptions on the power cost metrics, and with constraints on the number of memory banks. Results, collected on a set of embedded applications for the ARM processor, have shown average energy savings around 42%.
内存处理器集成为降低系统能耗提供了新的机会。在嵌入式系统的情况下,一种解决方案包括将最频繁访问的地址映射到片上SRAM上,以保证功率和性能效率。当可以在设计时分析和研究内存访问模式时(如在典型的实时嵌入式系统中),此选项特别有效。在这项工作中,我们提出了一种可以独立访问的多组片上SRAM自动分区的算法。从运行在给定处理器核心上的嵌入式应用程序的动态执行配置文件开始,我们合成了一个最适合执行配置文件的多银行SRAM架构。该算法在实际的电力成本指标假设下,在存储库数量约束下,提供了问题的全局最优解。在一组ARM处理器的嵌入式应用程序上收集的结果显示,平均节能约为42%。
{"title":"A recursive algorithm for low-power memory partitioning","authors":"L. Benini, A. Macii, M. Poncino","doi":"10.1145/344166.344518","DOIUrl":"https://doi.org/10.1145/344166.344518","url":null,"abstract":"Memory-processor integration offers new opportunities for reducing the energy of a system. In the case of embedded systems, one solution consists of mapping the most frequently accessed addresses onto the on-chip SRAM to guarantee power and performance efficiency. This option is especially effective when memory access patterns can be profiled and studied at design time (as in typical real-time embedded systems). In this work, we propose an algorithm for the automatic partitioning of on-chip SRAM in multiple banks that can be independently accessed. Starting from the dynamic execution profile of an embedded application running on a given processor core, we synthesize a multi-banked SRAM architecture optimally fitted to the execution profile. The algorithm provides a globally optimum solution to the problem under realistic assumptions on the power cost metrics, and with constraints on the number of memory banks. Results, collected on a set of embedded applications for the ARM processor, have shown average energy savings around 42%.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134614436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
An asynchronous matrix-vector multiplier for discrete cosine transform 离散余弦变换的异步矩阵向量乘法器
Kyeounsoo Kim, P. Beerel, Youpyo Hong
This paper proposes an efficient asynchronous hardwired matrix-vector multiplier for the two-dimensional discrete cosine transform and inverse discrete cosine transform (DCT/IDCT). The design achieves low power and high performance by taking advantage of the typically large fraction of zero and small-valued data in DCT and IDCT applications. In particular, it skips multiplication by zero and dynamically activates/deactivates required bit-slices of fine-grain bit-partitioned adders using simplified, static-logic-based speculative completion sensing. The results extracted by both bit-level analysis and HSPICE simulations indicate significant improvements compared to traditional designs.
针对二维离散余弦变换和离散逆余弦变换(DCT/IDCT),提出了一种高效的异步硬连线矩阵向量乘法器。该设计通过利用DCT和IDCT应用中典型的大比例零和小值数据,实现了低功耗和高性能。特别是,它跳过了乘零,并使用简化的、基于静态逻辑的推测完井感知来动态激活/取消激活所需的细粒度位分割加法器的位片。比特级分析和HSPICE仿真结果表明,与传统设计相比,该设计有显著改进。
{"title":"An asynchronous matrix-vector multiplier for discrete cosine transform","authors":"Kyeounsoo Kim, P. Beerel, Youpyo Hong","doi":"10.1145/344166.344621","DOIUrl":"https://doi.org/10.1145/344166.344621","url":null,"abstract":"This paper proposes an efficient asynchronous hardwired matrix-vector multiplier for the two-dimensional discrete cosine transform and inverse discrete cosine transform (DCT/IDCT). The design achieves low power and high performance by taking advantage of the typically large fraction of zero and small-valued data in DCT and IDCT applications. In particular, it skips multiplication by zero and dynamically activates/deactivates required bit-slices of fine-grain bit-partitioned adders using simplified, static-logic-based speculative completion sensing. The results extracted by both bit-level analysis and HSPICE simulations indicate significant improvements compared to traditional designs.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128767109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
High-speed dynamic logic styles for scaled-down CMOS and MTCMOS technologies 用于缩小CMOS和MTCMOS技术的高速动态逻辑样式
M. Allam, M. Anis, M. Elmasry
A new high-speed domino circuit, called HS-Domino is developed. HS-Domino resolves the trade-off between performance and noise margins in conventional CD-Domino logic while dissipating low dynamic power with minimal area overhead. A dual-threshold (MTCMOS) implementation of HS-Domino and DDCVS logic is also devised. This implementation achieves low leakage values during standby, while maintaining high performance and low dynamic power during the active mode.
开发了一种新的高速多米诺电路,称为HS-Domino。HS-Domino解决了传统CD-Domino逻辑中性能和噪声裕度之间的权衡,同时以最小的面积开销消耗低动态功率。还设计了HS-Domino和DDCVS逻辑的双阈值(MTCMOS)实现。这种实现在待机时实现低泄漏值,同时在活动模式下保持高性能和低动态功率。
{"title":"High-speed dynamic logic styles for scaled-down CMOS and MTCMOS technologies","authors":"M. Allam, M. Anis, M. Elmasry","doi":"10.1145/344166.344562","DOIUrl":"https://doi.org/10.1145/344166.344562","url":null,"abstract":"A new high-speed domino circuit, called HS-Domino is developed. HS-Domino resolves the trade-off between performance and noise margins in conventional CD-Domino logic while dissipating low dynamic power with minimal area overhead. A dual-threshold (MTCMOS) implementation of HS-Domino and DDCVS logic is also devised. This implementation achieves low leakage values during standby, while maintaining high performance and low dynamic power during the active mode.","PeriodicalId":188020,"journal":{"name":"ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117212540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 129
期刊
ISLPED'00: Proceedings of the 2000 International Symposium on Low Power Electronics and Design (Cat. No.00TH8514)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1