首页 > 最新文献

ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)最新文献

英文 中文
Fast, predictable and low energy memory references through architecture-aware compilation 通过架构感知编译实现快速、可预测和低能耗的内存引用
P. Marwedel, L. Wehmeyer, Manish Verma, S. Steinke, Urs Helmig
The design of future high-performance embedded systems is hampered by two problems: First, the required hardware needs more energy than is available from batteries. Second, current cache-based approaches for bridging the increasing speed gap between processors and memories cannot guarantee predictable real-time behavior. A contribution to solving both problems is made which describes a comprehensive set of algorithms that can be applied at design time in order to maximally exploit scratch pad memories (SPMs). We show that both the energy consumption as well as the computed worst case execution time (WCET) can be reduced by up to to 80% and 48%, respectively, by establishing a strong link between the memory architecture and the compiler.
未来高性能嵌入式系统的设计受到两个问题的阻碍:首先,所需的硬件需要比电池提供的能量更多的能量。其次,当前用于弥合处理器和内存之间日益增长的速度差距的基于缓存的方法不能保证可预测的实时行为。为解决这两个问题作出了贡献,它描述了一套全面的算法,可以在设计时应用,以最大限度地利用刮擦板存储器(spm)。我们表明,通过在内存架构和编译器之间建立强大的联系,能耗和计算出的最坏情况执行时间(WCET)分别可以减少高达80%和48%。
{"title":"Fast, predictable and low energy memory references through architecture-aware compilation","authors":"P. Marwedel, L. Wehmeyer, Manish Verma, S. Steinke, Urs Helmig","doi":"10.1109/ASPDAC.2004.1337530","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337530","url":null,"abstract":"The design of future high-performance embedded systems is hampered by two problems: First, the required hardware needs more energy than is available from batteries. Second, current cache-based approaches for bridging the increasing speed gap between processors and memories cannot guarantee predictable real-time behavior. A contribution to solving both problems is made which describes a comprehensive set of algorithms that can be applied at design time in order to maximally exploit scratch pad memories (SPMs). We show that both the energy consumption as well as the computed worst case execution time (WCET) can be reduced by up to to 80% and 48%, respectively, by establishing a strong link between the memory architecture and the compiler.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123988110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Minimization of memory size for heterogeneous MDDs 最小化异构mdd的内存大小
Shinobu Nagayama, Tsutomu Sasao
In this paper, we pmpose exact and heuristic algorithms for minimizing the memory size for heterogeneous Multivalued Decision Diagrams (MDh). In a heterogeneous MDD, each multi-valued variable can take a different domain. To represen1 a binary logic hnclion using a heterogeneous MDD, we partition the binary variables into gmnps, and treat the groups as multi-valued variables. Therefore, the memory size of a hetemgeneous MDD depends on the partition of the binary variables. Our experimental results show that heterogeneous MDDs repuim smaller memory size than Reduced Ordered Binary Decision Diagrams (ROBDDs) and Free BDDs (FBDDs).
在本文中,我们提出了精确的启发式算法来最小化异构多值决策图(MDh)的内存大小。在异构MDD中,每个多值变量可以采用不同的域。为了使用异构MDD表示二进制逻辑节点,我们将二进制变量划分为gmnps,并将这些组视为多值变量。因此,异构MDD的内存大小取决于二进制变量的分区。我们的实验结果表明,异构mdd比简化有序二进制决策图(robdd)和自由bdd (fbdd)需要更小的内存大小。
{"title":"Minimization of memory size for heterogeneous MDDs","authors":"Shinobu Nagayama, Tsutomu Sasao","doi":"10.1109/ASPDAC.2004.1337717","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337717","url":null,"abstract":"In this paper, we pmpose exact and heuristic algorithms for minimizing the memory size for heterogeneous Multivalued Decision Diagrams (MDh). In a heterogeneous MDD, each multi-valued variable can take a different domain. To represen1 a binary logic hnclion using a heterogeneous MDD, we partition the binary variables into gmnps, and treat the groups as multi-valued variables. Therefore, the memory size of a hetemgeneous MDD depends on the partition of the binary variables. Our experimental results show that heterogeneous MDDs repuim smaller memory size than Reduced Ordered Binary Decision Diagrams (ROBDDs) and Free BDDs (FBDDs).","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130780164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A reliable low-power fast skew-compensation circuit 一种可靠的低功耗快速偏斜补偿电路
Yi-Ming Wang, Jinn-Shyan Wang
A reliable low-power fast skew-compensation circuit is proposed. Operating on the clock with a 50% duty cycle, the new design is more reliable compared to conventional SMD-based circuits [1]-[3], which can operate only on the pulsed clock. This new circuit also gets phase locking within two clock cycles. The test circuit works successfully between 600-MHz ~ 800-MHz with a power consumption of 25-μW/MHz ~ 26-μW/MHz. When measured at 616.9-MHz and 791.4-MHz, the static phase is 76.8-ps and 124.5-ps, respectively.
提出了一种可靠的低功耗快速偏斜补偿电路。与传统的基于smd的电路相比,新设计在50%占空比的时钟上工作更加可靠[1]-[3],传统的smd电路只能在脉冲时钟上工作。这种新电路也能在两个时钟周期内实现锁相。测试电路工作在600 ~ 800 MHz范围内,功耗为25 μ w /MHz ~ 26 μ w /MHz。当测量频率为616.9 mhz和791.4 mhz时,静态相位分别为76.8 ps和124.5 ps。
{"title":"A reliable low-power fast skew-compensation circuit","authors":"Yi-Ming Wang, Jinn-Shyan Wang","doi":"10.1109/ASPDAC.2004.1337642","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337642","url":null,"abstract":"A reliable low-power fast skew-compensation circuit is proposed. Operating on the clock with a 50% duty cycle, the new design is more reliable compared to conventional SMD-based circuits [1]-[3], which can operate only on the pulsed clock. This new circuit also gets phase locking within two clock cycles. The test circuit works successfully between 600-MHz ~ 800-MHz with a power consumption of 25-μW/MHz ~ 26-μW/MHz. When measured at 616.9-MHz and 791.4-MHz, the static phase is 76.8-ps and 124.5-ps, respectively.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"47 45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127531086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using positive equality to prove liveness for pipelined microprocessors 用正等式证明流水线微处理器的活动性
M. Velev
We present an indirect method to automatically prove liveness for pipelined microprocessors. This is done by first proving safety-correctness for one step, starting from an arbitrary initial state that is possibly restricted by invariant constraints. By induction, the implementation will be correct for any number of steps; we need to prove that for some fixed number of steps, n, the implementation will fetch at least one instruction that will be completed. This was proved efficiently by using the property of positive equality. Modeling restrictions made the method applicable to designs with exceptions and branch prediction. The indirect method and the modeling restrictions resulted in 4 orders of magnitude speedup, enabling the automatic live-ness proof for dual-issue superscalar and VLIW designs.
我们提出了一种间接的方法来自动证明流水线微处理器的活动性。要做到这一点,首先要证明一步的安全正确性,从可能受到不变约束限制的任意初始状态开始。通过归纳,实现对于任何数量的步骤都是正确的;我们需要证明,对于某个固定的步数n,实现将获取至少一条将被完成的指令。利用正等式的性质有效地证明了这一点。建模限制使得该方法适用于具有异常和分支预测的设计。间接方法和建模限制使速度提高了4个数量级,实现了双问题超标量和VLIW设计的自动动态证明。
{"title":"Using positive equality to prove liveness for pipelined microprocessors","authors":"M. Velev","doi":"10.1109/ASPDAC.2004.1337588","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337588","url":null,"abstract":"We present an indirect method to automatically prove liveness for pipelined microprocessors. This is done by first proving safety-correctness for one step, starting from an arbitrary initial state that is possibly restricted by invariant constraints. By induction, the implementation will be correct for any number of steps; we need to prove that for some fixed number of steps, n, the implementation will fetch at least one instruction that will be completed. This was proved efficiently by using the property of positive equality. Modeling restrictions made the method applicable to designs with exceptions and branch prediction. The indirect method and the modeling restrictions resulted in 4 orders of magnitude speedup, enabling the automatic live-ness proof for dual-issue superscalar and VLIW designs.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127537403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Dynamic voltage scaling of periodic and aperiodic tasks in priority-driven systems 优先级驱动系统中周期性和非周期性任务的动态电压缩放
Dongkun Shin, Jihong Kim
We describe dynamic voltage scaling (DVS) algorithms for real-time systems with both periodic and aperiodic tasks. Although many DVS algorithms have been developed for real-time systems with periodic tasks, none of them can he used for the system with both periodic and aperiodic tasks because of arbitrary temporal hehaviors of aperiodic tasks. We propose an off-line DVS algorithm and on-line DVS algorithms that are based on existing DVS algorithms. The proposed algorithms utilize the execution behaviors of scheduling server for aperiodic tasks. Experimental results show that the proposed algorithms reduce the energy consumption by 12% and 32% under the RM scheduling policy and the EDF scheduling policy, respectively.
我们描述了具有周期性和非周期性任务的实时系统的动态电压缩放(DVS)算法。尽管针对具有周期性任务的实时系统已经开发了许多分布式交换机算法,但由于非周期性任务的任意时间行为,它们都不能用于同时具有周期性和非周期性任务的系统。我们提出了基于现有分布式交换机算法的离线分布式交换机算法和在线分布式交换机算法。该算法利用调度服务器对非周期任务的执行行为。实验结果表明,该算法在RM调度策略和EDF调度策略下分别降低了12%和32%的能耗。
{"title":"Dynamic voltage scaling of periodic and aperiodic tasks in priority-driven systems","authors":"Dongkun Shin, Jihong Kim","doi":"10.1109/ASPDAC.2004.1337673","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337673","url":null,"abstract":"We describe dynamic voltage scaling (DVS) algorithms for real-time systems with both periodic and aperiodic tasks. Although many DVS algorithms have been developed for real-time systems with periodic tasks, none of them can he used for the system with both periodic and aperiodic tasks because of arbitrary temporal hehaviors of aperiodic tasks. We propose an off-line DVS algorithm and on-line DVS algorithms that are based on existing DVS algorithms. The proposed algorithms utilize the execution behaviors of scheduling server for aperiodic tasks. Experimental results show that the proposed algorithms reduce the energy consumption by 12% and 32% under the RM scheduling policy and the EDF scheduling policy, respectively.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126794469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
Instruction set and functional unit synthesis for SIMD processor cores SIMD处理器核心的指令集和功能单元合成
N. Togawa, K. Tachikake, Yuichiro Miyaoka, M. Yanagisawa, T. Ohtsuki
This paper focuses on SlMD processor synthesis and proposes a SIMD instruction setlfunctional unit synthesis algorithm. Given an initial assembly code and a timing constraint, the proposed algorithm synthesizes an area-optimized processor core with optimal SIMD functional units. It also synthesizes a SIMD instruction set. The input initial assemhly code is assumed to run on a full-resource SIMD processor (virtual processor) which has all the possible SIMD functional units. In our algorithm, we introduce the SIMD operation decomposition and apply it to the initial assembly code and the full-resource SIMD processor. By gradually reducing SIMD operations or decomposing SIMD operations, we can finally find a processor core with small area under the given timing constraint. The promising experimental results are also shown.
针对SlMD处理器的综合,提出了一种SIMD指令集功能单元综合算法。在给定初始汇编代码和时间约束的情况下,该算法综合了具有最优SIMD功能单元的区域优化处理器内核。它还综合了一个SIMD指令集。假定输入的初始汇编代码在具有所有可能的SIMD功能单元的全资源SIMD处理器(虚拟处理器)上运行。在该算法中,我们引入了SIMD运算分解,并将其应用于初始汇编代码和全资源SIMD处理器。通过逐步减少SIMD操作或分解SIMD操作,我们最终可以在给定的时间约束下找到一个面积较小的处理器核心。并给出了有希望的实验结果。
{"title":"Instruction set and functional unit synthesis for SIMD processor cores","authors":"N. Togawa, K. Tachikake, Yuichiro Miyaoka, M. Yanagisawa, T. Ohtsuki","doi":"10.1109/ASPDAC.2004.1337692","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337692","url":null,"abstract":"This paper focuses on SlMD processor synthesis and proposes a SIMD instruction setlfunctional unit synthesis algorithm. Given an initial assembly code and a timing constraint, the proposed algorithm synthesizes an area-optimized processor core with optimal SIMD functional units. It also synthesizes a SIMD instruction set. The input initial assemhly code is assumed to run on a full-resource SIMD processor (virtual processor) which has all the possible SIMD functional units. In our algorithm, we introduce the SIMD operation decomposition and apply it to the initial assembly code and the full-resource SIMD processor. By gradually reducing SIMD operations or decomposing SIMD operations, we can finally find a processor core with small area under the given timing constraint. The promising experimental results are also shown.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114555760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimal design of high fan-in multiplexers via mixed-integer nonlinear programming 基于混合整数非线性规划的高扇入多路复用器优化设计
H. Huang, Cheng-Yeh Wang, Jing-Yang Jou
A novel strategy for designing the heterogeneous-tree multiplexer is proposed. We build the multiplexer delay model by curve fitting and then formulate the heterogeneous-tree multiplexer design problem as a special type of optimization problem called mixed-integer nonlinear programming (MINLP). A new design parameter, the switch size in each stage, is introduced to improve the speed of the heterogeneous-tree multiplexer. The proposed strategy can determine the multiplexer architecture and the switch size in each stage simultaneously. Three optimization methods are provided to synthesize the heterogeneous-tree multiplexer according to the design specifications.
提出了一种设计异构树多路复用器的新策略。采用曲线拟合的方法建立了多路复用器的时延模型,并将异构树多路复用器设计问题表述为一种特殊的优化问题——混合整数非线性规划(MINLP)。为了提高异构树多路复用器的速度,引入了一种新的设计参数——每级开关尺寸。该策略可以同时确定每一级的多路复用器结构和开关大小。根据设计规范,提出了三种优化方法来综合异构树多路复用器。
{"title":"Optimal design of high fan-in multiplexers via mixed-integer nonlinear programming","authors":"H. Huang, Cheng-Yeh Wang, Jing-Yang Jou","doi":"10.1109/ASPDAC.2004.1337580","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337580","url":null,"abstract":"A novel strategy for designing the heterogeneous-tree multiplexer is proposed. We build the multiplexer delay model by curve fitting and then formulate the heterogeneous-tree multiplexer design problem as a special type of optimization problem called mixed-integer nonlinear programming (MINLP). A new design parameter, the switch size in each stage, is introduced to improve the speed of the heterogeneous-tree multiplexer. The proposed strategy can determine the multiplexer architecture and the switch size in each stage simultaneously. Three optimization methods are provided to synthesize the heterogeneous-tree multiplexer according to the design specifications.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"34 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114099942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple specifications radio-frequency integrated circuit design with automatic template-driven layout retargeting 多规格射频集成电路设计与自动模板驱动布局重定位
N. Jangkrajarng, S. Bhattacharya, R. Hartono, C. Shi
We present an automatic layout retargeting tool that generates analog and RF layouts incorporating new device sizes and geometries based on new circuit specifications. A graph-based symbolic template is automatically constructed from a practical layout such that expert designer knowledge embedded in the layout is preserved. The template can be solved for multiple layouts based on different device sizes and geometries, satisfying several different specifications. Symmetry conservation and passive device modification are also embedded in the tool. The retargeting tool is demonstrated on a voltage controlled oscillator to generate three layouts with different target goals. While manual redesign is known to take days to finish, the automatic layout retargeting tool takes a few hours to generate a reusable template and takes minutes to generate comparable layouts.
我们提出了一种自动布局重定位工具,可根据新的电路规格生成包含新器件尺寸和几何形状的模拟和RF布局。基于图形的符号模板从实际布局自动构造,从而保留了嵌入在布局中的专家设计人员知识。该模板可以解决基于不同设备尺寸和几何形状的多种布局,满足几种不同的规格。该工具还嵌入了对称守恒和被动器件修改。在电压控制振荡器上演示了重定位工具,以生成具有不同目标的三种布局。众所周知,手动重新设计需要几天才能完成,而自动布局重定向工具需要几个小时来生成可重用的模板,需要几分钟来生成可比的布局。
{"title":"Multiple specifications radio-frequency integrated circuit design with automatic template-driven layout retargeting","authors":"N. Jangkrajarng, S. Bhattacharya, R. Hartono, C. Shi","doi":"10.1109/ASPDAC.2004.1337607","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337607","url":null,"abstract":"We present an automatic layout retargeting tool that generates analog and RF layouts incorporating new device sizes and geometries based on new circuit specifications. A graph-based symbolic template is automatically constructed from a practical layout such that expert designer knowledge embedded in the layout is preserved. The template can be solved for multiple layouts based on different device sizes and geometries, satisfying several different specifications. Symmetry conservation and passive device modification are also embedded in the tool. The retargeting tool is demonstrated on a voltage controlled oscillator to generate three layouts with different target goals. While manual redesign is known to take days to finish, the automatic layout retargeting tool takes a few hours to generate a reusable template and takes minutes to generate comparable layouts.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121543482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Exploiting program execution phases to trade power and performance for media workload 利用程序执行阶段来交换功率和性能以换取媒体工作负载
Subhasish Banerjee, G. Surendra, S. Nandy
Processing streaming media comprises several program phases (often distinct) that are periodic and independent of application data. Here we characterize execution of such programs into execution phases based on their dynamic IPC (instruction per cycle) profile. We show that program execution of selected phases can be dynamically boosted by activating additional standby functional units which are otherwise powered down for saving energy. Through simulation we show that speedup ranging from 1.1 to 1.25 can be achieved while reducing the energy-delay product (EDP) for most of the media benchmarks evaluated. Additionally we show that artificially introduced stalls during phases of processor underutilization reduces power by around 2 to 4%.
处理流媒体包括几个周期性且独立于应用程序数据的程序阶段(通常不同)。在这里,我们根据这些程序的动态IPC(每周期指令)配置文件将它们的执行分为执行阶段。我们表明,通过激活额外的备用功能单元,可以动态地提高选定阶段的程序执行速度,否则这些功能单元将被关闭以节省能源。通过仿真,我们表明,在降低能量延迟积(EDP)的同时,可以实现1.1到1.25的加速范围。此外,我们还表明,在处理器未充分利用的阶段人为引入停机可以减少大约2%到4%的功耗。
{"title":"Exploiting program execution phases to trade power and performance for media workload","authors":"Subhasish Banerjee, G. Surendra, S. Nandy","doi":"10.1109/ASPDAC.2004.1337605","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337605","url":null,"abstract":"Processing streaming media comprises several program phases (often distinct) that are periodic and independent of application data. Here we characterize execution of such programs into execution phases based on their dynamic IPC (instruction per cycle) profile. We show that program execution of selected phases can be dynamically boosted by activating additional standby functional units which are otherwise powered down for saving energy. Through simulation we show that speedup ranging from 1.1 to 1.25 can be achieved while reducing the energy-delay product (EDP) for most of the media benchmarks evaluated. Additionally we show that artificially introduced stalls during phases of processor underutilization reduces power by around 2 to 4%.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114973170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A low power asynchronous java processor for contactless smart card 用于非接触式智能卡的低功耗异步java处理器
Chun-Pong Yu, O. Choy, Hao Min, C. Chan, K. Pun
This paper presents the design of a low power 16-hit asynchronous java processor for contactless smart card. It can directly execute the java bytecodes in a subset ofthe instruction set defined in the Java Card Virtual Machine specification 111 The remaining java bytecodes are handled by software routines. Also, we intend to use asynchronous circuit design technique to reduce the power Consumption of the java processor core. It has been fabricated in a CMOS 0.35-μm technology and the experimental result shows that it is suitable for the contactless smart card.
本文设计了一种低功耗16路异步java非接触式智能卡处理器。它可以直接执行java卡虚拟机规范中定义的指令集子集中的java字节码,其余的java字节码由软件例程处理。此外,我们打算使用异步电路设计技术来降低java处理器核心的功耗。采用CMOS 0.35 μm工艺制作了该芯片,实验结果表明该芯片适用于非接触式智能卡。
{"title":"A low power asynchronous java processor for contactless smart card","authors":"Chun-Pong Yu, O. Choy, Hao Min, C. Chan, K. Pun","doi":"10.1109/ASPDAC.2004.1337645","DOIUrl":"https://doi.org/10.1109/ASPDAC.2004.1337645","url":null,"abstract":"This paper presents the design of a low power 16-hit asynchronous java processor for contactless smart card. It can directly execute the java bytecodes in a subset ofthe instruction set defined in the Java Card Virtual Machine specification 111 The remaining java bytecodes are handled by software routines. Also, we intend to use asynchronous circuit design technique to reduce the power Consumption of the java processor core. It has been fabricated in a CMOS 0.35-μm technology and the experimental result shows that it is suitable for the contactless smart card.","PeriodicalId":426349,"journal":{"name":"ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115287006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1