首页 > 最新文献

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文 中文
Energy optimization in Android applications through wakelock placement 通过wakelock放置在Android应用程序中的能量优化
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.101
Faisal Alam, P. Panda, Nikhil Tripathi, Namita Sharma, Sanjiv Narayan
Energy efficiency is a critical factor in mobile systems, and a significant body of recent research efforts has focused on reducing the energy dissipation in mobile hardware and applications. The Android OS Power Manager provides programming interface routines called wakelocks for controlling the activation state of devices on a mobile system. An appropriate placement of wakelock acquire and release functions in the application can make a significant difference to the energy consumption. In this paper, we propose a data flow analysis based strategy for determining the placement of wakelock statements corresponding to the uses of devices in an application. Our experimental evaluation on a set of Android applications show significant (up to 32%) energy savings with the proposed optimization strategy.
能源效率是移动系统的一个关键因素,最近大量的研究工作集中在减少移动硬件和应用的能量消耗上。Android OS Power Manager提供了称为wakelock的编程接口例程,用于控制移动系统上设备的激活状态。在应用程序中适当地放置wakelock获取和释放函数可以对能耗产生重大影响。在本文中,我们提出了一种基于数据流分析的策略,用于确定与应用程序中设备的使用相对应的唤醒语句的位置。我们在一组Android应用程序上的实验评估表明,采用所提出的优化策略可以显著(高达32%)节省能源。
{"title":"Energy optimization in Android applications through wakelock placement","authors":"Faisal Alam, P. Panda, Nikhil Tripathi, Namita Sharma, Sanjiv Narayan","doi":"10.7873/DATE.2014.101","DOIUrl":"https://doi.org/10.7873/DATE.2014.101","url":null,"abstract":"Energy efficiency is a critical factor in mobile systems, and a significant body of recent research efforts has focused on reducing the energy dissipation in mobile hardware and applications. The Android OS Power Manager provides programming interface routines called wakelocks for controlling the activation state of devices on a mobile system. An appropriate placement of wakelock acquire and release functions in the application can make a significant difference to the energy consumption. In this paper, we propose a data flow analysis based strategy for determining the placement of wakelock statements corresponding to the uses of devices in an application. Our experimental evaluation on a set of Android applications show significant (up to 32%) energy savings with the proposed optimization strategy.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"47 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86573377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
SuperRange: Wide operational range power delivery design for both STV and NTV computing SuperRange:适用于STV和NTV计算的宽操作范围功率输送设计
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.159
Xin He, Guihai Yan, Yinhe Han, Xiaowei Li
The load power range of modern processors is greatly enlarged because many advanced power management techniques like dynamic voltage frequency scaling, Turbo boosting, and Near Threshold Voltage technologies are incorporated. However, the power saving may be offset by power loss in power delivery; moreover, as the efficiency of power delivery varies greatly with different load conditions, conventional power delivery designs cannot maintain high efficiency over the entire voltage range. We propose SuperRange, a wide operational range power delivery scheme. SuperRange complements the power delivery capability of on-chip voltage regulator and off-chip voltage regulator. Experimental results show SuperRange has an average 70% power conversion efficiency over wide operational range which outperforms conventional power delivery schemes. And it also exhibits superior resilience to power-constrained systems.
现代处理器的负载功率范围大大扩大,因为许多先进的电源管理技术,如动态电压频率缩放,涡轮增压和近阈值电压技术被纳入。但是,省电可能会被供电中的功率损失所抵消;此外,由于不同负载条件下的供电效率差异很大,传统的供电设计无法在整个电压范围内保持高效率。我们提出SuperRange,一个大范围的电力输送方案。SuperRange补充了片上稳压器和片外稳压器的电力输送能力。实验结果表明,在宽工作范围内,SuperRange的平均功率转换效率为70%,优于传统的功率传输方案。此外,它还表现出对电力受限系统的超强弹性。
{"title":"SuperRange: Wide operational range power delivery design for both STV and NTV computing","authors":"Xin He, Guihai Yan, Yinhe Han, Xiaowei Li","doi":"10.7873/DATE.2014.159","DOIUrl":"https://doi.org/10.7873/DATE.2014.159","url":null,"abstract":"The load power range of modern processors is greatly enlarged because many advanced power management techniques like dynamic voltage frequency scaling, Turbo boosting, and Near Threshold Voltage technologies are incorporated. However, the power saving may be offset by power loss in power delivery; moreover, as the efficiency of power delivery varies greatly with different load conditions, conventional power delivery designs cannot maintain high efficiency over the entire voltage range. We propose SuperRange, a wide operational range power delivery scheme. SuperRange complements the power delivery capability of on-chip voltage regulator and off-chip voltage regulator. Experimental results show SuperRange has an average 70% power conversion efficiency over wide operational range which outperforms conventional power delivery schemes. And it also exhibits superior resilience to power-constrained systems.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"63 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90449702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Characterizing power delivery systems with on/off-chip voltage regulators for many-core processors 具有片上/片外稳压器的多核处理器供电系统的特性
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.060
Xuan Wang, Jiang Xu, Zhe Wang, K. J. Chen, Xiaowen Wu, Zhehui Wang
Design of power delivery system has great influence on the power management in many-core processor systems. Moving voltage regulators from off-chip to on-chip gains more and more interest in the power delivery system design, because it is able to provide fast voltage scaling and multiple power domains. Previous works are proposed to implement power efficient on-chip regulators. It is also important to analyze the characteristics of the entire power delivery system to explore the tradeoff between the promising properties and costs of employing on-chip regulators. In this work, we develop an analytical model to evaluate important characteristics of the power delivery system, including on-chip/off-chip voltage regulators and the passive on-chip/on-board parasitic. Compared with SPICE simulations, our model achieves a fast system-level evaluation with comparable accuracy. Based on the model, geometric programming is utilized to find the optimal power efficiency of different architectures of power delivery systems under constraints of output voltage stability and area. Experiments show that compared with the conventional architecture using off-chip regulators, the hybrid one using both on-chip and off-chip voltage regulators achieves 1.0% power efficiency improvement and 68% area reduction of voltage regulators on average. We conclude that the hybrid architecture has potential for high power efficiency and small area at heavy workload, but careful account for the overhead of on-chip regulators is needed.
在多核处理器系统中,电源传输系统的设计对系统的电源管理有着重要的影响。将稳压器从片外移动到片内在电力输送系统设计中越来越受到关注,因为它能够提供快速的电压缩放和多个功率域。先前的工作是为了实现低功耗的片上调节器。分析整个电力输送系统的特性,以探索使用片上调节器的前景性能和成本之间的权衡也很重要。在这项工作中,我们开发了一个分析模型来评估电力输送系统的重要特性,包括片上/片外稳压器和无源片上/板上寄生。与SPICE仿真相比,我们的模型实现了快速的系统级评估,并且具有相当的精度。在此基础上,利用几何规划的方法,在输出电压稳定性和面积约束下,找出不同结构的输电系统的最优功率效率。实验表明,与采用片外稳压器的传统结构相比,采用片内稳压器和片外稳压器的混合结构平均提高了1.0%的功率效率,减少了68%的稳压器面积。我们的结论是,混合架构具有高功率效率和小面积的潜力,在繁重的工作负载下,但需要仔细考虑片上调节器的开销。
{"title":"Characterizing power delivery systems with on/off-chip voltage regulators for many-core processors","authors":"Xuan Wang, Jiang Xu, Zhe Wang, K. J. Chen, Xiaowen Wu, Zhehui Wang","doi":"10.7873/DATE.2014.060","DOIUrl":"https://doi.org/10.7873/DATE.2014.060","url":null,"abstract":"Design of power delivery system has great influence on the power management in many-core processor systems. Moving voltage regulators from off-chip to on-chip gains more and more interest in the power delivery system design, because it is able to provide fast voltage scaling and multiple power domains. Previous works are proposed to implement power efficient on-chip regulators. It is also important to analyze the characteristics of the entire power delivery system to explore the tradeoff between the promising properties and costs of employing on-chip regulators. In this work, we develop an analytical model to evaluate important characteristics of the power delivery system, including on-chip/off-chip voltage regulators and the passive on-chip/on-board parasitic. Compared with SPICE simulations, our model achieves a fast system-level evaluation with comparable accuracy. Based on the model, geometric programming is utilized to find the optimal power efficiency of different architectures of power delivery systems under constraints of output voltage stability and area. Experiments show that compared with the conventional architecture using off-chip regulators, the hybrid one using both on-chip and off-chip voltage regulators achieves 1.0% power efficiency improvement and 68% area reduction of voltage regulators on average. We conclude that the hybrid architecture has potential for high power efficiency and small area at heavy workload, but careful account for the overhead of on-chip regulators is needed.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"68 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90456992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Fast and accurate computation using stochastic circuits 利用随机电路进行快速准确的计算
Pub Date : 2014-03-24 DOI: 10.7873/DATE2014.089
Armin Alaghi, J. Hayes
Stochastic computing (SC) is a low-cost design technique that has great promise in applications such as image processing. SC enables arithmetic operations to be performed on stochastic bit-streams using ultra-small and low-power circuitry. However, accurate computations tend to require long run-times due to the random fluctuations inherent in stochastic numbers (SNs). We present novel techniques for SN generation that lead to better accuracy/run-time trade-offs. First, we analyze a property called progressive precision (PP) which allows computational accuracy to grow systematically with run-time. Second, borrowing from Monte Carlo methods, we show that SC performance can be greatly improved by replacing the usual pseudo-random number sources by low-discrepancy (LD) sequences that are predictably progressive. Finally, we evaluate the use of LD stochastic numbers in SC, and show they can produce significantly faster and more accurate results than existing stochastic designs.
随机计算是一种低成本的设计技术,在图像处理等领域有着广阔的应用前景。SC使算术运算能够在使用超小型和低功耗电路的随机比特流上执行。然而,由于随机数固有的随机波动,精确的计算往往需要较长的运行时间。我们提出了新的SN生成技术,可以更好地权衡准确性和运行时。首先,我们分析了一种称为渐进精度(PP)的特性,它允许计算精度随着运行时间系统地增长。其次,借鉴蒙特卡罗方法,我们证明了用可预测的低差异(LD)序列代替通常的伪随机数源可以大大提高SC性能。最后,我们评估了LD随机数在SC中的使用,并表明它们可以比现有的随机设计产生更快和更准确的结果。
{"title":"Fast and accurate computation using stochastic circuits","authors":"Armin Alaghi, J. Hayes","doi":"10.7873/DATE2014.089","DOIUrl":"https://doi.org/10.7873/DATE2014.089","url":null,"abstract":"Stochastic computing (SC) is a low-cost design technique that has great promise in applications such as image processing. SC enables arithmetic operations to be performed on stochastic bit-streams using ultra-small and low-power circuitry. However, accurate computations tend to require long run-times due to the random fluctuations inherent in stochastic numbers (SNs). We present novel techniques for SN generation that lead to better accuracy/run-time trade-offs. First, we analyze a property called progressive precision (PP) which allows computational accuracy to grow systematically with run-time. Second, borrowing from Monte Carlo methods, we show that SC performance can be greatly improved by replacing the usual pseudo-random number sources by low-discrepancy (LD) sequences that are predictably progressive. Finally, we evaluate the use of LD stochastic numbers in SC, and show they can produce significantly faster and more accurate results than existing stochastic designs.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"41 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89407364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
Modeling and analysis of digital linear dropout regulators with adaptive control for high efficiency under wide dynamic range digital loads 基于自适应控制的宽动态范围数字负载下高效数字线性差压稳压器的建模与分析
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.160
S. Gangopadhyay, Youngtak Lee, Saad Bin Nasir, A. Raychowdhury
Discrete time digital linear regulators, including low dropout regulators (LDOs) have become competitive in muti-Vcc digital systems for fine-grained spatio-temporal voltage regulation and distribution. However, wide dynamic current range of the digital load circuits poses serious problems in maintaining stability and high efficiency at all corners. In this paper we present a control model for discrete time LDOs and demonstrate how online adaptive control can be employed for consistent performance and high efficiency across the load current range.
离散时间数字线性稳压器,包括低降稳压器(LDOs)在多vcc数字系统中具有竞争力,用于细粒度的时空电压调节和分布。然而,数字负载电路的动态电流范围太大,在各个角落保持稳定性和高效率是一个严重的问题。在本文中,我们提出了离散时间ldo的控制模型,并演示了如何使用在线自适应控制在整个负载电流范围内保持一致的性能和高效率。
{"title":"Modeling and analysis of digital linear dropout regulators with adaptive control for high efficiency under wide dynamic range digital loads","authors":"S. Gangopadhyay, Youngtak Lee, Saad Bin Nasir, A. Raychowdhury","doi":"10.7873/DATE.2014.160","DOIUrl":"https://doi.org/10.7873/DATE.2014.160","url":null,"abstract":"Discrete time digital linear regulators, including low dropout regulators (LDOs) have become competitive in muti-Vcc digital systems for fine-grained spatio-temporal voltage regulation and distribution. However, wide dynamic current range of the digital load circuits poses serious problems in maintaining stability and high efficiency at all corners. In this paper we present a control model for discrete time LDOs and demonstrate how online adaptive control can be employed for consistent performance and high efficiency across the load current range.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"81 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76782016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Hybrid memory architecture for voltage scaling in ultra-low power multi-core biomedical processors 超低功耗多核生物医学处理器中电压缩放的混合存储架构
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.182
Daniele Bortolotti, Andrea Bartolini, C. Weis, D. Rossi, L. Benini
Technology scaling enables today the design of sensor-based ultra-low cost chips well suited for emerging applications such as wireless body sensor networks, urban life and environment monitoring. Energy consumption is the key limiting factor of this up-coming revolution and memories are often the energy bottleneck mainly due to leakage power. This paper proposes an ultra-low power multi-core architecture targeting eHealth monitoring systems, where applications involve collection of sequences of slow biomedical signals and highly parallel computations at very low voltage. We propose a hybrid memory architecture that combines 6T-SRAM and 8T-SRAM operating in the same voltage domain and capable of dispatching at high voltage a normal operation and at low voltage a fully reliable small memory partition (8T) while the rest of the memory (6T) is state-retentive. Our architecture offers significant energy savings with a low area overhead in typical eHealth Compressed Sensing-based applications.
技术扩展使今天基于传感器的超低成本芯片的设计非常适合新兴应用,如无线身体传感器网络,城市生活和环境监测。能源消耗是这场即将到来的革命的关键限制因素,而存储器往往是能源瓶颈,主要是由于泄漏功率。本文提出了一种针对电子健康监测系统的超低功耗多核架构,其中应用涉及慢速生物医学信号序列的收集和极低电压下的高度并行计算。我们提出了一种混合存储架构,该架构结合了在同一电压域中工作的6T- sram和8T- sram,能够在高压下调度正常操作,在低压下调度完全可靠的小内存分区(8T),而其余的内存(6T)保持状态。我们的架构在典型的基于eHealth压缩传感的应用中提供了显著的节能和低面积开销。
{"title":"Hybrid memory architecture for voltage scaling in ultra-low power multi-core biomedical processors","authors":"Daniele Bortolotti, Andrea Bartolini, C. Weis, D. Rossi, L. Benini","doi":"10.7873/DATE.2014.182","DOIUrl":"https://doi.org/10.7873/DATE.2014.182","url":null,"abstract":"Technology scaling enables today the design of sensor-based ultra-low cost chips well suited for emerging applications such as wireless body sensor networks, urban life and environment monitoring. Energy consumption is the key limiting factor of this up-coming revolution and memories are often the energy bottleneck mainly due to leakage power. This paper proposes an ultra-low power multi-core architecture targeting eHealth monitoring systems, where applications involve collection of sequences of slow biomedical signals and highly parallel computations at very low voltage. We propose a hybrid memory architecture that combines 6T-SRAM and 8T-SRAM operating in the same voltage domain and capable of dispatching at high voltage a normal operation and at low voltage a fully reliable small memory partition (8T) while the rest of the memory (6T) is state-retentive. Our architecture offers significant energy savings with a low area overhead in typical eHealth Compressed Sensing-based applications.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"210 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76115628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Thermal management of manycore systems with silicon-photonic networks 硅光子网络多核系统的热管理
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.320
Tiansheng Zhang, José L. Abellán, A. Joshi, A. Coskun
Silicon-photonic network-on-chips (NoCs) provide high bandwidth density; therefore, they are promising candidates to replace electrical NoCs in manycore systems. The silicon-photonic NoCs, however, are sensitive to the temperature gradients that typically occur on the chip, and hence, require proactive thermal management. This paper first provides a design space exploration of silicon-photonic networks in manycore systems and quantifies the performance impact of the temperature gradients for various network bandwidths. The paper then introduces a novel job allocation technique that minimizes the temperature gradients among the ring modulators/filters to improve the application performance. Experimental results for a single-chip 256-core system demonstrate that our policy is able to maintain the maximum network bandwidth. Compared to existing workload allocation policies, the proposed policy improves system performance by up to 26.1% when running a single application and 18.3% for multi-program scenarios.
硅光子片上网络(noc)提供高带宽密度;因此,它们有望取代多核系统中的电气noc。然而,硅光子noc对通常发生在芯片上的温度梯度很敏感,因此需要主动热管理。本文首先对多核系统中硅光子网络的设计空间进行了探索,并量化了不同网络带宽下温度梯度对性能的影响。然后,本文介绍了一种新的工作分配技术,该技术可以最小化环形调制器/滤波器之间的温度梯度,以提高应用性能。在单片256核系统上的实验结果表明,该策略能够保持最大的网络带宽。与现有的工作负载分配策略相比,建议的策略在运行单个应用程序时可将系统性能提高26.1%,在运行多程序场景时可将系统性能提高18.3%。
{"title":"Thermal management of manycore systems with silicon-photonic networks","authors":"Tiansheng Zhang, José L. Abellán, A. Joshi, A. Coskun","doi":"10.7873/DATE.2014.320","DOIUrl":"https://doi.org/10.7873/DATE.2014.320","url":null,"abstract":"Silicon-photonic network-on-chips (NoCs) provide high bandwidth density; therefore, they are promising candidates to replace electrical NoCs in manycore systems. The silicon-photonic NoCs, however, are sensitive to the temperature gradients that typically occur on the chip, and hence, require proactive thermal management. This paper first provides a design space exploration of silicon-photonic networks in manycore systems and quantifies the performance impact of the temperature gradients for various network bandwidths. The paper then introduces a novel job allocation technique that minimizes the temperature gradients among the ring modulators/filters to improve the application performance. Experimental results for a single-chip 256-core system demonstrate that our policy is able to maintain the maximum network bandwidth. Compared to existing workload allocation policies, the proposed policy improves system performance by up to 26.1% when running a single application and 18.3% for multi-program scenarios.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76238958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Efficient simulation and modelling of non-rectangular NoC topologies 非矩形NoC拓扑的高效仿真与建模
Pub Date : 2014-03-24 DOI: 10.7873/DATE2014.298
Ji Qi, Mark Zwolinski
With increasing chip complexity, Networks-on-Chips (NoCs) are becoming a central platform for future on-chip communications. Many regular NoC architectures have been proposed to eliminate the communication bottlenecks of traditional bus-based networks. Non-rectangular and irregular architectures have also been proposed to increase performance. However, the complexity of designing custom non-rectangular networks leads to a rapid increase in design and verification times. To alleviate the conflict between performance and efficiency, this paper proposes a novel method that efficiently constructs virtual non-rectangular topologies on a mesh network by using time-regulated models to emulate irregular patterns. Data routings on virtual hexagonal and two irregular geometries validate the proposed method. An MPEG-4 decoder is used to exemplify the proposed method for media applications. Results analysis shows the virtual topologies emulated by the proposed method can provide precise timing and energy performance.
随着芯片复杂性的增加,片上网络(noc)正在成为未来片上通信的中心平台。为了消除传统基于总线的网络的通信瓶颈,人们提出了许多常规的NoC架构。非矩形和不规则的结构也被提出来提高性能。然而,设计自定义非矩形网络的复杂性导致设计和验证时间的快速增加。为了缓解性能与效率之间的冲突,本文提出了一种利用时间调节模型模拟不规则模式,在网状网络上高效构建虚拟非矩形拓扑结构的新方法。在虚拟六边形和两种不规则几何上的数据路由验证了所提方法的有效性。以MPEG-4解码器为例说明了该方法在媒体应用中的应用。结果分析表明,采用该方法仿真的虚拟拓扑能够提供精确的时序和能量性能。
{"title":"Efficient simulation and modelling of non-rectangular NoC topologies","authors":"Ji Qi, Mark Zwolinski","doi":"10.7873/DATE2014.298","DOIUrl":"https://doi.org/10.7873/DATE2014.298","url":null,"abstract":"With increasing chip complexity, Networks-on-Chips (NoCs) are becoming a central platform for future on-chip communications. Many regular NoC architectures have been proposed to eliminate the communication bottlenecks of traditional bus-based networks. Non-rectangular and irregular architectures have also been proposed to increase performance. However, the complexity of designing custom non-rectangular networks leads to a rapid increase in design and verification times. To alleviate the conflict between performance and efficiency, this paper proposes a novel method that efficiently constructs virtual non-rectangular topologies on a mesh network by using time-regulated models to emulate irregular patterns. Data routings on virtual hexagonal and two irregular geometries validate the proposed method. An MPEG-4 decoder is used to exemplify the proposed method for media applications. Results analysis shows the virtual topologies emulated by the proposed method can provide precise timing and energy performance.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"110 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76301241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mDTM: Multi-objective dynamic thermal management for on-chip systems 片上系统的多目标动态热管理
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.343
Heba Khdr, T. Ebi, M. Shafique, H. Amrouch, J. Henkel
Thermal hot spots and unbalanced temperatures between cores on chip can cause either degradation in performance or may have a severe impact on reliability, or both. In this paper, we propose mDTM, a proactive dynamic thermal management technique for on-chip systems. It employs multi-objective management for migrating tasks in order to both prevent the system from hitting an undesirable thermal threshold and to balance the temperatures between the cores. Our evaluation on the Intel SCC platform shows that mDTM can successfully avoid a given thermal threshold and reduce spatial thermal variation by 22%. Compared to state-of-the-art, our mDTM achieves up to 58% performance gain. Additionally, we deploy an FPGA and IR camera based setup to analyze the effectiveness of our technique.
芯片上的热热点和内核之间的温度不平衡可能会导致性能下降或严重影响可靠性,或者两者兼而有之。在本文中,我们提出mDTM,一种主动动态热管理技术的片上系统。它采用多目标管理来迁移任务,以防止系统达到不希望的热阈值,并平衡核心之间的温度。我们在英特尔SCC平台上的评估表明,mDTM可以成功地避免给定的热阈值,并将空间热变化减少22%。与最先进的技术相比,我们的mDTM实现了高达58%的性能提升。此外,我们部署了一个FPGA和基于红外相机的设置来分析我们的技术的有效性。
{"title":"mDTM: Multi-objective dynamic thermal management for on-chip systems","authors":"Heba Khdr, T. Ebi, M. Shafique, H. Amrouch, J. Henkel","doi":"10.7873/DATE.2014.343","DOIUrl":"https://doi.org/10.7873/DATE.2014.343","url":null,"abstract":"Thermal hot spots and unbalanced temperatures between cores on chip can cause either degradation in performance or may have a severe impact on reliability, or both. In this paper, we propose mDTM, a proactive dynamic thermal management technique for on-chip systems. It employs multi-objective management for migrating tasks in order to both prevent the system from hitting an undesirable thermal threshold and to balance the temperatures between the cores. Our evaluation on the Intel SCC platform shows that mDTM can successfully avoid a given thermal threshold and reduce spatial thermal variation by 22%. Compared to state-of-the-art, our mDTM achieves up to 58% performance gain. Additionally, we deploy an FPGA and IR camera based setup to analyze the effectiveness of our technique.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"203 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73213808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Improving efficiency of extensible processors by using approximate custom instructions 通过使用近似自定义指令来提高可扩展处理器的效率
Pub Date : 2014-03-24 DOI: 10.7873/DATE.2014.238
M. Kamal, Amin Ghasemazar, A. Afzali-Kusha, M. Pedram
In this paper, we propose to move the conventional extensible processor design flow to the approximate computing domain to gain more speedup. In this domain, the instruction set architecture (ISA) design flow selects both exact and approximate custom instructions (CIs). The proposed approach could be used for the applications where imprecise results may be tolerated. In the CI identification phase of the flow, the CIs which do not satisfy the maximum propagation delay but can provide approximate results also may be included in the CI candidate set. Next, in the selection phase, we propose a merit function which selects CIs with higher cycle savings and small error rates. The efficacy of the proposed approximate design flow is investigated using the case studies of the discrete cosine transform (DCT) and inverse DCT (iDCT) of the MPEG2 application. Also, the impact of the process variation on the impreciseness of the results is investigated.
在本文中,我们提出将传统的可扩展处理器设计流程转移到近似计算域以获得更大的加速。在这个领域中,指令集架构(ISA)设计流选择精确和近似自定义指令(ci)。所提出的方法可用于可容忍不精确结果的应用。在流的CI识别阶段,不满足最大传播延迟但能提供近似结果的CI也可以包含在CI候选集中。其次,在选择阶段,我们提出了一个价值函数,该函数选择周期节省高且错误率小的ci。通过MPEG2应用程序的离散余弦变换(DCT)和逆DCT (iDCT)的案例研究,研究了所提出的近似设计流程的有效性。同时,研究了工艺变化对结果不精确性的影响。
{"title":"Improving efficiency of extensible processors by using approximate custom instructions","authors":"M. Kamal, Amin Ghasemazar, A. Afzali-Kusha, M. Pedram","doi":"10.7873/DATE.2014.238","DOIUrl":"https://doi.org/10.7873/DATE.2014.238","url":null,"abstract":"In this paper, we propose to move the conventional extensible processor design flow to the approximate computing domain to gain more speedup. In this domain, the instruction set architecture (ISA) design flow selects both exact and approximate custom instructions (CIs). The proposed approach could be used for the applications where imprecise results may be tolerated. In the CI identification phase of the flow, the CIs which do not satisfy the maximum propagation delay but can provide approximate results also may be included in the CI candidate set. Next, in the selection phase, we propose a merit function which selects CIs with higher cycle savings and small error rates. The efficacy of the proposed approximate design flow is investigated using the case studies of the discrete cosine transform (DCT) and inverse DCT (iDCT) of the MPEG2 application. Also, the impact of the process variation on the impreciseness of the results is investigated.","PeriodicalId":6550,"journal":{"name":"2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"104 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73282259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1