首页 > 最新文献

2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)最新文献

英文 中文
An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing 一种降低实时视频处理中稀疏线性系统直接求解器复杂度的近似计算技术
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593082
Michael Schaffner, Frank K. Gürkaynak, A. Smolic, H. Kaeslin, L. Benini
Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems. Solving such systems in real time is very demanding. This paper focuses on reducing the computational complexity of a direct Cholesky-decomposition-based solver. Our approximation scheme builds on the observation that, in well-conditioned problems, many elements in the decomposition nearly vanish. Such elements may be pruned from the dependency graph with mild accuracy degradation. Using an example from image-domain warping, we show that pruning reduces the amount of operations per solve by over 75 %, resulting in significant savings in computing time, area or energy.
许多视频处理算法被表述为导致大型稀疏线性系统的最小二乘问题。实时解决这样的系统是非常困难的。本文的重点是降低直接基于cholesky分解的求解器的计算复杂度。我们的近似方案建立在观察的基础上,在条件良好的问题中,分解中的许多元素几乎消失。这样的元素可能会从依赖关系图中删除,但准确性会有轻微的降低。使用图像域翘曲的示例,我们表明修剪将每次求解的操作量减少了75%以上,从而显着节省了计算时间,面积或能量。
{"title":"An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing","authors":"Michael Schaffner, Frank K. Gürkaynak, A. Smolic, H. Kaeslin, L. Benini","doi":"10.1145/2593069.2593082","DOIUrl":"https://doi.org/10.1145/2593069.2593082","url":null,"abstract":"Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems. Solving such systems in real time is very demanding. This paper focuses on reducing the computational complexity of a direct Cholesky-decomposition-based solver. Our approximation scheme builds on the observation that, in well-conditioned problems, many elements in the decomposition nearly vanish. Such elements may be pruned from the dependency graph with mild accuracy degradation. Using an example from image-domain warping, we show that pruning reduces the amount of operations per solve by over 75 %, resulting in significant savings in computing time, area or energy.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122627864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
ePlace: Electrostatics based placement using Nesterov's method ePlace:使用Nesterov方法的基于静电的放置
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593133
Jingwei Lu, Pengwen Chen, Chin-Chih Chang, Lu Sha, D. J. Huang, C. Teng, Chung-Kuan Cheng
ePlace is a generalized analytic algorithm to handle large-scale standard-cell and mixed-size placement. We use a novel density function based on electrostatics to remove overlap and Nesterov's method to minimize the nonlinear cost. Steplength is estimated as the inverse of Lipschitz constant, which is determined by our dynamic prediction and backtracking method. An approximated preconditioner is proposed to resolve the difference between large macros and standard cells, while an annealing engine is devised to handle macro legalization followed by placement of standard cells. The above innovations are integrated into our placement prototype ePlace, which outperforms the leading-edge placers on respective standard-cell and mixed-size benchmark suites. Specifically, ePlace produces 2.83%, 4.59% and 7.13% shorter wirelength while runs 3.05×, 2.84× and 1.05× faster than BonnPlace, MAPLE and NTUplace3-unified in average of ISPD 2005, ISPD 2006 and MMS circuits, respectively.
ePlace是一种用于处理大规模标准单元和混合大小放置的广义解析算法。我们使用了一种新的基于静电的密度函数来消除重叠,并使用Nesterov方法来最小化非线性代价。步长估计为利普希茨常数的倒数,该常数由动态预测和回溯方法确定。提出了一种近似的预调节器来解决大宏和标准单元之间的差异,同时设计了一个退火引擎来处理宏合法化,然后放置标准单元。上述创新集成到我们的放置原型ePlace中,它在各自的标准单元和混合尺寸基准套件上优于领先的放置器。在ISPD 2005、ISPD 2006和MMS电路中,ePlace比BonnPlace、MAPLE和ntuplace3的平均运行速度分别快3.05倍、2.84倍和1.05倍,比BonnPlace、MAPLE和ntuplace3的平均运行速度短2.83%、4.59%和7.13%。
{"title":"ePlace: Electrostatics based placement using Nesterov's method","authors":"Jingwei Lu, Pengwen Chen, Chin-Chih Chang, Lu Sha, D. J. Huang, C. Teng, Chung-Kuan Cheng","doi":"10.1145/2593069.2593133","DOIUrl":"https://doi.org/10.1145/2593069.2593133","url":null,"abstract":"ePlace is a generalized analytic algorithm to handle large-scale standard-cell and mixed-size placement. We use a novel density function based on electrostatics to remove overlap and Nesterov's method to minimize the nonlinear cost. Steplength is estimated as the inverse of Lipschitz constant, which is determined by our dynamic prediction and backtracking method. An approximated preconditioner is proposed to resolve the difference between large macros and standard cells, while an annealing engine is devised to handle macro legalization followed by placement of standard cells. The above innovations are integrated into our placement prototype ePlace, which outperforms the leading-edge placers on respective standard-cell and mixed-size benchmark suites. Specifically, ePlace produces 2.83%, 4.59% and 7.13% shorter wirelength while runs 3.05×, 2.84× and 1.05× faster than BonnPlace, MAPLE and NTUplace3-unified in average of ISPD 2005, ISPD 2006 and MMS circuits, respectively.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132713482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
On the scheduling of fault-tolerant mixed-criticality systems 容错混合临界系统的调度问题
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593169
Pengcheng Huang, Hoeseok Yang, L. Thiele
We consider in this paper fault-tolerant mixed-criticality scheduling, where heterogeneous safety guarantees must be provided to functionalities (tasks) of varying criticalities (importances). We model explicitly the safety requirements for tasks of different criticalities according to safety standards, assuming hardware transient faults. We further provide analysis techniques to bound the effects of task killing and service degradation on the system safety and schedulability. Based on our model and analysis, we show that our problem can be converted to a conventional mixed-criticality scheduling problem. Thus, we broaden the scope of applicability of the conventional mixed-criticality scheduling techniques. Our proposed techniques are validated with a realistic flight management system application and extensive simulations.
在本文中,我们考虑容错混合临界调度,其中必须为不同临界(重要性)的功能(任务)提供异构安全保证。我们根据安全标准明确建模不同临界任务的安全要求,假设硬件瞬态故障。我们进一步提供了分析技术来绑定任务终止和服务降级对系统安全性和可调度性的影响。基于我们的模型和分析,我们表明我们的问题可以转化为一个传统的混合临界调度问题。从而拓宽了传统混合临界调度技术的适用范围。我们提出的技术通过实际的飞行管理系统应用和广泛的仿真得到了验证。
{"title":"On the scheduling of fault-tolerant mixed-criticality systems","authors":"Pengcheng Huang, Hoeseok Yang, L. Thiele","doi":"10.1145/2593069.2593169","DOIUrl":"https://doi.org/10.1145/2593069.2593169","url":null,"abstract":"We consider in this paper fault-tolerant mixed-criticality scheduling, where heterogeneous safety guarantees must be provided to functionalities (tasks) of varying criticalities (importances). We model explicitly the safety requirements for tasks of different criticalities according to safety standards, assuming hardware transient faults. We further provide analysis techniques to bound the effects of task killing and service degradation on the system safety and schedulability. Based on our model and analysis, we show that our problem can be converted to a conventional mixed-criticality scheduling problem. Thus, we broaden the scope of applicability of the conventional mixed-criticality scheduling techniques. Our proposed techniques are validated with a realistic flight management system application and extensive simulations.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134407186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Run-time technique for simultaneous aging and power optimization in GPGPUs gpgpu中同时老化和功耗优化的运行时技术
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593208
Xiaoming Chen, Yu Wang, Yun Liang, Yuan Xie, Huazhong Yang
High-performance general-purpose graphics processing units (GPGPUs) may suffer from serious power and negative bias temperature instability (NBTI) problems. In this paper, we propose a framework for run-time aging and power optimization. Our technique is based on the observation that many GPGPU applications achieve optimal performance with only a portion of cores due to either bandwidth saturation or shared resource contention. During run-time, given the dynamically tracked NBTI-induced threshold voltage shift and the problem size of GPGPU applications, our algorithm returns the optimal number of cores using detailed performance modeling. The unused cores are power-gated for power saving and NBTI recovery. Experiments show that our proposed technique achieves on average 34% reduction in NBTI-induced threshold voltage shift and 19% power reduction, while the average performance degradation is less than 1%.
高性能通用图形处理单元(gpgpu)可能会遭受严重的功率和负偏置温度不稳定性(NBTI)问题。在本文中,我们提出了一个运行时老化和功率优化的框架。我们的技术是基于这样一种观察,即由于带宽饱和或共享资源争用,许多GPGPU应用程序仅使用部分内核即可实现最佳性能。在运行期间,考虑到动态跟踪的nbti诱导的阈值电压偏移和GPGPU应用程序的问题大小,我们的算法使用详细的性能建模返回最优核数。未使用的核心是电源门控,以节省电力和NBTI恢复。实验表明,该技术平均降低了34%的nbti诱导阈值电压偏移和19%的功耗,而平均性能下降小于1%。
{"title":"Run-time technique for simultaneous aging and power optimization in GPGPUs","authors":"Xiaoming Chen, Yu Wang, Yun Liang, Yuan Xie, Huazhong Yang","doi":"10.1145/2593069.2593208","DOIUrl":"https://doi.org/10.1145/2593069.2593208","url":null,"abstract":"High-performance general-purpose graphics processing units (GPGPUs) may suffer from serious power and negative bias temperature instability (NBTI) problems. In this paper, we propose a framework for run-time aging and power optimization. Our technique is based on the observation that many GPGPU applications achieve optimal performance with only a portion of cores due to either bandwidth saturation or shared resource contention. During run-time, given the dynamically tracked NBTI-induced threshold voltage shift and the problem size of GPGPU applications, our algorithm returns the optimal number of cores using detailed performance modeling. The unused cores are power-gated for power saving and NBTI recovery. Experiments show that our proposed technique achieves on average 34% reduction in NBTI-induced threshold voltage shift and 19% power reduction, while the average performance degradation is less than 1%.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133023050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Branch-aware loop mapping on CGRAs CGRAs上的分支感知环路映射
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593100
M. Hamzeh, Aviral Shrivastava, S. Vrudhula
One of the challenges that all accelerators face, is to execute loops that have if-then-else constructs. There are three ways to accelerate loops with an if-then-else construct on a Coarse-grained reconfigurable architecture (CGRA): full predication, partial predication, and dual-issue scheme. In comparison with the other schemes, dual-issue scheme may achieve the best performance, but it requires compiler support - which does not exist. In this paper, we develop compiler techniques to map loops with conditionals on CGRA for the dual-issue scheme. Our experiments show: i) 40% of loops that can be accelerated on CGRA have conditionals, ii) The proposed dual-issue scheme enables our compiler to accelerate loops 40% faster than full predication scheme proposed in [12], and iii) Our compiler assisted dual issue scheme can exploit richer interconnects, if present.
所有加速器面临的挑战之一是执行带有if-then-else结构的循环。在粗粒度可重构体系结构(CGRA)上使用if-then-else构造加速循环有三种方法:完全预测、部分预测和双问题模式。与其他方案相比,双问题方案可能达到最佳性能,但它需要编译器的支持——而这种支持并不存在。在本文中,我们开发了一种编译器技术,用于在CGRA上映射带有条件的循环。我们的实验表明:i)可以在CGRA上加速的循环中有40%具有条件,ii)所提出的双发布方案使我们的编译器加速循环的速度比[12]中提出的全预测方案快40%,iii)我们的编译器辅助双发布方案可以利用更丰富的互连,如果存在的话。
{"title":"Branch-aware loop mapping on CGRAs","authors":"M. Hamzeh, Aviral Shrivastava, S. Vrudhula","doi":"10.1145/2593069.2593100","DOIUrl":"https://doi.org/10.1145/2593069.2593100","url":null,"abstract":"One of the challenges that all accelerators face, is to execute loops that have if-then-else constructs. There are three ways to accelerate loops with an if-then-else construct on a Coarse-grained reconfigurable architecture (CGRA): full predication, partial predication, and dual-issue scheme. In comparison with the other schemes, dual-issue scheme may achieve the best performance, but it requires compiler support - which does not exist. In this paper, we develop compiler techniques to map loops with conditionals on CGRA for the dual-issue scheme. Our experiments show: i) 40% of loops that can be accelerated on CGRA have conditionals, ii) The proposed dual-issue scheme enables our compiler to accelerate loops 40% faster than full predication scheme proposed in [12], and iii) Our compiler assisted dual issue scheme can exploit richer interconnects, if present.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125628807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Low power GPGPU computation with imprecise hardware
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593156
Hang Zhang, M. Putic, J. Lach
Massively parallel computation in GPUs significantly boosts performance of compute-intensive applications but creates power and thermal issues that limit further performance scaling. This paper demonstrates significant GPGPU power savings by relaxing application accuracy requirements and enabling the use of low power imprecise hardware (IHW). A synthesized set of novel imprecise floating point arithmetic units is presented. GPGPU-Sim and GPUWattch are used to estimate impacts of IHW units on output quality and system-level power consumption, providing a quality-power tradeoff model for application-specific optimization. Experimental results for a 45 nm process show up to 32% power savings with negligible impacts on output quality.
gpu中的大规模并行计算显著提高了计算密集型应用程序的性能,但也产生了功耗和热问题,限制了进一步的性能扩展。本文通过放松应用精度要求和允许使用低功耗不精确硬件(IHW)来演示显著的GPGPU功耗节省。提出了一种新型非精确浮点运算单元的合成集。GPGPU-Sim和gpuwatch用于估计IHW单元对输出质量和系统级功耗的影响,为特定应用的优化提供质量-功率权衡模型。45纳米制程的实验结果显示,在输出质量影响可以忽略不计的情况下,可节省高达32%的功率。
{"title":"Low power GPGPU computation with imprecise hardware","authors":"Hang Zhang, M. Putic, J. Lach","doi":"10.1145/2593069.2593156","DOIUrl":"https://doi.org/10.1145/2593069.2593156","url":null,"abstract":"Massively parallel computation in GPUs significantly boosts performance of compute-intensive applications but creates power and thermal issues that limit further performance scaling. This paper demonstrates significant GPGPU power savings by relaxing application accuracy requirements and enabling the use of low power imprecise hardware (IHW). A synthesized set of novel imprecise floating point arithmetic units is presented. GPGPU-Sim and GPUWattch are used to estimate impacts of IHW units on output quality and system-level power consumption, providing a quality-power tradeoff model for application-specific optimization. Experimental results for a 45 nm process show up to 32% power savings with negligible impacts on output quality.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129927520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Resource efficient mobile communications for crowd-sensing 用于人群感知的资源高效移动通信
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2596686
C. Wietfeld, Christoph Ide, Bjoern Dusza
Due to continuously growing communication traffic of emerging mobile phone applications, resource-efficient communication is a key to affordable network services with high Quality of Experience. While some communication traffic requires immediate resource allocations (such as voice services), an increasing number of mobile phone applications produce a lot of background communication traffic (e.g. social network apps, crowd sensing services). In this paper we discuss the predictive Channel-Aware Transmission (pCAT) scheme, which leverages the fact, that favorable channel conditions require much less spectrum resource allocation than bad channel conditions. By leveraging the knowledge of user trajectories and recurring spots with favorable channel conditions, the so-called LTE connectivity hot spots, background traffic transmissions can be scheduled by the client according to expected channel quality and application data priority. Thereby, the spectrum consumption of background traffic of applications can be reduced significantly. At the same time, the efficient usage of spectrum resources has also an impact on the battery lifetime of the mobile devices. By introducing the Context-Aware Power Consumption Model (CoPoMo) for LTE communications, we highlight, how decisions about the spectrum resource allocation by the network will impact the battery lifetime. One case study will show, that by simply changing the resource allocation scheme and without the need for spending more spectrum resources, the power consumption can be reduced by more than 70% using the Energy-Efficient Scheduling (EES) introduced in this paper.
由于新兴移动电话应用程序的通信流量不断增长,资源高效的通信是提供高质量体验的廉价网络服务的关键。虽然一些通信流量需要立即分配资源(例如语音服务),但越来越多的手机应用程序产生了大量的后台通信流量(例如社交网络应用程序,人群感知服务)。本文讨论了预测信道感知传输(pCAT)方案,该方案利用了有利信道条件比恶劣信道条件所需的频谱资源分配少得多的事实。通过利用对用户轨迹和具有有利信道条件的重复点(即所谓的LTE连接热点)的了解,客户端可以根据预期的信道质量和应用数据优先级来调度后台流量传输。因此,可以显著减少应用程序后台流量的频谱消耗。同时,频谱资源的有效利用也影响着移动设备的电池寿命。通过介绍LTE通信的情境感知功耗模型(CoPoMo),我们强调了网络关于频谱资源分配的决策将如何影响电池寿命。一个案例研究将表明,通过简单地改变资源分配方案,而不需要花费更多的频谱资源,使用本文介绍的节能调度(EES)可以减少70%以上的功耗。
{"title":"Resource efficient mobile communications for crowd-sensing","authors":"C. Wietfeld, Christoph Ide, Bjoern Dusza","doi":"10.1145/2593069.2596686","DOIUrl":"https://doi.org/10.1145/2593069.2596686","url":null,"abstract":"Due to continuously growing communication traffic of emerging mobile phone applications, resource-efficient communication is a key to affordable network services with high Quality of Experience. While some communication traffic requires immediate resource allocations (such as voice services), an increasing number of mobile phone applications produce a lot of background communication traffic (e.g. social network apps, crowd sensing services). In this paper we discuss the predictive Channel-Aware Transmission (pCAT) scheme, which leverages the fact, that favorable channel conditions require much less spectrum resource allocation than bad channel conditions. By leveraging the knowledge of user trajectories and recurring spots with favorable channel conditions, the so-called LTE connectivity hot spots, background traffic transmissions can be scheduled by the client according to expected channel quality and application data priority. Thereby, the spectrum consumption of background traffic of applications can be reduced significantly. At the same time, the efficient usage of spectrum resources has also an impact on the battery lifetime of the mobile devices. By introducing the Context-Aware Power Consumption Model (CoPoMo) for LTE communications, we highlight, how decisions about the spectrum resource allocation by the network will impact the battery lifetime. One case study will show, that by simply changing the resource allocation scheme and without the need for spending more spectrum resources, the power consumption can be reduced by more than 70% using the Energy-Efficient Scheduling (EES) introduced in this paper.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"97 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133721982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Data mining in EDA - Basic principles, promises, and constraints EDA中的数据挖掘——基本原则、承诺和约束
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2596675
Li-C. Wang, M. Abadir
This paper discusses the basic principles of applying data mining in Electronic Design Automation. It begins by introducing several important concepts in statistical learning and summarizes different types of learning algorithms. Then, the experience of developing a practical data mining application is described, including promises that are demonstrated through positive results based on industrial settings and constraints explained in their respective application contexts.
本文讨论了数据挖掘在电子设计自动化中应用的基本原理。它首先介绍了统计学习中的几个重要概念,并总结了不同类型的学习算法。然后,描述了开发实际数据挖掘应用程序的经验,包括通过基于各自应用环境中解释的工业设置和约束的积极结果来证明的承诺。
{"title":"Data mining in EDA - Basic principles, promises, and constraints","authors":"Li-C. Wang, M. Abadir","doi":"10.1145/2593069.2596675","DOIUrl":"https://doi.org/10.1145/2593069.2596675","url":null,"abstract":"This paper discusses the basic principles of applying data mining in Electronic Design Automation. It begins by introducing several important concepts in statistical learning and summarizes different types of learning algorithms. Then, the experience of developing a practical data mining application is described, including promises that are demonstrated through positive results based on industrial settings and constraints explained in their respective application contexts.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133243623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Protecting SRAM-based FPGAs against multiple bit upsets using erasure codes 使用擦除码保护基于sram的fpga免受多位干扰
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593191
Parthasarathy M. B. Rao, Mojtaba Ebrahimi, Razi Seyyedi, M. Tahoori
Multiple bit upsets due to radiation-induced soft errors are a major concern in nanoscale technology nodes. Once such errors occur in the configuration frames of an FPGA device, they permanently affect the functionality of the mapped design. The combination of error correction schemes and configuration scrubbing is an efficient approach to avoid such permanent errors. Existing solutions exploit coding techniques with considerably high overhead to protect configuration frames against multiple bit upsets. In this paper, we propose a generic scrubbing scheme which reconstructs the erroneous configuration frame based on the concept of erasure codes. Our proposed scheme does not require any changes to the FPGA architecture. Experimental results on a Xilinx Virtex-6 FPGA device show that the proposed scheme achieves error recovery coverage of 99.30% with only 3% resource occupation while the mean time to repair is comparable with previous schemes.
由于辐射引起的软误差引起的多次位失稳是纳米级技术节点的主要问题。一旦这种错误发生在FPGA器件的配置帧中,它们就会永久地影响映射设计的功能。纠错方案和配置清洗相结合是避免此类永久性错误的有效方法。现有的解决方案利用相当高的开销的编码技术来保护配置帧免受多位干扰。本文提出了一种基于擦除码概念重构错误配置帧的通用清洗方案。我们提出的方案不需要对FPGA架构进行任何更改。在Xilinx Virtex-6 FPGA器件上的实验结果表明,该方案在仅占用3%资源的情况下实现了99.30%的错误恢复覆盖率,而平均修复时间与之前的方案相当。
{"title":"Protecting SRAM-based FPGAs against multiple bit upsets using erasure codes","authors":"Parthasarathy M. B. Rao, Mojtaba Ebrahimi, Razi Seyyedi, M. Tahoori","doi":"10.1145/2593069.2593191","DOIUrl":"https://doi.org/10.1145/2593069.2593191","url":null,"abstract":"Multiple bit upsets due to radiation-induced soft errors are a major concern in nanoscale technology nodes. Once such errors occur in the configuration frames of an FPGA device, they permanently affect the functionality of the mapped design. The combination of error correction schemes and configuration scrubbing is an efficient approach to avoid such permanent errors. Existing solutions exploit coding techniques with considerably high overhead to protect configuration frames against multiple bit upsets. In this paper, we propose a generic scrubbing scheme which reconstructs the erroneous configuration frame based on the concept of erasure codes. Our proposed scheme does not require any changes to the FPGA architecture. Experimental results on a Xilinx Virtex-6 FPGA device show that the proposed scheme achieves error recovery coverage of 99.30% with only 3% resource occupation while the mean time to repair is comparable with previous schemes.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132624087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Space-efficient multiversion index scheme for PCM-based embedded database systems 基于pcm的嵌入式数据库系统的多版本索引方案
Pub Date : 2014-06-01 DOI: 10.1145/2593069.2593219
Yuan-Hung Kuan, Yuan-Hao Chang, Po-Chun Huang, K. Lam
Embedded database systems are widely adopted in various control and motoring systems, e.g., cyber-physical systems (CPSes). To support the functionality to access the historical data, a multiversion index is adopted to maintain multiple versions of data items and their index information. However, CPSes are usually battery-powered embedded systems that have limited energy, computing power, and storage space. In this work, we consider the systems with phase-change memory (PCM) as their storage due to its non-volatility and low energy consumption. In order to resolve the problem of the limited storage space and the fact that existing multiversion index designs are lack of space efficiency, we propose a space-efficient multiversion index scheme to enhance the space utilization and access performance of embedded multiversion database systems on PCM by utilizing the byte-addressability and write asymmetry of PCM. A series of experiments was conducted to evaluate the efficacy of the proposed scheme. The results show that the proposed scheme achieves very high space utilization and has good performance on serving update transactions and range queries.
嵌入式数据库系统广泛应用于各种控制和运动系统,例如网络物理系统(CPSes)。为了支持访问历史数据的功能,采用多版本索引来维护数据项及其索引信息的多个版本。然而,cpse通常是电池供电的嵌入式系统,具有有限的能量、计算能力和存储空间。在这项工作中,我们考虑相变存储器(PCM)作为其存储系统,因为它的非易失性和低能耗。为了解决存储空间有限和现有多版本索引设计缺乏空间效率的问题,我们提出了一种空间高效的多版本索引方案,利用PCM的字节可寻址性和写不对称性来提高嵌入式多版本数据库系统在PCM上的空间利用率和访问性能。通过一系列的实验来评价该方案的有效性。结果表明,该方案具有很高的空间利用率,并且在服务更新事务和范围查询方面具有良好的性能。
{"title":"Space-efficient multiversion index scheme for PCM-based embedded database systems","authors":"Yuan-Hung Kuan, Yuan-Hao Chang, Po-Chun Huang, K. Lam","doi":"10.1145/2593069.2593219","DOIUrl":"https://doi.org/10.1145/2593069.2593219","url":null,"abstract":"Embedded database systems are widely adopted in various control and motoring systems, e.g., cyber-physical systems (CPSes). To support the functionality to access the historical data, a multiversion index is adopted to maintain multiple versions of data items and their index information. However, CPSes are usually battery-powered embedded systems that have limited energy, computing power, and storage space. In this work, we consider the systems with phase-change memory (PCM) as their storage due to its non-volatility and low energy consumption. In order to resolve the problem of the limited storage space and the fact that existing multiversion index designs are lack of space efficiency, we propose a space-efficient multiversion index scheme to enhance the space utilization and access performance of embedded multiversion database systems on PCM by utilizing the byte-addressability and write asymmetry of PCM. A series of experiments was conducted to evaluate the efficacy of the proposed scheme. The results show that the proposed scheme achieves very high space utilization and has good performance on serving update transactions and range queries.","PeriodicalId":433816,"journal":{"name":"2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132884429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1