首页 > 最新文献

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)最新文献

英文 中文
An adaptively biased low-dropout regulator with transient enhancement 一种具有瞬态增强的自适应偏置低差调节器
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722166
Chenchang Zhan, W. Ki
An output-capacitor-free adaptively biased low-dropout regulator with transient enhancement (ABTE LDR) is proposed. Techniques of Q-reduction compensation, adaptive biasing, and transient enhancement achieve low-voltage high-precision regulation with low quiescent current consumption while significantly improving the line and load transient responses and power supply rejections. The features of the ABTE LDR are experimentally verified by a 0.35-μm CMOS prototype.
提出了一种无输出电容的暂态增强自适应偏置低差稳压器(ABTE LDR)。降q补偿、自适应偏置和瞬态增强技术实现了低静态电流消耗的低压高精度调节,同时显著改善了线路和负载的瞬态响应和电源抑制。在0.35 μm CMOS样机上验证了ABTE LDR的特性。
{"title":"An adaptively biased low-dropout regulator with transient enhancement","authors":"Chenchang Zhan, W. Ki","doi":"10.1109/ASPDAC.2011.5722166","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722166","url":null,"abstract":"An output-capacitor-free adaptively biased low-dropout regulator with transient enhancement (ABTE LDR) is proposed. Techniques of Q-reduction compensation, adaptive biasing, and transient enhancement achieve low-voltage high-precision regulation with low quiescent current consumption while significantly improving the line and load transient responses and power supply rejections. The features of the ABTE LDR are experimentally verified by a 0.35-μm CMOS prototype.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129384694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
ILP-based inter-die routing for 3D ICs 基于ilp的3D集成电路芯片间路由
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722209
Chia-Jen Chang, Pao-Jen Huang, Tai-Chen Chen, C. Liu
The 3D IC is an emerging technology. The primary emphasis on 3D-IC routing is the interface issues across dies. To handle the interface issue of connections, the inter-die routing, which uses micro bumps and two single-layer RDLs (Re-Distribution Layers) to achieve the connection between adjacent dies, is adopted. In this paper, we present an inter-die routing algorithm for 3D ICs with a pre-defined netlist. Our algorithm is based on integer linear programming (ILP) and adopts a two-stage technique of micro-bump assignment followed by non-regular RDL routing. First, the micro-bump assignment selects suitable micro-bumps for the pre-defined netlist such that no crossing problem exists inside the bounding boxes of each net. After the micro-bump assignment, the netlist is divided into two sub-netlists, one is for the upper RDL and the other is for the lower RDL. Second, the non-regular RDL routing determines minimum and non-crossing global paths for sub-netlists in the upper and lower RDLs individually. Experimental results show that our approach can obtain optimal wirelength and achieve 100% routability under reasonable CPU times.
3D集成电路是一项新兴技术。3D-IC路由的主要重点是跨芯片的接口问题。为了解决连接的接口问题,采用了模具间路由,利用微凸点和两个单层重分布层来实现相邻模具之间的连接。本文提出了一种具有预定义网表的三维集成电路的芯片间路由算法。该算法基于整数线性规划(ILP),采用微碰撞分配和非规则RDL路由两阶段技术。首先,为预定义的网表选择合适的微凸点,使每个网的边界框内不存在交叉问题;在微碰撞分配后,将网络列表划分为两个子网络列表,一个用于上RDL,另一个用于下RDL。其次,非规则RDL路由分别确定上层RDL和下层RDL中的子网络列表的最小和非交叉全局路径。实验结果表明,在合理的CPU时间下,该方法可以获得最优的带宽和100%的路由可达性。
{"title":"ILP-based inter-die routing for 3D ICs","authors":"Chia-Jen Chang, Pao-Jen Huang, Tai-Chen Chen, C. Liu","doi":"10.1109/ASPDAC.2011.5722209","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722209","url":null,"abstract":"The 3D IC is an emerging technology. The primary emphasis on 3D-IC routing is the interface issues across dies. To handle the interface issue of connections, the inter-die routing, which uses micro bumps and two single-layer RDLs (Re-Distribution Layers) to achieve the connection between adjacent dies, is adopted. In this paper, we present an inter-die routing algorithm for 3D ICs with a pre-defined netlist. Our algorithm is based on integer linear programming (ILP) and adopts a two-stage technique of micro-bump assignment followed by non-regular RDL routing. First, the micro-bump assignment selects suitable micro-bumps for the pre-defined netlist such that no crossing problem exists inside the bounding boxes of each net. After the micro-bump assignment, the netlist is divided into two sub-netlists, one is for the upper RDL and the other is for the lower RDL. Second, the non-regular RDL routing determines minimum and non-crossing global paths for sub-netlists in the upper and lower RDLs individually. Experimental results show that our approach can obtain optimal wirelength and achieve 100% routability under reasonable CPU times.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129467837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Parallel cross-layer optimization of high-level synthesis and physical design 高级合成与物理设计的并行跨层优化
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722235
James Williamson, Yinghai Lu, L. Shang, H. Zhou, Xuan Zeng
Integrated circuit (IC) design automation has traditionally followed a hierarchical approach. Modern IC design flow is divided into sequentially-addressed design and optimization layers; each successively finer in design detail and data granularity while increasing in computational complexity. Eventual agreement across the design layers signals design closure. Obtaining design closure is a continual problem, as lack of awareness and interaction between layers often results in multiple design flow iterations. In this work, we propose parallel cross-layer optimization, in which the boundaries between design layers are broken, allowing for a more informed and efficient exploration of the design space. We leverage the heterogeneous parallel computational power in current and upcoming multi-core/many-core computation platforms to suite the heterogeneous characteristics of multiple design layers. Specifically, we unify the highlevel and physical synthesis design layers for parallel cross-layer IC design optimization. In addition, we introduce a massively-parallel GPU floorplanner with local and global convergence test as the proposed physical synthesis design layer. Our results show average performance gains of 11X speed-up over state-of-the-art.
集成电路(IC)设计自动化传统上遵循分层方法。现代IC设计流程分为顺序寻址设计层和优化层;每一个都在设计细节和数据粒度上不断细化,同时增加了计算复杂性。设计层之间的最终一致标志着设计的结束。获得设计闭合是一个持续存在的问题,因为缺乏意识和层之间的交互通常会导致多个设计流迭代。在这项工作中,我们提出了平行的跨层优化,其中设计层之间的边界被打破,允许对设计空间进行更明智和有效的探索。我们利用当前和即将到来的多核/多核计算平台的异构并行计算能力来适应多个设计层的异构特性。具体来说,我们统一了高层和物理合成设计层,以实现并行跨层集成电路设计优化。此外,我们还引入了一个具有局部和全局收敛测试的大规模并行GPU floorplanner作为提议的物理合成设计层。我们的结果显示,与最先进的技术相比,平均性能提高了11倍。
{"title":"Parallel cross-layer optimization of high-level synthesis and physical design","authors":"James Williamson, Yinghai Lu, L. Shang, H. Zhou, Xuan Zeng","doi":"10.1109/ASPDAC.2011.5722235","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722235","url":null,"abstract":"Integrated circuit (IC) design automation has traditionally followed a hierarchical approach. Modern IC design flow is divided into sequentially-addressed design and optimization layers; each successively finer in design detail and data granularity while increasing in computational complexity. Eventual agreement across the design layers signals design closure. Obtaining design closure is a continual problem, as lack of awareness and interaction between layers often results in multiple design flow iterations. In this work, we propose parallel cross-layer optimization, in which the boundaries between design layers are broken, allowing for a more informed and efficient exploration of the design space. We leverage the heterogeneous parallel computational power in current and upcoming multi-core/many-core computation platforms to suite the heterogeneous characteristics of multiple design layers. Specifically, we unify the highlevel and physical synthesis design layers for parallel cross-layer IC design optimization. In addition, we introduce a massively-parallel GPU floorplanner with local and global convergence test as the proposed physical synthesis design layer. Our results show average performance gains of 11X speed-up over state-of-the-art.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130565881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FPGA prototyping using behavioral synthesis for improving video processing algorithm and FHD TV SoC design FPGA原型设计利用行为综合改进视频处理算法和FHD电视SoC设计
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722291
Masaru Takahashi
The System on Chip (SoC) can include Full High Definition (FHD) video processing, however the turn around time of algorithm improvement have been long. We provide the new method utilizing the behavioral synthesis. Therefore, the turn around time of the algorithm improvement and hardware implementation can be shorten.
片上系统(SoC)可以实现全高清(FHD)视频处理,但算法改进的周期较长。我们提出了利用行为综合的新方法。因此,可以缩短算法改进和硬件实现的周转时间。
{"title":"FPGA prototyping using behavioral synthesis for improving video processing algorithm and FHD TV SoC design","authors":"Masaru Takahashi","doi":"10.1109/ASPDAC.2011.5722291","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722291","url":null,"abstract":"The System on Chip (SoC) can include Full High Definition (FHD) video processing, however the turn around time of algorithm improvement have been long. We provide the new method utilizing the behavioral synthesis. Therefore, the turn around time of the algorithm improvement and hardware implementation can be shorten.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128117348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust and efficient baseband receiver design for MB-OFDM UWB system MB-OFDM UWB系统稳健高效的基带接收机设计
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722163
Wen Fan, O. Choy
Robust, efficient and low complexity design methodologies for high speed multi-band orthogonal frequency division multiplexing ultra-wideband (MB-OFDM UWB) is presented. The proposed design is implemented in 0.13μm CMOS technology with the core area of 2.66mm × 0.94mm. Operating at 132MHz clock frequency, the estimated power consumption is 170mW.
提出了一种鲁棒、高效、低复杂度的高速多频带正交频分复用超宽带(MB-OFDM)设计方法。该设计采用0.13μm CMOS工艺,核心面积为2.66mm × 0.94mm。在132MHz时钟频率下工作,估计功耗为170mW。
{"title":"Robust and efficient baseband receiver design for MB-OFDM UWB system","authors":"Wen Fan, O. Choy","doi":"10.1109/ASPDAC.2011.5722163","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722163","url":null,"abstract":"Robust, efficient and low complexity design methodologies for high speed multi-band orthogonal frequency division multiplexing ultra-wideband (MB-OFDM UWB) is presented. The proposed design is implemented in 0.13μm CMOS technology with the core area of 2.66mm × 0.94mm. Operating at 132MHz clock frequency, the estimated power consumption is 170mW.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114315343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the interplay of loop caching, code compression, and cache configuration 关于循环缓存、代码压缩和缓存配置的相互作用
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722191
M. Rawlins, A. Gordon-Ross
Even though much previous work explores varying instruction cache optimization techniques individually, little work explores the combined effects of these techniques (i.e., do they complement or obviate each other). In this paper we explore the interaction of three optimizations: loop caching, cache tuning, and code compression. Results show that loop caching increases energy savings by as much as 26% compared to cache tuning alone and reduces decompression energy by as much as 73%.
尽管以前的很多工作都是单独探索不同的指令缓存优化技术,但很少有工作是探索这些技术的综合效果(即,它们是相互补充还是相互排斥)。在本文中,我们探讨了三种优化的交互:循环缓存、缓存调优和代码压缩。结果表明,与单独进行缓存调优相比,循环缓存可节省多达26%的能量,并可减少多达73%的解压能量。
{"title":"On the interplay of loop caching, code compression, and cache configuration","authors":"M. Rawlins, A. Gordon-Ross","doi":"10.1109/ASPDAC.2011.5722191","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722191","url":null,"abstract":"Even though much previous work explores varying instruction cache optimization techniques individually, little work explores the combined effects of these techniques (i.e., do they complement or obviate each other). In this paper we explore the interaction of three optimizations: loop caching, cache tuning, and code compression. Results show that loop caching increases energy savings by as much as 26% compared to cache tuning alone and reduces decompression energy by as much as 73%.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123285847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Rapid layout pattern classification 快速布局模式分类
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722295
Jen-Yi Wuu, F. Pikus, A. Torres, M. Marek-Sadowska
Printability of layout objects becomes increasingly dependent on neighboring shapes within a larger and larger context window. In this paper, we propose a two-level hotspot pattern classification methodology that examines both central and peripheral patterns. Accuracy and runtime enhancement techniques are proposed, making our detection methodology robust and efficient as a fast physical verification tool that can be applied during early design stages to large-scale designs. We position our method as an approximate detection solution, similar to pattern matching-based tools widely adopted by the industry. In addition, our analyses of classification results reveal that the majority of non-hotspots falsely predicted as hotspots have printed CD barely over the minimum allowable CD threshold. Our method is verified on several 45 nm and 32 nm industrial designs.
布局对象的可打印性越来越依赖于越来越大的上下文窗口中的相邻形状。在本文中,我们提出了一个两级热点模式分类方法,同时检查中心和外围模式。提出了准确性和运行时间增强技术,使我们的检测方法作为一种快速的物理验证工具,可以在早期设计阶段应用于大规模设计。我们将我们的方法定位为近似检测解决方案,类似于行业广泛采用的基于模式匹配的工具。此外,我们对分类结果的分析显示,大多数被错误地预测为热点的非热点打印的CD几乎没有超过最小允许CD阈值。我们的方法在45 nm和32 nm工业设计上得到了验证。
{"title":"Rapid layout pattern classification","authors":"Jen-Yi Wuu, F. Pikus, A. Torres, M. Marek-Sadowska","doi":"10.1109/ASPDAC.2011.5722295","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722295","url":null,"abstract":"Printability of layout objects becomes increasingly dependent on neighboring shapes within a larger and larger context window. In this paper, we propose a two-level hotspot pattern classification methodology that examines both central and peripheral patterns. Accuracy and runtime enhancement techniques are proposed, making our detection methodology robust and efficient as a fast physical verification tool that can be applied during early design stages to large-scale designs. We position our method as an approximate detection solution, similar to pattern matching-based tools widely adopted by the industry. In addition, our analyses of classification results reveal that the majority of non-hotspots falsely predicted as hotspots have printed CD barely over the minimum allowable CD threshold. Our method is verified on several 45 nm and 32 nm industrial designs.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129276134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Template-based memory access engine for accelerators in SoCs 用于soc中加速器的基于模板的内存访问引擎
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722175
Bin Li, Zhen Fang, R. Iyer
With the rapid progress in semiconductor technologies, more and more accelerators can be integrated onto a single SoC chip. In SoCs, accelerators often require deterministic data access. However, as more and more applications are running simultaneous, latency can vary significantly due to contention. To address this problem, we propose a template-based memory access engine (MAE) for accelerators in SoCs. The proposed MAE can handle several common memory access patterns observed for near-future accelerators. Our evaluation results show that the proposed MAE can significantly reduce memory access latency and jitter, thus very effective for accelerators in SoCs.
随着半导体技术的飞速发展,越来越多的加速器可以集成到单个SoC芯片上。在soc中,加速器通常需要确定性的数据访问。但是,随着越来越多的应用程序同时运行,由于争用,延迟可能会有很大差异。为了解决这个问题,我们提出了一个基于模板的内存访问引擎(MAE),用于soc中的加速器。所提出的MAE可以处理近期加速器观察到的几种常见的内存访问模式。我们的评估结果表明,所提出的MAE可以显著降低存储器访问延迟和抖动,因此对于soc中的加速器非常有效。
{"title":"Template-based memory access engine for accelerators in SoCs","authors":"Bin Li, Zhen Fang, R. Iyer","doi":"10.1109/ASPDAC.2011.5722175","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722175","url":null,"abstract":"With the rapid progress in semiconductor technologies, more and more accelerators can be integrated onto a single SoC chip. In SoCs, accelerators often require deterministic data access. However, as more and more applications are running simultaneous, latency can vary significantly due to contention. To address this problem, we propose a template-based memory access engine (MAE) for accelerators in SoCs. The proposed MAE can handle several common memory access patterns observed for near-future accelerators. Our evaluation results show that the proposed MAE can significantly reduce memory access latency and jitter, thus very effective for accelerators in SoCs.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129193506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Profile assisted online system-level performance and power estimation for dynamic reconfigurable embedded systems 动态可重构嵌入式系统的在线系统级性能和功率估计
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722285
Jingqing Mu, Roman L. Lysecky
Significant research has demonstrated the performance and power benefits of runtime dynamic reconfiguration of FPGAs and microprocessor/FPGA devices. For dynamically reconfigurable systems, in which the selection of hardware coprocessors to implement within the FPGA is determined at runtime, online estimation methods are needed to evaluate the performance and power consumption impact of the hardware coprocessor selection. In this paper, we present a profile assisted online system-level performance and power estimation framework for estimating the speedup and power consumption of dynamically reconfigurable embedded systems. We evaluate the accuracy and fidelity of our online estimation framework for dynamic hardware kernel selection to maximize performance or minimize system power consumption.
重要的研究已经证明了运行时动态重构FPGA和微处理器/FPGA器件的性能和功耗优势。对于动态可重构系统,在FPGA内实现的硬件协处理器的选择是在运行时确定的,需要在线估计方法来评估硬件协处理器选择对性能和功耗的影响。在本文中,我们提出了一个轮廓辅助在线系统级性能和功耗估计框架,用于估计动态可重构嵌入式系统的加速和功耗。我们评估了动态硬件内核选择的在线估计框架的准确性和保真度,以最大化性能或最小化系统功耗。
{"title":"Profile assisted online system-level performance and power estimation for dynamic reconfigurable embedded systems","authors":"Jingqing Mu, Roman L. Lysecky","doi":"10.1109/ASPDAC.2011.5722285","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722285","url":null,"abstract":"Significant research has demonstrated the performance and power benefits of runtime dynamic reconfiguration of FPGAs and microprocessor/FPGA devices. For dynamically reconfigurable systems, in which the selection of hardware coprocessors to implement within the FPGA is determined at runtime, online estimation methods are needed to evaluate the performance and power consumption impact of the hardware coprocessor selection. In this paper, we present a profile assisted online system-level performance and power estimation framework for estimating the speedup and power consumption of dynamically reconfigurable embedded systems. We evaluate the accuracy and fidelity of our online estimation framework for dynamic hardware kernel selection to maximize performance or minimize system power consumption.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122669993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reconfiguration-aware real-time scheduling under QoS constraint QoS约束下的重构感知实时调度
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722174
H. Kooti, D. Mishra, E. Bozorgzadeh
Due to the increase in demand for reconfigurability in embedded systems, schedulability in real-time task scheduling is challenged by non-negligible reconfiguration overheads. Reconfiguration of the system during task execution affects both deadline miss rate and deadline miss distribution. On the other hand, Quality of Service (QoS) in several embedded applications is not only determined by deadline miss rate but also the distribution of the tasks missing their deadlines (known as weakly-hard real-time systems). As a result, we propose to model QoS constraints as a set of constraints on dropout patterns (due to reconfiguration overhead) and present a novel online solution for the problem of reconfiguration-aware real-time scheduling. According to QoS constraints, we divide the ready instances of the tasks into two groups: critical and non-critical, then model each group as a network flow problem and provide an online scheduler for each group. We deployed our method on synthetic benchmarks as well as software defined radio implementation of VoIP on reconfigurable systems. Results show that our solution reduces the number of QoS violations by 19.01 times and 2.33 times (57.02%) in comparison with Bi-Modal Scheduler (BMS) [1] for synthetic benchmarks with low and high QoS constraint, respectively.
由于嵌入式系统对可重构性需求的增加,实时任务调度中的可调度性受到不可忽略的可重构开销的挑战。在任务执行过程中,系统的重新配置会影响截止日期缺失率和截止日期缺失分布。另一方面,一些嵌入式应用程序中的服务质量(QoS)不仅由错过截止日期率决定,而且还取决于错过截止日期的任务的分布(称为弱硬实时系统)。因此,我们建议将QoS约束建模为一组对退出模式的约束(由于重新配置开销),并为重新配置感知的实时调度问题提出了一种新的在线解决方案。根据QoS约束,我们将任务的就绪实例分为关键和非关键两组,然后将每组建模为网络流问题,并为每组提供在线调度程序。我们将我们的方法部署在综合基准测试以及可重构系统上VoIP的软件定义无线电实现上。结果表明,在低QoS约束和高QoS约束的综合基准测试中,与Bi-Modal Scheduler (BMS)[1]相比,我们的解决方案分别减少了19.01倍和2.33倍(57.02%)的QoS违规次数。
{"title":"Reconfiguration-aware real-time scheduling under QoS constraint","authors":"H. Kooti, D. Mishra, E. Bozorgzadeh","doi":"10.1109/ASPDAC.2011.5722174","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722174","url":null,"abstract":"Due to the increase in demand for reconfigurability in embedded systems, schedulability in real-time task scheduling is challenged by non-negligible reconfiguration overheads. Reconfiguration of the system during task execution affects both deadline miss rate and deadline miss distribution. On the other hand, Quality of Service (QoS) in several embedded applications is not only determined by deadline miss rate but also the distribution of the tasks missing their deadlines (known as weakly-hard real-time systems). As a result, we propose to model QoS constraints as a set of constraints on dropout patterns (due to reconfiguration overhead) and present a novel online solution for the problem of reconfiguration-aware real-time scheduling. According to QoS constraints, we divide the ready instances of the tasks into two groups: critical and non-critical, then model each group as a network flow problem and provide an online scheduler for each group. We deployed our method on synthetic benchmarks as well as software defined radio implementation of VoIP on reconfigurable systems. Results show that our solution reduces the number of QoS violations by 19.01 times and 2.33 times (57.02%) in comparison with Bi-Modal Scheduler (BMS) [1] for synthetic benchmarks with low and high QoS constraint, respectively.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121157006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1