首页 > 最新文献

1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)最新文献

英文 中文
Approximate algorithms for time separation of events 事件时间分离的近似算法
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643520
Supratik Chakraborty, D. Dill
We describe a polynomial-time approximate algorithm for computing minimum and maximum time separations between all pairs of events in systems specified by acyclic timing constraint graphs. Even for acyclic graphs, the problem is NP-complete. We propose finding an approximate solution by first approximating the non-convex feasible space with a suitable convex "envelope", and then solving the problem efficiently in the approximate convex space. Unlike previous works, our algorithm can handle both min and max type timing constraints in the same system, and has a computational complexity that is polynomial in the number of events. Although the computed separations are conservative in the worst-case, experiments indicate that our results are highly accurate in practice.
我们描述了一种多项式时间近似算法,用于计算由非循环时序约束图指定的系统中所有事件对之间的最小和最大时间间隔。即使对于非循环图,这个问题也是np完全的。我们提出先用一个合适的凸包络来逼近非凸可行空间,然后在近似凸空间中高效地求解问题,从而求得近似解。与以前的工作不同,我们的算法可以在同一系统中处理最小和最大类型的时间约束,并且计算复杂度是事件数量的多项式。虽然在最坏情况下计算的分离是保守的,但实验表明,我们的结果在实践中是非常准确的。
{"title":"Approximate algorithms for time separation of events","authors":"Supratik Chakraborty, D. Dill","doi":"10.1109/ICCAD.1997.643520","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643520","url":null,"abstract":"We describe a polynomial-time approximate algorithm for computing minimum and maximum time separations between all pairs of events in systems specified by acyclic timing constraint graphs. Even for acyclic graphs, the problem is NP-complete. We propose finding an approximate solution by first approximating the non-convex feasible space with a suitable convex \"envelope\", and then solving the problem efficiently in the approximate convex space. Unlike previous works, our algorithm can handle both min and max type timing constraints in the same system, and has a computational complexity that is polynomial in the number of events. Although the computed separations are conservative in the worst-case, experiments indicate that our results are highly accurate in practice.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128436836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Achievable bounds on signal transition activity 信号转换活动的可实现边界
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643387
S. Ramprasad, Naresh R Shanbhag, I. Hajj
Transitions on high capacitance busses in VLSI systems result in considerable system power dissipation. Therefore, various coding schemes have been proposed in the literature to encode the input signal in order to reduce the number of transitions. In this paper we derive achievable lower and upper bounds on the expected signal transition activity. These bounds are derived via an information-theoretic approach in which symbols generated by a source (possibly correlated) with entropy rate H are coded with an average of R bits/symbol. These results are applied to, (1) determine the activity reducing efficiency of different coding algorithms such as Entropy coding, Transition coding, and Bus-Invert coding, (2) bound the error in entropy-based power estimation schemes, and (3) determine the lower-bound on the power-delay product. Two examples are provided where transition activity within 4% and 8% of the lower bound is achieved when blocks of 8 and 13 symbols respectively are coded at a time.
VLSI系统中高电容母线上的转换导致相当大的系统功耗。因此,文献中提出了各种编码方案来对输入信号进行编码,以减少转换次数。本文给出了期望信号转移活度可实现的下界和上界。这些界限是通过信息论方法推导出来的,其中由熵率为H的源(可能相关)生成的符号以平均R位/符号编码。这些结果应用于:(1)确定不同编码算法(如熵编码、转换编码和总线逆变编码)的活动降低效率;(2)确定基于熵的功率估计方案的误差;(3)确定功率延迟积的下界。提供了两个示例,其中当一次分别编码8个和13个符号块时,在下限的4%和8%内实现了转换活动。
{"title":"Achievable bounds on signal transition activity","authors":"S. Ramprasad, Naresh R Shanbhag, I. Hajj","doi":"10.1109/ICCAD.1997.643387","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643387","url":null,"abstract":"Transitions on high capacitance busses in VLSI systems result in considerable system power dissipation. Therefore, various coding schemes have been proposed in the literature to encode the input signal in order to reduce the number of transitions. In this paper we derive achievable lower and upper bounds on the expected signal transition activity. These bounds are derived via an information-theoretic approach in which symbols generated by a source (possibly correlated) with entropy rate H are coded with an average of R bits/symbol. These results are applied to, (1) determine the activity reducing efficiency of different coding algorithms such as Entropy coding, Transition coding, and Bus-Invert coding, (2) bound the error in entropy-based power estimation schemes, and (3) determine the lower-bound on the power-delay product. Two examples are provided where transition activity within 4% and 8% of the lower bound is achieved when blocks of 8 and 13 symbols respectively are coded at a time.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128720534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
DSP address optimization using a minimum cost circulation technique 使用最低成本循环技术的DSP地址优化
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643380
C. Gebotys
This paper presents a new approach to solving the DSP address assignment problem. A minimum cost circulation approach is used to efficiently generate high performance addressing code in polynomial time. Addressing code size improvements of up to 7 times are obtained, accounting for up to 1.6 times improvement in code size and performance of compiler-generated DSP code. Results also show that memory layout has a small effect on code size and performance when optimal addressing is used. This research is important for industry since this value-added technique can improve code size, power dissipation and performance, without increasing cost.
本文提出了一种解决DSP地址分配问题的新方法。采用最小成本循环方法,在多项式时间内高效生成高性能寻址码。寻址代码大小提高了7倍,占编译器生成的DSP代码的代码大小和性能提高了1.6倍。结果还表明,当使用最优寻址时,内存布局对代码大小和性能的影响很小。这项研究对工业具有重要意义,因为这种增值技术可以在不增加成本的情况下改善代码大小,功耗和性能。
{"title":"DSP address optimization using a minimum cost circulation technique","authors":"C. Gebotys","doi":"10.1109/ICCAD.1997.643380","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643380","url":null,"abstract":"This paper presents a new approach to solving the DSP address assignment problem. A minimum cost circulation approach is used to efficiently generate high performance addressing code in polynomial time. Addressing code size improvements of up to 7 times are obtained, accounting for up to 1.6 times improvement in code size and performance of compiler-generated DSP code. Results also show that memory layout has a small effect on code size and performance when optimal addressing is used. This research is important for industry since this value-added technique can improve code size, power dissipation and performance, without increasing cost.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129775342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
Efficient circuit partitioning to extend cycle simulation beyond synchronous circuits 有效的电路划分扩展周期仿真超越同步电路
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643400
Charlie DeVane
Cycle simulation techniques, such as levelized compiled code, can ordinarily be applied only to synchronous designs. They usually cannot be applied to designs containing circuit features like combinational paths, multiple clock domains, generated clocks, asynchronous resets, and transparent latches. This paper presents a novel partitioning algorithm that partitions a non-cycle-simulatable circuit containing these features into simulation that can be cycle simulated. Cycle simulation techniques can be applied to the individual sub-circuits, and the whole collection of sub-circuits can be simulated together using conventional co-simulation techniques. Empirical results demonstrate that this approach brings the benefits of cycle simulation to circuits that were previously impossible to cycle simulate. The partitioning algorithm requires time and space linear in the size of the circuit, and in practice is very fast. We also discuss how the key ideas presented here can be applied to accelerate HDL simulation.
周期模拟技术,如水平化编译代码,通常只能应用于同步设计。它们通常不能应用于包含组合路径、多个时钟域、生成时钟、异步复位和透明锁存等电路特征的设计。本文提出了一种新的划分算法,将包含这些特征的不可循环仿真电路划分为可循环仿真的仿真电路。周期仿真技术可以应用于单个子电路,并且可以使用传统的联合仿真技术对整个子电路集合进行模拟。经验结果表明,这种方法为以前不可能进行循环模拟的电路带来了循环模拟的好处。分划算法要求电路的大小在时间和空间上是线性的,在实践中速度非常快。我们还讨论了如何将这里提出的关键思想应用于加速HDL仿真。
{"title":"Efficient circuit partitioning to extend cycle simulation beyond synchronous circuits","authors":"Charlie DeVane","doi":"10.1109/ICCAD.1997.643400","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643400","url":null,"abstract":"Cycle simulation techniques, such as levelized compiled code, can ordinarily be applied only to synchronous designs. They usually cannot be applied to designs containing circuit features like combinational paths, multiple clock domains, generated clocks, asynchronous resets, and transparent latches. This paper presents a novel partitioning algorithm that partitions a non-cycle-simulatable circuit containing these features into simulation that can be cycle simulated. Cycle simulation techniques can be applied to the individual sub-circuits, and the whole collection of sub-circuits can be simulated together using conventional co-simulation techniques. Empirical results demonstrate that this approach brings the benefits of cycle simulation to circuits that were previously impossible to cycle simulate. The partitioning algorithm requires time and space linear in the size of the circuit, and in practice is very fast. We also discuss how the key ideas presented here can be applied to accelerate HDL simulation.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132712724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A new high-order absolutely-stable explicit numerical integration algorithm for the time-domain simulation of nonlinear circuits 非线性电路时域仿真的一种新的高阶绝对稳定显式数值积分算法
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643531
J. R. Griffith, M. Nakhla
A new numerical integration method for the time-domain simulation of nonlinear circuits is presented. The new method does not belong to the traditional class of linear multistep methods. Consequently, it is free from Dahlquist's (1963) barriers in terms of stability and order. The new method is shown to be both A-stable and at the same time of arbitrarily high order. In addition, the method is explicit in nature and does not require matrix inversion at each time step. Examples of linear and nonlinear circuit simulation are included. The proposed method significantly speeds up the time-domain simulation of nonlinear circuits as it combines the efficiency of an explicit method with the accuracy and large step size possible with high order.
提出了一种新的非线性电路时域仿真的数值积分方法。该方法不属于传统的线性多步骤方法。因此,在稳定性和秩序方面,它不受Dahlquist(1963)的障碍的影响。结果表明,该方法是稳定的,同时具有任意高阶性。此外,该方法本质上是显式的,不需要在每个时间步进行矩阵反演。包括线性和非线性电路仿真实例。该方法结合了显式方法的效率、精度和高阶可能的大步长,大大加快了非线性电路的时域仿真速度。
{"title":"A new high-order absolutely-stable explicit numerical integration algorithm for the time-domain simulation of nonlinear circuits","authors":"J. R. Griffith, M. Nakhla","doi":"10.1109/ICCAD.1997.643531","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643531","url":null,"abstract":"A new numerical integration method for the time-domain simulation of nonlinear circuits is presented. The new method does not belong to the traditional class of linear multistep methods. Consequently, it is free from Dahlquist's (1963) barriers in terms of stability and order. The new method is shown to be both A-stable and at the same time of arbitrarily high order. In addition, the method is explicit in nature and does not require matrix inversion at each time step. Examples of linear and nonlinear circuit simulation are included. The proposed method significantly speeds up the time-domain simulation of nonlinear circuits as it combines the efficiency of an explicit method with the accuracy and large step size possible with high order.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114811141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
High-level scheduling model and control synthesis for a broad range of design applications 高级调度模型和控制综合广泛的设计应用
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643526
Chih-Tung Chen, Kayhan Küçükçakar
Presents a versatile scheduling model and an efficient control synthesis methodology which enables architectural (high-level) design/synthesis systems to seamlessly support a broad range of architectural design applications from datapath-dominated digital signal processing (DSP) to commodity ICs such as microprocessors/microcontrollers and control-dominated peripherals, utilizing multiphase clocking schemes, multiple threading, data-dependent delays, pipelining, and combinations of the above. The work presented in this paper is an enabling technology for high-level synthesis to go beyond traditional datapath-dominated DSP applications and to start becoming a viable and cost-effective design methodology for commodity ICs.
提出了一种通用的调度模型和有效的控制综合方法,使架构(高级)设计/综合系统能够无缝地支持广泛的架构设计应用,从数据路径主导的数字信号处理(DSP)到微处理器/微控制器和控制主导的外设等商品ic,利用多相时钟方案,多线程,数据依赖延迟,流水线和上述组合。本文提出的工作是一种使能技术,用于高级综合,超越传统的数据路径主导的DSP应用,并开始成为商品ic的可行且具有成本效益的设计方法。
{"title":"High-level scheduling model and control synthesis for a broad range of design applications","authors":"Chih-Tung Chen, Kayhan Küçükçakar","doi":"10.1109/ICCAD.1997.643526","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643526","url":null,"abstract":"Presents a versatile scheduling model and an efficient control synthesis methodology which enables architectural (high-level) design/synthesis systems to seamlessly support a broad range of architectural design applications from datapath-dominated digital signal processing (DSP) to commodity ICs such as microprocessors/microcontrollers and control-dominated peripherals, utilizing multiphase clocking schemes, multiple threading, data-dependent delays, pipelining, and combinations of the above. The work presented in this paper is an enabling technology for high-level synthesis to go beyond traditional datapath-dominated DSP applications and to start becoming a viable and cost-effective design methodology for commodity ICs.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121595117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Verifying correct pipeline implementation for microprocessors 验证微处理器的正确管道实现
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643402
J. Levitt, K. Olukotun
We introduce a general, automatic verification technique for pipelined designs. The technique is based on a scalable, formal methodology for analysing pipelines. The key advantages to our technique are: it specifically targets pipeline control, making it more efficient; it requires no explicit specification, since it compares hardware against itself; it can be used within the broader framework of hierarchical verification; and, it can be easily extended to handle certain "complex" pipelined structures.
我们为流水线设计引入了一种通用的自动验证技术。该技术基于一种可扩展的、形式化的方法来分析管道。我们的技术的主要优势是:它专门针对管道控制,使其更高效;它不需要明确的规范,因为它将硬件与自身进行比较;它可以在更广泛的层次验证框架内使用;而且,它可以很容易地扩展到处理某些“复杂”的流水线结构。
{"title":"Verifying correct pipeline implementation for microprocessors","authors":"J. Levitt, K. Olukotun","doi":"10.1109/ICCAD.1997.643402","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643402","url":null,"abstract":"We introduce a general, automatic verification technique for pipelined designs. The technique is based on a scalable, formal methodology for analysing pipelines. The key advantages to our technique are: it specifically targets pipeline control, making it more efficient; it requires no explicit specification, since it compares hardware against itself; it can be used within the broader framework of hierarchical verification; and, it can be easily extended to handle certain \"complex\" pipelined structures.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116200291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Approximate timing analysis of combinational circuits under the XBDO model XBDO模型下组合电路的近似时序分析
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643404
Y. Kukimoto, W. Gosti, A. Saldanha, R. Brayton
This paper is concerned with approximate delay computation algorithms for combinational circuits. As a result of intensive research in the early 90's efficient tools exist which can analyze circuits of thousands of gates in a few minutes or even in seconds for many cases. However, the computation time of these tools is not so predictable since the internal engine of the analysis is either a SAT solver or a modified ATPG algorithm, both of which are just heuristic algorithms for an NP-complete problem. Although they are highly tuned for CAD applications, there exists a class of problem instances which exhibits the worst-case exponential CPU time behavior. In the context of timing analysis, circuits with a high amount of reconvergence, e.g. C6288 of the ISCAS benchmark suite, are known to be difficult to analyze under sophisticated delay models even with state-of-the-art techniques. To make timing analysis of such corner case circuits feasible we propose an approximate computation scheme to the timing analysis problem as an extension to the exact analysis method proposed previously. Sensitization conditions are conservatively approximated in a selective fashion so that the size of SAT problems solved during analysis is controlled. Experimental results show that the approximation technique is effective in reducing the total analysis time without losing accuracy for the case where the exact approach takes much time or cannot complete.
本文研究了组合电路的近似延迟计算算法。由于90年代早期的深入研究,已经有了在许多情况下可以在几分钟甚至几秒钟内分析数千个门电路的有效工具。然而,这些工具的计算时间并不是那么可预测,因为分析的内部引擎要么是SAT求解器,要么是改进的ATPG算法,这两种算法都只是np完全问题的启发式算法。虽然它们对CAD应用程序进行了高度调整,但存在一类问题实例,它们表现出最坏情况下的指数CPU时间行为。在时序分析的背景下,具有大量再收敛的电路,例如ISCAS基准套件的C6288,即使使用最先进的技术,也很难在复杂的延迟模型下进行分析。为了使拐角电路的时序分析可行,我们提出了时序分析问题的一种近似计算方案,作为前面提出的精确分析方法的扩展。敏化条件以一种选择性的方式保守地近似,以便在分析过程中解决的SAT问题的大小得到控制。实验结果表明,在精确方法耗时较长或无法完成的情况下,近似方法在不损失精度的情况下,可以有效地减少总分析时间。
{"title":"Approximate timing analysis of combinational circuits under the XBDO model","authors":"Y. Kukimoto, W. Gosti, A. Saldanha, R. Brayton","doi":"10.1109/ICCAD.1997.643404","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643404","url":null,"abstract":"This paper is concerned with approximate delay computation algorithms for combinational circuits. As a result of intensive research in the early 90's efficient tools exist which can analyze circuits of thousands of gates in a few minutes or even in seconds for many cases. However, the computation time of these tools is not so predictable since the internal engine of the analysis is either a SAT solver or a modified ATPG algorithm, both of which are just heuristic algorithms for an NP-complete problem. Although they are highly tuned for CAD applications, there exists a class of problem instances which exhibits the worst-case exponential CPU time behavior. In the context of timing analysis, circuits with a high amount of reconvergence, e.g. C6288 of the ISCAS benchmark suite, are known to be difficult to analyze under sophisticated delay models even with state-of-the-art techniques. To make timing analysis of such corner case circuits feasible we propose an approximate computation scheme to the timing analysis problem as an extension to the exact analysis method proposed previously. Sensitization conditions are conservatively approximated in a selective fashion so that the size of SAT problems solved during analysis is controlled. Experimental results show that the approximation technique is effective in reducing the total analysis time without losing accuracy for the case where the exact approach takes much time or cannot complete.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115367059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Power optimization using divide-and-conquer techniques for minimization of the number of operations 使用分而治之技术进行功率优化,以最小化操作次数
Pub Date : 1997-11-13 DOI: 10.1109/iccad.1997.643384
Inki Hong, M. Potkonjak, R. Karri
We develop an approach to minimizing power consumption of portable wireless DSP applications using a set of compilation and architectural techniques. The key technical innovation is a novel divide-and-conquer compilation technique to minimize the number of operations for general DSP computations. Our technique optimizes not only a significantly wider set of computations than the previously published techniques, but also outperforms (or performs at least as well as other techniques) on all examples. Along the architectural dimension, we investigate coordinated impact of compilation techniques on the number of processors which provide optimal trade-off between cost and power. We demonstrate that proper compilation techniques can significantly reduce power with bounded hardware cost. The effectiveness of all techniques and algorithms is documented on numerous real-life designs.
我们开发了一种使用一套编译和架构技术来最小化便携式无线DSP应用程序功耗的方法。关键的技术创新是一种新的分治编译技术,以最大限度地减少通用DSP计算的运算次数。我们的技术不仅优化了比以前发表的技术更广泛的计算集,而且在所有示例上都优于(或至少与其他技术一样好)。沿着体系结构维度,我们研究了编译技术对处理器数量的协调影响,这些处理器在成本和功耗之间提供了最佳的权衡。我们证明了适当的编译技术可以在有限的硬件成本下显著降低功耗。所有技术和算法的有效性都记录在许多现实生活中的设计中。
{"title":"Power optimization using divide-and-conquer techniques for minimization of the number of operations","authors":"Inki Hong, M. Potkonjak, R. Karri","doi":"10.1109/iccad.1997.643384","DOIUrl":"https://doi.org/10.1109/iccad.1997.643384","url":null,"abstract":"We develop an approach to minimizing power consumption of portable wireless DSP applications using a set of compilation and architectural techniques. The key technical innovation is a novel divide-and-conquer compilation technique to minimize the number of operations for general DSP computations. Our technique optimizes not only a significantly wider set of computations than the previously published techniques, but also outperforms (or performs at least as well as other techniques) on all examples. Along the architectural dimension, we investigate coordinated impact of compilation techniques on the number of processors which provide optimal trade-off between cost and power. We demonstrate that proper compilation techniques can significantly reduce power with bounded hardware cost. The effectiveness of all techniques and algorithms is documented on numerous real-life designs.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129496390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Large scale circuit partitioning with loose/stable net removal and signal flow based clustering 基于松散/稳定网络去除和信号流聚类的大规模电路划分
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643573
J. Cong, H. Li, S. Lim, Toshiyuki Shibuya, D. Xu
In this paper, we present an efficient Iterative Improvement based Partitioning (IIP) algorithm called LSR/MFFS, that combines signal flow based Maximum Fanout Free Subgraph (MFFS) clustering algorithm with Loose and Stable net Removal (LSR) partitioning algorithm. The MFFS algorithm generalizes existing MFFC decomposition method from combinational circuits to general sequential circuits in order to handle cycles naturally. We also study the properties of the nets that straddle the cutline carefully, and introduce the concepts of the loose and stable nets as well as effective ways to remove them out of the cutset. The LSR/MFFS algorithm first applies LSR algorithm to clustered netlist generated by MFFS algorithm for global-level cutsize optimization and then declusters netlist for further cutsize refinement. As a result, the LSR/MFFS algorithm has achieved the best cutsize result among all the bipartitioning algorithms published in the literatures with very promising runtime performance. In particular, it outperforms the recent state-of-the-art IIP algorithms LA3-CDIP, CLIP-PROP/sub f/, Strawman, hMetis-FM, and MLc by 17.4%, 12.1%, 5.9%, 3.1%, and 1.9%, respectively. It also outperforms the state-of-the-art non-IIP algorithms Paraboli, FEB, and PANZA by 32.0%, 21.4%, and 1.4%, respectively.
本文提出了一种高效的基于迭代改进的分区(IIP)算法LSR/MFFS,该算法将基于信号流的最大扇出自由子图(MFFS)聚类算法与松散稳定的网络去除(LSR)分区算法相结合。MFFS算法将现有的MFFC分解方法从组合电路推广到一般顺序电路,以自然地处理周期。我们还仔细研究了跨越切线的网的特性,并介绍了松散和稳定网的概念以及将它们从切线中移除的有效方法。LSR/MFFS算法首先利用LSR算法对MFFS算法生成的聚类网表进行全局裁剪尺寸优化,然后对网表进行聚类,进一步细化裁剪尺寸。结果表明,LSR/MFFS算法在所有已发表的双分区算法中取得了最好的分割效果,并且具有很好的运行时性能。特别是,它比最新的最先进的IIP算法LA3-CDIP、CLIP-PROP/sub /、Strawman、hMetis-FM和MLc分别高出17.4%、12.1%、5.9%、3.1%和1.9%。它也比最先进的非iip算法抛物线、FEB和PANZA分别高出32.0%、21.4%和1.4%。
{"title":"Large scale circuit partitioning with loose/stable net removal and signal flow based clustering","authors":"J. Cong, H. Li, S. Lim, Toshiyuki Shibuya, D. Xu","doi":"10.1109/ICCAD.1997.643573","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643573","url":null,"abstract":"In this paper, we present an efficient Iterative Improvement based Partitioning (IIP) algorithm called LSR/MFFS, that combines signal flow based Maximum Fanout Free Subgraph (MFFS) clustering algorithm with Loose and Stable net Removal (LSR) partitioning algorithm. The MFFS algorithm generalizes existing MFFC decomposition method from combinational circuits to general sequential circuits in order to handle cycles naturally. We also study the properties of the nets that straddle the cutline carefully, and introduce the concepts of the loose and stable nets as well as effective ways to remove them out of the cutset. The LSR/MFFS algorithm first applies LSR algorithm to clustered netlist generated by MFFS algorithm for global-level cutsize optimization and then declusters netlist for further cutsize refinement. As a result, the LSR/MFFS algorithm has achieved the best cutsize result among all the bipartitioning algorithms published in the literatures with very promising runtime performance. In particular, it outperforms the recent state-of-the-art IIP algorithms LA3-CDIP, CLIP-PROP/sub f/, Strawman, hMetis-FM, and MLc by 17.4%, 12.1%, 5.9%, 3.1%, and 1.9%, respectively. It also outperforms the state-of-the-art non-IIP algorithms Paraboli, FEB, and PANZA by 32.0%, 21.4%, and 1.4%, respectively.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"211 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127040287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
期刊
1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1