首页 > 最新文献

1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)最新文献

英文 中文
Resource sharing in hierarchical synthesis 层次综合中的资源共享
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643537
O. Bringmann, W. Rosenstiel
This paper presents a new approach to hierarchical high-level synthesis with respect to internal register-transfer structures of complex components. Entire subdesigns can efficiently be used as complex components at a higher hierarchical level of the design. After synthesis, the calculated schedule of each subdesign is added to its register-transfer component model. This enables the sharing of unused subcomponents across different hierarchical levels of the design. Especially, subcomponents of autonomous components, with a separate controller can also be shared. As a result, the presented methodology offers a high degree of optimization to hierarchically specified designs.
本文提出了一种针对复杂元件内部寄存器转移结构的分层高级合成新方法。整个子设计可以有效地用作设计的更高层次上的复杂组件。综合后,将各子设计的计算进度添加到其寄存器-传输组件模型中。这允许在设计的不同层次级别之间共享未使用的子组件。特别是,具有独立控制器的自治组件的子组件也可以共享。因此,所提出的方法为分层指定的设计提供了高度的优化。
{"title":"Resource sharing in hierarchical synthesis","authors":"O. Bringmann, W. Rosenstiel","doi":"10.1109/ICCAD.1997.643537","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643537","url":null,"abstract":"This paper presents a new approach to hierarchical high-level synthesis with respect to internal register-transfer structures of complex components. Entire subdesigns can efficiently be used as complex components at a higher hierarchical level of the design. After synthesis, the calculated schedule of each subdesign is added to its register-transfer component model. This enables the sharing of unused subcomponents across different hierarchical levels of the design. Especially, subcomponents of autonomous components, with a separate controller can also be shared. As a result, the presented methodology offers a high degree of optimization to hierarchically specified designs.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115882758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
A new approach to simultaneous buffer insertion and wire sizing 一种同时进行缓冲区插入和导线大小调整的新方法
Lei He, A. Kahng, K. Tam, Jinjun Xiong
We present a completely new approach to the problem of delay minimization by simultaneous buffer insertion and wire sizing for a wire. We show that the problem can be formulated as a convex quadratic program, which is known to be solvable in polynomial time. Nevertheless, we explore some special properties of our problem and derive on optimal and very efficient algorithm to solve the resulting program. Given m buffers and a set of n discrete choices of wire width, the running time of our algorithm is O(mn/sup 2/) and is independent of the wire length in practice. For example, an instance of 100 buffers and 100 choices of wire width can be solved in 3 seconds. Besides, our formulation is so versatile that it is easy to consider other objectives like wire area or power dissipation, or to add constraints to the solution. Also, wire capacitance lookup tables, or very general wire capacitance models which can capture area capacitance, fringing capacitance, coupling capacitance, etc. can be used.
我们提出了一种全新的方法,通过同时插入缓冲器和导线尺寸来解决导线延迟最小化问题。我们证明了这个问题可以被表述为一个凸二次规划,它可以在多项式时间内求解。然而,我们探索了这个问题的一些特殊性质,并推导出一个最优的、非常有效的算法来解决所得到的程序。给定m个缓冲区和一组n个离散的导线宽度选择,我们的算法的运行时间为O(mn/sup 2/),并且在实践中与导线长度无关。例如,一个有100个缓冲器和100个导线宽度选择的实例可以在3秒内解决。此外,我们的公式是如此通用,很容易考虑其他目标,如线面积或功耗,或添加约束的解决方案。此外,还可以使用线电容查找表或非常通用的线电容模型,这些模型可以捕获面积电容、边缘电容、耦合电容等。
{"title":"A new approach to simultaneous buffer insertion and wire sizing","authors":"Lei He, A. Kahng, K. Tam, Jinjun Xiong","doi":"10.5555/266388.266564","DOIUrl":"https://doi.org/10.5555/266388.266564","url":null,"abstract":"We present a completely new approach to the problem of delay minimization by simultaneous buffer insertion and wire sizing for a wire. We show that the problem can be formulated as a convex quadratic program, which is known to be solvable in polynomial time. Nevertheless, we explore some special properties of our problem and derive on optimal and very efficient algorithm to solve the resulting program. Given m buffers and a set of n discrete choices of wire width, the running time of our algorithm is O(mn/sup 2/) and is independent of the wire length in practice. For example, an instance of 100 buffers and 100 choices of wire width can be solved in 3 seconds. Besides, our formulation is so versatile that it is easy to consider other objectives like wire area or power dissipation, or to add constraints to the solution. Also, wire capacitance lookup tables, or very general wire capacitance models which can capture area capacitance, fringing capacitance, coupling capacitance, etc. can be used.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"32 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123205401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Generalized matching from theory to application 广义匹配从理论到应用
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643255
P. Vuillod, L. Benini, G. Micheli
We present a novel approach for post-mapping optimization. We exploit the concept of generalised matching, a technique that finds symbolically all possible matching assignments of library cells to a multi-output network specified by a Boolean relation. Several objectives are targeted: area minimization under delay constraints; power minimization under delay constraints; and unconstrained delay minimization. We describe the theory of generalized matching and the algorithmic optimization required for its efficient and robust implementation. A tool based on generalized matching has been implemented and tested on large examples of the MCNC'91 benchmark suite. We obtain sizable improvements in: speed (6% in average, up to 20.7%); area under speed constraints (13.7% an average, up to 29.5%); and power under speed constraints (22.3% in average, up to 38.1%).
我们提出了一种新的映射后优化方法。我们利用广义匹配的概念,这是一种从符号上找到库单元到布尔关系指定的多输出网络的所有可能匹配分配的技术。有几个目标:延迟约束下的面积最小化;时延约束下的功率最小化以及无约束延迟最小化。我们描述了广义匹配的理论和算法优化所需的有效和鲁棒的实现。一个基于广义匹配的工具已经实现并在MCNC'91基准套件的大型示例上进行了测试。我们在以下方面获得了相当大的改进:速度(平均6%,最高20.7%);受速度限制的面积(平均13.7%,最高29.5%);速度限制下的动力(平均22.3%,最高38.1%)。
{"title":"Generalized matching from theory to application","authors":"P. Vuillod, L. Benini, G. Micheli","doi":"10.1109/ICCAD.1997.643255","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643255","url":null,"abstract":"We present a novel approach for post-mapping optimization. We exploit the concept of generalised matching, a technique that finds symbolically all possible matching assignments of library cells to a multi-output network specified by a Boolean relation. Several objectives are targeted: area minimization under delay constraints; power minimization under delay constraints; and unconstrained delay minimization. We describe the theory of generalized matching and the algorithmic optimization required for its efficient and robust implementation. A tool based on generalized matching has been implemented and tested on large examples of the MCNC'91 benchmark suite. We obtain sizable improvements in: speed (6% in average, up to 20.7%); area under speed constraints (13.7% an average, up to 29.5%); and power under speed constraints (22.3% in average, up to 38.1%).","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128914362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
FastPep: a fast parasitic extraction program for complex three-dimensional geometries FastPep:用于复杂三维几何形状的快速寄生提取程序
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643575
M. Kamon, N. Marques, Jacob K. White
In this paper we describe a computationally efficient approach to generating reduced-order models from PEEC-based three-dimensional electromagnetic analysis programs. It is shown that a recycled multipole-accelerated approach applied to recent model order reduction techniques requires nearly two orders of magnitude fewer floating point operations than direct techniques thus allowing the analysis of larger, more complex three-dimensional geometries.
在本文中,我们描述了一种计算效率的方法,从基于peec的三维电磁分析程序生成降阶模型。研究表明,应用于最近模型降阶技术的循环多极加速方法需要的浮点运算比直接技术少近两个数量级,从而允许分析更大、更复杂的三维几何形状。
{"title":"FastPep: a fast parasitic extraction program for complex three-dimensional geometries","authors":"M. Kamon, N. Marques, Jacob K. White","doi":"10.1109/ICCAD.1997.643575","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643575","url":null,"abstract":"In this paper we describe a computationally efficient approach to generating reduced-order models from PEEC-based three-dimensional electromagnetic analysis programs. It is shown that a recycled multipole-accelerated approach applied to recent model order reduction techniques requires nearly two orders of magnitude fewer floating point operations than direct techniques thus allowing the analysis of larger, more complex three-dimensional geometries.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130174899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
An exact gate decomposition algorithm for low-power technology mapping 低功耗技术映射的精确门分解算法
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643597
H. Zhou, Martin D. F. Wong
With the remarkable growth of portable application and the increasing frequency and integration density, power is being given comparable weight to speed and area in IC designs. In technology mapping, how decomposition is done can have a significant impact on the power dissipation of the final implementation. In the literature, only heuristic algorithms are given for the low power gate decomposition problem. We prove many properties an optimal decomposition tree must have. Based on these optimality properties, we design an efficient exact algorithm to solve the low power gate decomposition problem. Moreover the exact algorithm can be easily modified to a heuristic algorithm which performs much better than the known heuristics.
随着便携式应用的显著增长以及频率和集成密度的增加,在集成电路设计中,功率的重要性与速度和面积相当。在技术映射中,如何进行分解会对最终实现的功耗产生重大影响。在文献中,对于低功耗栅极分解问题,只给出了启发式算法。我们证明了最优分解树必须具备的许多性质。基于这些最优性,我们设计了一种高效精确的算法来解决低功耗栅极分解问题。此外,精确算法可以很容易地修改为启发式算法,其性能比已知的启发式算法要好得多。
{"title":"An exact gate decomposition algorithm for low-power technology mapping","authors":"H. Zhou, Martin D. F. Wong","doi":"10.1109/ICCAD.1997.643597","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643597","url":null,"abstract":"With the remarkable growth of portable application and the increasing frequency and integration density, power is being given comparable weight to speed and area in IC designs. In technology mapping, how decomposition is done can have a significant impact on the power dissipation of the final implementation. In the literature, only heuristic algorithms are given for the low power gate decomposition problem. We prove many properties an optimal decomposition tree must have. Based on these optimality properties, we design an efficient exact algorithm to solve the low power gate decomposition problem. Moreover the exact algorithm can be easily modified to a heuristic algorithm which performs much better than the known heuristics.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126556782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Transformational partitioning for co-design of multiprocessor systems 多处理器系统协同设计的转换划分
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643585
G. Marchioro, J. Daveau, A. Jerraya
This paper presents the underlying methodology of Cosmos, an interactive approach for hardware/software co-design capable of handling multiprocessor systems and distributed architectures. The approach covers the co-design process through a set of user guided transformations allowing semi-automatic partitioning. The transformations are based on a powerful set of primitives for functional partitioning, structural reorganization and communication transformation. It leads to a fast transformation of a system-level specification into an architecture with a short design time and fast exploration of design space. The application of this approach is illustrated using a design example starting from a system-level specification given in SDL to a distributed hardware/software architecture described in C/VHDL. We show that the use of transformational approach allows: 1) application of the expertise of the designer during partitioning; 2) the user to understand the results of the co-design process; 3) the process to take into account partial existing solutions; 4) large design space exploration; and 5) the designer to start from a very high-level specification language of the system to be designed.
本文介绍了Cosmos的基本方法,这是一种能够处理多处理器系统和分布式体系结构的硬件/软件协同设计的交互式方法。该方法通过一组允许半自动分区的用户引导转换涵盖了协同设计过程。这些转换基于一组强大的原语,用于功能划分、结构重组和通信转换。它可以在较短的设计时间和对设计空间的快速探索下,将系统级规范快速转换为体系结构。用一个设计实例说明了这种方法的应用,从SDL给出的系统级规范到C/VHDL描述的分布式硬件/软件体系结构。我们表明,使用转换方法允许:1)在分区期间应用设计师的专业知识;2)用户了解协同设计过程的结果;3)过程中考虑到部分现有解决方案;4)大设计空间探索;并且设计者要从一个非常高级的规范语言开始对所要设计的系统进行设计。
{"title":"Transformational partitioning for co-design of multiprocessor systems","authors":"G. Marchioro, J. Daveau, A. Jerraya","doi":"10.1109/ICCAD.1997.643585","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643585","url":null,"abstract":"This paper presents the underlying methodology of Cosmos, an interactive approach for hardware/software co-design capable of handling multiprocessor systems and distributed architectures. The approach covers the co-design process through a set of user guided transformations allowing semi-automatic partitioning. The transformations are based on a powerful set of primitives for functional partitioning, structural reorganization and communication transformation. It leads to a fast transformation of a system-level specification into an architecture with a short design time and fast exploration of design space. The application of this approach is illustrated using a design example starting from a system-level specification given in SDL to a distributed hardware/software architecture described in C/VHDL. We show that the use of transformational approach allows: 1) application of the expertise of the designer during partitioning; 2) the user to understand the results of the co-design process; 3) the process to take into account partial existing solutions; 4) large design space exploration; and 5) the designer to start from a very high-level specification language of the system to be designed.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134250725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Built-in test generation for synchronous sequential circuits 内置测试生成同步顺序电路
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643570
I. Pomeranz, S. Reddy
We consider the problem of built-in test generation for synchronous sequential circuits. The proposed scheme leaves the circuit flip-flops unmodified, and thus allows at-speed test application. We introduce a uniform, parametrized structure for test pattern generation. By matching the parameters of the test pattern generator to the circuit-under-test, high fault coverage is achieved. In many cases, the fault coverage is equal to the fault coverage that can be achieved by deterministic test sequences. We also investigate a method to minimize the size of the test pattern generator, and study its effectiveness alone and in conjunction with the insertion of test-points.
研究同步顺序电路的内置测试生成问题。所提出的方案不修改电路触发器,从而允许高速测试应用。我们引入了一种统一的、参数化的测试模式生成结构。通过将测试图发生器的参数与待测电路相匹配,实现了高故障覆盖率。在许多情况下,故障覆盖率等于通过确定性测试序列可以实现的故障覆盖率。我们还研究了一种最小化测试模式生成器大小的方法,并研究了它单独和与测试点插入相结合的有效性。
{"title":"Built-in test generation for synchronous sequential circuits","authors":"I. Pomeranz, S. Reddy","doi":"10.1109/ICCAD.1997.643570","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643570","url":null,"abstract":"We consider the problem of built-in test generation for synchronous sequential circuits. The proposed scheme leaves the circuit flip-flops unmodified, and thus allows at-speed test application. We introduce a uniform, parametrized structure for test pattern generation. By matching the parameters of the test pattern generator to the circuit-under-test, high fault coverage is achieved. In many cases, the fault coverage is equal to the fault coverage that can be achieved by deterministic test sequences. We also investigate a method to minimize the size of the test pattern generator, and study its effectiveness alone and in conjunction with the insertion of test-points.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115339615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Power sensitivity-a new method to estimate power dissipation considering uncertain specifications of primary inputs 功率灵敏度——一种考虑一次输入不确定参数的功耗估计方法
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643276
Zhanping Chen, K. Roy, T. Chou
Power dissipation in CMOS circuits heavily depends on the signal properties of the primary inputs. Due to uncertainties in specification of such properties, the average power should be specified between a maximum and a minimum possible value. Due to the complex nature of the problem, it is practically impossible to use traditional power estimation techniques to determine such bounds. We present a novel approach to accurately estimate the maximum and minimum bounds for average power using a technique which calculates the sensitivities of average power dissipation to primary input signal properties. The sensitivities are calculated using a novel statistical technique and can be obtained as a by-product of average power estimation using a Monte Carlo based approach. The signal properties are specified in terms of signal probability (probability of a signal being logic ONE) and signal activity (probability of signal switching). Results show that the maximum and minimum average power dissipation can vary widely if the primary input probabilities and activities are not specified accurately.
CMOS电路的功耗很大程度上取决于主输入的信号特性。由于这些特性的规格不确定,平均功率应在可能的最大值和最小值之间指定。由于问题的复杂性,使用传统的功率估计技术来确定这种边界实际上是不可能的。我们提出了一种新的方法,利用计算平均功率耗散对初级输入信号特性的灵敏度的技术来准确地估计平均功率的最大值和最小限值。灵敏度的计算采用了一种新颖的统计技术,并可以作为平均功率估计的副产品,使用基于蒙特卡罗的方法。信号属性是根据信号概率(信号是逻辑ONE的概率)和信号活动(信号切换的概率)来指定的。结果表明,如果不准确地指定主输入概率和活动,最大和最小平均功耗会有很大的变化。
{"title":"Power sensitivity-a new method to estimate power dissipation considering uncertain specifications of primary inputs","authors":"Zhanping Chen, K. Roy, T. Chou","doi":"10.1109/ICCAD.1997.643276","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643276","url":null,"abstract":"Power dissipation in CMOS circuits heavily depends on the signal properties of the primary inputs. Due to uncertainties in specification of such properties, the average power should be specified between a maximum and a minimum possible value. Due to the complex nature of the problem, it is practically impossible to use traditional power estimation techniques to determine such bounds. We present a novel approach to accurately estimate the maximum and minimum bounds for average power using a technique which calculates the sensitivities of average power dissipation to primary input signal properties. The sensitivities are calculated using a novel statistical technique and can be obtained as a by-product of average power estimation using a Monte Carlo based approach. The signal properties are specified in terms of signal probability (probability of a signal being logic ONE) and signal activity (probability of signal switching). Results show that the maximum and minimum average power dissipation can vary widely if the primary input probabilities and activities are not specified accurately.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115134897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Scheduling and binding bounds for RT-level symbolic execution rt级符号执行的调度和绑定边界
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643525
C. Monahan, F. Brewer
Generalizes ALAP (as late as possible) bounds for the exact scheduling problem on a pre-defined data path. Conventional bounds are inapplicable because of the possible requirement of re-computing operands for minimal schedule length. Efficient techniques are presented for constructing the new bounds which are sensitive to point-to-point delays via transitive memory units. An efficient operand mapping bound is also described. Based on these two bounds, time improvement factors of 50 have demonstrated in exact scheduling results.
为预定义数据路径上的精确调度问题泛化ALAP(尽可能晚)边界。由于可能需要为最小的调度长度重新计算操作数,因此传统的边界不适用。提出了利用传递存储单元构造对点对点延迟敏感的新边界的有效方法。还描述了一个有效的操作数映射边界。基于这两个边界,在精确的调度结果中证明了50的时间改进因子。
{"title":"Scheduling and binding bounds for RT-level symbolic execution","authors":"C. Monahan, F. Brewer","doi":"10.1109/ICCAD.1997.643525","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643525","url":null,"abstract":"Generalizes ALAP (as late as possible) bounds for the exact scheduling problem on a pre-defined data path. Conventional bounds are inapplicable because of the possible requirement of re-computing operands for minimal schedule length. Efficient techniques are presented for constructing the new bounds which are sensitive to point-to-point delays via transitive memory units. An efficient operand mapping bound is also described. Based on these two bounds, time improvement factors of 50 have demonstrated in exact scheduling results.","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122471102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Wavesched: a novel scheduling technique for control-flow intensive behavioral descriptions 波化:一种用于控制流密集行为描述的新型调度技术
Pub Date : 1997-11-13 DOI: 10.1109/ICCAD.1997.643527
G. Lakshminarayana, K. Khouri, N. Jha
Presents a novel scheduling algorithm targeted towards minimizing the average execution time of control-flow intensive behavioral descriptions. Our algorithm uses a control-data flow graph (CDFG) model, which preserves the parallelism inherent in the application. It explores previously unexplored regions of the solution space through its ability to overlap the schedules of independent iterative constructs whose bodies share resources. It also incorporates well-known optimization techniques like loop unrolling in a natural fashion. This is made possible by a general loop-handling technique which we have devised. Application of the algorithm to several common benchmarks demonstrates up to 4.8-fold improvement in expected schedule length over existing scheduling algorithms, without paying a price in terms of the best- and worst-case schedule lengths required to execute the behavioral description (in fact, frequently, the best/worst-case schedule lengths are also better for our algorithm).
提出了一种以最小化控制流密集行为描述的平均执行时间为目标的调度算法。我们的算法使用控制数据流图(CDFG)模型,该模型保留了应用程序固有的并行性。它通过重叠独立迭代构造的时间表的能力来探索以前未探索的解决方案空间区域,这些构造的主体共享资源。它还结合了著名的优化技术,如以自然的方式展开循环。这是通过我们设计的一般循环处理技术实现的。该算法在几个常见基准测试中的应用表明,与现有调度算法相比,预期调度长度提高了4.8倍,而执行行为描述所需的最佳和最坏情况调度长度没有付出代价(事实上,通常,最佳/最坏情况调度长度也更适合我们的算法)。
{"title":"Wavesched: a novel scheduling technique for control-flow intensive behavioral descriptions","authors":"G. Lakshminarayana, K. Khouri, N. Jha","doi":"10.1109/ICCAD.1997.643527","DOIUrl":"https://doi.org/10.1109/ICCAD.1997.643527","url":null,"abstract":"Presents a novel scheduling algorithm targeted towards minimizing the average execution time of control-flow intensive behavioral descriptions. Our algorithm uses a control-data flow graph (CDFG) model, which preserves the parallelism inherent in the application. It explores previously unexplored regions of the solution space through its ability to overlap the schedules of independent iterative constructs whose bodies share resources. It also incorporates well-known optimization techniques like loop unrolling in a natural fashion. This is made possible by a general loop-handling technique which we have devised. Application of the algorithm to several common benchmarks demonstrates up to 4.8-fold improvement in expected schedule length over existing scheduling algorithms, without paying a price in terms of the best- and worst-case schedule lengths required to execute the behavioral description (in fact, frequently, the best/worst-case schedule lengths are also better for our algorithm).","PeriodicalId":187521,"journal":{"name":"1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122946801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
1997 Proceedings of IEEE International Conference on Computer Aided Design (ICCAD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1