1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)最新文献

英文中文

Optimal allocation of carry-save-adders in arithmetic optimization 算术优化中进位加法器的最优分配

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810685

Junhyung Um, Taewhan Kim, C. Liu

Carry-save-adder(CSA) is one of the most widely used schemes for fast arithmetic in industry. This paper provides a solution to the problem of finding an optimal-timing allocation of CSAs. Specifically, we present a polynomial time algorithm which finds an optimal-timing CSA allocation for a given arithmetic expression. In addition, we extend our result for CSA allocation to the problem of optimizing arithmetic expressions across the boundary of design hierarchy by introducing a new concept, called auxiliary ports. Our algorithm can be used to carry out the CSA allocation step optimally and automatically, and this can be done within the context of a standard HDL synthesis environment.

进位节省加法器(CSA)是工业上应用最广泛的快速算法之一。本文提出了一种求解csa最优时序分配问题的方法。具体来说，我们提出了一个多项式时间算法，该算法可以找到给定算术表达式的最佳时序CSA分配。此外，我们通过引入一个称为辅助端口的新概念，将CSA分配的结果扩展到跨设计层次边界优化算术表达式的问题。我们的算法可以在标准HDL合成环境中最优和自动地执行CSA分配步骤。

引用次数: 23

Power minimization using system-level partitioning of applications with quality of service requirements 使用具有服务质量要求的应用程序的系统级分区来实现功耗最小化

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810672

G. Qu, M. Potkonjak

Design systems to provide various quality of service (QoS) guarantees have received a lot of attention due to the increasing popularity of real-time multimedia and wireless communication applications. Meanwhile, low power consumption is always one of the goals for system design, especially for battery-operated systems. With the design trend of integrating multiple processor cores and memory on a single chip, we address the problem of how to partition a set of applications among processors, such that all the individual QoS requirements are met and the total energy consumption is minimized. We exploit the advantages provided by the variable voltage design methodology to choose the voltage for each application on the same processor optimally for this purpose. We also discuss how to partition applications among the processors to achieve the same goal. We formulate the problem on an abstract QoS model and present how to allocate resources (e.g., CPU time) and determine the voltage profile for every single processor. Experiments on media benchmarks have also been studied.

随着实时多媒体和无线通信应用的日益普及，设计提供各种服务质量(QoS)保证的系统受到了广泛的关注。同时，低功耗一直是系统设计的目标之一，尤其是电池供电系统。随着多处理器核心和存储集成在一个芯片上的设计趋势，我们解决了如何在处理器之间划分一组应用的问题，以满足所有单独的QoS要求，并使总能耗最小化。我们利用可变电压设计方法提供的优势，为同一处理器上的每个应用选择最佳电压。我们还讨论了如何在处理器之间对应用程序进行分区以实现相同的目标。我们在一个抽象的QoS模型上提出了这个问题，并给出了如何分配资源(例如，CPU时间)和确定每个单个处理器的电压分布。对媒介基准的实验也进行了研究。

{"title":"Power minimization using system-level partitioning of applications with quality of service requirements","authors":"G. Qu, M. Potkonjak","doi":"10.1109/ICCAD.1999.810672","DOIUrl":"https://doi.org/10.1109/ICCAD.1999.810672","url":null,"abstract":"Design systems to provide various quality of service (QoS) guarantees have received a lot of attention due to the increasing popularity of real-time multimedia and wireless communication applications. Meanwhile, low power consumption is always one of the goals for system design, especially for battery-operated systems. With the design trend of integrating multiple processor cores and memory on a single chip, we address the problem of how to partition a set of applications among processors, such that all the individual QoS requirements are met and the total energy consumption is minimized. We exploit the advantages provided by the variable voltage design methodology to choose the voltage for each application on the same processor optimally for this purpose. We also discuss how to partition applications among the processors to achieve the same goal. We formulate the problem on an abstract QoS model and present how to allocate resources (e.g., CPU time) and determine the voltage profile for every single processor. Experiments on media benchmarks have also been studied.","PeriodicalId":6414,"journal":{"name":"1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)","volume":"315 1","pages":"343-346"},"PeriodicalIF":0.0,"publicationDate":"1999-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77624909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Function inlining under code size constraints for embedded processors 嵌入式处理器在代码大小限制下的函数内联

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.5555/339492.340022

R. Leupers, P. Marwedel

Function inlining is a compiler optimization that generally increases performance at the expense of larger code size. However, current inlining techniques do not meet the special demands in the design of embedded systems, since they are based on simple heuristics, and they generate code of unpredictable size. This paper presents a novel approach to function inlining in C compilers for embedded processors, which aims at maximum program speedup under a global limit on code size. The core of this approach is a branch-and-bound algorithm which allows one to quickly explore the large search space. In an application study, we show how this algorithm can be applied to maximize the execution speed of an application under a given code size constraint.

函数内联是一种编译器优化，通常以更大的代码大小为代价来提高性能。然而，目前的内联技术不能满足嵌入式系统设计的特殊要求，因为它们基于简单的启发式，并且它们生成的代码大小不可预测。本文提出了一种在嵌入式处理器的C编译器中实现函数内联的新方法，该方法的目的是在全局限制的代码大小下实现最大的程序加速。该方法的核心是分支定界算法，该算法允许快速探索大的搜索空间。在应用程序研究中，我们展示了如何在给定代码大小约束下应用此算法来最大化应用程序的执行速度。

引用次数: 49

Formal specification and verification of a dataflow processor array 数据流处理器阵列的正式规范和验证

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810700

T. Henzinger, Xiaojun Liu, S. Qadeer, S. Rajamani

We describe the formal specification and verification of the VGI parallel DSP chip (V. Srini et al., 1998), which contains 64 compute processors with /spl sim/30 K gates in each processor. Our effort coincided with the "informal" verification stage of the chip. By interacting with the designers, we produced an abstract but executable specification of the design which embodies the programmer's view of the system. Given the size of the design, an automatic check that even one of the 64 processors satisfies its specification is well beyond the scope of current verification tools. However, the check can be decomposed using assume-guarantee reasoning. For VGI, the implementation and specification operate at different time scales: several steps of the implementation correspond to a single step in the specification. We generalized both the assume-guarantee method and our model checker MOCHA to allow compositional verification for such applications. We used our proof rule to decompose the verification problem of the VGI chip into smaller proof obligations that were discharged automatically by MOCHA. Using our formal approach, we uncovered and fixed subtle bugs that were unknown to the designers.

我们描述了VGI并行DSP芯片的正式规范和验证(V. Srini等人，1998)，其中包含64个计算处理器，每个处理器中有/spl sim/ 30k门。我们的努力与芯片的“非正式”验证阶段相吻合。通过与设计师的互动，我们产生了一个抽象但可执行的设计规范，它体现了程序员对系统的看法。考虑到设计的大小，即使64个处理器中的一个满足其规格的自动检查也远远超出了当前验证工具的范围。但是，可以使用假设-保证推理对检查进行分解。对于VGI，实现和规范在不同的时间尺度上运行:实现的几个步骤对应于规范中的一个步骤。我们推广了假设保证方法和模型检查器MOCHA，以允许对此类应用程序进行组合验证。我们使用我们的证明规则将VGI芯片的验证问题分解为更小的证明义务，这些证明义务由MOCHA自动解除。使用我们的正式方法，我们发现并修复了设计师不知道的细微漏洞。

{"title":"Formal specification and verification of a dataflow processor array","authors":"T. Henzinger, Xiaojun Liu, S. Qadeer, S. Rajamani","doi":"10.1109/ICCAD.1999.810700","DOIUrl":"https://doi.org/10.1109/ICCAD.1999.810700","url":null,"abstract":"We describe the formal specification and verification of the VGI parallel DSP chip (V. Srini et al., 1998), which contains 64 compute processors with /spl sim/30 K gates in each processor. Our effort coincided with the \"informal\" verification stage of the chip. By interacting with the designers, we produced an abstract but executable specification of the design which embodies the programmer's view of the system. Given the size of the design, an automatic check that even one of the 64 processors satisfies its specification is well beyond the scope of current verification tools. However, the check can be decomposed using assume-guarantee reasoning. For VGI, the implementation and specification operate at different time scales: several steps of the implementation correspond to a single step in the specification. We generalized both the assume-guarantee method and our model checker MOCHA to allow compositional verification for such applications. We used our proof rule to decompose the verification problem of the VGI chip into smaller proof obligations that were discharged automatically by MOCHA. Using our formal approach, we uncovered and fixed subtle bugs that were unknown to the designers.","PeriodicalId":6414,"journal":{"name":"1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)","volume":"136 1","pages":"494-499"},"PeriodicalIF":0.0,"publicationDate":"1999-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77941895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

Efficient model reduction of interconnect via approximate system gramians 通过近似系统文法实现互连的有效模型约简

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810679

Jing-Rebecca Li, Jacob K. White

Krylov-subspace based methods for generating low-order models of complicated interconnect are extremely effective, but there is no optimality theory for the resulting models. Alternatively, methods based on truncating a balanced realization (TBR), in which the observability and controllability gramians have been diagonalized, do have an optimality property but are too computationally expensive to use on complicated problems. In this paper we present a method for computing reduced-order models of interconnect by projection via the orthogonalized union of the approximate dominant eigenspaces of the system's controllability and observability gramians. The approximate dominant eigenspaces are obtained efficiently using an iterative Lyapunov equation solver, Vector ADI, which requires only linear matrix-vector solves. A spiral inductor and a transmission line example are used to demonstrate that the new method accurately approximates the TBR results and gives much more accurate wideband models than Krylov subspace-based moment matching methods.

基于krylov -子空间的复杂互连低阶模型生成方法是非常有效的，但对于生成的模型没有最优性理论。另外，基于截断平衡实现(TBR)的方法，其中可观察性和可控性已经对角化，确实具有最优性，但计算成本太高，无法用于复杂问题。本文提出了一种利用系统可控性和可观测性矩阵的近似显性特征空间的正交化并来计算投影互联的降阶模型的方法。利用迭代Lyapunov方程求解器Vector ADI有效地获得了近似的显性特征空间，该方法只需要线性矩阵-向量求解。以螺旋电感和传输线为例，表明该方法能较好地逼近TBR结果，并能比基于Krylov子空间的矩匹配方法得到更精确的宽带模型。

引用次数: 60

Modeling design constraints and biasing in simulation using BDDs 利用bdd建模仿真中的设计约束和偏置

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810715

Jun Yuan, Kurt Shultz, C. Pixley, H. Miller, A. Aziz

Constraining and input biasing are frequently used techniques in functional verification methodologies based on randomized simulation generation. Constraints confine the simulation to a legal input space, while input biasing, which can be considered as a probabilistic constraint, makes it easier to cover interesting "corner" cases. In this paper, we propose to use constraints and biasing to form a simulation environment instead of using an explicit testbench in hierarchical functional verification. Both constraints and input biasing can depend on the state of the design and thus are very expressive in modeling the environment. We present a novel method that unifies the handling of constraints and biasing via the use of Binary Decision Diagrams (BDDs). The distribution of input vectors under the effect of constraints and input biasing are determined by what we refer to as the constrained probabilities. A BDD representing the constraints is first built, then an algorithm is applied to bias the branching probabilities in the BDD. During simulation, this annotated BDD is used to generate input vectors whose distribution matches their predetermined constrained probabilities. The simulation generation is a one-pass process, i.e., no backtracking or retry is needed. Also, we describe a partitioning method to minimize the size of BDDs used in simulation generation. Our techniques were used in the verification of a set of commercial designs; experimental results demonstrated their effectiveness.

约束和输入偏置是基于随机仿真生成的功能验证方法中常用的技术。约束将模拟限制在合法的输入空间中，而输入偏差(可以被视为概率约束)使其更容易覆盖有趣的“角落”情况。在本文中，我们建议使用约束和偏差来形成仿真环境，而不是在分层功能验证中使用显式测试平台。约束和输入偏差都取决于设计的状态，因此在环境建模中非常有表现力。本文提出了一种利用二元决策图(bdd)统一约束和偏置处理的新方法。在约束和输入偏置的影响下，输入向量的分布是由我们所说的约束概率决定的。首先建立一个表示约束的BDD，然后应用算法对BDD中的分支概率进行偏置。在仿真过程中，该带注释的BDD用于生成与其预定约束概率匹配的输入向量。模拟生成是一次通过的过程，也就是说，不需要回溯或重试。此外，我们还描述了一种分区方法，以最小化仿真生成中使用的bdd的大小。我们的技术被用于一系列商业设计的验证;实验结果证明了该方法的有效性。

{"title":"Modeling design constraints and biasing in simulation using BDDs","authors":"Jun Yuan, Kurt Shultz, C. Pixley, H. Miller, A. Aziz","doi":"10.1109/ICCAD.1999.810715","DOIUrl":"https://doi.org/10.1109/ICCAD.1999.810715","url":null,"abstract":"Constraining and input biasing are frequently used techniques in functional verification methodologies based on randomized simulation generation. Constraints confine the simulation to a legal input space, while input biasing, which can be considered as a probabilistic constraint, makes it easier to cover interesting \"corner\" cases. In this paper, we propose to use constraints and biasing to form a simulation environment instead of using an explicit testbench in hierarchical functional verification. Both constraints and input biasing can depend on the state of the design and thus are very expressive in modeling the environment. We present a novel method that unifies the handling of constraints and biasing via the use of Binary Decision Diagrams (BDDs). The distribution of input vectors under the effect of constraints and input biasing are determined by what we refer to as the constrained probabilities. A BDD representing the constraints is first built, then an algorithm is applied to bias the branching probabilities in the BDD. During simulation, this annotated BDD is used to generate input vectors whose distribution matches their predetermined constrained probabilities. The simulation generation is a one-pass process, i.e., no backtracking or retry is needed. Also, we describe a partitioning method to minimize the size of BDDs used in simulation generation. Our techniques were used in the verification of a set of commercial designs; experimental results demonstrated their effectiveness.","PeriodicalId":6414,"journal":{"name":"1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)","volume":"50 1","pages":"584-589"},"PeriodicalIF":0.0,"publicationDate":"1999-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91278363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 102

Deep submicron defect detection with the energy consumption ratio 基于能耗比的深亚微米缺陷检测

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810695

B. Vinnakota

Advances in technology and increasing integration are expected to degrade the effectiveness of I/sub ddq/ test. Total leakage currents per IC are expected to be very large, making it difficult to detect the impact of a single defect (T.W. Williams et al., 1996; M. Sachdev, 1997). Test methods which detect faults by monitoring the dynamic supply current have been suggested as one alternative. Almost all the dynamic current techniques proposed in the literature are adversely affected by the impact of process variations. Many are also susceptible to timing and magnitude errors in measurement. The energy consumption ratio (ECR) is a new dynamic current-based test metric (B. Vinnakota et al., 1998), that addresses some of these problems. ECR-based test offers several advantages over other dynamic test methods and the I/sub ddq/ test such as tolerance to process variations, reduced test process complexity and a proven ability to detect faults that escape other techniques. The ECR has been validated through extensive simulation and through application to a 50K gate sub-micron biomedical IC (W. Jiang and B. Vinnakota, 1999). Our contributions are directed towards further validating the real quality of the ECR.

技术的进步和一体化程度的提高预计会降低I/sub ddq/测试的有效性。每个集成电路的总泄漏电流预计会非常大，因此很难检测到单个缺陷的影响(T.W. Williams等人，1996;M. Sachdev, 1997)。通过监测动态电源电流来检测故障的测试方法已被建议作为一种替代方法。文献中提出的几乎所有动态电流技术都受到工艺变化的不利影响。许多在测量中也容易受到时间和幅度误差的影响。能量消耗比(ECR)是一种新的基于电流的动态测试指标(B. Vinnakota et al.， 1998)，它解决了其中的一些问题。与其他动态测试方法和I/sub ddq/测试相比，基于ecr的测试提供了几个优势，例如对过程变化的容忍度，降低了测试过程的复杂性，以及检测其他技术无法检测到的故障的能力。ECR已经通过广泛的模拟和50K栅极亚微米生物医学IC的应用得到验证(W. Jiang和B. Vinnakota, 1999)。我们的贡献是为了进一步验证ECR的真实质量。

{"title":"Deep submicron defect detection with the energy consumption ratio","authors":"B. Vinnakota","doi":"10.1109/ICCAD.1999.810695","DOIUrl":"https://doi.org/10.1109/ICCAD.1999.810695","url":null,"abstract":"Advances in technology and increasing integration are expected to degrade the effectiveness of I/sub ddq/ test. Total leakage currents per IC are expected to be very large, making it difficult to detect the impact of a single defect (T.W. Williams et al., 1996; M. Sachdev, 1997). Test methods which detect faults by monitoring the dynamic supply current have been suggested as one alternative. Almost all the dynamic current techniques proposed in the literature are adversely affected by the impact of process variations. Many are also susceptible to timing and magnitude errors in measurement. The energy consumption ratio (ECR) is a new dynamic current-based test metric (B. Vinnakota et al., 1998), that addresses some of these problems. ECR-based test offers several advantages over other dynamic test methods and the I/sub ddq/ test such as tolerance to process variations, reduced test process complexity and a proven ability to detect faults that escape other techniques. The ECR has been validated through extensive simulation and through application to a 50K gate sub-micron biomedical IC (W. Jiang and B. Vinnakota, 1999). Our contributions are directed towards further validating the real quality of the ECR.","PeriodicalId":6414,"journal":{"name":"1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)","volume":"103 1","pages":"467-470"},"PeriodicalIF":0.0,"publicationDate":"1999-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90746583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

An integrated algorithm for combined placement and libraryless technology mapping 一种组合布局与无库技术映射的集成算法

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810630

Yanbin Jiang, S. Sapatnekar

This paper presents a new solution for combining technology mapping with placement, coupling the two into one phase. The original aspects of our work are the use of libraryless mapping and a state-space search mechanism that is used to find the best solution. Several heuristics are presented for speeding up the search. Comparisons with a more conventional approach show that these strategies provide improvements of about 20%, with reasonable CPU times, on benchmark circuits.

本文提出了一种将技术映射与布局相结合的新方案，将两者耦合到一个阶段。我们工作的原始方面是使用无库映射和用于寻找最佳解决方案的状态空间搜索机制。提出了几种提高搜索速度的启发式方法。与传统方法的比较表明，在合理的CPU时间下，这些策略在基准电路上提供了约20%的改进。

引用次数: 13

Robust optimization based backtrace method for analog circuits 基于鲁棒优化的模拟电路回溯方法

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810666

A. Gomes, A. Chatterjee

We propose a new robust approach to signal backtrace for efficiently testing embedded analog modules in a large system. The proposed signal backtrace method is formulated as a solution to a multi-point boundary value problem (BVP), with constraints on the output state and the input. This error constraint minimizes large spurious deviations in the input signal and the convergence problems that arise if multiple solutions exist or if the desired signal does not exist in the feasible signal space. As an additional attractive advantage, this formulation preserves the core iteration structure of a SPICE-like simulator without modifications, greatly easing implementation.

我们提出了一种新的鲁棒的信号回溯方法，以有效地测试大型系统中的嵌入式模拟模块。所提出的信号回溯方法是一个对输出状态和输入状态都有约束的多点边值问题的解。这种误差约束最小化了输入信号中较大的杂散偏差，以及在存在多个解或在可行信号空间中不存在所需信号时出现的收敛问题。作为一个额外的吸引人的优势，这个公式保留了类似spice的模拟器的核心迭代结构，而无需修改，大大简化了实现。

引用次数: 4

Clock skew scheduling for improved reliability via quadratic programming 通过二次规划提高可靠性的时钟偏差调度

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

Pub Date : 1999-11-07 DOI: 10.1109/ICCAD.1999.810655

I. Kourtev, E. Friedman

This paper considers the problem of determining an optimal clock skew schedule for a synchronous VLSI circuit. A novel formulation of clock skew scheduling as a constrained quadratic programming (QP) problem is introduced. The concept of a permissible range, or a valid interval, for the clock skew of each local data path is key to this QP approach. From a reliability perspective, the ideal clock schedule corresponds to each clock skew within the circuit being at the center of the respective permissible range. However, this ideal clock schedule is nor practically implementable because of limitations imposed by the connectivity among the registers within the circuit. To evaluate the reliability, a quadratic cost function is introduced as the Euclidean distance between the ideal schedule and a given practically feasible clock schedule. This cost function is the minimization objective of the described algorithms for the solution of the previously mentioned quadratic program. Furthermore, the work described here substantially differs from previous research in that it permits complete control over specific clock signal delays or skews within the circuit. Specifically, the algorithms described here can be employed to obtain results with explicitly specified target values of important clock delays/skews with a circuit, such as for example, the clock delays/skews for I/O registers. An additional benefit is a potential reduction in clock period of up to 10%. An efficient mathematical algorithm is derived for the solution of the QP problem with O(r/sup 3/) run time complexity and O(r/sup 2/) storage complexity, where r is the number of registers in the circuit. The algorithm is implemented as a C++ program and demonstrated on the ISCAS'89 suite of benchmark circuits as well as on a number of industrial circuits. The work described here yields additional insights into the correlation between circuit structure and circuit timing by characterizing the degree to which specific signal paths limit the overall performance and reliability of a circuit. This information is directly applicable to logic and architectural synthesis.

本文研究了同步VLSI电路的最佳时钟偏差调度问题。提出了一种基于约束二次规划(QP)问题的时钟倾斜调度新公式。每个本地数据路径的时钟偏差的允许范围或有效间隔的概念是这种QP方法的关键。从可靠性的角度来看，理想的时钟调度对应于电路中处于各自允许范围中心的每个时钟偏差。然而，由于电路中寄存器之间的连接所施加的限制，这种理想的时钟调度实际上是不可实现的。为了评估可靠性，引入二次代价函数作为理想调度与给定实际可行时钟调度之间的欧氏距离。这个代价函数是前面提到的二次规划的解所描述的算法的最小化目标。此外，这里描述的工作与以前的研究有很大的不同，因为它允许完全控制电路内特定的时钟信号延迟或偏差。具体来说，这里描述的算法可以用来获得具有显式指定的重要时钟延迟/偏差的目标值的结果，例如，I/O寄存器的时钟延迟/偏差。一个额外的好处是时钟周期可能减少高达10%。推导了一种求解运行时间复杂度为0 (r/sup 3/)、存储复杂度为0 (r/sup 2/)的QP问题的有效数学算法，其中r为电路中的寄存器数。该算法以c++程序的形式实现，并在ISCAS’89系列基准电路和一些工业电路上进行了验证。这里描述的工作通过描述特定信号路径限制电路整体性能和可靠性的程度，对电路结构和电路时序之间的相关性产生了额外的见解。此信息可直接应用于逻辑和架构综合。

{"title":"Clock skew scheduling for improved reliability via quadratic programming","authors":"I. Kourtev, E. Friedman","doi":"10.1109/ICCAD.1999.810655","DOIUrl":"https://doi.org/10.1109/ICCAD.1999.810655","url":null,"abstract":"This paper considers the problem of determining an optimal clock skew schedule for a synchronous VLSI circuit. A novel formulation of clock skew scheduling as a constrained quadratic programming (QP) problem is introduced. The concept of a permissible range, or a valid interval, for the clock skew of each local data path is key to this QP approach. From a reliability perspective, the ideal clock schedule corresponds to each clock skew within the circuit being at the center of the respective permissible range. However, this ideal clock schedule is nor practically implementable because of limitations imposed by the connectivity among the registers within the circuit. To evaluate the reliability, a quadratic cost function is introduced as the Euclidean distance between the ideal schedule and a given practically feasible clock schedule. This cost function is the minimization objective of the described algorithms for the solution of the previously mentioned quadratic program. Furthermore, the work described here substantially differs from previous research in that it permits complete control over specific clock signal delays or skews within the circuit. Specifically, the algorithms described here can be employed to obtain results with explicitly specified target values of important clock delays/skews with a circuit, such as for example, the clock delays/skews for I/O registers. An additional benefit is a potential reduction in clock period of up to 10%. An efficient mathematical algorithm is derived for the solution of the QP problem with O(r/sup 3/) run time complexity and O(r/sup 2/) storage complexity, where r is the number of registers in the circuit. The algorithm is implemented as a C++ program and demonstrated on the ISCAS'89 suite of benchmark circuits as well as on a number of industrial circuits. The work described here yields additional insights into the correlation between circuit structure and circuit timing by characterizing the degree to which specific signal paths limit the overall performance and reliability of a circuit. This information is directly applicable to logic and architectural synthesis.","PeriodicalId":6414,"journal":{"name":"1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)","volume":"70 1","pages":"239-243"},"PeriodicalIF":0.0,"publicationDate":"1999-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83743606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 49

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀