Proceedings of International Conference on Computer Aided Design最新文献

英文中文

A design for testability technique for RTL circuits using control/data flow extraction 采用控制/数据流提取的RTL电路可测试性技术设计

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-10 DOI: 10.1109/ICCAD.1996.569798

Indradeep Ghosh, A. Raghunathan, N. Jha

We present a technique for extracting functional (control/dataflow) information from register transfer level (RTL) controller/data path circuits and illustrate its use in design for hierarchical testability of these circuits. This testing procedure and design for testability (DFT) technique is general enough to handle RTL control flow intensive circuits like protocol handlers as well as data flow intensive circuits like digital filters. It makes the combined controller-data path highly testable and does not require any external behavioral information. This scheme has the advantages of low area/delay/power overheads (average of 3.2%, 0.9% and 4.1%, respectively, for benchmarks), high fault coverage (over 99% for most cases), very low test generation times (because it is independent of bit-width), and the advantage of at-speed testing. Experiments show a 2-to-4 (1-to-3) orders of magnitude test generation time advantage over an efficient gate-level sequential test generator (combinational test generator that assumes full scan).

我们提出了一种从寄存器传输级(RTL)控制器/数据路径电路中提取功能(控制/数据流)信息的技术，并说明了它在设计这些电路的分层可测试性中的应用。这个测试过程和可测试性设计(DFT)技术足够通用，可以处理RTL控制流密集型电路，如协议处理程序，以及数据流密集型电路，如数字滤波器。它使组合的控制器-数据路径高度可测试，并且不需要任何外部行为信息。该方案具有低面积/延迟/功耗开销(基准测试的平均值分别为3.2%，0.9%和4.1%)，高故障覆盖率(大多数情况下超过99%)，非常低的测试生成时间(因为它与位宽度无关)以及高速测试的优势。实验表明，与有效的门级顺序测试生成器(假设完全扫描的组合测试生成器)相比，2到4(1到3)个数量级的测试生成时间优势。

引用次数: 55

VLSI circuit partitioning by cluster-removal using iterative improvement techniques 利用迭代改进技术去除簇的VLSI电路划分

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-10 DOI: 10.1109/ICCAD.1996.569591

S. Dutt, W. Deng

Move-based iterative improvement partitioning methods such as the Fiduccia-Mattheyses (FM) algorithm and Krishnamurthy's Look-Ahead (LA) algorithm are widely used in VLSI CAD applications largely due to their time efficiency and ease of implementation. This class of algorithms is of the "local improvement" type. They generate relatively high quality results for small and medium size circuits. However, as VLSI circuits become larger, these algorithms are not so effective on them as direct partitioning tools. We propose new iterative-improvement methods that select cells to move with a view to moving clusters that straddle the two subsets of a partition into one of the subsets. The new algorithms significantly improve partition quality while preserving the advantage of time efficiency. Experimental results on 25 medium to large size ACM/SIGDA benchmark circuits show up to 70% improvement over FM in cutsize, with an average of per-circuit percent improvements of about 25%, and a total cut improvement of about 35%. They also outperform the recent placement-based partitioning tool Paraboli and the spectral partitioner MELO by about 17% and 23%, respectively, with less CPU time. This demonstrates the potential of iterative improvement algorithms in dealing with the increasing complexity of modern VLSI circuitry.

基于移动的迭代改进划分方法，如fiduccia - mattheses (FM)算法和Krishnamurthy's looking - ahead (LA)算法，主要是因为它们的时间效率和易于实现而广泛应用于VLSI CAD应用。这类算法属于“局部改进”类型。它们对中小型电路产生相对高质量的结果。然而，随着VLSI电路变得越来越大，这些算法在它们上不如直接划分工具那么有效。我们提出了新的迭代改进方法，选择要移动的单元格，以将跨越分区的两个子集的集群移动到其中一个子集。新算法在保持时间效率优势的同时，显著提高了分区质量。在25个中大型ACM/SIGDA基准电路上的实验结果表明，与FM相比，切割尺寸提高了70%，平均每个电路百分比提高约25%，总切割改进约35%。它们的性能也比最近的基于位置的分区工具抛物线和频谱分区器MELO分别高出17%和23%，而且CPU时间更少。这证明了迭代改进算法在处理日益复杂的现代VLSI电路方面的潜力。

{"title":"VLSI circuit partitioning by cluster-removal using iterative improvement techniques","authors":"S. Dutt, W. Deng","doi":"10.1109/ICCAD.1996.569591","DOIUrl":"https://doi.org/10.1109/ICCAD.1996.569591","url":null,"abstract":"Move-based iterative improvement partitioning methods such as the Fiduccia-Mattheyses (FM) algorithm and Krishnamurthy's Look-Ahead (LA) algorithm are widely used in VLSI CAD applications largely due to their time efficiency and ease of implementation. This class of algorithms is of the \"local improvement\" type. They generate relatively high quality results for small and medium size circuits. However, as VLSI circuits become larger, these algorithms are not so effective on them as direct partitioning tools. We propose new iterative-improvement methods that select cells to move with a view to moving clusters that straddle the two subsets of a partition into one of the subsets. The new algorithms significantly improve partition quality while preserving the advantage of time efficiency. Experimental results on 25 medium to large size ACM/SIGDA benchmark circuits show up to 70% improvement over FM in cutsize, with an average of per-circuit percent improvements of about 25%, and a total cut improvement of about 35%. They also outperform the recent placement-based partitioning tool Paraboli and the spectral partitioner MELO by about 17% and 23%, respectively, with less CPU time. This demonstrates the potential of iterative improvement algorithms in dealing with the increasing complexity of modern VLSI circuitry.","PeriodicalId":408850,"journal":{"name":"Proceedings of International Conference on Computer Aided Design","volume":"958 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127031169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 155

Statistical sampling and regression analysis for RT-Level power evaluation RT-Level功率评估的统计抽样与回归分析

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-10 DOI: 10.1109/ICCAD.1996.569914

Cheng-Ta Hsieh, Qing Wu, Chih-Shun Ding, Massoud Pedram

In this paper, we propose a statistical power evaluation framework at the RT-level. We first discuss the power macro-modeling formulation, and then propose a simple random sampling technique to alleviate the the overhead of macro-modeling during RTL simulation. Next, we describe a regression estimator to reduce the error of the macro-modeling approach. Experimental results indicate that the execution time of the simple random sampling combined with power macro-modeling is 50 X lower than that of conventional macro-modeling while the percentage error of regression estimation combined with power macro-modeling is 16 X lower than that of conventional macro-modeling. Hence, we provide the designer with options to either improve the accuracy or the execution time when using power macro-modeling in the context of RTL simulation.

在本文中，我们提出了一个在rt水平上的统计能力评估框架。我们首先讨论了功率宏建模公式，然后提出了一种简单的随机抽样技术，以减轻RTL仿真过程中宏建模的开销。接下来，我们描述了一个回归估计器，以减少宏观建模方法的误差。实验结果表明，简单随机抽样结合功率宏观建模的执行时间比常规宏观建模的执行时间低50倍，回归估计结合功率宏观建模的执行时间比常规宏观建模的执行时间低16倍。因此，我们为设计人员提供了在RTL仿真环境中使用功率宏建模时提高准确性或缩短执行时间的选项。

引用次数: 36

Heterogeneous built-in resiliency of application specific programmable processors 特定应用程序可编程处理器的异构内置弹性

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-01 DOI: 10.1109/ICCAD.1996.569830

Kyosun Kim, R. Karri, M. Potkonjak

Using the flexibility provided by multiple functionalities we have developed a new approach for permanent fault-tolerance: Heterogeneous Built-In-Resiliency (HBIR). HBIR processor synthesis imposes several unique tasks on the synthesis process: (i) latency determination targeting k-unit fault-tolerance, (ii) application-to-faulty-unit matching and (iii) HBIR scheduling and assignment algorithms. We address each of them and demonstrate the effectiveness of the overall approach, the synthesis algorithms, and software implementations on a number of designs.

利用多种功能提供的灵活性，我们开发了一种永久性容错的新方法:异构内置弹性(HBIR)。HBIR处理器合成在合成过程中施加了几个独特的任务:(i)针对k单元容错的延迟确定，(ii)应用程序到故障单元的匹配，以及(iii) HBIR调度和分配算法。我们解决了其中的每一个问题，并展示了整体方法的有效性，综合算法，以及在许多设计上的软件实现。

引用次数: 8

Unit delay simulation with the inversion algorithm 用反演算法进行单位时延仿真

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-01 DOI: 10.1109/ICCAD.1996.569831

William J. Schilp, P. Maurer

The Inversion Algorithm is an event driven algorithm whose performance meets or exceeds that of Levelized Compiled Code simulation, even when the activity rate is unrealistically high. Existing implementations of the Inversion Algorithm are based on the Zero Delay model. This paper extends the algorithm to more realistic timing models. The main problems discussed in this paper are avoiding scheduling conflicts, and minimizing the amount of storage space. These problems are made considerably more difficult by the deletion of NOT gates and the collapsing of various connections. These optimizations transform the simulation into a multi-delay simulation under the transport delay model. A complete solution to the scheduling problem is presented under these conditions.

反转算法是一种事件驱动算法，即使在活动率高得不切实际的情况下，其性能也能达到或超过Levelized Compiled Code模拟。现有的反演算法都是基于零延迟模型。本文将该算法扩展到更现实的时序模型。本文讨论的主要问题是避免调度冲突和最小化存储空间。由于非门的删除和各种连接的崩溃，这些问题变得相当困难。这些优化将仿真转化为传输延迟模型下的多延迟仿真。在这些条件下，给出了调度问题的完整解。

引用次数: 7

A new method to express functional permissibilities for LUT based FPGAs and its applications 基于LUT的fpga功能权限表达新方法及其应用

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-01 DOI: 10.1109/ICCAD.1996.569635

S. Yamashita, Hiroshi Sawada, A. Nagoya

This paper presents a new method to express functional permissibilities for look-up table (LUT) based field programmable gate arrays (FPGAs). The method represents functional permissibilities by using sets of pairs of functions, not by incompletely specified functions. It makes good use of the properties of LUTs such that their internal logics can be freely changed. The permissibilities expressed by the proposed method have the desired property that at many points of a network they can be simultaneously treated. Applications of the proposed method are also presented; a method to optimize networks and a method to remove connections that are obstacles at the routing step. Preliminary experimental results are given to show the effectiveness of our proposed method.

针对基于现场可编程门阵列(fpga)的查询表(LUT)，提出了一种表达功能权限的新方法。该方法通过使用函数对的集合来表示功能权限，而不是通过不完全指定的函数。它很好地利用了lut的属性，使得它们的内部逻辑可以自由地改变。所提出的方法所表示的许可具有在网络的许多点上可以同时处理的期望性质。最后介绍了该方法的应用。一种优化网络的方法和一种在路由步骤中移除障碍的连接的方法。初步的实验结果表明了该方法的有效性。

引用次数: 81

A coordinate-transformed Arnoldi algorithm for generating guaranteed stable reduced-order models of RLC circuits RLC电路保稳定降阶模型的坐标变换Arnoldi算法

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-11-01 DOI: 10.1109/ICCAD.1996.569710

L. M. Silveira, M. Kamon, I. Elfadel, Jacob K. White

Since the first papers on asymptotic waveform evaluation (AWE), Pade-based reduced order models have become standard for improving coupled circuit-interconnect simulation efficiency. Such models can be accurately computed using bi-orthogonalization algorithms like Pade via Lanczos (PVL), but the resulting Pade approximates can still be unstable even when generated from stable RLC circuits. For certain classes of RC circuits it has been shown that congruence transforms, like the Arnoldi algorithm, can generate guaranteed stable and passive reduced-order models. In this paper we present a computationally efficient model-order reduction technique, the coordinate-transformed Arnoldi algorithm, and show that this method generates arbitrarily accurate and guaranteed stable reduced-order models for RLC circuits. Examples are presented which demonstrates the enhanced stability and efficiency of the new method.

自第一批关于渐近波形评估(AWE)的论文发表以来，基于pade的降阶模型已经成为提高耦合电路互连仿真效率的标准。这样的模型可以使用像Pade via Lanczos (PVL)这样的双正交化算法精确计算，但是即使由稳定的RLC电路生成，所得到的Pade近似值仍然是不稳定的。对于某些类型的RC电路，证明了同余变换，如Arnoldi算法，可以生成保证稳定的无源降阶模型。本文提出了一种计算效率很高的模型降阶技术——坐标变换Arnoldi算法，并证明了该方法可以为RLC电路生成任意精确且保证稳定的降阶模型。算例表明，该方法提高了算法的稳定性和效率。

引用次数: 288

Clock-driven performance optimization in interactive behavioral synthesis 交互行为合成中时钟驱动的性能优化

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-04-10 DOI: 10.1109/ICCAD.1996.569538

H. Juan, D. Gajski, Viraphol Chaiyakul

In interactive behavioral synthesis, the designer can control the design process at every stage, including modifying the schedule of the design to improve its performance. In this paper, we present a methodology for performance optimization in interactive behavioral synthesis. Also proposed are several quality metrics and hints that can assist the user in utilizing the proposed methodology. When the user is optimizing the performance of the design, one important decision is the selection of a clock period. To facilitate clock selection by the user, we have developed an algorithm to estimate the effect of different clock periods on the execution time of the design. We have tested our methodology on several benchmarks. The experimental results support the proposed methodology by demonstrating an average improvement of 46.2% in design performance.

在交互行为综合中，设计师可以控制每个阶段的设计过程，包括修改设计的进度以提高其性能。在本文中，我们提出了一种交互式行为合成中性能优化的方法。还提出了一些质量指标和提示，可以帮助用户利用所建议的方法。当用户优化设计的性能时，一个重要的决定是时钟周期的选择。为了方便用户选择时钟，我们开发了一种算法来估计不同时钟周期对设计执行时间的影响。我们已经在几个基准测试中测试了我们的方法。实验结果表明，设计性能平均提高46.2%，支持所提出的方法。

引用次数: 11

Interchangeable pin routing with application to package layout 可互换引脚布线，适用于封装布局

Proceedings of International Conference on Computer Aided Design

Pub Date : 1996-04-01 DOI: 10.1109/ICCAD.1996.571349

Man-Fai Yu, J. Darnauer, W. Dai

Many practical routing problems such as BGA, PGA, pin redistribution and test fixture routing involve routing with interchangeable pins. These routing problems, especially package layout, are becoming more difficult to do manually due to increasing speed and I/O. Currently, no commercial or university router is available for this task. In this paper, we unify these different problems as instances of the interchangeable pin routing (IPR) problem, which is NP-complete. By representing the solution space with flows in a triangulated routing network instead of grids, we developed a min-cost max-flow heuristic considering only the most important cuts in the design. The heuristic handles multiple layers, prerouted nets, and all-angle, octilinear or rectilinear wiring styles. Experiments show that the heuristic is very effective on most practical examples. It had been used to route industry designs with thousands of interchangeable pins.

许多实际布线问题，如BGA, PGA，引脚重新分配和测试夹具布线涉及可互换引脚布线。由于速度和I/O的增加，这些路由问题，尤其是包布局，正变得越来越难以手动解决。目前，没有商用或大学路由器可用于此任务。在本文中，我们将这些不同的问题统一为可互换引脚布线(IPR)问题的实例，该问题是np完全的。通过用三角形路由网络而不是网格中的流来表示解空间，我们开发了一种最小成本最大流量启发式算法，仅考虑设计中最重要的切割。启发式处理多层、预路由网以及全角度、八边形或直线布线样式。实验表明，该启发式算法在大多数实际示例中都是非常有效的。它曾被用来为带有数千个可互换引脚的工业设计布线。

引用次数: 38

Timing verification of sequential domino circuits 时序多米诺电路的时序验证

Proceedings of International Conference on Computer Aided Design

Pub Date : 1900-01-01 DOI: 10.1109/iccad.1996.569418

D. V. Campenhout, T. Mudge, K. Sakallah

Two methods are presented for static timing verification of sequential circuits implemented as a mix of static and domino logic. Constraints for proper operation of domino gates are derived. An important observation is that input signals to domino gates may start changing near the end of the evaluate phase. The first method models domino gates explicitly, similar to latches. The second method treats domino gates only during pre- and post-processing steps. This method is shown to be more conservative, but easier to compute.

提出了两种方法，用于串行电路的静态定时验证，实现静态和多米诺逻辑的混合。导出了多米诺门正常运行的约束条件。一个重要的观察结果是，多米诺骨牌门的输入信号可能在评估阶段接近尾声时开始改变。第一种方法显式地模拟domino门，类似于闩锁。第二种方法仅在预处理和后处理步骤中处理domino门。这种方法被证明是更保守的，但更容易计算。

引用次数: 17

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of International Conference on Computer Aided Design

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀