33rd Design Automation Conference Proceedings, 1996最新文献

英文中文

Hardware emulation for functional verification of K5 K5功能验证的硬件仿真

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1145/240518.240578

G. Ganapathy, Ram Narayan, Glenn Jorden, D. Fernandez, Ming Wang, J. Nishimura

The K5 microprocessor is a 4 Million transistor superscalar, X86 microprocessor. The K5 microprocessor is an AMD original design, verifying compatibility with the existing X86 architecture and software is crucial to its success in the market place. The X86 architecture has been constantly evolving over several years without any published specification. The primary mechanism for functional design verification of an X86 processor is simulation. The ability to execute a good sample set of the X86 software base on a model of the processor architecture before tapeout is key to achieving very high confidence first silicon. The Quickturn Hardware Emulation system allows us to map a model of the design onto hardware resources and execute it at high speeds. In this paper we present the emulation methodology that was jointly developed for K5 and applied successfully to meet our functional verification goals.

K5微处理器是一个400万晶体管的超标量X86微处理器。K5微处理器是AMD的原创设计，验证与现有X86架构和软件的兼容性对其在市场上的成功至关重要。X86架构几年来一直在不断发展，没有发布任何规范。对X86处理器进行功能设计验证的主要机制是仿真。在处理器体系结构模型上执行良好的X86软件样本集的能力是实现非常高的置信度的关键。Quickturn硬件仿真系统允许我们将设计模型映射到硬件资源上，并以高速执行它。在本文中，我们提出了为K5共同开发并成功应用的仿真方法，以满足我们的功能验证目标。

引用次数: 45

A Boolean approach to performance-directed technology mapping for LUT-based FPGA designs 基于lut的FPGA设计中性能导向技术映射的布尔方法

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545669

C. Legl, B. Wurth, K. Eckl

This paper presents a novel Boolean approach to LUT-based FPGA technology mapping targeting high performance. At the core of the approach, we have developed a powerful functional decomposition algorithm. The impact of decomposition is enhanced by a preceding collapsing step. To decompose functions for small depth and area, we present an iterative, BDD-based variable partitioning procedure. The procedure optimizer the variable partition for each bound set size by iteratively exchanging variables between bound set and free set, and finally selects a good bound set size. Our decomposition algorithm extracts common subfunctions of multiple-output functions and thus further reduces area and the maximum interconnect lengths. Experimental results show that our new algorithm produces circuits with significantly smaller depths than other performance-oriented mappers. This advantage also holds for the actual delays after placement and routing.

本文提出了一种新颖的布尔方法来实现基于lut的高性能FPGA技术映射。在该方法的核心，我们开发了一个强大的函数分解算法。分解的影响通过前面的折叠步骤得到增强。为了分解小深度和小面积的函数，我们提出了一个迭代的、基于bdd的变量划分过程。该过程通过在绑定集和自由集之间迭代交换变量来优化每个绑定集大小的变量分区，最终选择一个较好的绑定集大小。我们的分解算法提取了多个输出函数的公共子函数，从而进一步减小了面积和最大互连长度。实验结果表明，我们的新算法产生的电路深度明显小于其他面向性能的映射器。这一优势也适用于放置和路由后的实际延迟。

引用次数: 39

Structural gate decomposition for depth-optimal technology mapping in LUT-based FPGA design 基于lut的FPGA设计中深度最优技术映射的结构门分解

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545668

J. Cong, Yean-Yow Hwang

In this paper, we study the problem of decomposing gates in fanin-unbounded or K-bounded networks such that the K-input LUT mapping solutions computed by a depth-optimal mapper have minimum depth. We show (1) any decomposition leads to a smaller or equal mapping depth regardless the decomposition algorithm used, and (2) the problem is NP-hard for unbounded networks when K/spl ges/3 and remains NP-hard for K-bounded networks when K/spl ges/5. We propose a gate decomposition algorithm, named DOGMA, which combines level-driven node packing technique (Chortle-d) and the network flow based optimal labeling technique (FlowMap). Experimental results show that networks decomposed by DOGMA allow depth-optimal technology mappers to improve the mapping solutions by up to 11% in depth and up to 35% in area comparing to the mapping results of networks decomposed by other existing decomposition algorithms.

在本文中，我们研究了在fanin-unbounded或K-bounded网络中分解门的问题，使得由深度最优映射器计算的k -输入LUT映射解具有最小深度。我们证明(1)无论使用哪种分解算法，任何分解都会导致更小或相等的映射深度，(2)当K/spl为3时，无界网络的问题是NP-hard，当K/spl为5时，K-有界网络的问题仍然是NP-hard。本文提出了一种门分解算法DOGMA，该算法结合了水平驱动节点打包技术(Chortle-d)和基于网络流的最优标注技术(FlowMap)。实验结果表明，与其他现有分解算法相比，DOGMA分解的网络使深度最优技术映射器的映射结果深度提高了11%，面积提高了35%。

引用次数: 29

A scalable formal verification methodology for pipelined microprocessors 流水线微处理器的可扩展形式化验证方法

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545638

J. Levitt, K. Olukotun

We describe a novel, formal verification technique for proving the correctness of a pipelined microprocessor that focuses specifically on pipeline control logic. We iteratively deconstruct a pipeline by merging adjacent pipeline stages, allowing for the verification to be done in several easier steps. We present an inductive proof methodology that verifies that pipeline behaviour is preserved as the pipeline depth is reduced via deconstruction; this inductive approach is less sensitive to pipeline depth and complexity than previous approaches. Invariants are used to simplify the proof, and datapath components are abstracted using validity checking with uninterpreted functions. We present experimental results from the formal verification of a DLX five-stage pipeline using our technique.

我们描述了一种新颖的形式化验证技术，用于证明流水线微处理器的正确性，该技术特别关注流水线控制逻辑。我们通过合并相邻的管道阶段来迭代地解构管道，从而允许在几个更简单的步骤中完成验证。我们提出了一种归纳证明方法，该方法验证了当管道深度通过解构减少时管道行为被保留;与以前的方法相比，这种归纳方法对管道深度和复杂性的敏感性较低。使用不变量来简化证明，并使用未解释函数进行有效性检查来抽象数据路径组件。我们介绍了使用我们的技术对DLX五级管道进行正式验证的实验结果。

引用次数: 32

Efficient full-wave electromagnetic analysis via model-order reduction of fast integral transforms 基于快速积分变换的模型阶降的高效全波电磁分析

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545605

Joel R. Philips, E. Chiprout, D. D. Ling

An efficient full-wave electromagnetic analysis tool would be useful in many aspects of engineering design. Development of integral-equation based tools has been hampered by the high computational complexity of dense matrix representations and difficulty in obtaining and utilizing the frequency-domain response. In this paper we demonstrate that an algorithm based on application of a novel model-order reduction scheme directly to the sparse model generated by a fast integral transform has significant advantages for frequency- and time-domain simulation.

一个高效的全波电磁分析工具将在工程设计的许多方面发挥重要作用。基于积分方程的工具的发展受到密集矩阵表示的高计算复杂度和难以获得和利用频域响应的阻碍。本文证明了将一种新的模型阶数降阶格式直接应用于由快速积分变换生成的稀疏模型的算法在频域和时域仿真方面具有显著的优势。

引用次数: 73

A parallel precorrected FFT based capacitance extraction program for signal integrity analysis 基于并行预校正FFT的电容提取程序，用于信号完整性分析

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1145/240518.240587

N. Aluru, V. Nadkarni, James White

In order to optimize interconnect to avoid signal integrity problems, very fast and accurate 3-D capacitance extraction is essential. Fast algorithms, such as the multipole or precorrected Fast Fourier Transform (FFT) accelerated methods in programs like FASTCAP, must be combined with techniques to exploit the emerging cluster-of-workstation based parallel computers like the IBM SP2. In this paper, we examine parallelizing the precorrected FFT algorithm for 3-D capacitance extraction and present several algorithms for balancing workload and reducing communication time. Results from a prototype implementation on an eight processor IBM SP2 are presented for several test examples, and the largest of these examples achieves nearly linear parallel speed-up.

为了优化互连以避免信号完整性问题，非常快速和准确的三维电容提取是必不可少的。快速算法，如多极或预校正的快速傅立叶变换(FFT)加速方法，在像FASTCAP这样的程序中，必须与利用新兴的基于工作站集群的并行计算机(如IBM SP2)的技术相结合。在本文中，我们研究了用于三维电容提取的预校正FFT算法的并行化，并提出了几种平衡工作负载和减少通信时间的算法。本文给出了在8处理器IBM SP2上的原型实现的几个测试示例的结果，其中最大的示例实现了近乎线性的并行加速。

引用次数: 25

Architectural retiming: pipelining latency-constrained circuits 架构重定时:流水线延迟约束电路

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1145/240518.240652

S. Hassoun, C. Ebeling

This paper presents a new optimization technique called architectural retiming which is able to improve the performance of many latency-constrained circuits. Architectural retiming achieves this by increasing the number of registers on the latency-constrained path while preserving the functionality and latency of the circuit. This is done using the concept of a negative register, which can be implemented using precomputation and prediction. We use the name architectural retiming since it both reschedules operations in time and modifies the structure of the circuit to preserve its functionality. We illustrate the use of architectural retiming on two realistic examples and present performance improvement results for a number of sample circuits.

本文提出了一种新的优化技术——体系结构重定时，它能够改善许多延迟受限电路的性能。架构重定时通过增加延迟受限路径上的寄存器数量，同时保留电路的功能和延迟来实现这一点。这是使用负寄存器的概念来完成的，负寄存器可以通过预计算和预测来实现。我们使用“架构重定时”这个名称，因为它既可以及时重新安排操作，又可以修改电路的结构以保持其功能。我们在两个实际示例中说明了架构重定时的使用，并给出了许多示例电路的性能改进结果。

引用次数: 50

Efficient approximation algorithms for floorplan area minimization 平面图面积最小化的有效近似算法

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545624

D. Chen, X. Hu

Approximation has been shown to be an effective method for reducing the time and space costs of solving various floorplan area minimization problems. In this paper, we present several approximation techniques for solving floorplan area minimization problems. These new techniques enable us to reduce both the time and space complexities of the previously best known approximation algorithms by more than a factor of n and n/sup 2/ for rectangular and L-shaped sub-floorplans, respectively (where n is the number of given implementations). The efficiency in the time and space complexities is critical to the applicability of such approximation techniques in floorplan area minimization algorithms. We also give a technique for enhancing the quality of approximation results.

逼近法已被证明是一种有效的减少求解各种平面面积最小化问题的时间和空间成本的方法。在本文中，我们提出了几种近似技术来解决平面面积最小化问题。这些新技术使我们能够将之前最著名的近似算法的时间和空间复杂性分别减少n和n/sup 2/以上，分别适用于矩形和l形子平面图(其中n是给定实现的数量)。在时间复杂度和空间复杂度上的效率是这种近似技术能否适用于平面面积最小化算法的关键。我们还给出了一种提高近似结果质量的技术。

引用次数: 6

Delay minimal decomposition of multiplexers in technology mapping 技术映射中多路复用器的延迟最小分解

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545582

Shashidhar Thakur, D. F. Wong, S. Krishnamoorthy

Technology mapping requires the unmapped logic network to be represented in terms of base functions, usually two-input NORs and inverters. Technology decomposition is the step that transforms arbitrary networks to this form. Typically, such decomposition schemes ignore the fact that certain circuit elements can be mapped more efficiently by treating them separately during decomposition. Multiplexers are one such category of circuit elements. They appear very naturally in circuits, in the form of datapath elements and as a result of synthesis of CASE statements in HDL specifications of control logic. Mapping them using multiplexers in technology libraries has many advantages. In this paper, we give an algorithm for optimally decomposing multiplexers, so as to minimize the delay of the network, and demonstrate its effectiveness in improving the quality of mapped circuits.

技术映射要求用基函数表示未映射的逻辑网络，通常是双输入NORs和逆变器。技术分解是将任意网络转换为这种形式的步骤。通常，这样的分解方案忽略了这样一个事实，即在分解过程中单独处理某些电路元件可以更有效地映射它们。多路复用器就是这样一类电路元件。它们很自然地出现在电路中，以数据路径元素的形式出现，并作为HDL控制逻辑规范中CASE语句的综合结果。使用技术库中的多路复用器对它们进行映射有许多优点。本文给出了一种最优分解多路复用器的算法，以使网络的延迟最小化，并证明了该算法在提高映射电路质量方面的有效性。

引用次数: 21

Extracting circuit models for large RC interconnections that are accurate up to a predefined signal frequency 提取电路模型的大型RC互连是精确到一个预定义的信号频率

33rd Design Automation Conference Proceedings, 1996

Pub Date : 1996-06-01 DOI: 10.1109/DAC.1996.545675

P. Elias, N. V. D. Meijs

This paper presents a technique that transforms large and complex RC networks into much smaller physically realizable RC networks. These models reflect the transmission behavior of the initial network accurately for frequencies up to a user-defined maximal signal frequency. This technique has been incorporated in a layout-to-circuit extractor, using a scan-line approach. The method guarantees numerical stability and performs excellently in modeling RC interconnects.

本文提出了一种将大型复杂RC网络转换成更小的物理可实现RC网络的技术。这些模型准确地反映了初始网络在用户定义的最大信号频率范围内的传输行为。该技术已被纳入一个布局电路提取器，使用扫描线的方法。该方法保证了数值稳定性，并能很好地模拟钢筋混凝土互连。

引用次数: 21

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

33rd Design Automation Conference Proceedings, 1996

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀