32nd Design Automation Conference最新文献

英文中文

Optimal ILP-based Approach for Throughput Optimization Using Simultaneous Algorithm/Architecture Matching and Retiming 基于最优ilp的并行算法/架构匹配和重定时吞吐量优化方法

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217516

Y. G. DeCastelo-Vide-e-Souza, M. Potkonjak, A. C. Parker

System level design and behavior transformations have been rapidly establishing themselves as design steps with the most inuential impact on final performance metrics, throughput and latency, of a design. In this paper we develop a formal ILP-based approach for throughput and latency optimization when algorithm-architecture matching, retiming, and pipelining are considered simultaneously. The effectiveness of the approach is demonstrated on several real-life examples.

系统级设计和行为转换已经迅速成为对设计的最终性能指标、吞吐量和延迟影响最大的设计步骤。在本文中，我们开发了一种正式的基于ilp的方法，用于同时考虑算法架构匹配、重定时和流水线时的吞吐量和延迟优化。通过几个实例验证了该方法的有效性。

引用次数: 8

Partial Scan with Pre-selected Scan Signals 部分扫描与预选的扫描信号

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217528

P. Pan, C. Liu

A partial scan approach proposed recently selects scan signals without considering the availability of the ip-ops (FFs). Such an approach can greatly reduce the number of scan signals since maximum freedom is allowed in scan signal selection. To actually scan the selected signals, we, however, must make them FF-driving signals. In this paper, we study the problem of modifying and retiming a circuit to make a pre-selected set of scan signals FF-driving signals while preserving the set of cycles being broken. We present a new approach for solving this problem. Based on the new approach we design an efficient algorithm. Unlike a previous algorithm which inherently has no control over the area overhead incurred during the modification, our algorithm explicitly minimizes the area overhead. The algorithm has been implemented and encouraging results were obtained.

最近提出的部分扫描方法在不考虑ip-ops (FFs)可用性的情况下选择扫描信号。这种方法可以极大地减少扫描信号的数量，因为在扫描信号的选择上有最大的自由度。然而，要真正扫描选定的信号，我们必须使它们成为ff驱动信号。在本文中，我们研究了修改和重新定时电路的问题，使一组预先选定的扫描信号成为ff驱动信号，同时保持一组周期被打破。我们提出了解决这个问题的新方法。在此基础上设计了一种高效的算法。不像以前的算法，固有地无法控制在修改过程中产生的面积开销，我们的算法显式地最小化面积开销。该算法已经实现，并取得了令人鼓舞的结果。

引用次数: 3

Extreme Delay Sensitivity and the Worst-Case Switching Activity in VLSI Circuitsy VLSI电路的极端延迟灵敏度与最坏情况下的开关活动

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217600

F. Najm, Michael Y. Zhang

We observe that the switching activity at a circuit node, also called the transition density, can be extremely sensitive to the circuit internal delays. As a result, slight delay variations can lead to several orders of magnitude changes in the node activity. This has important implications for CAD in that, if the transition density is estimated by simulation, then minor inaccuracies in the timing models can lead to very large errors in the estimated activity. As a solution, we propose an efficient technique for estimating an upper bound on the transition density at every node. While it is not always very tight, the upper bound is robust, in the sense that it is valid irrespective of delay variations and modeling errors. We will describe the technique and present experimental results based on a prototype implementation.

我们观察到电路节点的开关活动，也称为跃迁密度，对电路内部延迟非常敏感。因此，轻微的延迟变化会导致节点活动发生几个数量级的变化。这对CAD具有重要意义，因为如果通过模拟来估计过渡密度，那么定时模型中的微小误差可能导致估计活动中的非常大的误差。作为一种解决方案，我们提出了一种有效的技术来估计每个节点转移密度的上界。虽然它并不总是非常紧密，但上界是鲁棒的，在某种意义上它是有效的，而不考虑延迟变化和建模误差。我们将描述该技术并基于原型实现呈现实验结果。

引用次数: 54

A Method for Finding Good Ashenhurst Decompositions and Its Application to FPGA Synthesis 寻优Ashenhurst分解方法及其在FPGA合成中的应用

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217507

T. Stanion, C. Sechen

In this paper, we present an algorithm for finding a good Ashenhurst decomposition of a switching function. Most current methods for performing this type of decomposition are based on the Roth-Karp algorithm. The algorithm presented here is based on finding an optimal cut in a BDD. This algorithm differs from previous decomposition algorithms in that the cut determines the size and composition of the bound set and the free set. Other methods examine all possible bound sets of an arbitrary size. We have applied this method to decomposing functions into sets of k-variable functions. This is a required step when implementing a function using a lookup table (LUT) based FPGA. The results compare very favorably to existing implementations of Roth-Karp decomposition methods.

在本文中，我们提出了一种寻找切换函数的良好Ashenhurst分解的算法。目前执行这类分解的大多数方法都是基于Roth-Karp算法。本文提出的算法是基于在BDD中找到最优切割。该算法与以往的分解算法的不同之处在于，切割决定了约束集和自由集的大小和组成。其他方法检查任意大小的所有可能的边界集。我们已经将这种方法应用于将函数分解为k变量函数的集合。当使用基于查找表(LUT)的FPGA实现函数时，这是必需的步骤。结果与现有的Roth-Karp分解方法的实现相比非常有利。

引用次数: 24

Verification of Arithmetic Circuits with Binary Moment Diagrams 用二元矩图验证算术电路

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217583

R. Bryant, Yirng-An Chen

Binary Moment Diagrams (BMDs) provide a canonical representations for linear functions similar to the way Binary Decision Diagrams (BDDs) represent Boolean functions. Within the class of linear functions, we can embed arbitrary functions from Boolean variables to integer values. BMDs can thus model the functionality of data path circuits operating over word-level data. Many important functions, including integermultiplication, that cannot be represented efficiently at the bit level with BDDs have simple representations at the word level with BMDs. Furthermore, BMDs can represent Boolean functions with around the same complexity as BDDs. We propose a hierarchical approach to verifying arithmetic circuits, where componentmodules are first shownto implement their word-level specifications. The overall circuit functionality is then verified by composing the component functions and comparing the result to the word-level circuit specification. Multipliers with word sizes of up to 256 bits have been verified by this technique.

二元矩图(bmd)提供了线性函数的规范表示，类似于二元决策图(bdd)表示布尔函数的方式。在线性函数类中，我们可以嵌入从布尔变量到整数值的任意函数。因此，bmd可以对操作字级数据的数据路径电路的功能进行建模。许多重要的函数，包括不能在位级上用bdd有效表示的积乘函数，可以在字级上用bdd简单表示。此外，bmd可以表示布尔函数，其复杂性与bdd大致相同。我们提出了一种分层方法来验证算术电路，其中组件模块首先显示实现其字级规范。然后通过组合组件功能并将结果与字级电路规格进行比较来验证整体电路功能。字长高达256位的乘法器已通过该技术得到验证。

引用次数: 357

A Design and Validation System for Asynchronous Circuits 异步电路的设计与验证系统

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217618

P. Vanbekbergen, Albert R. Wang, K. Keutzer

In this paper we present a completemethodology for the design and validation of asynchronous circuits starting from a formal specificationmodel that roughly correspondsto a timing diagram. The methodology is presented in such a way that it is easy to embed in the current methodology for synchronous circuits. The different steps of the synthesis process will just be briefly touched upon. The main part of the paper concentrates on the simulation and validation of asynchronous circuits. It discusses where the designer needs validation and how it can be done. It also explains how this process can be automated and embedded in the complete methodology.

在本文中，我们提出了异步电路设计和验证的完整方法，从一个大致对应于时序图的正式规格模型开始。该方法以一种易于嵌入到当前同步电路方法中的方式提出。合成过程的不同步骤将被简单地提及。论文的主要部分是对异步电路的仿真与验证。讨论了设计师在哪些方面需要验证，以及如何进行验证。它还解释了如何将此过程自动化并嵌入到完整的方法中。

引用次数: 15

Constrained Register Allocation in Bus Architectures 总线体系结构中的受限寄存器分配

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217525

E. Frank, S. Raje, M. Sarrafzadeh

Partitioned memory with bus interconnect architecture in its most general form consists of several functional units with associated memory accessible to the functional unit via local interconnect and global buses to communicate data values across from one functional unit to another. As can be expected, the time at which certain values are communicated affect the size of the local memories and the number of buses that are needed. We address the problem of scheduling communications in a bus architecture under memory constraints. We present here a network ow formulation for the problem and obtain an exact algorithm to schedule the communications, such that the constraint on the number of registers in each functional unit is satisfied. As an increasing number of architectures use multiple memories in addition to (or instead of) one central RAM, this work is especially interesting. Several authors have already studied this problem in related architectures, yet all use heuristic approaches to schedule the communications. Our technique is the first exact solution to the problem. Also, our graph theoretic formulation provides a clearer insight into the problem.

具有总线互连体系结构的分区内存在其最一般的形式中由几个功能单元组成，这些功能单元可以通过本地互连和全局总线访问相关的内存，以便将数据值从一个功能单元传递到另一个功能单元。正如可以预料的那样，传递某些值的时间会影响本地存储器的大小和所需总线的数量。我们解决了在内存约束下总线架构中的通信调度问题。本文给出了该问题的网络模型，并给出了一种精确的通信调度算法，使每个功能单元的寄存器数满足约束。随着越来越多的体系结构使用除了(或代替)一个中央RAM之外的多个内存，这项工作特别有趣。一些作者已经在相关的体系结构中研究了这个问题，但他们都使用启发式方法来调度通信。我们的技术是这个问题的第一个精确解决方案。此外，我们的图论公式提供了一个更清晰的洞察问题。

引用次数: 0

Spectral Partitioning: The More Eigenvectors, The Better 谱划分:特征向量越多越好

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217529

C. Alpert, So-Zen Yao

A spectral partitioning method uses the eigenvectors of a graph's adjacency or Laplacian matrix to construct a geometric representation (e.g., a linear ordering) which is then heuristically partitioned. We map each graph vertex to a vector in d-dimensional space, where d is the number of eigenvectors, such that these vectors constitute an instance of the vector partitioning problem. When all the eigenvectors are used, graph partitioning exactly reduces to vector partitioning. This result motivates a simple ordering heuristic that can be used to yield high-quality 2-way and multi-way partitionings. Our experiments suggest the vector partitioning perspective opens the door to new and effective heuristics.

谱划分方法使用图的邻接矩阵或拉普拉斯矩阵的特征向量来构造几何表示(例如，线性排序)，然后启发式划分。我们将每个图顶点映射到d维空间中的一个向量，其中d是特征向量的个数，使得这些向量构成向量划分问题的一个实例。当使用所有特征向量时，图的划分精确地简化为向量的划分。这个结果激发了一种简单的排序启发式，可用于产生高质量的双向和多路分区。我们的实验表明，向量划分视角打开了新的和有效的启发式的大门。

引用次数: 257

Symbolic Modeling and Evaluation of Data Paths 数据路径的符号建模与评估

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217560

C. Monahan, F. Brewer

We present an automata model which concisely captures the constraints imposed by a data-path, such as bus hazards, register constraints, and control encoding limitations. A set of uniform base components for depicting general data-paths and techniques for systematic translation of such depictions into Boolean functions are described. Finally, this model is expanded to represent the limitations of generating as well as moving operands by incorporating data-flow graphs. The benefits of this representation are demonstrated by modeling a commercial DSP microprocessor.

我们提出了一个自动机模型，它简明地捕获了数据路径所施加的约束，如总线危害、寄存器约束和控制编码限制。描述了一组用于描述一般数据路径的统一基本组件和将这种描述系统地转换为布尔函数的技术。最后，通过合并数据流图对该模型进行扩展，以表示生成和移动操作数的限制。通过对一个商用DSP微处理器进行建模，证明了这种表示的优点。

引用次数: 21

Transmission Line Synthesis 传输线综合

32nd Design Automation Conference

Pub Date : 1900-01-01 DOI: 10.1145/217474.217555

B. Krauter, Rohinish Gupta, J. Willis, L. Pileggi

RLC transmission line synthesis is a difficult problem because minimal net delay cannot be achieved or accurately predicted unless: 1) a termination scheme is chosen and properly implemented, and 2) output driver transition rates are constrained at or below the net's capability. This paper describes a method to concurrently find, for any RLC net, an optimal termination value, a maximum source transition rate, and an approximate net delay using the first few response moments. When optimal termination is accomplished and transition rates do not exceed the capabilities of the net, the resulting delay metrics are an interesting extension of the popular Elmore delay metric for RC interconnects. The task of physical RLC interconnect design is facilitated by the ease with which these first few moments are calculated for generalized RLC lines.

RLC传输线合成是一个难题，因为最小的网络延迟无法实现或准确预测，除非:1)选择并适当实施终止方案，2)输出驱动器转换速率被限制在或低于网络的能力。本文描述了一种利用前几个响应矩同时求任意RLC网络的最优终止值、最大源跃迁率和近似净延迟的方法。当完成最佳终止并且转换速率不超过网络的能力时，所得到的延迟度量是RC互连中流行的Elmore延迟度量的有趣扩展。物理RLC互连设计的任务是方便的，这些最初的几个矩计算广义RLC线路。

引用次数: 24

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

32nd Design Automation Conference

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀