Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors最新文献

英文中文

A 13.3ns double-precision floating-point ALU and multiplier 13.3ns双精度浮点ALU和乘法器

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528909

H. Yamada, T. Hotta, T. Nishiyama, F. Murabayashi, T. Yamauchi, H. Sawamoto

One-bit pre-shifting before alignment shift, normalization with anticipated leading '1' bit and pre-rounding techniques have been developed for a floating-point arithmetic logic unit (ALU). In addition, carry select addition and pre-rounding techniques have been developed for a floating-point multiplier. A noise tolerant precharge (NTP) circuit was designed and applied to the ALU and multiplier. These techniques reduced the delay time of the critical path by 24%. Each unit was fabricated in 0.3 /spl mu/m 2.5 V four-layer-metal CMOS technology and achieved a two-cycle latency at 150 MHz.

针对浮点算术逻辑单元(ALU)，开发了对齐移位前的1位预移位、预期前导“1”位的归一化和预舍入技术。此外，还开发了浮点乘法器的进位选择、加法和预舍入技术。设计了一种容噪预充电路，并将其应用于ALU和乘法器。这些技术使关键路径的延迟时间减少了24%。每个单元采用0.3 /spl mu/m 2.5 V四层金属CMOS技术制造，并在150 MHz下实现了两周延迟。

引用次数: 21

Emulation verification of the Motorola 68060 摩托罗拉68060的仿真验证

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528804

J. Kumar, N. Strader, J. Freeman, Michael Miller

Large-scale hardware logic emulation using software configurable hardware provides a new means to significantly improve verification of complex integrated circuits such as today's advanced microprocessors. The essence of hardware logic emulation is the provision of a hardware prototype of the circuit being designed. Such a hardware prototype can execute both pseudo-random verification vectors and software application programs up to six orders-of-magnitude faster than conventional software logic simulators. Trillions of verification vectors can be run on the emulation model for verification in only a few weeks compared to the prior best practice of running only billions of verification vectors in many months. Application of hardware logic emulation requires a sound design methodology with an HDL model (RTL or at least gate-level), an unlimited source of vectors or software applications intended to exercise the design in a target system.

采用软件可配置硬件的大规模硬件逻辑仿真为显著提高复杂集成电路(如当今先进的微处理器)的验证提供了一种新的手段。硬件逻辑仿真的本质是提供被设计电路的硬件原型。这样的硬件原型可以执行伪随机验证向量和软件应用程序，比传统的软件逻辑模拟器快6个数量级。与之前的最佳实践相比，在几个月内只运行数十亿个验证向量，在仿真模型上运行数万亿个验证向量可以在短短几周内进行验证。硬件逻辑仿真的应用需要一个具有HDL模型(RTL或至少门级)的合理设计方法，一个无限的矢量源或旨在在目标系统中执行设计的软件应用程序。

引用次数: 22

Efficient state assignment framework for asynchronous state graphs 异步状态图的高效状态分配框架

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528943

C. Ykman-Couvreur, Bill Lin

This paper presents a new efficient state assignment framework for synthesizing asynchronous state graphs. This framework operates purely at the state graph level and is applicable to a broad class of behaviors. In this paper we focus the framework for solving the complete state coding problem. This method has been automated and applied to a large set of asynchronous circuits. It achieves significant improvements in terms of both circuit area and computation time.

本文提出了一种新的用于异步状态图合成的高效状态分配框架。该框架纯粹在状态图级别上运行，适用于广泛的行为类别。本文重点研究了解决完全状态编码问题的框架。该方法已实现自动化，并应用于大量异步电路。它在电路面积和计算时间方面都取得了显著的改进。

引用次数: 4

VLSI issues in memory-system design for video signal processors 视频信号处理器存储系统设计中的VLSI问题

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528914

S. Dutta, W. Wolf, A. Wolfe

This paper addresses the design of memory-system architectures for video signal processors. The memory subsystem is the bottleneck of most video computing systems and demands a careful analysis of the design tradeoffs related to area, cycle time, and utilization. We emphasize the need to consider technological and circuit-level issues during the design of a system architecture, particularly that of a video processor, and present a method whereby the conceptual organization of the memory architecture can be evaluated before a detailed design is undertaken. Our analysis suggests that the organization of an efficient memory hierarchy for video signal processors is different from the register-cache based hierarchy of general-purpose programmable microprocessors.

本文讨论了视频信号处理器的存储系统架构设计。内存子系统是大多数视频计算系统的瓶颈，需要仔细分析与面积、周期时间和利用率相关的设计权衡。我们强调在系统架构的设计过程中需要考虑技术和电路层面的问题，特别是视频处理器的设计，并提出了一种方法，可以在进行详细设计之前评估内存架构的概念组织。我们的分析表明，视频信号处理器的高效内存层次结构的组织不同于通用可编程微处理器的基于寄存器缓存的层次结构。

引用次数: 5

A self-timed redundant-binary number to binary number converter for digital arithmetic processors 用于数字算术处理器的自定时冗余二进制数到二进制数转换器

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528838

C. Wey, Haiyan Wang, Cheng-Ping Wang

This paper presents a self-timed converter circuit which converts an n-digit redundant binary number to an (n+1)-bit binary number. Self-timed refers to the fact that the conversion is problem-dependent and requires variable conversion time to complete the operation. The propagation delay of the proposed converter circuit does not increase with the number of digits to be converted, but it is determined by the maximum number of consecutive 0's in that number. This study shows that the statistical upper bound of the average maximum number of consecutive 0's is log/sub 3/n, or 3.78 for 64-digits. This implies that the proposed self-time circuit can be approximately 17 times faster than the ripple-type converter. Thus the proposed converter is well-suited to high-speed, long-word digital arithmetic processors.

本文提出了一种将n位冗余二进制数转换为(n+1)位二进制数的自定时转换电路。自定时是指转换依赖于问题，需要可变的转换时间来完成操作。所提出的转换电路的传播延迟不随要转换的数字数的增加而增加，而是由该数字中连续0的最大数量决定。本研究表明，连续0的平均最大值的统计上界为log/sub 3/n, 64位的统计上界为3.78。这意味着所提出的自时间电路可以比纹波型变换器快大约17倍。因此，所提出的转换器非常适合于高速、长字的数字运算处理器。

引用次数: 2

Connection-oriented net model and fuzzy clustering techniques for K-way circuit partitioning 面向连接的网络模型及k路电路划分的模糊聚类技术

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528816

Jin-Tai Yan

In this paper, we firstly propose a k-way connection-oriented net model, chain net model, to generalize the cut analysis for k-way circuit partitioning and to reduce the complexity of edges for the representation of a multiple-pin net between the transformation of a hypergraph and an edge-weighted graph. Furthermore, based on the techniques of fuzzy c-means clustering, we develop and propose fuzzy c-means graph clustering to obtain k groups of fuzzy memberships for the vertices in the mapped graph according to the global information of all the net connections. Finally, by the area information of any cell in the circuit netlist, these k groups of fuzzy memberships will lead to a cut-driven or balance-driven k-way circuit partitioning. As a result, k-way circuit partitioning has been implemented for testing MCNC circuit benchmarks and the experimental results show that the proposed partitioning approach generates effective results on the partitioning cut and the partitioning balance for these benchmarks.

本文首先提出了一种面向k路连接的网络模型——链网模型，以推广k路电路划分的割分析方法，并降低了在超图和边权图的转换之间表示多针网络的边的复杂度。在此基础上，基于模糊c均值聚类技术，发展并提出了模糊c均值图聚类，根据所有网络连接的全局信息获得映射图中顶点的k组模糊隶属度。最后，根据电路网络表中任何单元的面积信息，这k组模糊隶属关系将导致切割驱动或平衡驱动的k路电路划分。最后，将k-way电路划分方法应用于MCNC电路基准测试中，实验结果表明，所提出的划分方法对这些基准的划分切割和划分平衡产生了有效的结果。

引用次数: 2

Execution-time profiling for multiple-process behavioral synthesis 多进程行为综合的执行时间分析

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528803

J. Adams, J. Miller, D. E. Thomas

This paper presents a technique for back-annotating the results of high-level synthesis into the source description to produce a timing-accurate behavioral simulation model. The resulting simulation model exhibits the same cycle-by-cycle behavior as a register-transfer level model, but can be simulated in a fraction of the time. This idea has analogies both to software profiling and to back-annotation at lower levels of hardware design. Experimental results demonstrate that the annotated behavioral simulation models run two to three orders of magnitude faster than register-transfer level simulation models, and only about an order of magnitude slower than behavioral models with no timing information.

本文提出了一种将高级综合结果反向注释到源描述中的技术，以产生时序精确的行为仿真模型。所得到的仿真模型显示出与寄存器传输级模型相同的逐周期行为，但可以在一小部分时间内进行仿真。这个想法与软件分析和较低级别的硬件设计中的反向注释都有相似之处。实验结果表明，带注释的行为仿真模型的运行速度比寄存器-迁移级仿真模型快2 ~ 3个数量级，仅比没有时序信息的行为模型慢1个数量级。

引用次数: 1

Extending equivalence class computation to large FSMs 将等价类计算扩展到大型fsm

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528819

G. Cabodi, S. Quer, P. Camurati

Computing equivalence classes for finite state machines (FSMs) has several applications to synthesis and verification problems, like state minimization, automata reduction, and logic optimization with don't cares. Symbolic traversal techniques are applicable to medium-small circuits. This paper extends their use to large FSMs by means of cofactor-based enhancements to the state-of-the-art approaches and of underestimations of equivalence classes. The key to success is pruning the search space by constraining it. Experimental results on some of the larger ISCAS'89 and MCNC circuits show its applicability.

计算有限状态机(fsm)的等价类有几个应用于综合和验证问题，如状态最小化、自动机约简和不关心的逻辑优化。符号遍历技术适用于中小型电路。本文通过对最先进的方法的基于协因子的增强和对等价类的低估，将它们的使用扩展到大型fsm。成功的关键是通过限制搜索空间来缩减搜索空间。在一些较大的ISCAS'89和MCNC电路上的实验结果表明了该方法的适用性。

引用次数: 3

Incremental methods for FSM traversal FSM遍历的增量方法

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528928

G. Swamy, R. Brayton, V. Singhal

Computing the set of reachable states of a finite state machine, is an important component of many problems in the synthesis and formal verification of digital systems. The process of design is usually iterative, and the designer may modify and recompute information many times, and reachability is called each time the designer modifies the system because current methods for reachability analysis are not incremental. Unfortunately, the representation of the reachable states that is currently used in synthesis and verification, is inherently non updatable (O. Coudert and J.C. Madre, 1990). We solve this problem by presenting alternate ways to represent the reachable set, and incremental algorithms that can update the new representation each time the designer changes the system. The incremental algorithms use the reachable set computed at a previous iteration, and information about the changes to the system to update it, rather than compute the reachable set from the beginning. This results in computational savings, as demonstrated by the results.

有限状态机可达状态集的计算是数字系统综合与形式化验证中许多问题的重要组成部分。设计过程通常是迭代的，设计师可能会多次修改和重新计算信息，由于当前的可达性分析方法不是增量的，因此每次设计师修改系统时都称为可达性。不幸的是，目前用于合成和验证的可达状态的表示本质上是不可更新的(O. Coudert和J.C. Madre, 1990)。我们通过提出可达集合的替代表示方法和增量算法来解决这个问题，这些算法可以在设计者每次更改系统时更新新的表示。增量算法使用在前一次迭代中计算的可达集，以及有关系统更改的信息来更新它，而不是从一开始就计算可达集。正如结果所示，这样可以节省计算量。

引用次数: 13

Data parallel fault simulation 数据并行故障模拟

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

Pub Date : 1995-10-02 DOI: 10.1109/ICCD.1995.528931

M. Amin, B. Vinnakota

Fault simulation is a compute intensive problem. Data parallel simulation on multiple processors is one method to reduce fault simulation time. We discuss a novel technique to partition the fault set for data parallel fault simulation. When applied statically, the technique can scale well for up to eight processors. The fault set partitioning technique is simple, can itself be parallelized, and can be implemented with extreme ease. Therefore, the technique can be used on a low cost parallel resource, such as a network of workstations.

故障仿真是一个计算密集型问题。多处理器数据并行仿真是减少故障仿真时间的一种方法。讨论了一种数据并行故障模拟中故障集划分的新方法。当静态应用时，该技术可以很好地扩展到最多八个处理器。故障集分区技术很简单，本身可以并行化，并且可以非常容易地实现。因此，该技术可用于低成本的并行资源，如工作站网络。

引用次数: 31

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀