Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors最新文献

英文中文

Modeling switching activity using cascaded Bayesian networks for correlated input streams 对相关输入流使用级联贝叶斯网络建模交换活动

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106799

S. Bhanja, N. Ranganathan

We represent switching activity in VLSI circuits using a graphical probabilistic model based on cascaded Bayesian networks (CBNs). We develop an elegant method for maintaining probabilistic consistency in the interfacing boundaries across the CBNs during the inference process using a tree-dependent (TD) probability distribution function. A tree-dependent (TD) distribution is an approximation of the true joint probability function over the switching variables, with the constraint that the underlying Bayesian network representation is a tree. The tree approximation of the true joint probability function can be arrived at using a maximum weight spanning tree (MWST) built using pairwise mutual information between switchings at two signal lines. Further we also develop a TD distribution based method to model correlations among the primary inputs which is critical for accuracy in Bayesian modeling of switching activity. Experimental results for ISCAS circuits are presented to illustrate the efficacy of the proposed methods.

我们使用基于级联贝叶斯网络(CBNs)的图形概率模型来表示VLSI电路中的开关活动。我们开发了一种优雅的方法，用于在推理过程中使用树相关(TD)概率分布函数来保持跨cbn的接口边界的概率一致性。树相关(TD)分布是交换变量上真实联合概率函数的近似值，其约束是底层贝叶斯网络表示是树。真正的联合概率函数的树逼近可以使用最大权生成树(MWST)来实现，该最大权生成树是利用两条信号线上开关之间的成对互信息构建的。此外，我们还开发了一种基于输配电分布的方法来建模主输入之间的相关性，这对于开关活动贝叶斯建模的准确性至关重要。最后给出了ISCAS电路的实验结果，验证了所提方法的有效性。

引用次数: 8

Environment synthesis for compositional model checking 用于构件模型检验的环境综合

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106750

Hong Peng, Y. Mokhtari, S. Tahar

Modeling the environment of a design module under verification is a known practical problem in compositional verification. In this paper, we propose an approach to translate an ACTL specification into such an environment. Throughout the translation, we construct an efficient tableau for the full range of ACTL and synthesize the tableau into Verilog HDL behavior level program. The synthesized program can be used to check the properties that the system's components must guarantee. We have used the proposed environment synthesis in the compositional verification of an ATM switch fabric from Nortel Networks. Experiments show that given the theoretical compositional verification intractable limit, we can still manage to verify industry size designs.

对被验证的设计模块的环境进行建模是组合验证中一个众所周知的实际问题。在本文中，我们提出了一种将ACTL规范转换为这种环境的方法。在整个翻译过程中，我们构建了ACTL全范围的高效表，并将其合成为Verilog HDL行为级程序。合成程序可用于检查系统组件必须保证的属性。我们已经在北电网络的ATM交换结构的组成验证中使用了所提出的环境综合。实验表明，在给定理论成分验证难处理极限的情况下，我们仍然可以对工业规模设计进行验证。

引用次数: 15

On the coverage of delay faults in scan designs with multiple scan chains 多扫描链扫描设计中延迟故障的覆盖

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106771

I. Pomeranz, S. Reddy

The use of multiple scan chains for a scan design reduces the test application time by reducing the number of clock cycles required for a scan-in/scan-out operation. In this work, we show that the use of multiple scan chains also increases the fault coverage achievable for delay faults, requiring two-pattern tests, under the scan-shift test application scheme. Under this scheme, the first pattern of a two-pattern test is scanned in, and the second pattern is obtained by shifting the scan chain once more. We also demonstrate that the specific way in which scan flip-flops are partitioned into scan chains affects the delay fault coverage. This is true even if the order of the flip-flops in the scan chains remains the same. To demonstrate this point, we describe a procedure that partitions scan flip-flops into scan chains so as to maximize the coverage of transition faults.

在扫描设计中使用多个扫描链，通过减少扫描输入/扫描输出操作所需的时钟周期数量，减少了测试应用时间。在这项工作中，我们表明，在扫描移位测试应用方案下，使用多个扫描链也增加了延迟故障可实现的故障覆盖率，需要双模式测试。在该方案下，对双模式测试的第一模式进行扫描，并通过再次移动扫描链获得第二模式。我们还证明了扫描触发器划分为扫描链的特定方式会影响延迟故障覆盖率。即使扫描链中触发器的顺序保持不变，这也是正确的。为了证明这一点，我们描述了一个将扫描触发器划分为扫描链的过程，以最大限度地覆盖转换故障。

引用次数: 8

Power analysis of bipartition and dual-encoding architecture for pipelined circuits 流水线电路双分割双编码结构的功耗分析

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106790

S. Ruan, E. Naroska, Chia-Lin Ho, F. Lai

In this paper we propose a bipartition dual-encoding architecture for low power pipelined circuit. Pipelined circuits consist of combinational logic blocks separated by registers which usually consume a large amount of power Although the clock gated technique is a promising approach to reduce switching activities of the pipelined registers, this approach is restricted by the placement of the registers and the additional control signals that must be generated. Thus, we propose a technique for optimizing power dissipation of a pipelined circuit addressing registers and combinational logic blocks at the same time. Our approach modifies the registers using bipartition and encoding techniques. In our experiments power consumption were reduced by 72.9% for pipelined registers and 30.4% for the total pipelined stage on average.

本文提出了一种适用于低功耗流水线电路的双分块双编码结构。虽然时钟门控技术是一种很有前途的方法，可以减少流水线寄存器的开关活动，但这种方法受到寄存器的放置和必须产生的额外控制信号的限制。因此，我们提出了一种同时寻址寄存器和组合逻辑块的流水线电路的功耗优化技术。我们的方法使用双分区和编码技术修改寄存器。在我们的实验中，流水线寄存器的功耗平均降低了72.9%，整个流水线阶段的功耗平均降低了30.4%。

引用次数: 0

Fault dictionary size reduction through test response superposition 通过测试响应叠加减少故障字典大小

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106817

B. Arslan, A. Orailoglu

The exceedingly large size of fault dictionaries constitutes a fundamental obstacle to their usage. We outline a new method to reduce significantly, the size of fault dictionaries. The proposed method partitions the test set and a combined signature is stored for each partition. The new approach aims to provide high diagnostic resolution with a small number of combined signatures. The experimental results show a considerable decrease in the storage requirement of fault dictionaries.

异常庞大的故障字典构成了它们使用的根本障碍。我们提出了一种新的方法来显著减少故障字典的大小。该方法对测试集进行分区，并为每个分区存储一个组合签名。新方法旨在通过少量组合签名提供高诊断分辨率。实验结果表明，该方法大大降低了故障字典的存储需求。

引用次数: 20

Media processing applications on the Imagine stream processor Imagine流处理器上的媒体处理应用程序

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106785

John Douglas Owens, S. Rixner, U. Kapasi, P. Mattson, Brian Towles, B. Serebrin, W. Dally

Media applications, such as image processing, signal processing, video, and graphics, require high computation rates and data bandwidths. The stream programming model is a natural and powerful way to describe these applications. Expressing media applications in this model allows hardware and software systems to take advantage of their concurrency and locality in order to meet their high computational demands. The Imagine stream programming system, a set of software tools and algorithms, is used to program media applications in the stream programming model. We achieve real-time performance on a variety of media processing applications with high computation rates (4-15 billion achieved operations per second) and high efficiency (84-95% occupancy on the arithmetic clusters).

媒体应用，如图像处理、信号处理、视频和图形，需要很高的计算速率和数据带宽。流编程模型是描述这些应用程序的一种自然而强大的方式。在这个模型中表达媒体应用程序允许硬件和软件系统利用它们的并发性和局部性来满足它们的高计算需求。Imagine流编程系统是一套软件工具和算法，用于对流编程模型中的媒体应用程序进行编程。我们在各种媒体处理应用程序上实现了高计算率(每秒40 - 150亿次操作)和高效率(算术集群占用84-95%)的实时性能。

引用次数: 88

A framework for data prefetching using off-line training of Markovian predictors 使用离线训练马尔可夫预测器的数据预取框架

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106792

Jinwoo Kim, K. Palem, W. Wong

An important technique for alleviating the memory bottleneck is data prefetching. Data prefetching solutions ranging from pure software approach by inserting prefetch instructions through program analysis to purely hardware mechanisms have been proposed. The degrees of success of those techniques are dependent on the nature of the applications. The need for innovative approach is rapidly growing with the introduction of applications such as object-oriented applications that show dynamically changing memory access behavior In this paper, we propose a novel framework for the use of data prefetchers that are trained off-line using smart learning algorithms to produce prediction models which captures hidden memory access patterns. Once built, those prediction models are loaded into a data prefetching unit in the CPU at the appropriate point during the runtime to drive the prefetching. On average by using table size of about 8KB size, we were able to achieve prediction accuracy of about 68% through our own proposed learning method and performance was boosted about 37% on average on the benchmarks we tested. Furthermore, we believe our proposed framework is amenable to other predictors and can be done as a phase of the profiling-optimizing-compiler.

缓解内存瓶颈的一个重要技术是数据预取。数据预取的解决方案从通过程序分析插入预取指令的纯软件方法到纯硬件机制都有。这些技术的成功程度取决于应用程序的性质。随着应用程序的引入，对创新方法的需求正在迅速增长，例如显示动态变化的内存访问行为的面向对象应用程序。在本文中，我们提出了一个使用数据预取器的新框架，该数据预取器使用智能学习算法离线训练，以产生捕获隐藏内存访问模式的预测模型。一旦构建完成，这些预测模型将在运行期间的适当时间点加载到CPU中的数据预取单元中，以驱动预取。平均而言，通过使用大约8KB大小的表，我们能够通过我们自己提出的学习方法实现大约68%的预测准确度，并且在我们测试的基准测试中，性能平均提高了约37%。此外，我们相信我们提出的框架适用于其他预测器，并且可以作为分析优化编译器的一个阶段来完成。

{"title":"A framework for data prefetching using off-line training of Markovian predictors","authors":"Jinwoo Kim, K. Palem, W. Wong","doi":"10.1109/ICCD.2002.1106792","DOIUrl":"https://doi.org/10.1109/ICCD.2002.1106792","url":null,"abstract":"An important technique for alleviating the memory bottleneck is data prefetching. Data prefetching solutions ranging from pure software approach by inserting prefetch instructions through program analysis to purely hardware mechanisms have been proposed. The degrees of success of those techniques are dependent on the nature of the applications. The need for innovative approach is rapidly growing with the introduction of applications such as object-oriented applications that show dynamically changing memory access behavior In this paper, we propose a novel framework for the use of data prefetchers that are trained off-line using smart learning algorithms to produce prediction models which captures hidden memory access patterns. Once built, those prediction models are loaded into a data prefetching unit in the CPU at the appropriate point during the runtime to drive the prefetching. On average by using table size of about 8KB size, we were able to achieve prediction accuracy of about 68% through our own proposed learning method and performance was boosted about 37% on average on the benchmarks we tested. Furthermore, we believe our proposed framework is amenable to other predictors and can be done as a phase of the profiling-optimizing-compiler.","PeriodicalId":164768,"journal":{"name":"Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128153762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Low power mixed-mode BIST based on mask pattern generation using dual LFSR re-seeding 基于双LFSR重播掩模模式生成的低功耗混合模式BIST

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106816

P. Rosinger, B. Al-Hashimi, N. Nicolici

Low power design techniques have been employed for more than two decades, however an emerging problem is satisfying the test power constraints for avoiding destructive test and improving the yield. Our research addresses this problem by proposing a new method which maintains the benefits of mixed-mode built-in self-test (BIST) (low test application time and high fault coverage), and reduces the excessive power dissipation associated with scan-based test. This is achieved by employing dual linear feedback shift register (LFSR) re-seeding and generating mask patterns to reduce the switching activity. Theoretical analysis and experimental results show that the proposed method consistently reduces the switching activity by 25% when compared to the traditional approaches, at the expense of a limited increase in storage requirements.

低功耗设计技术已经应用了二十多年，但如何满足测试功率限制以避免破坏性测试和提高成品率是一个新出现的问题。我们的研究通过提出一种新的方法来解决这一问题，该方法既保留了混合模式内置自检(BIST)的优点(低测试应用时间和高故障覆盖率)，又减少了基于扫描的测试相关的过度功耗。这是通过采用双线性反馈移位寄存器(LFSR)重新播种和生成掩模模式来减少开关活动来实现的。理论分析和实验结果表明，与传统方法相比，该方法在存储要求有限增加的情况下，始终将开关活动降低25%。

引用次数: 60

Automotive virtual integration platforms: why's, what's, and how's 汽车虚拟集成平台:为什么，是什么，以及如何

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106796

P. Giusto, J. Brunel, A. Ferrari, E. Fourgeau, L. Lavagno, A. Sangiovanni-Vincentelli

In this paper, we present the new concept of virtual integration platform for automotive electronics. The platform provides the basis for a novel methodology in which the integration of sub-systems is performed much earlier in the design cycle. As a result, cost reduction in the final implementation and in the design process can be achieved. In addition, early and repeatable fault analysis can be performed therefore easing the task of system safety proving.

本文提出了汽车电子虚拟集成平台的新概念。该平台为一种新颖的方法提供了基础，在这种方法中，子系统的集成在设计周期的早期执行。因此，在最终实施和设计过程中可以实现成本降低。此外，可以进行早期和可重复的故障分析，从而减轻了系统安全性证明的任务。

引用次数: 32

Physical planning of on-chip interconnect architectures 片上互连架构的物理规划

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

Pub Date : 2002-09-16 DOI: 10.1109/ICCD.2002.1106743

Hongyu Chen, B. Yao, Feng Zhou, Chung-Kuan Cheng

Interconnect architecture plays an important role in determining the throughput of meshed communication structures. We assume a mesh structure with uniform communication demand for communication. A multi-commodity flow (MCF) model is proposed to find the throughput for several different routing architectures. The experimental results reveal several trends: 1. The throughput is limited by the capacity of the middle row and column in the mesh, simply enlarging the congested channel cannot produce better throughput. A flexible chip shape provides around 30% throughput improvement over a square chip of equal area. 2. A 45-degree mesh allows 17% throughput improvement over 90-degree mesh and a 90-degree and 45-degree mixed mesh provides 30% throughput improvement. 3. To achieve maximum throughput on a mixed Manhattan and diagonal interconnect architecture, the best ratio of the capacity for diagonal routing layers and the capacity for Manhattan routing layers is 5.6. 4. Incorporating a simplified via model, interleaving diagonal routing layers and Manhattan routing layer is the best way to organize the wiring directions on different layers.

互连体系结构对网状通信结构的吞吐量起着重要的决定作用。我们假设一个具有统一通信需求的网格结构。提出了一种多商品流(MCF)模型，用于求解几种不同路由体系结构的吞吐量。实验结果揭示了几个趋势:1。吞吐量受网络中排和列容量的限制，单纯扩大拥挤通道不能产生更好的吞吐量。灵活的芯片形状比相同面积的方形芯片提供了大约30%的吞吐量提高。2. 与90度网相比，45度网的吞吐量提高了17%，90度和45度混合网的吞吐量提高了30%。3.为了在混合曼哈顿和对角线互连架构上实现最大吞吐量，对角线路由层的容量和曼哈顿路由层的容量的最佳比率是5.6。4. 结合简化的通道模型，交错的对角线路由层和曼哈顿路由层是组织不同层上的布线方向的最佳方式。

{"title":"Physical planning of on-chip interconnect architectures","authors":"Hongyu Chen, B. Yao, Feng Zhou, Chung-Kuan Cheng","doi":"10.1109/ICCD.2002.1106743","DOIUrl":"https://doi.org/10.1109/ICCD.2002.1106743","url":null,"abstract":"Interconnect architecture plays an important role in determining the throughput of meshed communication structures. We assume a mesh structure with uniform communication demand for communication. A multi-commodity flow (MCF) model is proposed to find the throughput for several different routing architectures. The experimental results reveal several trends: 1. The throughput is limited by the capacity of the middle row and column in the mesh, simply enlarging the congested channel cannot produce better throughput. A flexible chip shape provides around 30% throughput improvement over a square chip of equal area. 2. A 45-degree mesh allows 17% throughput improvement over 90-degree mesh and a 90-degree and 45-degree mixed mesh provides 30% throughput improvement. 3. To achieve maximum throughput on a mixed Manhattan and diagonal interconnect architecture, the best ratio of the capacity for diagonal routing layers and the capacity for Manhattan routing layers is 5.6. 4. Incorporating a simplified via model, interleaving diagonal routing layers and Manhattan routing layer is the best way to organize the wiring directions on different layers.","PeriodicalId":164768,"journal":{"name":"Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124359926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀