首页 > 最新文献

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors最新文献

英文 中文
Adaptive routing in Clos networks Clos网络中的自适应路由
P. Franaszek, C. J. Georgiou, Chung-Sheng Li
We describe a method of controlling a three-stage Clos nonblocking switch where "speculative" self-routing over the Clos fabric is augmented with reservations over a control network that connects controllers in the input and output stages of the switch. The effect is that most connections succeed over the speculative path while those subject to contention are processed over the control network. We present simulation result which indicate that the inclusion of a control network yields significant benefits under heavily nonuniform traffic conditions.
我们描述了一种控制三级Clos非阻塞交换机的方法,其中Clos结构上的“推测”自路由与连接交换机输入和输出阶段控制器的控制网络上的保留相结合。其结果是,大多数连接在推测路径上成功,而那些受争用影响的连接在控制网络上处理。我们给出的仿真结果表明,在严重不均匀的交通条件下,包含控制网络可以产生显着的效益。
{"title":"Adaptive routing in Clos networks","authors":"P. Franaszek, C. J. Georgiou, Chung-Sheng Li","doi":"10.1109/ICCD.1995.528820","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528820","url":null,"abstract":"We describe a method of controlling a three-stage Clos nonblocking switch where \"speculative\" self-routing over the Clos fabric is augmented with reservations over a control network that connects controllers in the input and output stages of the switch. The effect is that most connections succeed over the speculative path while those subject to contention are processed over the control network. We present simulation result which indicate that the inclusion of a control network yields significant benefits under heavily nonuniform traffic conditions.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127799800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Precise exception handling for a self-timed processor 自定时处理器的精确异常处理
W. Richardson, E. Brunvand
Self-timed systems structured as multiple concurrent processes and communicating through self-timed queues are a convenient way to implement decoupled computer architectures. Machines of this type can exploit instruction level parallelism in a natural way, and can be easily modified and extended. However, providing a precise exception model for a self-timed micropipelined processor can be difficult, since the processor state does not change at uniformly discrete intervals. We present a precise exception method implemented for Fred, a self-timed, decoupled, pipelined computer architecture with out-of-order instruction completion.
自定时系统结构为多个并发进程,并通过自定时队列进行通信,是实现解耦计算机体系结构的一种方便方法。这种类型的机器可以自然地利用指令级并行性,并且可以很容易地修改和扩展。然而,为自定时微流水线处理器提供精确的异常模型可能很困难,因为处理器状态不会以统一的离散间隔变化。我们提出了一个精确的异常方法,实现了弗雷德,一个自定时,解耦,流水线计算机体系结构与乱序指令完成。
{"title":"Precise exception handling for a self-timed processor","authors":"W. Richardson, E. Brunvand","doi":"10.1109/ICCD.1995.528787","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528787","url":null,"abstract":"Self-timed systems structured as multiple concurrent processes and communicating through self-timed queues are a convenient way to implement decoupled computer architectures. Machines of this type can exploit instruction level parallelism in a natural way, and can be easily modified and extended. However, providing a precise exception model for a self-timed micropipelined processor can be difficult, since the processor state does not change at uniformly discrete intervals. We present a precise exception method implemented for Fred, a self-timed, decoupled, pipelined computer architecture with out-of-order instruction completion.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132659191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A novel architecture for an ATM switch 一种新颖的ATM交换机架构
Jin Li, Chuan-lin Wu
Multicast function is essential for an ATM switch. We propose a novel architecture for an output-buffer ATM switch and a shared-buffer ATM switch to realize the multicast function in a more efficient way. In an output-buffer ATM switch, we dedicate a first-in and first-out (FIFO) shared buffer for all multicast cells to increase buffer utilization. In a shared-buffer ATM switch, we dedicate a FIFO address queue for all multicast cells to simplify the design of the control logic. Performance evaluation of the new switch is also provided. Since each multicast cell occupies only one buffer space, the proposed switch achieves a better cell-loss performance under multicast traffic loads without the need for complicated control circuitry.
多播功能是ATM交换机必不可少的功能。为了更有效地实现组播功能,我们提出了一种新的输出缓冲式ATM交换机和共享缓冲式ATM交换机结构。在输出缓冲区ATM交换机中,我们为所有多播单元指定了一个先进先出(FIFO)共享缓冲区,以提高缓冲区利用率。在共享缓冲区ATM交换机中,我们为所有多播单元指定了一个FIFO地址队列,以简化控制逻辑的设计。对新交换机进行了性能评价。由于每个组播小区只占用一个缓冲空间,因此该交换机在组播业务负载下具有较好的小区损耗性能,无需复杂的控制电路。
{"title":"A novel architecture for an ATM switch","authors":"Jin Li, Chuan-lin Wu","doi":"10.1109/ICCD.1995.528831","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528831","url":null,"abstract":"Multicast function is essential for an ATM switch. We propose a novel architecture for an output-buffer ATM switch and a shared-buffer ATM switch to realize the multicast function in a more efficient way. In an output-buffer ATM switch, we dedicate a first-in and first-out (FIFO) shared buffer for all multicast cells to increase buffer utilization. In a shared-buffer ATM switch, we dedicate a FIFO address queue for all multicast cells to simplify the design of the control logic. Performance evaluation of the new switch is also provided. Since each multicast cell occupies only one buffer space, the proposed switch achieves a better cell-loss performance under multicast traffic loads without the need for complicated control circuitry.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"293 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132745328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Memory organization for video algorithms on programmable signal processors 可编程信号处理器上视频算法的存储器组织
E. D. Greef, F. Catthoor, H. Man
In this paper, several DSP system design principles are presented which are valid for a large class of memory-intensive algorithms. Our main focus lies on the optimization of the memory and I/O, since these are dominant cost factors in the domain of video and imaging applications. This has resulted in several formalizable mapping principles, which allow to prevent the memory from becoming a bottleneck. First, it as shown that for this class of applications, compile-time data caching decisions not only have a large effect on the performance, but also can have an even larger effect on the overall system cost and power consumption. This is illustrated by means of experiments in which the whole range of no cache up to large cache sizes is scanned. Next, it is shown that when enforcing constant I/O rates to reduce buffer sizes, the area gain may be far more important than the small performance decrease associated with it. A technique to achieve this in an efficient way is proposed. The main test-vehicle which is used throughout the paper to demonstrate our approach is the class of motion estimation type algorithms.
本文提出了适用于大量内存密集型算法的DSP系统设计原则。我们主要关注内存和I/O的优化,因为它们是视频和成像应用领域的主要成本因素。这就产生了几种可形式化的映射原则,可以防止内存成为瓶颈。首先,对于这类应用程序,编译时数据缓存决策不仅对性能有很大影响,而且对总体系统成本和功耗的影响甚至更大。这是通过实验来说明的,在实验中,从无缓存到大缓存大小的整个范围被扫描。接下来,研究表明,当强制恒定I/O速率以减小缓冲区大小时,面积增益可能远比与之相关的小性能下降重要得多。提出了一种有效实现这一目标的技术。整篇论文中用来证明我们的方法的主要测试工具是运动估计类算法。
{"title":"Memory organization for video algorithms on programmable signal processors","authors":"E. D. Greef, F. Catthoor, H. Man","doi":"10.1109/ICCD.1995.528922","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528922","url":null,"abstract":"In this paper, several DSP system design principles are presented which are valid for a large class of memory-intensive algorithms. Our main focus lies on the optimization of the memory and I/O, since these are dominant cost factors in the domain of video and imaging applications. This has resulted in several formalizable mapping principles, which allow to prevent the memory from becoming a bottleneck. First, it as shown that for this class of applications, compile-time data caching decisions not only have a large effect on the performance, but also can have an even larger effect on the overall system cost and power consumption. This is illustrated by means of experiments in which the whole range of no cache up to large cache sizes is scanned. Next, it is shown that when enforcing constant I/O rates to reduce buffer sizes, the area gain may be far more important than the small performance decrease associated with it. A technique to achieve this in an efficient way is proposed. The main test-vehicle which is used throughout the paper to demonstrate our approach is the class of motion estimation type algorithms.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133398164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Performance assessment of embedded Hw/Sw systems 嵌入式软硬件系统的性能评估
J. P. Calvez, O. Pasquier
Performance assessment of embedded Hw/Sw systems built with various types of VLSI components, i.e. heterogeneous multi-processor architectures, is important to help the development of complex real-time applications. To design such a tool, two issues are to be solved, relevant information gathered simultaneously on several components without disturbing the application behavior, and the display of the performance results in a way which is easily interpreted by designers. This paper presents an interesting solution for the two above issues. We first describe what the goal for designers is and what kind of applications are concerned. Then we describe the principle of collecting an event trace and the technique to evaluate the selected performance indexes. The monitoring technique, based on a specific ASIC, is nonintrusive and allows to capture real-time event occurrences from software tasks and even from hardware functions implemented in ASICs. Each event is automatically time-stamped, collected and processed in real-time to evaluate the performance indexes selected by the designer. We also describe the display tool which clearly shows to the designer the results according to different representations. This technique and the associated real-time performance analyzer are integrated in a whole development process based on the MCSE methodology.
利用各种类型的VLSI组件(即异构多处理器架构)构建的嵌入式软硬件系统的性能评估对于帮助开发复杂的实时应用程序非常重要。要设计这样一个工具,需要解决两个问题:在不干扰应用程序行为的情况下,同时在多个组件上收集相关信息;以一种易于设计人员解释的方式显示性能结果。本文针对上述两个问题提出了一个有趣的解决方案。我们首先描述了设计人员的目标是什么以及所关注的应用程序类型。然后描述了收集事件跟踪的原理和评估所选性能指标的技术。基于特定ASIC的监控技术是非侵入式的,允许从软件任务甚至ASIC中实现的硬件功能中捕获实时事件发生。每个事件都自动打上时间戳,实时收集和处理,以评估设计者选择的性能指标。我们还描述了显示工具,可以根据不同的表示方式清晰地向设计者显示结果。该技术和相关的实时性能分析仪集成在基于MCSE方法的整个开发过程中。
{"title":"Performance assessment of embedded Hw/Sw systems","authors":"J. P. Calvez, O. Pasquier","doi":"10.1109/ICCD.1995.528790","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528790","url":null,"abstract":"Performance assessment of embedded Hw/Sw systems built with various types of VLSI components, i.e. heterogeneous multi-processor architectures, is important to help the development of complex real-time applications. To design such a tool, two issues are to be solved, relevant information gathered simultaneously on several components without disturbing the application behavior, and the display of the performance results in a way which is easily interpreted by designers. This paper presents an interesting solution for the two above issues. We first describe what the goal for designers is and what kind of applications are concerned. Then we describe the principle of collecting an event trace and the technique to evaluate the selected performance indexes. The monitoring technique, based on a specific ASIC, is nonintrusive and allows to capture real-time event occurrences from software tasks and even from hardware functions implemented in ASICs. Each event is automatically time-stamped, collected and processed in real-time to evaluate the performance indexes selected by the designer. We also describe the display tool which clearly shows to the designer the results according to different representations. This technique and the associated real-time performance analyzer are integrated in a whole development process based on the MCSE methodology.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130375269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Test generation for multiple state-table faults in finite-state machines 有限状态机中多状态表故障的测试生成
I. Pomeranz, S. Reddy
A test generation procedure to detect multiple state-table faults in finite-state machines is proposed. The importance of multiple state-table faults and their advantages as test generation objectives to avoid the need for checking experiments are considered. The proposed procedure is based on a new method for implicit enumeration of large numbers of multiple faults by using incompletely specified faulty machines. Experimental results are presented to demonstrate the effectiveness of implicit fault enumeration in detecting large numbers of multiple faults.
提出了一种检测有限状态机多状态表故障的测试生成方法。考虑了多状态表故障的重要性及其作为测试生成目标的优势,避免了检测实验的需要。该方法基于一种利用不完全指定的故障机隐式枚举大量多故障的新方法。实验结果证明了隐式故障枚举在检测大量多故障方面的有效性。
{"title":"Test generation for multiple state-table faults in finite-state machines","authors":"I. Pomeranz, S. Reddy","doi":"10.1109/ICCD.1995.528824","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528824","url":null,"abstract":"A test generation procedure to detect multiple state-table faults in finite-state machines is proposed. The importance of multiple state-table faults and their advantages as test generation objectives to avoid the need for checking experiments are considered. The proposed procedure is based on a new method for implicit enumeration of large numbers of multiple faults by using incompletely specified faulty machines. Experimental results are presented to demonstrate the effectiveness of implicit fault enumeration in detecting large numbers of multiple faults.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126346757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Testing-what's missing? An incomplete list of challenges 测试的失踪吗?一份不完整的挑战清单
S. Reddy
Summary form only given. As the testing area becomes mature, the challenges it poses shift. We describe some of these challenges and how they are addressed in recent works in various areas of testing. In recent years, the formulations of testing problems have changed from "given a problem, find a solution" to "given a problem and quality measures, find a high-quality solution". Quality guarantees in the form of lower and upper bounds and optimal solutions are derived, in addition to the more conventional demonstration of performance on benchmark circuits. Quality guarantees allow one to measure the distance between a given solution and an optimal solution, and provide criteria for evaluating a new procedure that are more effective than comparison to previously proposed procedures. We review several areas where bounds and optimal solutions have been found. Most procedures are specific to a given problem, and cannot be reused to solve other problems. In contrast, general-purpose paradigms allow a large variety of problems to be solved cost-effectively by plugging in the appropriate procedures into the same algorithm. Such paradigms allow faster program development and reuse of expertise acquired in solving other problems under the same paradigm. We describe several attempts at using existing paradigms and developing new ones, that successfully compete with special-purpose procedures. Recent works address testing issues at increasingly higher levels of the design cycle and offer an integrated treatment of design and test. High-level failure models are considered as well as solutions that are completely independent of a failure model. We describe some of these works and the advantages of the two directions. We conclude with an (incomplete) list of challenges for future research.
只提供摘要形式。随着测试领域的成熟,它所带来的挑战也发生了变化。我们描述了其中的一些挑战,以及如何在不同测试领域的近期工作中解决这些挑战。近年来,测试问题的表述从“给定问题,找到解决方案”转变为“给定问题和质量措施,找到高质量的解决方案”。除了在基准电路上进行更常规的性能演示外,还推导出了以下界和上界形式的质量保证和最优解。质量保证允许人们测量给定解决方案和最优解决方案之间的距离,并提供评估新程序的标准,该程序比以前提出的程序更有效。我们回顾了几个已经找到边界和最优解的领域。大多数过程都是特定于给定问题的,不能被重用来解决其他问题。相比之下,通用范例允许通过将适当的过程插入到相同的算法中,以经济有效的方式解决大量的问题。这样的范例允许更快的程序开发和重用在解决相同范例下的其他问题时获得的专业知识。我们描述了几种使用现有范例和开发新范例的尝试,这些范例成功地与专用程序竞争。最近的工作在设计周期的越来越高的层次上解决测试问题,并提供设计和测试的综合处理。考虑高级故障模型以及完全独立于故障模型的解决方案。我们介绍了其中的一些工作和两个方向的优势。最后,我们列出了未来研究面临的挑战(不完整)。
{"title":"Testing-what's missing? An incomplete list of challenges","authors":"S. Reddy","doi":"10.1109/ICCD.1995.528843","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528843","url":null,"abstract":"Summary form only given. As the testing area becomes mature, the challenges it poses shift. We describe some of these challenges and how they are addressed in recent works in various areas of testing. In recent years, the formulations of testing problems have changed from \"given a problem, find a solution\" to \"given a problem and quality measures, find a high-quality solution\". Quality guarantees in the form of lower and upper bounds and optimal solutions are derived, in addition to the more conventional demonstration of performance on benchmark circuits. Quality guarantees allow one to measure the distance between a given solution and an optimal solution, and provide criteria for evaluating a new procedure that are more effective than comparison to previously proposed procedures. We review several areas where bounds and optimal solutions have been found. Most procedures are specific to a given problem, and cannot be reused to solve other problems. In contrast, general-purpose paradigms allow a large variety of problems to be solved cost-effectively by plugging in the appropriate procedures into the same algorithm. Such paradigms allow faster program development and reuse of expertise acquired in solving other problems under the same paradigm. We describe several attempts at using existing paradigms and developing new ones, that successfully compete with special-purpose procedures. Recent works address testing issues at increasingly higher levels of the design cycle and offer an integrated treatment of design and test. High-level failure models are considered as well as solutions that are completely independent of a failure model. We describe some of these works and the advantages of the two directions. We conclude with an (incomplete) list of challenges for future research.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"52 27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124619820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-dimensional interleaving for time-and-memory design optimization 多维交错时间和记忆设计优化
N. Passos, E. Sha, L. Chao
This paper presents a novel optimization technique for the design of application specific integrated circuits dedicated to perform iterative or recursive time-critical sections of multi-dimensional problems, such as image processing applications. These sections are modeled as cyclic multi-dimensional data flow graphs (MDFGs). This new technique, called multi-dimensional interleaving consists of an expansion and compression of the iteration space while considering memory requirements. It guarantees that all functional elements of a circuitry can be executed simultaneously, and no additional memory queues proportional to the problem size are required. The algorithm runs in O(|E|) time, where E is the set of edges of the MDFG representing the circuit.
本文提出了一种新的优化技术,用于设计用于执行多维问题(如图像处理应用)的迭代或递归时间关键部分的特定应用集成电路。这些部分建模为循环多维数据流图(mdfg)。这种被称为多维交错的新技术包括在考虑内存需求的同时对迭代空间进行扩展和压缩。它保证电路的所有功能元件可以同时执行,并且不需要与问题大小成比例的额外内存队列。算法运行时间为O(|E|),其中E为表示电路的MDFG的边集。
{"title":"Multi-dimensional interleaving for time-and-memory design optimization","authors":"N. Passos, E. Sha, L. Chao","doi":"10.1109/ICCD.1995.528905","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528905","url":null,"abstract":"This paper presents a novel optimization technique for the design of application specific integrated circuits dedicated to perform iterative or recursive time-critical sections of multi-dimensional problems, such as image processing applications. These sections are modeled as cyclic multi-dimensional data flow graphs (MDFGs). This new technique, called multi-dimensional interleaving consists of an expansion and compression of the iteration space while considering memory requirements. It guarantees that all functional elements of a circuitry can be executed simultaneously, and no additional memory queues proportional to the problem size are required. The algorithm runs in O(|E|) time, where E is the set of edges of the MDFG representing the circuit.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129891493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Distributed automatic test pattern generation with a parallel FAN algorithm 基于并行FAN算法的分布式自动测试模式生成
Stefan Radtke, J. Bargfrede, W. Anheier
The generation of test patterns for digital circuits is known as an NP hard problem. Due to the backtracking mechanism in the sequential algorithms for test pattern generation it is difficult to speed up the process. In this paper we present a parallel formulation of the FAN algorithm implemented on a heterogeneous cluster of workstations. Two different methods are used to take into account easy- and hard-to-detect faults. We show the strategies for our parallel implementations as well as implementation details. Linear speedups are shown with the results. Furthermore we introduce a new method for test vector compaction using a genetic algorithm. This results in smaller test sets compared to traditional methods. The reader should be familiar with notations of the FAN algorithm.
数字电路测试图的生成是一个NP困难问题。由于序列算法中存在回溯机制,导致测试模式生成的速度难以加快。在本文中,我们提出了一个在异构工作站集群上实现的FAN算法的并行公式。采用两种不同的方法来考虑易检测和难检测的故障。我们将展示并行实现的策略以及实现细节。结果显示线性加速。在此基础上,提出了一种基于遗传算法的测试向量压缩方法。与传统方法相比,这可以产生更小的测试集。读者应该熟悉FAN算法的符号。
{"title":"Distributed automatic test pattern generation with a parallel FAN algorithm","authors":"Stefan Radtke, J. Bargfrede, W. Anheier","doi":"10.1109/ICCD.1995.528944","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528944","url":null,"abstract":"The generation of test patterns for digital circuits is known as an NP hard problem. Due to the backtracking mechanism in the sequential algorithms for test pattern generation it is difficult to speed up the process. In this paper we present a parallel formulation of the FAN algorithm implemented on a heterogeneous cluster of workstations. Two different methods are used to take into account easy- and hard-to-detect faults. We show the strategies for our parallel implementations as well as implementation details. Linear speedups are shown with the results. Furthermore we introduce a new method for test vector compaction using a genetic algorithm. This results in smaller test sets compared to traditional methods. The reader should be familiar with notations of the FAN algorithm.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128822374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The resource conflict methodology for early-stage design space exploration of superscalar RISC processors 超大规模RISC处理器早期设计空间探索的资源冲突方法
J. Wellman, E. Davidson
In this paper we propose a new execution trace driven simulation technique, called the Resource Conflict Methodology (RCM) for modeling and simulating computer systems early in the design cycle. By using a simplified hardware element model which allows the user to easily add or delete hardware elements in the model, RCM allows the user to readily change the machine design being investigated and to evaluate the resulting machine on a given workload. We describe the RCM model with reference to a family of superscalar processors and develop an RCM-based analysis program (called REAP) for this family of processors. Using REAP, we demonstrate the validity of our method by comparing its RCM performance estimates to those of a traditional early design stage timer model.
在本文中,我们提出了一种新的执行跟踪驱动的仿真技术,称为资源冲突方法(RCM),用于在设计周期的早期对计算机系统进行建模和仿真。通过使用简化的硬件元素模型,允许用户轻松地在模型中添加或删除硬件元素,RCM允许用户轻松地更改正在调查的机器设计,并在给定的工作负载上评估生成的机器。我们用一系列超标量处理器来描述RCM模型,并为这些处理器开发了一个基于RCM的分析程序(称为REAP)。使用REAP,我们通过比较其RCM性能估计与传统早期设计阶段定时器模型的性能估计来证明我们方法的有效性。
{"title":"The resource conflict methodology for early-stage design space exploration of superscalar RISC processors","authors":"J. Wellman, E. Davidson","doi":"10.1109/ICCD.1995.528798","DOIUrl":"https://doi.org/10.1109/ICCD.1995.528798","url":null,"abstract":"In this paper we propose a new execution trace driven simulation technique, called the Resource Conflict Methodology (RCM) for modeling and simulating computer systems early in the design cycle. By using a simplified hardware element model which allows the user to easily add or delete hardware elements in the model, RCM allows the user to readily change the machine design being investigated and to evaluate the resulting machine on a given workload. We describe the RCM model with reference to a family of superscalar processors and develop an RCM-based analysis program (called REAP) for this family of processors. Using REAP, we demonstrate the validity of our method by comparing its RCM performance estimates to those of a traditional early design stage timer model.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126581743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1