首页 > 最新文献

2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools最新文献

英文 中文
Multiple Bit Error Detection and Correction in Memory 内存中的多比特错误检测与校正
J. F. Tarillo, Nikolaos Mavrogiannakis, C. Lisbôa, C. Argyrides, L. Carro
Technology evolution provides ever increasing density of transistors in chips, lower power consumption and higher performance. In this environment the occurrence of multiple-bit upsets (MBUs) becomes a significant concern. Critical applications need high reliability, but traditional error mitigation techniques assume only the single error model, and only a few techniques to correct MBUs at algorithm level have been proposed. In this paper, a novel circuit level technique to detect and correct multiple errors in memory is proposed. Since it is implemented at circuit level, it is transparent to programmers. This technique is based in the Decimal Hamming coding and here it is compared to Reed Solomon coding at circuit level. Experimental results show that for memory words wider than 16 bits, the proposed technique is faster and imposes lower area overhead than optimized RS, while mitigating errors affecting up to 25% of the memory word.
技术的发展使芯片中的晶体管密度不断提高,功耗更低,性能更高。在这种环境下,多比特扰流(MBUs)的发生成为一个值得关注的问题。关键应用需要高可靠性,但传统的错误缓解技术只假设了单一的错误模型,在算法层面上的错误纠正技术也很少。本文提出了一种新的电路级技术来检测和纠正存储器中的多重错误。由于它是在电路级实现的,所以对程序员来说是透明的。这种技术是基于十进制汉明编码,这里是比较里德所罗门编码在电路水平。实验结果表明,对于大于16位的存储字,该技术比优化后的RS更快,占用的面积更小,同时减少了多达25%的存储字的错误。
{"title":"Multiple Bit Error Detection and Correction in Memory","authors":"J. F. Tarillo, Nikolaos Mavrogiannakis, C. Lisbôa, C. Argyrides, L. Carro","doi":"10.1109/DSD.2010.64","DOIUrl":"https://doi.org/10.1109/DSD.2010.64","url":null,"abstract":"Technology evolution provides ever increasing density of transistors in chips, lower power consumption and higher performance. In this environment the occurrence of multiple-bit upsets (MBUs) becomes a significant concern. Critical applications need high reliability, but traditional error mitigation techniques assume only the single error model, and only a few techniques to correct MBUs at algorithm level have been proposed. In this paper, a novel circuit level technique to detect and correct multiple errors in memory is proposed. Since it is implemented at circuit level, it is transparent to programmers. This technique is based in the Decimal Hamming coding and here it is compared to Reed Solomon coding at circuit level. Experimental results show that for memory words wider than 16 bits, the proposed technique is faster and imposes lower area overhead than optimized RS, while mitigating errors affecting up to 25% of the memory word.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129041256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Common Operator for FFT and Viterbi Algorithms FFT和Viterbi算法的通用算子
Malek Naoues, Laurent Alaus, D. Noguet
In the Software Radio context, the parametrization is becoming an important topic especially when it comes to multi-standard designs. This paper capitalizes on the Common Operator technique to present a new common structure for the FFT and Viterbi algorithms. A key benefit of exhibiting common operators is the regular architecture it brings when implemented in a Common Operator Bank (COB). This regularity makes the architecture open to future function mapping and adapted to accommodated silicon technology variability through dependable design. Global complexity impact is discussed in the paper.
在软件无线电的背景下,参数化成为一个重要的话题,特别是当涉及到多标准设计时。本文利用公算子技术为FFT和Viterbi算法提出了一种新的公共结构。展示公共操作符的一个关键好处是,它在公共操作符库(COB)中实现时带来的常规体系结构。这种规律性使得架构对未来的功能映射开放,并通过可靠的设计适应硅技术的可变性。本文讨论了全局复杂性影响。
{"title":"A Common Operator for FFT and Viterbi Algorithms","authors":"Malek Naoues, Laurent Alaus, D. Noguet","doi":"10.1109/DSD.2010.80","DOIUrl":"https://doi.org/10.1109/DSD.2010.80","url":null,"abstract":"In the Software Radio context, the parametrization is becoming an important topic especially when it comes to multi-standard designs. This paper capitalizes on the Common Operator technique to present a new common structure for the FFT and Viterbi algorithms. A key benefit of exhibiting common operators is the regular architecture it brings when implemented in a Common Operator Bank (COB). This regularity makes the architecture open to future function mapping and adapted to accommodated silicon technology variability through dependable design. Global complexity impact is discussed in the paper.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"196 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116365557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Automated Power Characterization for Run-Time Power Emulation of SoC Designs SoC设计运行时功率仿真的自动功率表征
Christian Bachmann, Andreas Genser, C. Steger, R. Weiss, J. Haid
With the advent of increasingly complex systems, the use of traditional power estimation approaches is rendered infeasible due to extensive simulation times. Hardware accelerated power emulation techniques, performing power estimation as a by-product of functional emulation, are a promising solution to this problem. However, only little attention has been awarded so far to the problem of devising a generic methodology capable of automatically enabling the power emulation of a given system-under-test. In this paper, we propose an automated power characterization and modeling methodology for high level power emulation. Our methodology automatically extracts relevant model parameters from training set data and generates an according power model. Furthermore, we investigate the automation of the power model hardware implementation and the automated integration into the overall system’s HDL description. For a smart card controller test-system the automatically created power model reduces the average estimation error from 11.78% to 4.71% as compared to a manually optimized one.
随着系统的日益复杂,传统的功率估计方法由于需要大量的仿真时间而变得不可行。硬件加速功率仿真技术作为功能仿真的副产品执行功率估计,是解决这一问题的一个很有前途的解决方案。然而,到目前为止,对于设计一种能够自动启用给定被测系统的功率仿真的通用方法的问题,很少给予注意。在本文中,我们提出了一种用于高层次功率仿真的自动化功率表征和建模方法。我们的方法从训练集数据中自动提取相关的模型参数,并生成相应的功率模型。此外,我们还研究了功率模型硬件实现的自动化以及与整个系统HDL描述的自动化集成。对于智能卡控制器测试系统,与手动优化相比,自动创建的功率模型将平均估计误差从11.78%降低到4.71%。
{"title":"Automated Power Characterization for Run-Time Power Emulation of SoC Designs","authors":"Christian Bachmann, Andreas Genser, C. Steger, R. Weiss, J. Haid","doi":"10.1109/DSD.2010.38","DOIUrl":"https://doi.org/10.1109/DSD.2010.38","url":null,"abstract":"With the advent of increasingly complex systems, the use of traditional power estimation approaches is rendered infeasible due to extensive simulation times. Hardware accelerated power emulation techniques, performing power estimation as a by-product of functional emulation, are a promising solution to this problem. However, only little attention has been awarded so far to the problem of devising a generic methodology capable of automatically enabling the power emulation of a given system-under-test. In this paper, we propose an automated power characterization and modeling methodology for high level power emulation. Our methodology automatically extracts relevant model parameters from training set data and generates an according power model. Furthermore, we investigate the automation of the power model hardware implementation and the automated integration into the overall system’s HDL description. For a smart card controller test-system the automatically created power model reduces the average estimation error from 11.78% to 4.71% as compared to a manually optimized one.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116696315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Test Patterns Compression Technique Based on a Dedicated SAT-Based ATPG 基于专用sat ATPG的测试模式压缩技术
Jiri Balcarek, P. Fiser, Jan Schmidt
In this paper we propose a new method of test patterns compression based on a design of a dedicated SAT-based ATPG (Automatic Test Pattern Generator). This compression method is targeted to systems on chip (SoCs)provided with the P1500 test standard. The RESPIN architecture can be used for test patterns decompression. The main idea is based on finding the best overlap of test patterns during the test generation, unlike other methods, which are based on efficient overlapping of pre-generated test patterns. The proposed algorithm takes advantage of an implicit test representation as SAT problem instances. The results of test patterns compression obtained for standard ISCAS’85 and ‘89benchmark circuits are shown and compared with competitive test compression methods.
本文提出了一种基于专用sat的ATPG(自动测试模式发生器)的测试模式压缩新方法。这种压缩方法针对的是P1500测试标准提供的片上系统(soc)。RESPIN架构可以用于测试模式的解压缩。其主要思想是基于在测试生成过程中找到测试模式的最佳重叠,而不像其他方法,它们是基于预先生成的测试模式的有效重叠。该算法利用隐式测试表示作为SAT问题实例。给出了标准ISCAS ' 85和' 89基准电路的测试模式压缩结果,并与竞争对手的测试压缩方法进行了比较。
{"title":"Test Patterns Compression Technique Based on a Dedicated SAT-Based ATPG","authors":"Jiri Balcarek, P. Fiser, Jan Schmidt","doi":"10.1109/DSD.2010.111","DOIUrl":"https://doi.org/10.1109/DSD.2010.111","url":null,"abstract":"In this paper we propose a new method of test patterns compression based on a design of a dedicated SAT-based ATPG (Automatic Test Pattern Generator). This compression method is targeted to systems on chip (SoCs)provided with the P1500 test standard. The RESPIN architecture can be used for test patterns decompression. The main idea is based on finding the best overlap of test patterns during the test generation, unlike other methods, which are based on efficient overlapping of pre-generated test patterns. The proposed algorithm takes advantage of an implicit test representation as SAT problem instances. The results of test patterns compression obtained for standard ISCAS’85 and ‘89benchmark circuits are shown and compared with competitive test compression methods.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116830410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
An Efficient Method to Reliable Data Transmission in Network-on-Chips 片上网络中可靠数据传输的有效方法
A. Patooghy, H. Tabkhi, S. Miremadi
Data transmission in Network-on-Chips (NoCs) is a serious problem due to cross talk faults happening in adjacent communication links. This paper proposes an efficient flow-control method to enhance the reliability of packet transmission in Network-on-Chips. The method investigates the opposite direction transitions appearing between flits of a packet to reorder the flits in the packet. Flits are reordered in a fixed-size window to reduce: 1) the probability of cross talk occurrence, and 2) the total power consumed for packet delivery. The proposed flow-control method is evaluated by a VHDL-based simulator under different window sizes and various channel widths. Simulation results enable NoC designers to make a trade-off between window size, reliability and power consumption of packet delivery. This method is also compared with other cross talk tolerant methods in terms of reliability and power consumption. Comparison results confirm that the method is a cost efficient solution to overcome the cross talk problem.
片上网络(noc)中的数据传输是一个严重的问题,因为相邻通信链路会发生串扰故障。为了提高片上网络中数据包传输的可靠性,提出了一种有效的流量控制方法。该方法研究在数据包的飞动之间出现的相反方向转换,以重新排序数据包中的飞动。Flits在固定大小的窗口中重新排序,以减少:1)串扰发生的概率,以及2)数据包传输所消耗的总功率。在不同的窗口尺寸和不同的通道宽度下,通过基于vhdl的模拟器对所提出的流量控制方法进行了评估。仿真结果使NoC设计人员能够在窗口大小、可靠性和数据包传输功耗之间做出权衡。本文还从可靠性和功耗两方面对该方法进行了比较。对比结果表明,该方法是一种经济有效的解决串扰问题的方法。
{"title":"An Efficient Method to Reliable Data Transmission in Network-on-Chips","authors":"A. Patooghy, H. Tabkhi, S. Miremadi","doi":"10.1109/DSD.2010.23","DOIUrl":"https://doi.org/10.1109/DSD.2010.23","url":null,"abstract":"Data transmission in Network-on-Chips (NoCs) is a serious problem due to cross talk faults happening in adjacent communication links. This paper proposes an efficient flow-control method to enhance the reliability of packet transmission in Network-on-Chips. The method investigates the opposite direction transitions appearing between flits of a packet to reorder the flits in the packet. Flits are reordered in a fixed-size window to reduce: 1) the probability of cross talk occurrence, and 2) the total power consumed for packet delivery. The proposed flow-control method is evaluated by a VHDL-based simulator under different window sizes and various channel widths. Simulation results enable NoC designers to make a trade-off between window size, reliability and power consumption of packet delivery. This method is also compared with other cross talk tolerant methods in terms of reliability and power consumption. Comparison results confirm that the method is a cost efficient solution to overcome the cross talk problem.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115253120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low Latency Recovery from Transient Faults for Pipelined Processor Architectures 流水线处理器体系结构瞬态故障的低延迟恢复
M. Jeitler, J. Lechner
Recent technology trends have made radiation-induced soft errors a growing threat to the reliability of microprocessors, a problem previously only known to the aerospace industry. Therefore, the ability to handle higher soft error rates in modern processor architectures is essential in order to allow further technology scaling. This paper presents an efficient fault-tolerance method for pipeline-based processors using temporal redundancy. Instructions are executed twice at each pipeline stage, which allows the detection of transient faults. Once a fault is detected the execution is stopped immediately and recovery is implicitly performed within the pipeline stages. Due to this fast reaction the fault is contained at its origin and no expensive rollback operation is required later on.
最近的技术趋势使得辐射引起的软错误对微处理器的可靠性构成越来越大的威胁,这个问题以前只有航空航天工业才知道。因此,为了允许进一步的技术扩展,在现代处理器架构中处理更高的软错误率的能力是必不可少的。本文提出了一种利用时间冗余对基于流水线的处理器进行容错的方法。指令在每个管道阶段执行两次,这允许检测瞬态故障。一旦检测到故障,执行将立即停止,并在管道阶段内隐式执行恢复。由于这种快速反应,故障被控制在原点,以后不需要昂贵的回滚操作。
{"title":"Low Latency Recovery from Transient Faults for Pipelined Processor Architectures","authors":"M. Jeitler, J. Lechner","doi":"10.1109/DSD.2010.87","DOIUrl":"https://doi.org/10.1109/DSD.2010.87","url":null,"abstract":"Recent technology trends have made radiation-induced soft errors a growing threat to the reliability of microprocessors, a problem previously only known to the aerospace industry. Therefore, the ability to handle higher soft error rates in modern processor architectures is essential in order to allow further technology scaling. This paper presents an efficient fault-tolerance method for pipeline-based processors using temporal redundancy. Instructions are executed twice at each pipeline stage, which allows the detection of transient faults. Once a fault is detected the execution is stopped immediately and recovery is implicitly performed within the pipeline stages. Due to this fast reaction the fault is contained at its origin and no expensive rollback operation is required later on.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114331952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Area and Speed Oriented Implementations of Asynchronous Logic Operating under Strong Constraints 强约束下异步逻辑操作的面向面积和速度的实现
I. Lemberski, P. Fiser
Asynchronous circuit implementations operating under strong constraints (DIMS, Direct Logic, some of NCL gates, etc.) are attractive due to: 1) regularity, 2) combined implementation of the functional and completion detection logics, what simplifies the design process, 3) circuit output latency is based on the actual gate delays of the unbounded nature, 4) absence of additional synchronization chains (even of a local nature). However, the area and speed penalty is rather high. In contrast to the state-of-the-art approaches, where simple (NAND, NOR, etc.) 2 input gates are used, we propose a synthesis method based on complex nodes, i.e., nodes implementing any function of an arbitrary number of inputs. Synchronous synthesis procedures may be freely adopted for this purpose. Numerous experiments on standard benchmarks were performed and the efficiency of the proposed complex gate based method is clearly shown. DIMS and Direct Logic based asynchronous designs are considered in the paper.
在强约束下运行的异步电路实现(DIMS, Direct Logic,一些NCL门等)具有吸引力,因为:1)规律性,2)功能和完成检测逻辑的组合实现,简化了设计过程,3)电路输出延迟基于无界性质的实际门延迟,4)缺乏额外的同步链(即使是局部性质)。然而,面积和速度的惩罚是相当高的。与使用简单(NAND, NOR等)2输入门的最先进方法相反,我们提出了一种基于复杂节点的综合方法,即实现任意数量输入的任何函数的节点。为此目的可自由采用同步合成方法。在标准基准上进行了大量的实验,并清楚地表明了所提出的基于复杂门的方法的有效性。本文考虑了基于DIMS和Direct Logic的异步设计。
{"title":"Area and Speed Oriented Implementations of Asynchronous Logic Operating under Strong Constraints","authors":"I. Lemberski, P. Fiser","doi":"10.1109/DSD.2010.82","DOIUrl":"https://doi.org/10.1109/DSD.2010.82","url":null,"abstract":"Asynchronous circuit implementations operating under strong constraints (DIMS, Direct Logic, some of NCL gates, etc.) are attractive due to: 1) regularity, 2) combined implementation of the functional and completion detection logics, what simplifies the design process, 3) circuit output latency is based on the actual gate delays of the unbounded nature, 4) absence of additional synchronization chains (even of a local nature). However, the area and speed penalty is rather high. In contrast to the state-of-the-art approaches, where simple (NAND, NOR, etc.) 2 input gates are used, we propose a synthesis method based on complex nodes, i.e., nodes implementing any function of an arbitrary number of inputs. Synchronous synthesis procedures may be freely adopted for this purpose. Numerous experiments on standard benchmarks were performed and the efficiency of the proposed complex gate based method is clearly shown. DIMS and Direct Logic based asynchronous designs are considered in the paper.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127344686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Test Data and Power Reductions for Transition Delay Tests for Massive-Parallel Scan Structures 大规模并行扫描结构转换延迟测试的测试数据和功耗降低
R. Kothe, H. Vierhaus
Test technologies for integrated circuits have traditionally tried to maximise test data compression rates, because these are essential for keeping test time and costs low. However, power consumption during the test process is a problem that has been addressed on recently. Excessive power consumption may result in thermal stress and increased voltage drops within the circuit, which implies increasing signal delays. Thereby even fully-functional circuits may fail during delay testing. Therefore, in this paper a flexible concept is proposed which combines test pattern compression using a scan controller concept and reduction of power consumption during the fast capture cycles of transition delay tests. Essentially, this concept consists of a Greedy algorithm, which fills X-rich pattern with 0s or 1s step-by-step, and an event-driven logic and power consumption simulator, which calculates the costs of these steps. The implemented concept is applied to X-rich test sets of ISCAS'89, ITC'99 benchmarks and OpenSparc cores. Results show a best case with 96 percent test data reduction combined with 32 percent less peak capture power. With this concept it is also possible to reduce the peak power for shift-in, launch and shift-out cycles by over 50 percent.
集成电路的测试技术传统上一直试图最大化测试数据压缩率,因为这对于保持低测试时间和成本至关重要。然而,在测试过程中的功耗是最近已经解决的一个问题。过多的功耗可能会导致电路中的热应力和增加的电压降,这意味着增加信号延迟。因此,即使功能齐全的电路也可能在延迟测试中失败。因此,本文提出了一种灵活的概念,该概念结合了使用扫描控制器的测试模式压缩概念和在转换延迟测试的快速捕获周期中降低功耗。本质上,这个概念包括一个Greedy算法,它一步一步地用0或1填充富x模式,以及一个事件驱动的逻辑和功耗模拟器,它计算这些步骤的成本。实现的概念应用于ISCAS'89, ITC'99基准和OpenSparc核心的X-rich测试集。结果显示,在最佳情况下,测试数据减少96%,峰值捕获功率减少32%。利用这一概念,还可以将换入、启动和换出周期的峰值功率降低50%以上。
{"title":"Test Data and Power Reductions for Transition Delay Tests for Massive-Parallel Scan Structures","authors":"R. Kothe, H. Vierhaus","doi":"10.1109/DSD.2010.89","DOIUrl":"https://doi.org/10.1109/DSD.2010.89","url":null,"abstract":"Test technologies for integrated circuits have traditionally tried to maximise test data compression rates, because these are essential for keeping test time and costs low. However, power consumption during the test process is a problem that has been addressed on recently. Excessive power consumption may result in thermal stress and increased voltage drops within the circuit, which implies increasing signal delays. Thereby even fully-functional circuits may fail during delay testing. Therefore, in this paper a flexible concept is proposed which combines test pattern compression using a scan controller concept and reduction of power consumption during the fast capture cycles of transition delay tests. Essentially, this concept consists of a Greedy algorithm, which fills X-rich pattern with 0s or 1s step-by-step, and an event-driven logic and power consumption simulator, which calculates the costs of these steps. The implemented concept is applied to X-rich test sets of ISCAS'89, ITC'99 benchmarks and OpenSparc cores. Results show a best case with 96 percent test data reduction combined with 32 percent less peak capture power. With this concept it is also possible to reduce the peak power for shift-in, launch and shift-out cycles by over 50 percent.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125489557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An Improved Automotive Multiple Target Tracking System Design 一种改进的汽车多目标跟踪系统设计
T. Lange, N. Harb, Haisheng Liu, S. Niar, R. B. Atitallah
Multiple Target Tracking (MTT) algorithms are widely used in various military and civilian applications but its use in automotive safety has little been investigated. In MTT algorithms, implemented in embedded systems, it is important to use the minimum required resources to allow the entire DAS system to be integrated on the same chip (data acquisition, MTT and alarm restitution). This allows the reduction of the System on Chip (SoC) complexity and cost. This paper presents an efficient Driver Assistance System (DAS) based on MTT application. To do so, we first identified the performance bottlenecks in the application. In this application, a set of optimizations were applied to reduce the MTT algorithm’s complexity. Tuning in conjunction the hardware and the software yielded to optimize the final system and to meet the functional requirements. The result is a complete embedded MTT application running on an embedded system that fits in a contemporary medium sized FPGA device.
多目标跟踪(MTT)算法广泛应用于各种军事和民用应用,但其在汽车安全方面的应用研究却很少。在嵌入式系统中实现的MTT算法中,重要的是使用最少所需的资源,以便将整个DAS系统集成在同一芯片上(数据采集、MTT和报警恢复)。这可以降低片上系统(SoC)的复杂性和成本。提出了一种基于MTT应用的高效驾驶辅助系统。为此,我们首先确定了应用程序中的性能瓶颈。在这个应用程序中,应用了一组优化来降低MTT算法的复杂性。将硬件和软件结合在一起进行调优,以优化最终系统并满足功能要求。其结果是一个完整的嵌入式MTT应用程序运行在嵌入式系统上,适合当代中型FPGA设备。
{"title":"An Improved Automotive Multiple Target Tracking System Design","authors":"T. Lange, N. Harb, Haisheng Liu, S. Niar, R. B. Atitallah","doi":"10.1109/DSD.2010.54","DOIUrl":"https://doi.org/10.1109/DSD.2010.54","url":null,"abstract":"Multiple Target Tracking (MTT) algorithms are widely used in various military and civilian applications but its use in automotive safety has little been investigated. In MTT algorithms, implemented in embedded systems, it is important to use the minimum required resources to allow the entire DAS system to be integrated on the same chip (data acquisition, MTT and alarm restitution). This allows the reduction of the System on Chip (SoC) complexity and cost. This paper presents an efficient Driver Assistance System (DAS) based on MTT application. To do so, we first identified the performance bottlenecks in the application. In this application, a set of optimizations were applied to reduce the MTT algorithm’s complexity. Tuning in conjunction the hardware and the software yielded to optimize the final system and to meet the functional requirements. The result is a complete embedded MTT application running on an embedded system that fits in a contemporary medium sized FPGA device.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129986775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
On Scaling Speedup with Coarse-Grain Coprocessor Accelerators on Reconfigurable Platforms 可重构平台上粗粒度协处理器加速器的比例加速研究
Georgios Kornaros, Antonios Motakis
Instruction set accelerator architectures have emerged recently as light-weight hardware coprocessors, so as to transparently improve applications performance. This paper investigates the effectiveness of adding hardware accelerators as refers to scaling, based on applications that show data level parallelism such as image edge detection and fractal applications. The implementation results using reconfigurable technology show that, by utilizing a number of hardware coprocessor units, applications such as Sobel edge detection can achieve speedup more than 37Í. Finally, architectural directions based on the developed case studies show that even better performance can be achieved when the overheads of communication, of serialized data accesses, shared memory and of bus protocols are reduced.
指令集加速器架构最近作为轻量级硬件协处理器出现,从而透明地提高应用程序的性能。本文基于图像边缘检测和分形应用等显示数据级并行性的应用,研究了在缩放方面添加硬件加速器的有效性。使用可重构技术的实现结果表明,通过使用多个硬件协处理器单元,Sobel边缘检测等应用可以实现比37Í更快的速度。最后,基于已开发的案例研究的体系结构方向表明,当通信、序列化数据访问、共享内存和总线协议的开销减少时,可以实现更好的性能。
{"title":"On Scaling Speedup with Coarse-Grain Coprocessor Accelerators on Reconfigurable Platforms","authors":"Georgios Kornaros, Antonios Motakis","doi":"10.1109/DSD.2010.79","DOIUrl":"https://doi.org/10.1109/DSD.2010.79","url":null,"abstract":"Instruction set accelerator architectures have emerged recently as light-weight hardware coprocessors, so as to transparently improve applications performance. This paper investigates the effectiveness of adding hardware accelerators as refers to scaling, based on applications that show data level parallelism such as image edge detection and fractal applications. The implementation results using reconfigurable technology show that, by utilizing a number of hardware coprocessor units, applications such as Sobel edge detection can achieve speedup more than 37Í. Finally, architectural directions based on the developed case studies show that even better performance can be achieved when the overheads of communication, of serialized data accesses, shared memory and of bus protocols are reduced.","PeriodicalId":356885,"journal":{"name":"2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools","volume":"290 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132600319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1