首页 > 最新文献

2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)最新文献

英文 中文
Exploration of system availability during software-based self-testing in many-core systems under test latency constraints 在测试延迟约束下的多核系统中基于软件的自测试期间系统可用性的探索
M. Skitsas, C. Nicopoulos, M. Michael
As technology scales, the increased vulnerability of modern systems due to unreliable components becomes a major problem in the era of multi-/many-core architectures. Recently, several on-line testing techniques have been proposed, aiming towards error detection of wear-out/aging-related defects that can appear during the lifetime of a system. In this work, we investigate the relation between system test latency and testtime overhead in multi-/many-core systems with shared LastLevel Cache (LLC) for periodic Software-Based Self-Testing (SBST), under different test scheduling policies. The investigated scheduling policies primarily vary the number of cores concurrently under test in the overall system testing session. Our extensive, workload-driven dynamic exploration reveals that there is an inverse relation between the two test measures; as the number of cores concurrently under test increases, system test latency decreases, but at the cost of significantly increased test time, which sacrifices system availability for running normal workloads. Under given system test latency constraints, which should be utilized in order to be able to control system recovery time in the event of an error detection, our exploration framework identifies the scheduling policy under which overall test time overhead is minimized and, hence, system availability is maximized. Without any loss of generality, a 16-core system is explored in a full-system, execution-driven simulation framework running multi-threaded PARSEC workloads [1].
随着技术的发展,由于不可靠的组件而增加的现代系统的脆弱性成为多核/多核架构时代的一个主要问题。最近,人们提出了几种在线测试技术,旨在对系统生命周期中可能出现的磨损/老化相关缺陷进行错误检测。在这项工作中,我们研究了在不同的测试调度策略下,在具有共享LastLevel Cache (LLC)的多核/多核系统中用于周期性基于软件的自测(SBST)的系统测试延迟和测试时间开销之间的关系。所研究的调度策略主要是改变整个系统测试会话中并发测试的核心数量。我们广泛的,工作负载驱动的动态探索揭示了两个测试度量之间的反比关系;随着测试中的并发核数的增加,系统测试延迟减少,但代价是显著增加了测试时间,从而牺牲了运行正常工作负载的系统可用性。在给定的系统测试延迟约束下(应该利用这些约束来控制发生错误检测时的系统恢复时间),我们的探索框架确定了调度策略,在该策略下,总体测试时间开销最小,因此系统可用性最大化。在没有任何一般性损失的情况下,我们在一个运行多线程PARSEC工作负载的全系统、执行驱动的仿真框架中探索了一个16核系统[1]。
{"title":"Exploration of system availability during software-based self-testing in many-core systems under test latency constraints","authors":"M. Skitsas, C. Nicopoulos, M. Michael","doi":"10.1109/DFT.2014.6962088","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962088","url":null,"abstract":"As technology scales, the increased vulnerability of modern systems due to unreliable components becomes a major problem in the era of multi-/many-core architectures. Recently, several on-line testing techniques have been proposed, aiming towards error detection of wear-out/aging-related defects that can appear during the lifetime of a system. In this work, we investigate the relation between system test latency and testtime overhead in multi-/many-core systems with shared LastLevel Cache (LLC) for periodic Software-Based Self-Testing (SBST), under different test scheduling policies. The investigated scheduling policies primarily vary the number of cores concurrently under test in the overall system testing session. Our extensive, workload-driven dynamic exploration reveals that there is an inverse relation between the two test measures; as the number of cores concurrently under test increases, system test latency decreases, but at the cost of significantly increased test time, which sacrifices system availability for running normal workloads. Under given system test latency constraints, which should be utilized in order to be able to control system recovery time in the event of an error detection, our exploration framework identifies the scheduling policy under which overall test time overhead is minimized and, hence, system availability is maximized. Without any loss of generality, a 16-core system is explored in a full-system, execution-driven simulation framework running multi-threaded PARSEC workloads [1].","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115647409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Unifying scan compression 统一扫描压缩
Swapnil Bahl, Shrey Rungta, Shray Khullar, R. Kapur, A. Chandra, S. Talluto, Pramod Notiyath, Ajay Rajagopalan
STMicroelectronics has been using scan compression for many years. With the vast variety of designs and the size of the company it is important to deploy an easy to use solution that works for all the conditions. Today we support many different compression schemes DFTMAX, DFTMAX Xtol, Serializer. Each of these solutions is strong in a segment of the designs. DFTMAX Ultra has a technology that provides a single solution for all needs. In this paper we discuss the variety of design scenarios seen in ST from the point of scan compression. Results of DFTMAX Ultra are then presented to show that it is a viable unified solution.
意法半导体多年来一直在使用扫描压缩技术。由于设计的多样性和公司的规模,重要的是部署一个易于使用的解决方案,适用于所有条件。今天我们支持许多不同的压缩方案DFTMAX, DFTMAX Xtol, Serializer。这些解决方案在设计的某个部分都很强大。DFTMAX Ultra拥有一种技术,可为所有需求提供单一解决方案。在本文中,我们从扫描压缩的角度讨论了在ST中看到的各种设计方案。然后给出了DFTMAX Ultra的结果,表明它是一种可行的统一解决方案。
{"title":"Unifying scan compression","authors":"Swapnil Bahl, Shrey Rungta, Shray Khullar, R. Kapur, A. Chandra, S. Talluto, Pramod Notiyath, Ajay Rajagopalan","doi":"10.1109/DFT.2014.6962079","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962079","url":null,"abstract":"STMicroelectronics has been using scan compression for many years. With the vast variety of designs and the size of the company it is important to deploy an easy to use solution that works for all the conditions. Today we support many different compression schemes DFTMAX, DFTMAX Xtol, Serializer. Each of these solutions is strong in a segment of the designs. DFTMAX Ultra has a technology that provides a single solution for all needs. In this paper we discuss the variety of design scenarios seen in ST from the point of scan compression. Results of DFTMAX Ultra are then presented to show that it is a viable unified solution.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126740937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
SAM: A comprehensive mechanism for accessing embedded sensors in modern SoCs 在现代soc中访问嵌入式传感器的综合机制
M. He, M. Tehranipoor
With technology scaling, the number of sensors integrated into modern system-on-chip (SoC) designs has increased greatly over the past several years. These sensors must be accessed for a number of reasons (test, configuration, calibration, etc.). This paper proposes a novel sensor access mechanism (SAM) to address sensor access in various modes, including manufacturing test mode, functional mode, built-in self-test (BIST) mode, silicon validation mode, and calibration mode. Moreover, SAM standardizes the testing and measurement of embedded sensors by providing easy and effective access to sensors distributed across the SoC. Further, SAM does not introduce a new pin, making it JTAG compatible and practice-oriented for easy industrial adoption. Various simulation results, collected by integrating SAM into several benchmarks demonstrate its high performance and low overhead.
随着技术的扩展,集成到现代片上系统(SoC)设计中的传感器数量在过去几年中大大增加。由于多种原因(测试、配置、校准等),必须访问这些传感器。本文提出了一种新的传感器访问机制(SAM),以解决传感器在制造测试模式、功能模式、内置自检(BIST)模式、硅验证模式和校准模式下的访问问题。此外,SAM通过提供对分布在SoC上的传感器的简单有效访问,标准化了嵌入式传感器的测试和测量。此外,SAM没有引入新的引脚,使其与JTAG兼容,并且面向实践,易于工业采用。通过将SAM集成到几个基准测试中收集的各种模拟结果证明了它的高性能和低开销。
{"title":"SAM: A comprehensive mechanism for accessing embedded sensors in modern SoCs","authors":"M. He, M. Tehranipoor","doi":"10.1109/DFT.2014.6962097","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962097","url":null,"abstract":"With technology scaling, the number of sensors integrated into modern system-on-chip (SoC) designs has increased greatly over the past several years. These sensors must be accessed for a number of reasons (test, configuration, calibration, etc.). This paper proposes a novel sensor access mechanism (SAM) to address sensor access in various modes, including manufacturing test mode, functional mode, built-in self-test (BIST) mode, silicon validation mode, and calibration mode. Moreover, SAM standardizes the testing and measurement of embedded sensors by providing easy and effective access to sensors distributed across the SoC. Further, SAM does not introduce a new pin, making it JTAG compatible and practice-oriented for easy industrial adoption. Various simulation results, collected by integrating SAM into several benchmarks demonstrate its high performance and low overhead.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114855056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
CSST: Preventing distribution of unlicensed and rejected ICs by untrusted foundry and assembly CSST:防止由不可信的代工和组装分发未经许可和被拒绝的ic
Md. Tauhidur Rahman, Domenic Forte, Quihang Shi, Gustavo K. Contreras, M. Tehranipoor
The globalization of the semiconductor design and fabrication industry (also known as the horizontal business model) has led to many well-documented issues associated with untrusted foundries and assemblies, including IC overproduction, cloning, and the shipping of improperly or insufficiently tested chips. Besides the loss in profits to Intellectual Property (IP) owners, such chips entering the supply chain can have catastrophic consequences for critical applications. We propose a new Secure Split-Test (SST) scheme called the Connecticut SST (CSST) in which the IP owner takes full control over testing. In CSST, each chip and its scan chains are locked during testing, and only the IP owner can interpret the locked test results and unlock passing chips. The new SST can prevent overproduced, defective, and cloned chips from reaching the supply chain. The proposed method considerably simplifies the communication required between the foundry/assembly and the IP owner compared to the original version of the SST. The results demonstrate that our new technique is more secure than the original and has lower communication overheads.
半导体设计和制造行业的全球化(也被称为横向商业模式)导致了许多与不可信的代工厂和组装相关的问题,包括IC生产过剩、克隆以及不适当或测试不足的芯片的运输。除了知识产权(IP)所有者的利润损失外,此类芯片进入供应链可能会对关键应用产生灾难性后果。我们提出了一种新的安全分离测试(SST)方案,称为康涅狄格SST (CSST),其中IP所有者完全控制测试。在CSST中,每个芯片及其扫描链在测试过程中被锁定,只有IP所有者才能解释锁定的测试结果并解锁通过的芯片。新的SST可以防止过度生产,缺陷和克隆芯片到达供应链。与原始版本的SST相比,所提出的方法大大简化了铸造厂/组装厂和IP所有者之间所需的通信。结果表明,我们的新技术比原来的更安全,并且具有更低的通信开销。
{"title":"CSST: Preventing distribution of unlicensed and rejected ICs by untrusted foundry and assembly","authors":"Md. Tauhidur Rahman, Domenic Forte, Quihang Shi, Gustavo K. Contreras, M. Tehranipoor","doi":"10.1109/DFT.2014.6962096","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962096","url":null,"abstract":"The globalization of the semiconductor design and fabrication industry (also known as the horizontal business model) has led to many well-documented issues associated with untrusted foundries and assemblies, including IC overproduction, cloning, and the shipping of improperly or insufficiently tested chips. Besides the loss in profits to Intellectual Property (IP) owners, such chips entering the supply chain can have catastrophic consequences for critical applications. We propose a new Secure Split-Test (SST) scheme called the Connecticut SST (CSST) in which the IP owner takes full control over testing. In CSST, each chip and its scan chains are locked during testing, and only the IP owner can interpret the locked test results and unlock passing chips. The new SST can prevent overproduced, defective, and cloned chips from reaching the supply chain. The proposed method considerably simplifies the communication required between the foundry/assembly and the IP owner compared to the original version of the SST. The results demonstrate that our new technique is more secure than the original and has lower communication overheads.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115858001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Reliability estimation at block-level granularity of spin-transfer-torque MRAMs 基于块级粒度的自旋-传递-扭矩mram可靠性估计
S. Carlo, Marco Indaco, P. Prinetto, E. Vatajelu, R. Rodríguez-Montañés, J. Figueras
In recent years, the Spin-Transfer-Torque Magnetic Random Access Memory (STT-MRAM) has emerged as a promising choice for embedded memories due to its reduced read/write latency and high CMOS integration capability. Under today aggressive technology scaling requirements, the STT-MRAM is affected by process variability and aging phenomena, making reliability prediction a growing concern. In this paper, we provide a methodology for predicting the reliability of an STT-MRAM based memory at block level for different block sizes and access rates. The proposed methodology also allows for an exploration of required error correction capabilities as function of code word size to achieve the desired reliability target for the memory under study.
近年来,自旋-转移-扭矩磁随机存取存储器(STT-MRAM)由于其减少的读/写延迟和高CMOS集成能力,已成为嵌入式存储器的一个有前途的选择。在当今激进的技术规模要求下,STT-MRAM受到工艺变异性和老化现象的影响,使得可靠性预测日益受到关注。在本文中,我们提供了一种方法来预测基于STT-MRAM的存储器在不同块大小和访问速率下的块级可靠性。所提出的方法还允许探索所需的纠错能力作为码字大小的函数,以实现所研究的存储器的期望可靠性目标。
{"title":"Reliability estimation at block-level granularity of spin-transfer-torque MRAMs","authors":"S. Carlo, Marco Indaco, P. Prinetto, E. Vatajelu, R. Rodríguez-Montañés, J. Figueras","doi":"10.1109/DFT.2014.6962093","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962093","url":null,"abstract":"In recent years, the Spin-Transfer-Torque Magnetic Random Access Memory (STT-MRAM) has emerged as a promising choice for embedded memories due to its reduced read/write latency and high CMOS integration capability. Under today aggressive technology scaling requirements, the STT-MRAM is affected by process variability and aging phenomena, making reliability prediction a growing concern. In this paper, we provide a methodology for predicting the reliability of an STT-MRAM based memory at block level for different block sizes and access rates. The proposed methodology also allows for an exploration of required error correction capabilities as function of code word size to achieve the desired reliability target for the memory under study.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117249714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fault tolerant and highly adaptive routing for 2D NoCs 二维noc的容错和高自适应路由
Manoj Kumar, V. Laxmi, M. Gaur, M. Daneshtalab, M. Ebrahimi, Mark Zwolinski
Networks-on-Chip (NoCs) are emerging as a promising communication paradigm to overcome bottleneck of traditional bus-based interconnects for current microarchitectures (MCSoC and CMP). One of the known current problems in NoC routing is the use of acyclic Channel Dependency Graph (CDG) for deadlock freedom. This requirement forces certain routing turns to be prohibited, thus, reducing the degree of adaptiveness. In this paper, we propose a novel non-minimal turn model which allows cycles in CDG provided that Extended Channel Dependency Graph (ECDG) remains acyclic. The proposed turn model reduces number of restrictions on routing turns, hence able to provide path diversity through additional minimal and non-minimal routes between source and destination. We also develop a fault tolerant and congestion-aware routing algorithm based on the proposed turn model to demonstrate the effectiveness. In this algorithm, a non-minimal route is used only when links in minimal routes are congested or faulty. Average performance gain of the proposed method is up to 26% across all selected benchmarks when compared with DRFT and 12% when compared with LEAR for 7 × 7 mesh.
片上网络(noc)正在成为一种有前途的通信模式,以克服当前微架构(MCSoC和CMP)中传统基于总线的互连的瓶颈。目前已知的NoC路由问题之一是使用无循环通道依赖图(CDG)来实现死锁自由。这个要求强制禁止某些路由转换,从而降低了自适应程度。在本文中,我们提出了一种新的非最小回合模型,该模型允许在扩展通道依赖图(ECDG)保持非循环的情况下,在CDG中存在循环。所提出的转弯模型减少了路由转弯的限制数量,从而能够通过在源和目的之间增加最小和非最小路径来提供路径多样性。我们还开发了一个基于所提出的转弯模型的容错和拥塞感知路由算法来证明该算法的有效性。在该算法中,只有当最小路由中的链路出现拥塞或故障时,才会使用非最小路由。与DRFT相比,该方法在所有选定的基准测试中的平均性能增益高达26%,与LEAR相比,在7 × 7网格下的平均性能增益高达12%。
{"title":"Fault tolerant and highly adaptive routing for 2D NoCs","authors":"Manoj Kumar, V. Laxmi, M. Gaur, M. Daneshtalab, M. Ebrahimi, Mark Zwolinski","doi":"10.1109/DFT.2014.6962100","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962100","url":null,"abstract":"Networks-on-Chip (NoCs) are emerging as a promising communication paradigm to overcome bottleneck of traditional bus-based interconnects for current microarchitectures (MCSoC and CMP). One of the known current problems in NoC routing is the use of acyclic Channel Dependency Graph (CDG) for deadlock freedom. This requirement forces certain routing turns to be prohibited, thus, reducing the degree of adaptiveness. In this paper, we propose a novel non-minimal turn model which allows cycles in CDG provided that Extended Channel Dependency Graph (ECDG) remains acyclic. The proposed turn model reduces number of restrictions on routing turns, hence able to provide path diversity through additional minimal and non-minimal routes between source and destination. We also develop a fault tolerant and congestion-aware routing algorithm based on the proposed turn model to demonstrate the effectiveness. In this algorithm, a non-minimal route is used only when links in minimal routes are congested or faulty. Average performance gain of the proposed method is up to 26% across all selected benchmarks when compared with DRFT and 12% when compared with LEAR for 7 × 7 mesh.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129629642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A runtime manager for gracefully degrading SoCs 用于优雅地降级soc的运行时管理器
S. Tzilis, I. Sourdis
The increasing number of transistors integrated on a single chip comes with the blessing of raw computational power and the curse of susceptibility to various kinds of faults. On top of increased defect densities, wearout effects mean that the testing verdict at fabrication time cannot be trusted throughout the chip lifetime. However, extra computational power presents the opportunity to build gracefully degrading MPSoCs. Re-configurable components and flexible workloads, along with runtime support, enable MPSoCs to deal with permanent faults degrading one or more system aspects, such as performance, energy efficiency and delivered functionality, instead of failing. In this manner, chip life is prolonged and safety is increased. In this work Graceful Degradation (GD) is formulated as an optimization problem in the context of MPSoCs. As such, its possible solutions can be evaluated in a parameterizable and consistent manner. An attempt at a runtime solution for a heterogeneous 4-core SoC is made and the resulting GD manager is evaluated in terms of speed and accuracy, with a use case combining essential automotive tasks and non-essential additional features. On average, it is found to produce a solution 89% as good as the optimal, in 4.3μsec running on one core of a common modern CPU.
集成在单个芯片上的晶体管数量不断增加,随之而来的是原始计算能力的优势和易受各种故障影响的诅咒。除了缺陷密度增加之外,磨损效应意味着制造时的测试结果在整个芯片寿命期间都不可信。然而,额外的计算能力提供了构建优雅地降级mpsoc的机会。可重新配置的组件和灵活的工作负载,以及运行时支持,使mpsoc能够处理降低一个或多个系统方面(如性能、能源效率和交付功能)的永久性故障,而不是出现故障。这样可以延长芯片的使用寿命,提高安全性。在这项工作中,优雅退化(GD)被制定为mpsoc背景下的优化问题。因此,可以用可参数化和一致的方式对其可能的解进行评估。对异构4核SoC的运行时解决方案进行了尝试,并根据速度和准确性评估了生成的GD管理器,并结合了基本汽车任务和非必要附加功能的用例。平均而言,在一个普通现代CPU的一个核心上运行4.3μsec,可以产生比最优解决方案好89%的解决方案。
{"title":"A runtime manager for gracefully degrading SoCs","authors":"S. Tzilis, I. Sourdis","doi":"10.1109/DFT.2014.6962106","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962106","url":null,"abstract":"The increasing number of transistors integrated on a single chip comes with the blessing of raw computational power and the curse of susceptibility to various kinds of faults. On top of increased defect densities, wearout effects mean that the testing verdict at fabrication time cannot be trusted throughout the chip lifetime. However, extra computational power presents the opportunity to build gracefully degrading MPSoCs. Re-configurable components and flexible workloads, along with runtime support, enable MPSoCs to deal with permanent faults degrading one or more system aspects, such as performance, energy efficiency and delivered functionality, instead of failing. In this manner, chip life is prolonged and safety is increased. In this work Graceful Degradation (GD) is formulated as an optimization problem in the context of MPSoCs. As such, its possible solutions can be evaluated in a parameterizable and consistent manner. An attempt at a runtime solution for a heterogeneous 4-core SoC is made and the resulting GD manager is evaluated in terms of speed and accuracy, with a use case combining essential automotive tasks and non-essential additional features. On average, it is found to produce a solution 89% as good as the optimal, in 4.3μsec running on one core of a common modern CPU.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130639163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A fault injection methodology and infrastructure for fast single event upsets emulation on Xilinx SRAM-based FPGAs 基于Xilinx sram的fpga上快速单事件故障仿真的故障注入方法和基础结构
S. Carlo, P. Prinetto, Daniele Rolfo, Pascal Trotta
Modern SRAM-based Field Programmable Gate Arrays (FPGAs) are increasingly employed in safety- and mission-critical applications. However, the aggressive technology scaling is highlighting the increasing sensitivity of such devices to Single Event Upsets (SEUs) caused by external radiation events. Assessing the reliability of FPGA-based systems in the early design stages is of upmost importance, allowing design exploration of different protection alternatives. This paper presents a Dynamic Partial Reconfiguration-based fault injection methodology implemented by an integrated infrastructure for SEUs emulation in the configuration memory of Xilinx SRAM-based FPGAs. The proposed methodology exploits the Xilinx Essential Bits technology to extremely speed-up fault injection, ensuring correct operations of the fault injection infrastructure during the whole injection process.
现代基于sram的现场可编程门阵列(fpga)越来越多地用于安全和关键任务应用。然而,积极的技术扩展突出了此类设备对外部辐射事件引起的单事件干扰(seu)的敏感性日益增加。在早期设计阶段评估基于fpga的系统的可靠性是最重要的,允许设计探索不同的保护方案。本文提出了一种基于动态部分重构的故障注入方法,该方法由集成的基础设施实现,用于seu仿真在赛灵思sram fpga的配置存储器中。该方法利用Xilinx Essential Bits技术极大地加快了断层注入速度,确保了断层注入基础设施在整个注入过程中的正确操作。
{"title":"A fault injection methodology and infrastructure for fast single event upsets emulation on Xilinx SRAM-based FPGAs","authors":"S. Carlo, P. Prinetto, Daniele Rolfo, Pascal Trotta","doi":"10.1109/DFT.2014.6962073","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962073","url":null,"abstract":"Modern SRAM-based Field Programmable Gate Arrays (FPGAs) are increasingly employed in safety- and mission-critical applications. However, the aggressive technology scaling is highlighting the increasing sensitivity of such devices to Single Event Upsets (SEUs) caused by external radiation events. Assessing the reliability of FPGA-based systems in the early design stages is of upmost importance, allowing design exploration of different protection alternatives. This paper presents a Dynamic Partial Reconfiguration-based fault injection methodology implemented by an integrated infrastructure for SEUs emulation in the configuration memory of Xilinx SRAM-based FPGAs. The proposed methodology exploits the Xilinx Essential Bits technology to extremely speed-up fault injection, ensuring correct operations of the fault injection infrastructure during the whole injection process.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126462725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
An instance-based SER analysis in the presence of PVTA variations 在存在PVTA变化的情况下,基于实例的SER分析
Bahareh J. Farahani, S. Safari
As semiconductor technology has entered into the nanoscale regime, Single Event Transient (SET) became one of the major challenging issues for silicon chips. Susceptibility to soft error is even becoming more severe in the presence of Process, Voltage, Temperature, and transistor Aging (PVTA) variations. In this paper, we model and analyze the impacts of PVTA on the susceptibility of VLSI chips to SET. We show that higher PVTA results in significant reduction of critical charge (i.e, higher glitch generation) of silicons while electrical masking (preventing glitch propagation) is improved. In addition, we propose a holistic instance-based systematic methodology to calculate the Soft Error Rate (SER) of combinationals considering PVTA variations. The simulation results for various ITC'99 benchmark circuits show that disregarding PVTA information results in 76% error in the estimated SER on average. Moreover, according to the results, SER increases by 70% on average in the first years of circuit lifetime due to transistor aging and then it is almost saturated.
随着半导体技术进入纳米级,单事件瞬态(SET)成为硅芯片面临的主要挑战之一。在工艺、电压、温度和晶体管老化(PVTA)变化的情况下,对软误差的敏感性甚至变得更加严重。在本文中,我们建立模型并分析了PVTA对VLSI芯片对SET敏感性的影响。我们表明,较高的PVTA导致硅的临界电荷显著降低(即更高的毛刺产生),同时电掩蔽(防止毛刺传播)得到改善。此外,我们提出了一种基于实例的整体系统方法来计算考虑PVTA变化的组合的软错误率(SER)。对各种ITC’99基准电路的仿真结果表明,忽略PVTA信息导致估计SER的平均误差为76%。此外,根据结果,由于晶体管老化,SER在电路寿命的头几年平均增加了70%,然后几乎饱和。
{"title":"An instance-based SER analysis in the presence of PVTA variations","authors":"Bahareh J. Farahani, S. Safari","doi":"10.1109/DFT.2014.6962081","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962081","url":null,"abstract":"As semiconductor technology has entered into the nanoscale regime, Single Event Transient (SET) became one of the major challenging issues for silicon chips. Susceptibility to soft error is even becoming more severe in the presence of Process, Voltage, Temperature, and transistor Aging (PVTA) variations. In this paper, we model and analyze the impacts of PVTA on the susceptibility of VLSI chips to SET. We show that higher PVTA results in significant reduction of critical charge (i.e, higher glitch generation) of silicons while electrical masking (preventing glitch propagation) is improved. In addition, we propose a holistic instance-based systematic methodology to calculate the Soft Error Rate (SER) of combinationals considering PVTA variations. The simulation results for various ITC'99 benchmark circuits show that disregarding PVTA information results in 76% error in the estimated SER on average. Moreover, according to the results, SER increases by 70% on average in the first years of circuit lifetime due to transistor aging and then it is almost saturated.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133411836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Security methods in fault tolerant modified line graph based networks 基于容错修正线图的网络安全方法
Prashant D. Joshi, S. Hamdioui
Many routing protocols make some assumptions on the correctness of the routing information in the router. This at times allows faults and malicious attacks in the networks. This paper describes a class of networks based on modified line graphs with many features to authenticate the data and controls of the message routing, and having properties of the shortest diameters and easy shortest path calculations. We describe this class of fault tolerant networks and the relevant properties. This helps understand the security mechanism, WISH ('What I See and Hear') that probes the data and the routing information to reduce the possibility of router problems, malicious or otherwise.
许多路由协议对路由器中路由信息的正确性做了一些假设。这有时会导致网络出现故障和恶意攻击。本文描述了一类基于修改线形图的网络,该网络具有许多特征来验证消息路由的数据和控制,并且具有最短直径和最短路径计算容易的特性。我们描述了这类容错网络及其相关性质。这有助于理解安全机制WISH(“我所看到和听到的”),它探测数据和路由信息,以减少路由器问题的可能性,无论是恶意的还是其他的。
{"title":"Security methods in fault tolerant modified line graph based networks","authors":"Prashant D. Joshi, S. Hamdioui","doi":"10.1109/DFT.2014.6962104","DOIUrl":"https://doi.org/10.1109/DFT.2014.6962104","url":null,"abstract":"Many routing protocols make some assumptions on the correctness of the routing information in the router. This at times allows faults and malicious attacks in the networks. This paper describes a class of networks based on modified line graphs with many features to authenticate the data and controls of the message routing, and having properties of the shortest diameters and easy shortest path calculations. We describe this class of fault tolerant networks and the relevant properties. This helps understand the security mechanism, WISH ('What I See and Hear') that probes the data and the routing information to reduce the possibility of router problems, malicious or otherwise.","PeriodicalId":414665,"journal":{"name":"2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126263749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1