首页 > 最新文献

2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)最新文献

英文 中文
Predicting Single Event Effects in DRAM 预测DRAM中的单事件效应
Donald Kline, Stephen Longofono, R. Melhem, A. Jones
The ability to leverage commodity memory in harsh environments due to radiation has the potential advance computing capability for aerospace and nuclear applications, among others. In this work, we provide the first demonstration of the existence of a small number of weak cells to single event effects for DDR3 memory when exposed to radiation. Thus, a high proportion of single event faults are actually not entirely random and can be predicted with high accuracy. We also demonstrate a classification of single event effects into predictable single cell, unpredictable single cell, and correlated multi-cell persistent faults, the latter due to latch-up effects. We further show that through classification, we can partition faults, which allows the development of a holistic framework to provide enhanced protection of the DRAM memory. This framework leverages a fault map with bit sparing to protect against faults from weak cells in conjunction with Chipkill ECC to effectively correct chip-level and random errors. This protection provides a potential path to the use of commodity DRAM memory in high radiation environments with extremely low fault rates. Our results, based on data from a multi-day radiation beam experiment, indicate a reduction in uncorrectable bit error rate for rows containing a weak cell by a factor of $geq 10^{7}$ compared to Chipkill alone.
在受辐射影响的恶劣环境中利用商品存储器的能力,对航空航天和核应用等领域具有潜在的先进计算能力。在这项工作中,我们首次证明了当暴露于辐射时,存在少量弱细胞对DDR3记忆的单事件效应。因此,很大比例的单事件故障实际上并不是完全随机的,可以以很高的精度进行预测。我们还展示了单事件效应的分类,分为可预测的单细胞、不可预测的单细胞和相关的多细胞持续故障,后者是由于闭锁效应。我们进一步表明,通过分类,我们可以对故障进行分区,从而允许开发一个整体框架,以提供对DRAM存储器的增强保护。该框架利用具有位保留的故障映射来防止弱单元的故障,并结合Chipkill ECC有效地纠正芯片级和随机错误。这种保护为在高辐射环境中以极低的故障率使用商品DRAM存储器提供了潜在的途径。基于多日辐射束实验的数据,我们的结果表明,与单独Chipkill相比,包含弱单元的行不可校正比特误码率降低了$geq 10^{7}$。
{"title":"Predicting Single Event Effects in DRAM","authors":"Donald Kline, Stephen Longofono, R. Melhem, A. Jones","doi":"10.1109/DFT.2019.8875328","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875328","url":null,"abstract":"The ability to leverage commodity memory in harsh environments due to radiation has the potential advance computing capability for aerospace and nuclear applications, among others. In this work, we provide the first demonstration of the existence of a small number of weak cells to single event effects for DDR3 memory when exposed to radiation. Thus, a high proportion of single event faults are actually not entirely random and can be predicted with high accuracy. We also demonstrate a classification of single event effects into predictable single cell, unpredictable single cell, and correlated multi-cell persistent faults, the latter due to latch-up effects. We further show that through classification, we can partition faults, which allows the development of a holistic framework to provide enhanced protection of the DRAM memory. This framework leverages a fault map with bit sparing to protect against faults from weak cells in conjunction with Chipkill ECC to effectively correct chip-level and random errors. This protection provides a potential path to the use of commodity DRAM memory in high radiation environments with extremely low fault rates. Our results, based on data from a multi-day radiation beam experiment, indicate a reduction in uncorrectable bit error rate for rows containing a weak cell by a factor of $geq 10^{7}$ compared to Chipkill alone.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132115641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analog Test Interface for IEEE 1687 Employing Split SAR Architecture to Support Embedded Instrument Dependability Applications 采用分割SAR架构的IEEE 1687模拟测试接口支持嵌入式仪器可靠性应用
J. Pathrose, L. V. D. Logt, H. Kerkhoff
Embedded instruments have become ubiquitous in modern day System-on-Chips for test and monitoring purposes. IEEE 1687 or IJTAG addresses the standardization of access and operation of these embedded instruments. Recently, there has been a lot of interest in employing embedded instruments for dependability purposes. Many of these embedded instruments are required to monitor physical quantities which are analog in nature. A cost-effective architecture to integrate these analog instruments into the IEEE 1687 infrastructure is a bottleneck and has not yet been standardized. This paper presents a time and area efficient architecture to interface analog embedded instruments onto the IEEE 1687 network especially for dependability applications. The architecture mitigates the drawbacks associated with utilizing an analog test bus and enables periodic sampling with minimal hardware overhead. The simulations to illustrate the concept have been conducted with TSMC 40nm CMOS technology.
嵌入式仪器在现代芯片系统中已经无处不在,用于测试和监控目的。IEEE 1687或IJTAG解决了这些嵌入式仪器的访问和操作的标准化问题。最近,出于可靠性的考虑,人们对采用嵌入式仪器产生了浓厚的兴趣。许多这些嵌入式仪器都需要监测本质上是模拟量的物理量。将这些模拟仪器集成到IEEE 1687基础设施中的成本效益架构是一个瓶颈,尚未标准化。本文提出了一种既省时又省地的结构,将模拟嵌入式仪器连接到IEEE 1687网络上,特别是在可靠性应用方面。该体系结构减轻了与使用模拟测试总线相关的缺点,并以最小的硬件开销支持周期性采样。采用台积电40nm CMOS技术进行了仿真,以说明该概念。
{"title":"Analog Test Interface for IEEE 1687 Employing Split SAR Architecture to Support Embedded Instrument Dependability Applications","authors":"J. Pathrose, L. V. D. Logt, H. Kerkhoff","doi":"10.1109/DFT.2019.8875372","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875372","url":null,"abstract":"Embedded instruments have become ubiquitous in modern day System-on-Chips for test and monitoring purposes. IEEE 1687 or IJTAG addresses the standardization of access and operation of these embedded instruments. Recently, there has been a lot of interest in employing embedded instruments for dependability purposes. Many of these embedded instruments are required to monitor physical quantities which are analog in nature. A cost-effective architecture to integrate these analog instruments into the IEEE 1687 infrastructure is a bottleneck and has not yet been standardized. This paper presents a time and area efficient architecture to interface analog embedded instruments onto the IEEE 1687 network especially for dependability applications. The architecture mitigates the drawbacks associated with utilizing an analog test bus and enables periodic sampling with minimal hardware overhead. The simulations to illustrate the concept have been conducted with TSMC 40nm CMOS technology.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134604390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A State Assignment Method to Improve Transition Fault Coverage for Controllers 一种提高控制器过渡故障覆盖率的状态分配方法
Masayoshi Yoshimura, Yukihiko Takeuchi, Hiroshi Yamazaki, Toshinori Hosokawa
Recently, it is indispensable to test in transition fault model due to timing defects increase along with complication and high speed of VLSI. However, the transition fault coverage tends to be lower than the stuck-at fault coverage due to untestable faults caused by the circuit structure. Low transition fault coverage may not be able to detect potential timing defects. Therefore, it is important to design-for-testability (DFT) to improve transition fault coverage. In this paper, we show that transition fault coverages depend on state assignment to a controller in RTL netlists. We propose a QDT value which is an evaluation index on transition fault coverage for state assignment. Experimental results show that state assignment with high evaluation index has high transition fault coverages.
近年来,随着超大规模集成电路的复杂性和高速发展,时序缺陷不断增加,对过渡故障模型进行测试已成为必不可少的工作。然而,由于电路结构引起的不可测试故障,过渡故障覆盖率往往低于卡滞故障覆盖率。低转换故障覆盖率可能无法检测潜在的时序缺陷。因此,采用可测试性设计(DFT)来提高转换故障覆盖率是非常重要的。在本文中,我们证明了在RTL网络列表中,转换故障覆盖依赖于对控制器的状态分配。我们提出了一个QDT值作为状态分配中过渡故障覆盖率的评价指标。实验结果表明,评价指标高的状态分配具有较高的过渡断层覆盖率。
{"title":"A State Assignment Method to Improve Transition Fault Coverage for Controllers","authors":"Masayoshi Yoshimura, Yukihiko Takeuchi, Hiroshi Yamazaki, Toshinori Hosokawa","doi":"10.1109/DFT.2019.8875322","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875322","url":null,"abstract":"Recently, it is indispensable to test in transition fault model due to timing defects increase along with complication and high speed of VLSI. However, the transition fault coverage tends to be lower than the stuck-at fault coverage due to untestable faults caused by the circuit structure. Low transition fault coverage may not be able to detect potential timing defects. Therefore, it is important to design-for-testability (DFT) to improve transition fault coverage. In this paper, we show that transition fault coverages depend on state assignment to a controller in RTL netlists. We propose a QDT value which is an evaluation index on transition fault coverage for state assignment. Experimental results show that state assignment with high evaluation index has high transition fault coverages.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124640427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient Error-Tolerant Quantized Neural Network Accelerators 高效容错量化神经网络加速器
Giulio Gambardella, Johannes Kappauf, Michaela Blott, Christoph Doehring, M. Kumm, P. Zipf, K. Vissers
Neural Networks are currently one of the most widely deployed machine learning algorithms. In particular, Convolutional Neural Networks (CNNs), are gaining popularity and are evaluated for deployment in safety critical applications such as self driving vehicles. Modern CNNs feature enormous memory bandwidth and high computational needs, challenging existing hardware platforms to meet throughput, latency and power requirements. Functional safety and error tolerance need to be considered as additional requirement in safety critical systems. In general, fault tolerant operation can be achieved by adding redundancy to the system, which is further exacerbating the computational demands. Furthermore, the question arises whether pruning and quantization methods for performance scaling turn out to be counterproductive with regards to fail safety requirements. In this work we present a methodology to evaluate the impact of permanent faults affecting Quantized Neural Networks (QNNs) and how to effectively decrease their effects in hardware accelerators. We use FPGA-based hardware accelerated error injection, in order to enable the fast evaluation. A detailed analysis is presented showing that QNNs containing convolutional layers are by far not as robust to faults as commonly believed and can lead to accuracy drops of up to 10%. To circumvent that, we propose two different methods to increase their robustness: 1) selective channel replication which adds significantly less redundancy than used by the common triple modular redundancy and 2) a fault-aware scheduling of processing elements for folded implementations.
神经网络是目前应用最广泛的机器学习算法之一。特别是卷积神经网络(cnn)越来越受欢迎,并被评估用于自动驾驶汽车等安全关键应用。现代cnn具有巨大的内存带宽和高计算需求,挑战现有的硬件平台,以满足吞吐量,延迟和功耗要求。在安全关键系统中,功能安全和容错性需要作为附加要求加以考虑。一般来说,可以通过增加冗余来实现容错操作,这进一步加剧了计算需求。此外,出现的问题是,对于故障安全要求而言,用于性能缩放的修剪和量化方法是否会适得其反。在这项工作中,我们提出了一种方法来评估永久故障对量化神经网络(QNNs)的影响,以及如何有效地减少它们在硬件加速器中的影响。我们使用基于fpga的硬件加速错误注入,以实现快速评估。详细的分析表明,包含卷积层的qnn到目前为止对故障的鲁棒性不如通常认为的那样,并且可能导致准确率下降高达10%。为了避免这种情况,我们提出了两种不同的方法来增加它们的鲁棒性:1)选择性通道复制,它比常见的三模冗余增加的冗余少得多;2)折叠实现的处理元素的故障感知调度。
{"title":"Efficient Error-Tolerant Quantized Neural Network Accelerators","authors":"Giulio Gambardella, Johannes Kappauf, Michaela Blott, Christoph Doehring, M. Kumm, P. Zipf, K. Vissers","doi":"10.1109/DFT.2019.8875314","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875314","url":null,"abstract":"Neural Networks are currently one of the most widely deployed machine learning algorithms. In particular, Convolutional Neural Networks (CNNs), are gaining popularity and are evaluated for deployment in safety critical applications such as self driving vehicles. Modern CNNs feature enormous memory bandwidth and high computational needs, challenging existing hardware platforms to meet throughput, latency and power requirements. Functional safety and error tolerance need to be considered as additional requirement in safety critical systems. In general, fault tolerant operation can be achieved by adding redundancy to the system, which is further exacerbating the computational demands. Furthermore, the question arises whether pruning and quantization methods for performance scaling turn out to be counterproductive with regards to fail safety requirements. In this work we present a methodology to evaluate the impact of permanent faults affecting Quantized Neural Networks (QNNs) and how to effectively decrease their effects in hardware accelerators. We use FPGA-based hardware accelerated error injection, in order to enable the fast evaluation. A detailed analysis is presented showing that QNNs containing convolutional layers are by far not as robust to faults as commonly believed and can lead to accuracy drops of up to 10%. To circumvent that, we propose two different methods to increase their robustness: 1) selective channel replication which adds significantly less redundancy than used by the common triple modular redundancy and 2) a fault-aware scheduling of processing elements for folded implementations.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134640048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
A Fault-Tolerant MPSoC For CubeSats 用于立方体卫星的容错MPSoC
C. Fuchs, Pai H. Chou, X. Wen, N. Murillo, G. Furano, S. Holst, A. Tavoularis, Shyue-Kung Lu, A. Plaat, K. Marinis
We present the implementation of a fault-tolerant MP-SoC for very small satellites (<100kg) based upon commercial components and library IP. This MPSoC is the result of a codesign process and is designed as an ideal platform for software-implemented fault-tolerance measures. It enforces strong isolation between processors, and combines fault-tolerance measures across the embedded stack within an FPGA. This allows us to assure robustness for a satellite on-board computer consisting of modern semiconductors manufactured in fine technology nodes, for which traditional fault-tolerance concepts are ineffective. We successfully implemented this design on several Xilinx UltraScale and UltraScale+ FPGAs with modest utilization. We show that a 4-core implementation is possible with just 1.93 $W$ of total power consumption, which for the first time enables true fault-tolerance for very small spacecraft such as CubeSats. For critical space missions aboard heavier satellites, we implemented an MPSoC-variant for the space-grade XQRKU060 part together with the Xilinx Radiation Testing Consortium. The MPSoC was developed for a 4-year ESA project. It can satisfy the high performance requirements of future scientific and commercial space missions at low cost while offering the strong fault-coverage necessary for platform control for missions with a long duration.
我们提出了一种基于商业组件和库IP的用于非常小的卫星(<100kg)的容错MP-SoC的实现。该MPSoC是协同设计过程的结果,被设计为软件实现容错措施的理想平台。它加强了处理器之间的强隔离,并结合了FPGA内跨嵌入式堆栈的容错措施。这使我们能够确保由精细技术节点制造的现代半导体组成的卫星机载计算机的鲁棒性,而传统的容错概念对于这些计算机是无效的。我们成功地在多个Xilinx UltraScale和UltraScale+ fpga上实现了该设计,利用率适中。我们展示了一个4核的实现是可能的,只有1.93美元的总功耗,这是第一次实现真正的容错非常小的航天器,如立方体卫星。对于重型卫星上的关键空间任务,我们与赛灵思辐射测试联盟一起为空间级XQRKU060部件实施了mpsoc变体。MPSoC是为一个为期4年的ESA项目开发的。它可以以低成本满足未来科学和商业空间任务的高性能要求,同时为长时间任务的平台控制提供必要的强故障覆盖。
{"title":"A Fault-Tolerant MPSoC For CubeSats","authors":"C. Fuchs, Pai H. Chou, X. Wen, N. Murillo, G. Furano, S. Holst, A. Tavoularis, Shyue-Kung Lu, A. Plaat, K. Marinis","doi":"10.1109/DFT.2019.8875417","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875417","url":null,"abstract":"We present the implementation of a fault-tolerant MP-SoC for very small satellites (<100kg) based upon commercial components and library IP. This MPSoC is the result of a codesign process and is designed as an ideal platform for software-implemented fault-tolerance measures. It enforces strong isolation between processors, and combines fault-tolerance measures across the embedded stack within an FPGA. This allows us to assure robustness for a satellite on-board computer consisting of modern semiconductors manufactured in fine technology nodes, for which traditional fault-tolerance concepts are ineffective. We successfully implemented this design on several Xilinx UltraScale and UltraScale+ FPGAs with modest utilization. We show that a 4-core implementation is possible with just 1.93 $W$ of total power consumption, which for the first time enables true fault-tolerance for very small spacecraft such as CubeSats. For critical space missions aboard heavier satellites, we implemented an MPSoC-variant for the space-grade XQRKU060 part together with the Xilinx Radiation Testing Consortium. The MPSoC was developed for a 4-year ESA project. It can satisfy the high performance requirements of future scientific and commercial space missions at low cost while offering the strong fault-coverage necessary for platform control for missions with a long duration.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125160081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Reliability Evaluation of Polyphase-filter based Decimators Implemented on SRAM-FPGAs 基于sram - fpga的多相滤波器抽取器可靠性评估
Zhen Gao, Jinhua Zhu, Lina Yan, Tong Yan, P. Reviriego
Decimation is widely used in digital communication systems to reduce the oversampling rate of the base band signal. The structure of Poly Phase Filters (PPFs) provides an efficient implementation of the decimator. This paper studies the effects of Single Event Upsets (SEUs) on PPFs based decimators implemented on SRAM-FPGAs. Fault injection experiments are performed to evaluate the reliability of the decimator to SEUs on filter coefficients and on the configuration memory. For the first part, experiment results show that only SEUs on the two most significant bits would cause non negligible SNR loss in the output. For SEUs on the essential bits in the configuration memory, about 20% of them would affect the results, among which 70% of the SEUs would cause negligible SNR loss. The percentage of SEUs that cause a SNR loss larger than 5dB are 7%, 13% and 16% for decimation rates of 16, 8 and 4, respectively. This can be explained as the decimator is composed of a number of parallel filters that is equal to the decimation rate, and each filter contributes a fraction of the final output. Therefore, a SEU on one filter would not cause large degradation to the decimated signal, and the larger the decimation rate is, the smaller the degradation of decimated signal introduced by the SEU.
抽取被广泛应用于数字通信系统中,以降低基带信号的过采样率。多相滤波器(PPFs)的结构提供了一种高效的十进制数实现方法。本文研究了单事件干扰(SEUs)对基于ppf的抽取器在sram - fpga上实现的影响。通过故障注入实验,评估了抽取器在滤波器系数和组态存储器方面对seu的可靠性。对于第一部分,实验结果表明,只有两个最有效位上的seu会导致输出中不可忽略的信噪比损失。对于配置内存中关键位上的seu,大约20%的seu会影响结果,其中70%的seu会造成可以忽略不计的信噪比损失。在抽取率为16、8和4时,导致信噪比损失大于5dB的seu比例分别为7%、13%和16%。这可以解释为十进制数由许多等于抽取率的并行滤波器组成,并且每个滤波器贡献最终输出的一小部分。因此,一个滤波器上的SEU不会对抽取后的信号造成大的退化,而且抽取率越大,SEU对抽取后的信号的退化越小。
{"title":"Reliability Evaluation of Polyphase-filter based Decimators Implemented on SRAM-FPGAs","authors":"Zhen Gao, Jinhua Zhu, Lina Yan, Tong Yan, P. Reviriego","doi":"10.1109/DFT.2019.8875316","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875316","url":null,"abstract":"Decimation is widely used in digital communication systems to reduce the oversampling rate of the base band signal. The structure of Poly Phase Filters (PPFs) provides an efficient implementation of the decimator. This paper studies the effects of Single Event Upsets (SEUs) on PPFs based decimators implemented on SRAM-FPGAs. Fault injection experiments are performed to evaluate the reliability of the decimator to SEUs on filter coefficients and on the configuration memory. For the first part, experiment results show that only SEUs on the two most significant bits would cause non negligible SNR loss in the output. For SEUs on the essential bits in the configuration memory, about 20% of them would affect the results, among which 70% of the SEUs would cause negligible SNR loss. The percentage of SEUs that cause a SNR loss larger than 5dB are 7%, 13% and 16% for decimation rates of 16, 8 and 4, respectively. This can be explained as the decimator is composed of a number of parallel filters that is equal to the decimation rate, and each filter contributes a fraction of the final output. Therefore, a SEU on one filter would not cause large degradation to the decimated signal, and the larger the decimation rate is, the smaller the degradation of decimated signal introduced by the SEU.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127079873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Protecting RSA Hardware Accelerators against Differential Fault Analysis through Residue Checking 通过残差检测保护RSA硬件加速器不受差分故障分析的影响
Ana Lasheras, R. Canal, Eva Rodríguez, Luca Cassano
Hardware accelerators for cryptographic algorithms are ubiquitously deployed in nowadays consumer and industrial products. Unfortunately, the HW implementations of such algorithms often suffer from vulnerabilities that expose systems to a number of attacks, among which differential fault analysis (DFA). It is therefore crucial to protect cryptographic circuits against such attacks in a cost-effective and power-efficient way. In this paper, we propose a lightweight technique for protecting circuits implementing the RSA algorithm against DFA. The proposed solution borrows residue checking from the traditional fault tolerance and applies it to RSA circuits in order to first detect the occurrence a fault and then to react to the attack by obfuscating the output values. An experimental campaign demonstrated that the proposed solution detects the 100% of the possible fault attacks while leading to a 2.85% area overhead, a 16.67% power consumption increase and with no operating frequency decrease.
用于加密算法的硬件加速器在当今的消费和工业产品中无处不在。不幸的是,这种算法的硬件实现经常存在漏洞,使系统暴露于许多攻击,其中包括差分故障分析(DFA)。因此,以经济高效的方式保护加密电路免受此类攻击至关重要。在本文中,我们提出了一种轻量级的技术来保护实现RSA算法的电路免受DFA的侵害。该方案从传统的容错机制中借鉴了剩余检测,并将其应用于RSA电路中,首先检测故障的发生,然后通过混淆输出值来对攻击做出反应。实验表明,该方案检测到100%可能的故障攻击,同时导致2.85%的面积开销,16.67%的功耗增加,并且没有降低工作频率。
{"title":"Protecting RSA Hardware Accelerators against Differential Fault Analysis through Residue Checking","authors":"Ana Lasheras, R. Canal, Eva Rodríguez, Luca Cassano","doi":"10.1109/DFT.2019.8875320","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875320","url":null,"abstract":"Hardware accelerators for cryptographic algorithms are ubiquitously deployed in nowadays consumer and industrial products. Unfortunately, the HW implementations of such algorithms often suffer from vulnerabilities that expose systems to a number of attacks, among which differential fault analysis (DFA). It is therefore crucial to protect cryptographic circuits against such attacks in a cost-effective and power-efficient way. In this paper, we propose a lightweight technique for protecting circuits implementing the RSA algorithm against DFA. The proposed solution borrows residue checking from the traditional fault tolerance and applies it to RSA circuits in order to first detect the occurrence a fault and then to react to the attack by obfuscating the output values. An experimental campaign demonstrated that the proposed solution detects the 100% of the possible fault attacks while leading to a 2.85% area overhead, a 16.67% power consumption increase and with no operating frequency decrease.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"405 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133153389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High Performance Memory Repair 高性能内存修复
F. Merchant, Anandraj Devarajan, A. Basu, David Ashen, Brandon Yelton, Prashant D. Joshi
As process technology dimensions shrink, manufacturing defect density is increasing, adversely impacting product yield. Products have typically built redundancy and repair features in SRAM. Register File arrays (RFs) can also benefit from redundancy and repair. There are various types of repair techniques used in SRAMs today which can also be employed on RFs. However all known techniques (column, row, 1-bit, multi-bit) incur a performance loss of at least two gate delays due to the addition of logic either on the memory address path or on the read output path. This paper describes a row repair scheme that incurs virtually no performance penalty. In simulations conducted in recent process nodes, we noted a performance impact of less than half a gate delay
随着工艺技术尺寸的缩小,制造缺陷密度增加,对产品良率产生不利影响。产品通常在SRAM中内置冗余和修复功能。寄存器文件阵列(RFs)也可以从冗余和修复中获益。目前在sram中使用了各种类型的修复技术,这些技术也可以用于rf。然而,所有已知的技术(列、行、1位、多位)都会由于在内存地址路径或读输出路径上添加逻辑而导致至少两个门延迟的性能损失。本文描述了一种几乎没有性能损失的行修复方案。在最近的流程节点中进行的模拟中,我们注意到不到一半的门延迟对性能的影响
{"title":"High Performance Memory Repair","authors":"F. Merchant, Anandraj Devarajan, A. Basu, David Ashen, Brandon Yelton, Prashant D. Joshi","doi":"10.1109/DFT.2019.8875490","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875490","url":null,"abstract":"As process technology dimensions shrink, manufacturing defect density is increasing, adversely impacting product yield. Products have typically built redundancy and repair features in SRAM. Register File arrays (RFs) can also benefit from redundancy and repair. There are various types of repair techniques used in SRAMs today which can also be employed on RFs. However all known techniques (column, row, 1-bit, multi-bit) incur a performance loss of at least two gate delays due to the addition of logic either on the memory address path or on the read output path. This paper describes a row repair scheme that incurs virtually no performance penalty. In simulations conducted in recent process nodes, we noted a performance impact of less than half a gate delay","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116313505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-relation Scan Attack Analysis (COSAA) on AES: A Comprehensive Approach AES的相关扫描攻击分析(COSAA):一种综合方法
Dipojjwal Ray, Siddharth Singh, Sk Subidh Ali, S. Biswas
The chiller system in a building accounts for the main part of the building's total energy consumption, so it is critical to optimize the operation of chiller system for the purpose of saving energy. A chiller system generally consists of evaporation side and condensing side. The condensing side includes chillers, cooling water pumps and cooling towers, which form a subsystem of cooling water. It is a typical optimization problem to minimize the total energy consumption of cooling water subsystem. This paper discusses how to solve the optimization problem of the cooling water subsystem and proposes a simple optimal control rule for field operation based on the optimization results.
建筑中的冷水机组系统占建筑总能耗的主要部分,因此优化冷水机组系统的运行以达到节能的目的至关重要。冷水机系统一般由蒸发侧和冷凝侧组成。冷凝侧包括冷水机组、冷却水泵和冷却塔,构成冷却水子系统。冷却水子系统总能耗最小化是一个典型的优化问题。本文讨论了如何解决冷却水分系统的优化问题,并根据优化结果提出了简单的现场运行最优控制规则。
{"title":"Co-relation Scan Attack Analysis (COSAA) on AES: A Comprehensive Approach","authors":"Dipojjwal Ray, Siddharth Singh, Sk Subidh Ali, S. Biswas","doi":"10.1109/DFT.2019.8875272","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875272","url":null,"abstract":"The chiller system in a building accounts for the main part of the building's total energy consumption, so it is critical to optimize the operation of chiller system for the purpose of saving energy. A chiller system generally consists of evaporation side and condensing side. The condensing side includes chillers, cooling water pumps and cooling towers, which form a subsystem of cooling water. It is a typical optimization problem to minimize the total energy consumption of cooling water subsystem. This paper discusses how to solve the optimization problem of the cooling water subsystem and proposes a simple optimal control rule for field operation based on the optimization results.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129301218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
On the Reliability of Convolutional Neural Network Implementation on SRAM-based FPGA 基于sram的FPGA上卷积神经网络实现的可靠性研究
B. Du, S. Azimi, C. D. Sio, Ludovica Bozzoli, L. Sterpone
In recent years, topics around machine learning and artificial intelligence (AI) have (re-)gained a lot of interest due to high demand in industrial automation applications in various areas such as medical, automotive and space and the increasing computational power offered by technology advancements. One common task for these applications is object recognition/classification whose input is usually an image taken from camera and output is whether an object is present and the class of the object. In industrial pipeline, this task could be used to identify possible defects in products; in automotive application, such task could be deployed to detect pedestrians for Advanced Driver-Assistance Systems (ADAS). When the task is safety-critical as in automotive application, the reliability of the task implementation is crucial and has to be evaluated before final deployment. On the other hand, Field Programmable Gate Array (FPGA) devices are gaining increasing attention in the hardware acceleration part for machine learning applications due to their high flexibility and increasing computational power. When the SRAM-based FPGA is considered, Single Event Upset (SEU) in configuration memory induced by radiation particle is one of the major concerns even at sea level. In this paper, we present the fault injection results on a Convolutional Neural Network (CNN) implementation on Xilinx SRAM-based FPGA which demonstrate that though there exists built-in redundancy in CNN implementation one SEU in configuration memory can still impact the task execution results while the possibility of Single Event Multiple Upsets (SEMU) must also be taken into consideration.
近年来,由于医疗、汽车和空间等各个领域的工业自动化应用需求旺盛,以及技术进步带来的计算能力不断提高,围绕机器学习和人工智能(AI)的话题(重新)引起了人们的极大兴趣。这些应用程序的一个常见任务是对象识别/分类,其输入通常是从相机拍摄的图像,输出是对象是否存在以及对象的类别。在工业管道中,此任务可用于识别产品中可能存在的缺陷;在汽车应用中,这种任务可以用于高级驾驶辅助系统(ADAS)检测行人。当任务对安全至关重要时,如在汽车应用中,任务实现的可靠性至关重要,必须在最终部署之前进行评估。另一方面,现场可编程门阵列(FPGA)器件由于其高灵活性和不断提高的计算能力,在机器学习应用的硬件加速部分越来越受到关注。当考虑基于sram的FPGA时,即使在海平面下,由辐射粒子引起的配置存储器中的单事件扰动(SEU)也是一个主要问题。本文给出了基于Xilinx sram的FPGA上卷积神经网络(CNN)实现的故障注入结果,结果表明,尽管CNN实现中存在内置冗余,但配置内存中的一个SEU仍然会影响任务执行结果,同时还必须考虑单事件多异常(SEMU)的可能性。
{"title":"On the Reliability of Convolutional Neural Network Implementation on SRAM-based FPGA","authors":"B. Du, S. Azimi, C. D. Sio, Ludovica Bozzoli, L. Sterpone","doi":"10.1109/DFT.2019.8875362","DOIUrl":"https://doi.org/10.1109/DFT.2019.8875362","url":null,"abstract":"In recent years, topics around machine learning and artificial intelligence (AI) have (re-)gained a lot of interest due to high demand in industrial automation applications in various areas such as medical, automotive and space and the increasing computational power offered by technology advancements. One common task for these applications is object recognition/classification whose input is usually an image taken from camera and output is whether an object is present and the class of the object. In industrial pipeline, this task could be used to identify possible defects in products; in automotive application, such task could be deployed to detect pedestrians for Advanced Driver-Assistance Systems (ADAS). When the task is safety-critical as in automotive application, the reliability of the task implementation is crucial and has to be evaluated before final deployment. On the other hand, Field Programmable Gate Array (FPGA) devices are gaining increasing attention in the hardware acceleration part for machine learning applications due to their high flexibility and increasing computational power. When the SRAM-based FPGA is considered, Single Event Upset (SEU) in configuration memory induced by radiation particle is one of the major concerns even at sea level. In this paper, we present the fault injection results on a Convolutional Neural Network (CNN) implementation on Xilinx SRAM-based FPGA which demonstrate that though there exists built-in redundancy in CNN implementation one SEU in configuration memory can still impact the task execution results while the possibility of Single Event Multiple Upsets (SEMU) must also be taken into consideration.","PeriodicalId":415648,"journal":{"name":"2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129880402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1