首页 > 最新文献

2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)最新文献

英文 中文
Evaluating Read Disturb Effect on RRAM based AI Accelerator with Multilevel States and Input Voltages 基于多电平状态和输入电压的RRAM AI加速器的读干扰效应评估
J. Wen, Andrea Baroni, E. Pérez, Markus Ulbricht, C. Wenger, M. Krstic
RRAM technology is a promising candidate for implementing efficient AI accelerators with extensive multiply-accumulate operations. By scaling RRAM devices to the synaptic crossbar array, the computations can be realized in situ, avoiding frequent weights transfer between the processing units and memory. Besides, as the computations are conducted in the analog domain with high flexibility, applying multilevel input voltages to the RRAM devices with multilevel conductance states enhances the computational efficiency further. However, several non-idealities existing in emerging RRAM technology may degrade the reliability of the system. In this paper, we measured and investigated the impact of read disturb on RRAM devices with different input voltages, which incurs conductance drifts and introduces errors. The measured data are deployed to simulate the RRAM based AI inference engines with multilevel conductance states and input voltages. Device-to-device variability is also taken into consideration to assess the accuracy drop. Two convolutional neural networks, LeNet-5 and VGG-7, are benchmarked with MNIST and CIFAR-10 datasets, respectively. Our results show that mapping weights with differential pairs yields better robustness to read disturb and variability effects.
RRAM技术是实现具有广泛乘法累加运算的高效人工智能加速器的有前途的候选技术。通过将RRAM器件缩放到突触横杆阵列,可以就地实现计算,避免了处理单元和存储器之间频繁的权重传递。此外,由于计算是在模拟域中进行的,具有很高的灵活性,因此对具有多电平电导状态的RRAM器件施加多电平输入电压进一步提高了计算效率。然而,新兴RRAM技术中存在的一些非理想性可能会降低系统的可靠性。在本文中,我们测量和研究了不同输入电压下读干扰对RRAM器件的影响,它会引起电导漂移和引入误差。将测量数据用于模拟具有多电平电导状态和输入电压的基于RRAM的AI推理引擎。在评估精度下降时,还考虑了设备间的可变性。两个卷积神经网络LeNet-5和VGG-7分别使用MNIST和CIFAR-10数据集进行基准测试。我们的研究结果表明,用差分对映射权重对读取干扰和可变性效应具有更好的鲁棒性。
{"title":"Evaluating Read Disturb Effect on RRAM based AI Accelerator with Multilevel States and Input Voltages","authors":"J. Wen, Andrea Baroni, E. Pérez, Markus Ulbricht, C. Wenger, M. Krstic","doi":"10.1109/DFT56152.2022.9962345","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962345","url":null,"abstract":"RRAM technology is a promising candidate for implementing efficient AI accelerators with extensive multiply-accumulate operations. By scaling RRAM devices to the synaptic crossbar array, the computations can be realized in situ, avoiding frequent weights transfer between the processing units and memory. Besides, as the computations are conducted in the analog domain with high flexibility, applying multilevel input voltages to the RRAM devices with multilevel conductance states enhances the computational efficiency further. However, several non-idealities existing in emerging RRAM technology may degrade the reliability of the system. In this paper, we measured and investigated the impact of read disturb on RRAM devices with different input voltages, which incurs conductance drifts and introduces errors. The measured data are deployed to simulate the RRAM based AI inference engines with multilevel conductance states and input voltages. Device-to-device variability is also taken into consideration to assess the accuracy drop. Two convolutional neural networks, LeNet-5 and VGG-7, are benchmarked with MNIST and CIFAR-10 datasets, respectively. Our results show that mapping weights with differential pairs yields better robustness to read disturb and variability effects.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"181 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123220721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Operational Age Estimation of ICs using Gaussian Process Regression 用高斯过程回归估计集成电路的使用寿命
Anmol Singh Narwariya, Pabitra Das, S. Khursheed, A. Acharyya
Electronic systems life is an essential aspect of ensuring reliability and safety. An accurate age estimation could assimilate, which is helpful for any electronics system. It would also positively impact the minimisation of electronics waste and support the endeavour of green computing. In this paper, we propose a methodology for age estimation using the Gaussian Process Regression (GPR) model. Our methodology requires an RO sensor, temperature sensor, and trained GPR model for the age prediction. The Ring Oscillator (RO) output frequency relies on the trackable path, temperature, voltage and ageing. These dependencies are utilized for the training of the GPR model. We exhibit the output frequency degradation of the ring oscillator through the Synopsys PrimeSim Hspice tool with the 32nm Predictive Technology Model (PTM). We consider variations from 0 °C to 100 °C in temperature and 0. 8V to 1. 05V in the voltage. Our methodology predicts age precisely, showing average prediction accuracy in 85.35% cases with a deviation of one month for 13-stage RO and 90.42% cases in 21-stage RO. Our proposed methodology is more accurate than the state-of-the-art techniques in terms of prediction accuracy as well as age estimation deviation. The prediction accuracy improvement got 9.59% for 13-stage and 9.17% for 21-stage RO on our dataset than the state-of-the-art technique with a month deviation, respectively, as opposed to 2.4 months for the state-of-the-art method.
电子系统的寿命是保证可靠性和安全性的重要方面。准确的年龄估计可以吸收,这对任何电子系统都有帮助。这亦会对减少电子废物及支持环保电脑的努力产生积极影响。在本文中,我们提出了一种使用高斯过程回归(GPR)模型估计年龄的方法。我们的方法需要RO传感器,温度传感器和训练有素的GPR模型进行年龄预测。环形振荡器(RO)输出频率依赖于可跟踪路径、温度、电压和老化。这些依赖关系被用于GPR模型的训练。我们通过Synopsys PrimeSim Hspice工具和32nm预测技术模型(PTM)展示了环形振荡器的输出频率退化。我们考虑从0°C到100°C的温度和0。8V到1。在电压05V。我们的方法准确预测年龄,13期RO的平均预测准确率为85.35%,偏差为1个月,21期RO的平均预测准确率为90.42%。我们提出的方法在预测精度和年龄估计偏差方面比最先进的技术更准确。在我们的数据集上,13阶段和21阶段RO的预测精度分别比最先进的技术提高了9.59%和9.17%,偏差分别为一个月,而最先进的方法的预测精度为2.4个月。
{"title":"Operational Age Estimation of ICs using Gaussian Process Regression","authors":"Anmol Singh Narwariya, Pabitra Das, S. Khursheed, A. Acharyya","doi":"10.1109/DFT56152.2022.9962355","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962355","url":null,"abstract":"Electronic systems life is an essential aspect of ensuring reliability and safety. An accurate age estimation could assimilate, which is helpful for any electronics system. It would also positively impact the minimisation of electronics waste and support the endeavour of green computing. In this paper, we propose a methodology for age estimation using the Gaussian Process Regression (GPR) model. Our methodology requires an RO sensor, temperature sensor, and trained GPR model for the age prediction. The Ring Oscillator (RO) output frequency relies on the trackable path, temperature, voltage and ageing. These dependencies are utilized for the training of the GPR model. We exhibit the output frequency degradation of the ring oscillator through the Synopsys PrimeSim Hspice tool with the 32nm Predictive Technology Model (PTM). We consider variations from 0 °C to 100 °C in temperature and 0. 8V to 1. 05V in the voltage. Our methodology predicts age precisely, showing average prediction accuracy in 85.35% cases with a deviation of one month for 13-stage RO and 90.42% cases in 21-stage RO. Our proposed methodology is more accurate than the state-of-the-art techniques in terms of prediction accuracy as well as age estimation deviation. The prediction accuracy improvement got 9.59% for 13-stage and 9.17% for 21-stage RO on our dataset than the state-of-the-art technique with a month deviation, respectively, as opposed to 2.4 months for the state-of-the-art method.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133958558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Image Degradation due to Interacting Adjacent Hot Pixels 相邻热像素相互作用引起的图像退化
G. Chapman, Klinsmann J. Coelho Silva Meneses, I. Koren, Z. Koren
Hot Pixels are cosmic ray induced digital imaging sensor defects that accumulate as the camera ages at rates that are highly dependent on pixel size. We previously developed an empirical formula projecting hot pixel defect density (defects/year/mm2) growth rates via a power law, with the inverse of the pixel size raised to the power of $sim $3, multiplied by the square root of the ISO (gain). We show in this paper that this increasing defect rate results in a higher probability that two defects will occur within a 5x5 pixel square. Under these conditions, the color demosaicing and JPEG image compression algorithms required for picture creation greatly amplify the impact of these two defective pixels, spreading damage to a 16x16 pixel area and creating significant color changes resulting in a very noticeable image degradation. We develop an analytical generalized birthday problem formula in order to estimate the number of hot pixels needed to achieve a given probability of having two defective pixels within a 5x5 square. For a 20 Mpixel DSLR camera, only 128 hot pixels generate a 4.5% probability of such interacting defective pixels, or 1 in 22 cameras. For 4 micron pixels this would occur in 1.4 years at ISO 6400, and in 3.2 years at ISO 3200.
热像素是宇宙射线引起的数字成像传感器缺陷,随着相机老化,这些缺陷以高度依赖于像素大小的速率累积。我们之前开发了一个经验公式,通过幂律预测热像素缺陷密度(缺陷/年/mm2)增长率,像素尺寸的倒数提高到$sim $3的幂,乘以ISO(增益)的平方根。我们在论文中表明,这种增加的缺陷率导致在一个5x5像素的正方形内出现两个缺陷的可能性更高。在这种情况下,图像创建所需的颜色去马赛克和JPEG图像压缩算法极大地放大了这两个缺陷像素的影响,将损坏扩散到16x16像素区域,并产生显著的颜色变化,导致非常明显的图像退化。我们开发了一个解析广义生日问题公式,以估计在5x5正方形内具有两个缺陷像素的给定概率所需的热像素的数量。对于一台2000万像素的数码单反相机,只有128个热像素产生4.5%的相互作用缺陷像素的概率,即22个相机中有1个。对于4微米像素,在ISO 6400下需要1.4年,在ISO 3200下需要3.2年。
{"title":"Image Degradation due to Interacting Adjacent Hot Pixels","authors":"G. Chapman, Klinsmann J. Coelho Silva Meneses, I. Koren, Z. Koren","doi":"10.1109/DFT56152.2022.9962368","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962368","url":null,"abstract":"Hot Pixels are cosmic ray induced digital imaging sensor defects that accumulate as the camera ages at rates that are highly dependent on pixel size. We previously developed an empirical formula projecting hot pixel defect density (defects/year/mm2) growth rates via a power law, with the inverse of the pixel size raised to the power of $sim $3, multiplied by the square root of the ISO (gain). We show in this paper that this increasing defect rate results in a higher probability that two defects will occur within a 5x5 pixel square. Under these conditions, the color demosaicing and JPEG image compression algorithms required for picture creation greatly amplify the impact of these two defective pixels, spreading damage to a 16x16 pixel area and creating significant color changes resulting in a very noticeable image degradation. We develop an analytical generalized birthday problem formula in order to estimate the number of hot pixels needed to achieve a given probability of having two defective pixels within a 5x5 square. For a 20 Mpixel DSLR camera, only 128 hot pixels generate a 4.5% probability of such interacting defective pixels, or 1 in 22 cameras. For 4 micron pixels this would occur in 1.4 years at ISO 6400, and in 3.2 years at ISO 3200.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114985663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improving DNN Fault Tolerance in Semantic Segmentation Applications 改进语义分割应用中的深度神经网络容错性
Stéphane Burel, A. Evans, L. Anghel
Semantic segmentation of images is essential for autonomous driving and modern DNNs now achieve high accuracy. Automotive systems must comply with safety standards, requiring hardware fault detection. We present an analysis of the effect of faults using Google’s DeepLabV3+ network processing an industrial data-set. A new symptom-based fault detection algorithm is shown to detect >99% of critical faults with zero false positives and a compute overhead of 0.2%. Further, these faults can be masked, virtually eliminating all critical errors. To the authors’ knowledge this is the first fault tolerance study of a DNN semantic segmentation application.
图像的语义分割对于自动驾驶至关重要,现代深度神经网络已经达到了很高的精度。汽车系统必须符合安全标准,需要硬件故障检测。我们使用谷歌的DeepLabV3+网络处理工业数据集,对故障的影响进行了分析。提出了一种新的基于症状的故障检测算法,该算法可以检测出>99%的关键故障,并且没有误报,计算开销仅为0.2%。此外,这些错误可以被掩盖,实际上消除了所有关键错误。据作者所知,这是DNN语义分割应用程序的第一个容错研究。
{"title":"Improving DNN Fault Tolerance in Semantic Segmentation Applications","authors":"Stéphane Burel, A. Evans, L. Anghel","doi":"10.1109/DFT56152.2022.9962354","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962354","url":null,"abstract":"Semantic segmentation of images is essential for autonomous driving and modern DNNs now achieve high accuracy. Automotive systems must comply with safety standards, requiring hardware fault detection. We present an analysis of the effect of faults using Google’s DeepLabV3+ network processing an industrial data-set. A new symptom-based fault detection algorithm is shown to detect >99% of critical faults with zero false positives and a compute overhead of 0.2%. Further, these faults can be masked, virtually eliminating all critical errors. To the authors’ knowledge this is the first fault tolerance study of a DNN semantic segmentation application.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125363054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
CRLock: A SAT and FALL Attacks Resistant Logic Locking Method at Register Transfer Level CRLock:一种在寄存器传输级抗SAT和FALL攻击的逻辑锁定方法
Masayoshi Yoshimura, Atsuya Tsujikawa, Hiroshi Yamazaki, Toshinori Hosokawa
In recent years, to meet strict time-to-market constraints, it has become difficult for only one semiconductor design company to design a VLSI. Thus, design companies purchase IP cores from third-party IP vendors and design only the necessary parts. On the other hand, since IP cores have the disadvantage that copyright infringement can be easily performed, logic locking has to be applied to them. Functional logic locking methods using TTLock are resilient to SAT attack, however vulnerable to FALL attacks. Additionally, it is difficult to design logic locking based on TTLock at gate level. In this paper, we propose a logic locking method based on SAT attack and FALL attack resistance at register transfer level.
近年来,为了满足严格的上市时间限制,只有一家半导体设计公司设计超大规模集成电路变得越来越困难。因此,设计公司从第三方IP供应商那里购买IP核,只设计必要的部分。另一方面,由于IP核具有容易侵犯版权的缺点,因此必须对其应用逻辑锁定。使用TTLock的功能逻辑锁定方法对SAT攻击具有弹性,但容易受到FALL攻击。此外,基于TTLock的逻辑锁设计在门级是困难的。在本文中,我们提出了一种基于在寄存器传输级抵抗SAT攻击和FALL攻击的逻辑锁定方法。
{"title":"CRLock: A SAT and FALL Attacks Resistant Logic Locking Method at Register Transfer Level","authors":"Masayoshi Yoshimura, Atsuya Tsujikawa, Hiroshi Yamazaki, Toshinori Hosokawa","doi":"10.1109/DFT56152.2022.9962349","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962349","url":null,"abstract":"In recent years, to meet strict time-to-market constraints, it has become difficult for only one semiconductor design company to design a VLSI. Thus, design companies purchase IP cores from third-party IP vendors and design only the necessary parts. On the other hand, since IP cores have the disadvantage that copyright infringement can be easily performed, logic locking has to be applied to them. Functional logic locking methods using TTLock are resilient to SAT attack, however vulnerable to FALL attacks. Additionally, it is difficult to design logic locking based on TTLock at gate level. In this paper, we propose a logic locking method based on SAT attack and FALL attack resistance at register transfer level.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131012819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study and Comparison of QDI Pipeline Components’ Sensitivity to Permanent Faults QDI管道元件对永久性故障敏感性的研究与比较
Raghda El Shehaby, A. Steininger
In the presence of permanent faults, QDI circuits exhibit the beneficial property of halting their operation until a repair procedure has been conducted. The state in which the circuit resides, however, does not always remain clean, i.e., a recovery process might be needed. This depends on how the circuit reacts in these situations. In this study, we investigate the effect a permanent fault has on the different components of the pipeline, the logic function unit and the butter. Our aim is to identify the weaknesses of each component and try to enhance each one accordingly. We perform extensive fault-injection simulations on different circuits following the famous 4-phase communication protocol, while varying the logic function and butter style for comparison. Our results show that the logic function does not affect the resilience of a specific butter type, and hence we can deduce which butter should perform better for a specific application based on parameters we extract from our experiments. On a parallel note, the implementation style of the logic also has an impact on the block’s ability to hold out against faults. We investigate two of these styles.
在存在永久性故障时,QDI电路表现出停止其操作直到进行修复程序的有益特性。然而,电路所处的状态并不总是保持干净,也就是说,可能需要一个恢复过程。这取决于电路在这些情况下的反应。在本研究中,我们研究了永久故障对管道、逻辑功能单元和黄油的不同组成部分的影响。我们的目标是找出每个组件的弱点,并尝试相应地增强每个组件。我们根据著名的四相通信协议在不同的电路上进行了广泛的故障注入模拟,同时改变了逻辑功能和黄油风格进行比较。我们的结果表明,逻辑函数不会影响特定黄油类型的弹性,因此我们可以根据从实验中提取的参数推断出哪种黄油在特定应用中表现更好。同时,逻辑的实现风格也会影响块抵御故障的能力。我们研究其中的两种风格。
{"title":"Study and Comparison of QDI Pipeline Components’ Sensitivity to Permanent Faults","authors":"Raghda El Shehaby, A. Steininger","doi":"10.1109/DFT56152.2022.9962353","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962353","url":null,"abstract":"In the presence of permanent faults, QDI circuits exhibit the beneficial property of halting their operation until a repair procedure has been conducted. The state in which the circuit resides, however, does not always remain clean, i.e., a recovery process might be needed. This depends on how the circuit reacts in these situations. In this study, we investigate the effect a permanent fault has on the different components of the pipeline, the logic function unit and the butter. Our aim is to identify the weaknesses of each component and try to enhance each one accordingly. We perform extensive fault-injection simulations on different circuits following the famous 4-phase communication protocol, while varying the logic function and butter style for comparison. Our results show that the logic function does not affect the resilience of a specific butter type, and hence we can deduce which butter should perform better for a specific application based on parameters we extract from our experiments. On a parallel note, the implementation style of the logic also has an impact on the block’s ability to hold out against faults. We investigate two of these styles.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128455326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of the Effects of SEUs on Configuration Memories in FPGA Implemented QC-LDPC Decoders FPGA实现的QC-LDPC解码器中seu对组态存储器影响的评估
Zhen Gao, Ying-Sheng Cheng, P. Reviriego
LDPC codes are widely used in wireless communication systems for reliable data transmission due to their excellent error correction capabilities. SRAM-FPGAs are a popular option for the implementation of LDPC decoders due to their excellent computing capabilities and re-configurability. However, when applied in critical environments, e.g. space platforms, the SRAM-FPGA based LDPC decoders will suffer single-event upsets (SEUs) that can cause failures and disrupt communications. Therefore, analyzing the reliability of LDPC decoders to SEUs on the FPGA is important. This paper first analyzes the effects of SEUs on different parts of the FPGA implemented LDPC decoder based on the module functions, including the influence of the parallelism on the decoder reliability. Then fault injection experiments are performed to validate the conclusions of the analysis. Experiment results show that about 98% of SEUs on the configuration memories can be tolerated by the decoder itself, and the modules with more interconnections are less robust to SEUs. In addition, the reliability of LDPC decoders decreases for lower levels of parallelism due to the larger computation load of each unit. These results will be a valuable input to design efficient SEU protection schemes for LDPC decoders.
LDPC码由于具有良好的纠错能力,被广泛应用于无线通信系统中,以保证数据的可靠传输。sram - fpga由于其出色的计算能力和可重构性,是实现LDPC解码器的热门选择。然而,当应用于关键环境时,例如空间平台,基于SRAM-FPGA的LDPC解码器将遭受单事件干扰(seu),可能导致故障和中断通信。因此,分析LDPC解码器对FPGA上seu的可靠性是非常重要的。本文首先根据模块功能分析了seu对FPGA实现的LDPC译码器不同部分的影响,包括并行度对译码器可靠性的影响。通过故障注入实验验证了分析结论。实验结果表明,解码器本身可以容忍配置存储器上约98%的seu,并且互连较多的模块对seu的鲁棒性较差。此外,由于每个单元的计算负荷较大,LDPC解码器的可靠性随着并行度的降低而降低。这些结果将为LDPC解码器设计高效的SEU保护方案提供有价值的输入。
{"title":"Evaluation of the Effects of SEUs on Configuration Memories in FPGA Implemented QC-LDPC Decoders","authors":"Zhen Gao, Ying-Sheng Cheng, P. Reviriego","doi":"10.1109/DFT56152.2022.9962337","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962337","url":null,"abstract":"LDPC codes are widely used in wireless communication systems for reliable data transmission due to their excellent error correction capabilities. SRAM-FPGAs are a popular option for the implementation of LDPC decoders due to their excellent computing capabilities and re-configurability. However, when applied in critical environments, e.g. space platforms, the SRAM-FPGA based LDPC decoders will suffer single-event upsets (SEUs) that can cause failures and disrupt communications. Therefore, analyzing the reliability of LDPC decoders to SEUs on the FPGA is important. This paper first analyzes the effects of SEUs on different parts of the FPGA implemented LDPC decoder based on the module functions, including the influence of the parallelism on the decoder reliability. Then fault injection experiments are performed to validate the conclusions of the analysis. Experiment results show that about 98% of SEUs on the configuration memories can be tolerated by the decoder itself, and the modules with more interconnections are less robust to SEUs. In addition, the reliability of LDPC decoders decreases for lower levels of parallelism due to the larger computation load of each unit. These results will be a valuable input to design efficient SEU protection schemes for LDPC decoders.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"417 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122850858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Preventing Soft Errors and Hardware Trojans in RISC-V Cores 防止RISC-V内核的软错误和硬件木马
Edian B. Annink, G. Rauwerda, E. Hakkennes, A. Menicucci, Stefano Di Mascio, G. Furano, M. Ottavi
Soft errors in embedded systems’ memories like single-event upsets and multiple-bit upsets lead to data and instruction corruption. Therefore, devices deployed in harsh environments, such as space, use fault-tolerant processors or redundancy methods to ensure critical application dependability. Another rising concern in secure, critical space applications is the possible introduction of hardware Trojans in an untrusted phase of the manufacturing process. Besides environmental side-effects, an adversary that has injected a malicious mechanism e.g., in the processor or memory can trigger unwanted behavior or leak sensitive information. Techniques to prevent or mitigate hardware Trojans are important to ensure hardware security. Leveraging the openness of the RISC-V ISA, this paper introduces a novel solution to improve the security and dependability of softcores with a low area and latency overhead. The instruction validator which is the first part of this solution can effectively detect hardware Trojans and multiple-bit upsets in the instruction memory by checking instruction/address pairs using a Bloom filter probabilistic data structure. The second part of the solution is the proposal of an error correction code instruction memory using Hamming single-error correction to detect and correct single-event upsets. It has also been proven that the Hamming decoder improves the detection performance of the instruction validator.
嵌入式系统内存中的软错误,如单事件干扰和多位干扰,会导致数据和指令损坏。因此,部署在恶劣环境(如空间)中的设备需要使用容错处理器或冗余方法来确保关键应用程序的可靠性。在安全、关键的空间应用中,另一个日益引起关注的问题是,在制造过程的不可信阶段可能引入硬件木马程序。除了环境副作用之外,攻击者在处理器或内存中注入了恶意机制,可以触发不想要的行为或泄露敏感信息。防止或减轻硬件木马的技术对于确保硬件安全非常重要。利用RISC-V ISA的开放性,本文提出了一种新的解决方案,以低面积和低延迟开销来提高软核的安全性和可靠性。指令验证器是该解决方案的第一部分,通过使用Bloom过滤器概率数据结构检查指令/地址对,可以有效地检测硬件木马和指令存储器中的多位异常。解决方案的第二部分是提出了一种使用汉明单错误校正来检测和纠正单事件干扰的纠错码指令存储器。实验还证明,汉明解码器提高了指令验证器的检测性能。
{"title":"Preventing Soft Errors and Hardware Trojans in RISC-V Cores","authors":"Edian B. Annink, G. Rauwerda, E. Hakkennes, A. Menicucci, Stefano Di Mascio, G. Furano, M. Ottavi","doi":"10.1109/DFT56152.2022.9962340","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962340","url":null,"abstract":"Soft errors in embedded systems’ memories like single-event upsets and multiple-bit upsets lead to data and instruction corruption. Therefore, devices deployed in harsh environments, such as space, use fault-tolerant processors or redundancy methods to ensure critical application dependability. Another rising concern in secure, critical space applications is the possible introduction of hardware Trojans in an untrusted phase of the manufacturing process. Besides environmental side-effects, an adversary that has injected a malicious mechanism e.g., in the processor or memory can trigger unwanted behavior or leak sensitive information. Techniques to prevent or mitigate hardware Trojans are important to ensure hardware security. Leveraging the openness of the RISC-V ISA, this paper introduces a novel solution to improve the security and dependability of softcores with a low area and latency overhead. The instruction validator which is the first part of this solution can effectively detect hardware Trojans and multiple-bit upsets in the instruction memory by checking instruction/address pairs using a Bloom filter probabilistic data structure. The second part of the solution is the proposal of an error correction code instruction memory using Hamming single-error correction to detect and correct single-event upsets. It has also been proven that the Hamming decoder improves the detection performance of the instruction validator.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115793038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Cross-Correlation Approach to Detecting Issue Test Sites in Massive Parallel Testing 大规模并行测试中问题测试点的互相关检测方法
Praise O. Farayola, Isaac Bruce, Shravan K. Chaganti, Abalhassan Sheikh, S. Ravi, Degang Chen
Parallel (multi-site) testing has become one of the semiconductor industry’s standards for testing chips. The method tests multiple chips in parallel, increasing throughput and cutting test time and costs. In digital IC testing, the number of sites has currently reached the level of thousands. However, the site count is still significantly lower when accurate analog testing is required. Due to the increased complexity involved in managing multiple test sites, variations are now being observed in the measurements from site to site for analog and mixed-signal testing. Some test sites’ measurements no longer reflect the true performance of the device under test (DUT) and can lead to yield loss and possible test escapes. As it is a recent issue, very little work has been done on robust and accurate detection of issue sites. We project that it will be an important issue in the future, especially as the number of test sites in multi-site analog testing continues to increase. A method capable of effectively detecting test sites exhibiting pronounced multi-site variation is presented. The proposed method utilizes the cross-correlation similarity between the distribution of each test site and a reference distribution to detect issue sites. Boundary conditions are derived for the method using a significance level, and a site is considered an issue site if the proposed method scores for that site fall outside the derived boundary for each measured specification. The method is applied to real test data from the industry. The presented results demonstrate the effectiveness of the approach.
并行(多点)测试已成为半导体行业测试芯片的标准之一。该方法并行测试多个芯片,提高了吞吐量,减少了测试时间和成本。在数字集成电路测试中,站点数量目前已达到数千个的水平。然而,当需要精确的模拟测试时,站点计数仍然显着降低。由于管理多个测试站点的复杂性增加,现在在模拟和混合信号测试的不同站点的测量中观察到变化。一些测试点的测量不再反映被测设备(DUT)的真实性能,并可能导致产量损失和可能的测试逃逸。由于这是一个最近的问题,因此在健壮和准确地检测问题地点方面所做的工作很少。我们预计这将是未来的一个重要问题,特别是随着多站点模拟测试的测试站点数量不断增加。提出了一种能够有效检测具有明显多位点变异的试验点的方法。该方法利用各试验点分布与参考分布之间的互相关相似性来检测问题点。使用显著性水平为该方法导出边界条件,如果该站点的建议方法得分落在每个测量规范的导出边界之外,则将该站点视为问题站点。将该方法应用于实际工业试验数据。给出的结果证明了该方法的有效性。
{"title":"Cross-Correlation Approach to Detecting Issue Test Sites in Massive Parallel Testing","authors":"Praise O. Farayola, Isaac Bruce, Shravan K. Chaganti, Abalhassan Sheikh, S. Ravi, Degang Chen","doi":"10.1109/DFT56152.2022.9962367","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962367","url":null,"abstract":"Parallel (multi-site) testing has become one of the semiconductor industry’s standards for testing chips. The method tests multiple chips in parallel, increasing throughput and cutting test time and costs. In digital IC testing, the number of sites has currently reached the level of thousands. However, the site count is still significantly lower when accurate analog testing is required. Due to the increased complexity involved in managing multiple test sites, variations are now being observed in the measurements from site to site for analog and mixed-signal testing. Some test sites’ measurements no longer reflect the true performance of the device under test (DUT) and can lead to yield loss and possible test escapes. As it is a recent issue, very little work has been done on robust and accurate detection of issue sites. We project that it will be an important issue in the future, especially as the number of test sites in multi-site analog testing continues to increase. A method capable of effectively detecting test sites exhibiting pronounced multi-site variation is presented. The proposed method utilizes the cross-correlation similarity between the distribution of each test site and a reference distribution to detect issue sites. Boundary conditions are derived for the method using a significance level, and a site is considered an issue site if the proposed method scores for that site fall outside the derived boundary for each measured specification. The method is applied to real test data from the industry. The presented results demonstrate the effectiveness of the approach.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120945192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Polarity-Driven Radiation-Hardened Latch design for Single Event Upset Tolerance 一个极性驱动的辐射硬化闩锁设计,用于单事件干扰公差
Shanshan Liu, Jing Guo, Xiaochen Tang, P. Reviriego, Fabrizio Lombardi
The ability of tolerating a radiation-induced single event upset (SEU) is required for nanoscale latches in most dependable applications. This is becoming a strict requirement, because an SEU in a latch node may corrupt its outcome and then, possibly cause a system failure. Moreover, the impact of an SEU further deteriorates for latch designs at reduced CMOS nano-scaled technology because it can result in double node upset (DNU) in addition to single node upset (SNU). Existing approaches of designing a radiation-hardened latch do not achieve complete SNU/DNU tolerance at low hardware overhead. The goal of this paper is to propose a high-performance latch design for SEU tolerance. By exploiting the polarity of the upset in different types of transistors, the proposed design has a small number of sensitive nodes, so incurring in a low protection overhead. Moreover, due to its configuration, the proposed design achieves SEU tolerance at circuit-level without requiring additional layout protection. These advantages make the proposed design superior to all existing hardened latches found in the technical literature; simulation results using 65 nm CMOS technology show that the proposed design achieves a reduction in the range of 14.53% to 98.76% in hardware overhead while providing a complete SNU/DNU recovery.
在大多数可靠的应用中,容忍辐射引起的单事件扰动(SEU)的能力是纳米级锁存器所必需的。这正在成为一个严格的要求,因为锁存节点中的SEU可能会破坏其结果,然后可能导致系统故障。此外,在减小CMOS纳米级技术的锁存器设计中,SEU的影响进一步恶化,因为它可能导致单节点干扰(SNU)之外的双节点干扰(DNU)。现有的设计抗辐射锁存器的方法不能在低硬件开销下实现完全的SNU/DNU容忍度。本文的目标是提出一种高性能的锁存器设计,用于SEU容限。通过利用不同类型晶体管中扰流的极性,所提出的设计具有少量的敏感节点,因此产生较低的保护开销。此外,由于其配置,所提出的设计在电路级实现了SEU公差,而无需额外的布局保护。这些优点使得所提出的设计优于技术文献中发现的所有现有硬化闩锁;使用65纳米CMOS技术的仿真结果表明,该设计在提供完整的SNU/DNU恢复的同时,实现了14.53%至98.76%的硬件开销降低。
{"title":"A Polarity-Driven Radiation-Hardened Latch design for Single Event Upset Tolerance","authors":"Shanshan Liu, Jing Guo, Xiaochen Tang, P. Reviriego, Fabrizio Lombardi","doi":"10.1109/DFT56152.2022.9962346","DOIUrl":"https://doi.org/10.1109/DFT56152.2022.9962346","url":null,"abstract":"The ability of tolerating a radiation-induced single event upset (SEU) is required for nanoscale latches in most dependable applications. This is becoming a strict requirement, because an SEU in a latch node may corrupt its outcome and then, possibly cause a system failure. Moreover, the impact of an SEU further deteriorates for latch designs at reduced CMOS nano-scaled technology because it can result in double node upset (DNU) in addition to single node upset (SNU). Existing approaches of designing a radiation-hardened latch do not achieve complete SNU/DNU tolerance at low hardware overhead. The goal of this paper is to propose a high-performance latch design for SEU tolerance. By exploiting the polarity of the upset in different types of transistors, the proposed design has a small number of sensitive nodes, so incurring in a low protection overhead. Moreover, due to its configuration, the proposed design achieves SEU tolerance at circuit-level without requiring additional layout protection. These advantages make the proposed design superior to all existing hardened latches found in the technical literature; simulation results using 65 nm CMOS technology show that the proposed design achieves a reduction in the range of 14.53% to 98.76% in hardware overhead while providing a complete SNU/DNU recovery.","PeriodicalId":411011,"journal":{"name":"2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133362728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1