首页 > 最新文献

2023 IEEE European Test Symposium (ETS)最新文献

英文 中文
Evaluating the Prevalence of SFUs in the Reliability of GPUs 评估gpu可靠性中sfu的患病率
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174110
J. E. R. Condia, Juan-David Guerrero-Balaguera, Edward Javier Patiño Nuñez, Robert Limas Sierra, M. Reorda
1 Currently, Graphics Processing Units (GPUs) are extensively used in several safety-critical domains to support the implementation of complex operations where reliability is a major concern. Some internal cores, such as Special Function Units (SFUs), are increasingly adopted, being crucial to achieving the necessary performance in multimedia, scientific computing, and neural network training. Unfortunately, these cores are highly unexplored in terms of their impact on reliability.In this work, we evaluate the incidence of SFUs on the reliability of GPUs when affected by soft errors. First, we analyze the impact of SFU cores on the GPU’s reliability and the running workloads. We resort to applications configured to use or not the SFU cores and evaluate the effect of soft errors by using a software-based fault injection environment (NVBITFI) in an NVIDIA Ampere GPU. Then, we focus on evaluating the impact of soft errors arising in the SFUs. A fine-grain RTL evaluation determines the soft error effects on two SFUs architectures for GPUs (’fused’ and ’modular’). The experiments use an open-source GPU (FlexGripPlus) instrumented with both SFU architectures. The results suggest that workloads using SFUs are more vulnerable to faults (from 1 up to 5 orders of magnitude for the analyzed applications). Moreover, the RTL results show that modular SFUs are less vulnerable to faults (in up to 47% for the analyzed workloads) in comparison with fused SFUs (base of commercial devices), so allowing us to identify the more robust SFU architecture.
目前,图形处理单元(gpu)广泛应用于几个安全关键领域,以支持复杂操作的实现,其中可靠性是一个主要关注的问题。一些内部内核,如特殊功能单元(sfu),被越来越多地采用,对于实现多媒体、科学计算和神经网络训练所需的性能至关重要。不幸的是,就其对可靠性的影响而言,这些核心还没有得到充分的研究。在这项工作中,我们评估了sfu在受到软错误影响时对gpu可靠性的影响。首先,我们分析了SFU内核对GPU可靠性和运行工作负载的影响。我们求助于配置为使用或不使用SFU内核的应用程序,并通过在NVIDIA安培GPU中使用基于软件的故障注入环境(NVBITFI)来评估软错误的影响。然后,我们重点评估了sfu中出现的软错误的影响。细粒度RTL评估确定了gpu的两种sfu架构(“融合”和“模块化”)的软误差影响。实验使用了一个开源GPU (FlexGripPlus),该GPU配备了两种SFU架构。结果表明,使用sfu的工作负载更容易受到故障的影响(对于所分析的应用程序,从1到5个数量级)。此外,RTL结果表明,与融合的SFU(商业设备的基础)相比,模块化SFU更不容易受到故障的影响(在分析的工作负载中高达47%),因此允许我们识别更健壮的SFU架构。
{"title":"Evaluating the Prevalence of SFUs in the Reliability of GPUs","authors":"J. E. R. Condia, Juan-David Guerrero-Balaguera, Edward Javier Patiño Nuñez, Robert Limas Sierra, M. Reorda","doi":"10.1109/ETS56758.2023.10174110","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174110","url":null,"abstract":"1 Currently, Graphics Processing Units (GPUs) are extensively used in several safety-critical domains to support the implementation of complex operations where reliability is a major concern. Some internal cores, such as Special Function Units (SFUs), are increasingly adopted, being crucial to achieving the necessary performance in multimedia, scientific computing, and neural network training. Unfortunately, these cores are highly unexplored in terms of their impact on reliability.In this work, we evaluate the incidence of SFUs on the reliability of GPUs when affected by soft errors. First, we analyze the impact of SFU cores on the GPU’s reliability and the running workloads. We resort to applications configured to use or not the SFU cores and evaluate the effect of soft errors by using a software-based fault injection environment (NVBITFI) in an NVIDIA Ampere GPU. Then, we focus on evaluating the impact of soft errors arising in the SFUs. A fine-grain RTL evaluation determines the soft error effects on two SFUs architectures for GPUs (’fused’ and ’modular’). The experiments use an open-source GPU (FlexGripPlus) instrumented with both SFU architectures. The results suggest that workloads using SFUs are more vulnerable to faults (from 1 up to 5 orders of magnitude for the analyzed applications). Moreover, the RTL results show that modular SFUs are less vulnerable to faults (in up to 47% for the analyzed workloads) in comparison with fused SFUs (base of commercial devices), so allowing us to identify the more robust SFU architecture.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121349018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ETS 2023 Foreword
Pub Date : 2023-05-22 DOI: 10.1109/ets56758.2023.10174207
{"title":"ETS 2023 Foreword","authors":"","doi":"10.1109/ets56758.2023.10174207","DOIUrl":"https://doi.org/10.1109/ets56758.2023.10174207","url":null,"abstract":"","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128782051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximate Communication: Balancing Performance, Power, Reliability, and Safety 近似通信:平衡性能、功率、可靠性和安全性
Pub Date : 2023-05-22 DOI: 10.1109/ets56758.2023.10174057
Abdalrhman Badran, Somayeh Sadeghi Kohan, Jan Dennis Reimer, S. Hellebrand
Interconnect is essential for the performance, power consumption, and safety of modern digital circuits. In the context of approximate computing, various methods have been proposed to improve the system performance by decreasing the amount of transmitted data or reducing power consumption through reduced switching activity on the interconnects. However, their impact on the reliability and safety of interconnect has not yet been evaluated. In this work, the effects of approximate communication on the reliability and safety of interconnects in digital circuits are assessed. The results show that while these methods increase performance, they can also harm the reliability and mission time of interconnects. We propose some modifications to address the safety and reliability issues when using approximate communication. Our results show that these modifications can increase the mission time, and establish a proper balance between performance, power consumption, and safety.
互连对于现代数字电路的性能、功耗和安全性至关重要。在近似计算的背景下,已经提出了各种方法来通过减少传输数据量或通过减少互连上的切换活动来降低功耗来提高系统性能。然而,它们对互连的可靠性和安全性的影响尚未得到评估。在这项工作中,评估了近似通信对数字电路互连可靠性和安全性的影响。结果表明,这些方法在提高互连性能的同时,也会损害互连的可靠性和任务时间。我们提出了一些修改,以解决使用近似通信时的安全性和可靠性问题。我们的研究结果表明,这些改进可以增加任务时间,并在性能,功耗和安全性之间建立适当的平衡。
{"title":"Approximate Communication: Balancing Performance, Power, Reliability, and Safety","authors":"Abdalrhman Badran, Somayeh Sadeghi Kohan, Jan Dennis Reimer, S. Hellebrand","doi":"10.1109/ets56758.2023.10174057","DOIUrl":"https://doi.org/10.1109/ets56758.2023.10174057","url":null,"abstract":"Interconnect is essential for the performance, power consumption, and safety of modern digital circuits. In the context of approximate computing, various methods have been proposed to improve the system performance by decreasing the amount of transmitted data or reducing power consumption through reduced switching activity on the interconnects. However, their impact on the reliability and safety of interconnect has not yet been evaluated. In this work, the effects of approximate communication on the reliability and safety of interconnects in digital circuits are assessed. The results show that while these methods increase performance, they can also harm the reliability and mission time of interconnects. We propose some modifications to address the safety and reliability issues when using approximate communication. Our results show that these modifications can increase the mission time, and establish a proper balance between performance, power consumption, and safety.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129418622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Electrical Behavior of Core Interconnects for System-Level Crosstalk Prediction 用于系统级串扰预测的核心互连电行为学习
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10173987
Katayoon Basharkhah, Raheleh Sadat Mirhashemi, Nooshin Nosrati, Mohammad-Javad Zare, Z. Navabi
Efficient distribution of tasks in an SOC between various components of an embedded system affect rate of data exchange between cores and obviously the number and fanout of interconnecting cores. Data rate and interconnect fanouts depend on post-layout wire characteristics that, in the worst-case situation, must be evaluated for abovementioned system level decisions. In this work we are making provisions for avoiding this large gap between high-level decision making and low-level physical properties. IP-core interconnects can be fully characterized by post layout information of the IP-core, load properties, and the number of destination cores they are driving. This information can be back-annotated into abstract system-level interconnect models to be used by core integrators for design space exploration (DSE). Fanout and/or frequency of operation of an IP-core can be decided by this DSE environment. In this work, we propose a machine-learning based methodology that uses signoff parasitic information and the actual wire data to generate the dataset and train a model. The model was evaluated in fast high-level SystemC environment for two RISC-V based processors in two SoCs. The models were 26 times faster than the low-level simulations with a crosstalk fault coverage of 1.5% error.
在嵌入式系统的SOC中,任务在各个组件之间的有效分配会影响内核之间的数据交换速率,并明显影响互连内核的数量和扇出。数据速率和互连扇出取决于布线后的电线特性,在最坏的情况下,必须对上述系统级决策进行评估。在这项工作中,我们正在为避免高层决策和低层物理性质之间的巨大差距做准备。ip核互连可以通过ip核的后布局信息、负载属性和它们正在驱动的目标核的数量来充分表征。这些信息可以反向注释到抽象的系统级互连模型中,供核心集成商用于设计空间探索(DSE)。扇出和/或ip核的操作频率可以由这个DSE环境决定。在这项工作中,我们提出了一种基于机器学习的方法,该方法使用签名寄生信息和实际线数据来生成数据集并训练模型。该模型在两个soc的两个RISC-V处理器的快速高级SystemC环境中进行了评估。模型速度是低阶模拟的26倍,串扰故障覆盖率为1.5%。
{"title":"Learning Electrical Behavior of Core Interconnects for System-Level Crosstalk Prediction","authors":"Katayoon Basharkhah, Raheleh Sadat Mirhashemi, Nooshin Nosrati, Mohammad-Javad Zare, Z. Navabi","doi":"10.1109/ETS56758.2023.10173987","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10173987","url":null,"abstract":"Efficient distribution of tasks in an SOC between various components of an embedded system affect rate of data exchange between cores and obviously the number and fanout of interconnecting cores. Data rate and interconnect fanouts depend on post-layout wire characteristics that, in the worst-case situation, must be evaluated for abovementioned system level decisions. In this work we are making provisions for avoiding this large gap between high-level decision making and low-level physical properties. IP-core interconnects can be fully characterized by post layout information of the IP-core, load properties, and the number of destination cores they are driving. This information can be back-annotated into abstract system-level interconnect models to be used by core integrators for design space exploration (DSE). Fanout and/or frequency of operation of an IP-core can be decided by this DSE environment. In this work, we propose a machine-learning based methodology that uses signoff parasitic information and the actual wire data to generate the dataset and train a model. The model was evaluated in fast high-level SystemC environment for two RISC-V based processors in two SoCs. The models were 26 times faster than the low-level simulations with a crosstalk fault coverage of 1.5% error.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130787792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Counterfeit Detection by Semiconductor Process Technology Inspection 半导体制程技术检测的防伪方法
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174131
Matthias Ludwig, A. Bette, Bernhard Lippmann, G. Sigl
With world-wide distributed semiconductor supply chains and a scarcity of microelectronic products, counterfeit devices are gaining momentum. Sourcing products from trusted providers are the theoretical remedy, yet practice shows the reality. Forged electronics are entering the supply chain at a high rate and pose a threat to safety, reliability, and security. Academia and industry have established various pre- or post-production measures to effectively address this issue partially. Yet, several inadequately covered aspects of the field require improvements. First, this work introduces a rating scheme to enable the effective comparison between anti-counterfeiting methods. Recently published methods are compared using this scheme. Second, a novel, generic, generally applicable prover-verifier attestation framework for post-production anti-counterfeiting methods is established. Third, the work implements a new anti-counterfeit method. By introducing technological individual features, the method incorporates technology intrinsic features of the front-end semiconductor manufacturing process as technology distinctive characteristic. Profile parameters are extracted through pattern recognition and statistical methods which are compared to the expected technologies through distance metrics, allowing an assertion of device authenticity. Finally, the versatility of the method is experimentally validated through real samples. Overall, an accuracy of 100% is reported for seven samples which are checked for authenticity.
随着全球半导体供应链的分布和微电子产品的稀缺,假冒设备的势头正在增强。从可信的供应商那里采购产品是理论上的补救措施,但实践表明现实。伪造电子产品正以很高的速度进入供应链,并对安全性、可靠性和安全性构成威胁。学术界和工业界已经制定了各种制作前或制作后的措施来有效地解决这一问题。然而,该领域的几个未充分涵盖的方面需要改进。首先,本文引入了一种评级方案,使防伪方法之间能够进行有效的比较。使用该方案对最近发表的方法进行了比较。其次,建立了一种新的、通用的、普遍适用的证明者-验证者认证框架,用于后期防伪方法。第三,本工作实现了一种新的防伪方法。该方法通过引入技术个体特征,将半导体前端制造工艺的技术内在特征作为技术显著特征。通过模式识别和统计方法提取配置文件参数,通过距离度量将其与预期技术进行比较,从而可以断言设备的真实性。最后,通过实际样品验证了该方法的通用性。总体而言,对7个样品进行真实性检查,报告的准确性为100%。
{"title":"Counterfeit Detection by Semiconductor Process Technology Inspection","authors":"Matthias Ludwig, A. Bette, Bernhard Lippmann, G. Sigl","doi":"10.1109/ETS56758.2023.10174131","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174131","url":null,"abstract":"With world-wide distributed semiconductor supply chains and a scarcity of microelectronic products, counterfeit devices are gaining momentum. Sourcing products from trusted providers are the theoretical remedy, yet practice shows the reality. Forged electronics are entering the supply chain at a high rate and pose a threat to safety, reliability, and security. Academia and industry have established various pre- or post-production measures to effectively address this issue partially. Yet, several inadequately covered aspects of the field require improvements. First, this work introduces a rating scheme to enable the effective comparison between anti-counterfeiting methods. Recently published methods are compared using this scheme. Second, a novel, generic, generally applicable prover-verifier attestation framework for post-production anti-counterfeiting methods is established. Third, the work implements a new anti-counterfeit method. By introducing technological individual features, the method incorporates technology intrinsic features of the front-end semiconductor manufacturing process as technology distinctive characteristic. Profile parameters are extracted through pattern recognition and statistical methods which are compared to the expected technologies through distance metrics, allowing an assertion of device authenticity. Finally, the versatility of the method is experimentally validated through real samples. Overall, an accuracy of 100% is reported for seven samples which are checked for authenticity.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114008548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Semi-Supervised Deep Learning for Microcontroller Performance Screening 微控制器性能筛选的半监督深度学习
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174083
N. Bellarmino, R. Cantoro, M. Huch, T. Kilian, Ulf Schlichtmann, Giovanni Squillero
In safety-critical applications, microcontrollers must satisfy strict quality constraints and performances in terms of Fmax (the maximum operating frequency). Data extracted from on-chip ring oscillators (ROs) can model the Fmax of integrated circuits using machine learning models. Those models are suitable for the performance screening process. Acquiring data from the ROs is a fast process that leads to many unlabeled data. Contrarily, the labeling phase (i.e., acquiring Fmax) is a time-consuming and costly task, that leads to a small set of labeled data. This paper presents deep-learning-based methodologies to cope with the low number of labeled data in microcontroller performance screening. We propose a method that takes advantage of the high number of unlabeled samples in a semi-supervised learning fashion. We derive deep feature extractor models that project data into higher dimensional spaces and use the data feature embedding to face the performance prediction problem with simple linear regression. Experiments showed that the proposed models outperformed state-of-the-art methodologies in terms of prediction error and permitted us to use a significantly smaller number of devices to be characterized, thus reducing the time needed to build ML models by a factor of six with respect to baseline approaches.
在安全关键应用中,微控制器必须满足严格的质量约束和Fmax(最大工作频率)方面的性能。从片上环形振荡器(ROs)中提取的数据可以使用机器学习模型对集成电路的Fmax进行建模。这些模型适用于性能筛选过程。从ro获取数据是一个快速的过程,会导致许多未标记的数据。相反,标记阶段(即获取Fmax)是一项耗时且昂贵的任务,这导致标记数据集很小。本文提出了基于深度学习的方法来处理微控制器性能筛选中标记数据数量少的问题。我们提出了一种半监督学习方式利用大量未标记样本的方法。我们推导了深度特征提取器模型,将数据投影到高维空间,并使用数据特征嵌入来解决简单线性回归的性能预测问题。实验表明,所提出的模型在预测误差方面优于最先进的方法,并允许我们使用显着较少数量的设备进行表征,从而将构建ML模型所需的时间减少了六倍。
{"title":"Semi-Supervised Deep Learning for Microcontroller Performance Screening","authors":"N. Bellarmino, R. Cantoro, M. Huch, T. Kilian, Ulf Schlichtmann, Giovanni Squillero","doi":"10.1109/ETS56758.2023.10174083","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174083","url":null,"abstract":"In safety-critical applications, microcontrollers must satisfy strict quality constraints and performances in terms of Fmax (the maximum operating frequency). Data extracted from on-chip ring oscillators (ROs) can model the Fmax of integrated circuits using machine learning models. Those models are suitable for the performance screening process. Acquiring data from the ROs is a fast process that leads to many unlabeled data. Contrarily, the labeling phase (i.e., acquiring Fmax) is a time-consuming and costly task, that leads to a small set of labeled data. This paper presents deep-learning-based methodologies to cope with the low number of labeled data in microcontroller performance screening. We propose a method that takes advantage of the high number of unlabeled samples in a semi-supervised learning fashion. We derive deep feature extractor models that project data into higher dimensional spaces and use the data feature embedding to face the performance prediction problem with simple linear regression. Experiments showed that the proposed models outperformed state-of-the-art methodologies in terms of prediction error and permitted us to use a significantly smaller number of devices to be characterized, thus reducing the time needed to build ML models by a factor of six with respect to baseline approaches.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114081138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Physical-aware Interconnect Testing and Repairing of Chiplets 小芯片的物理感知互连测试与修复
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174179
C. Cui, Tuanhui Xu, Haitao Fu, Junlin Huang
As the interconnect density of chiplets increases rapidly, some physics related defects appeared, such as coupling defects, etc. These defects are hard to detect with ordinary pseudo-random sequence patterns, some special test patterns are needed. Besides, the chip warpage caused by the thinning of 3D chips manufacturing and the uneven stress around TSVs or micro bumps will bring clustered faults of interconnections. For these defects, the repair rate of conventional interconnect redundancy method will be decreased. This paper proposes a physical-aware interconnect testing and repairing method of chiplets, using specific test patterns and clustered faults redundancy circuits to improve the interconnect test coverage and repair rate of chiplets. We also propose automatic repair circuits and the repair data synchronization scheme between multiple dies, so that the calculating and programming of repair data do not need to rely on the ATE programming, and the synchronization of the repair data between multiple dies can be done by hardware circuits automatically, which ensure the interconnection correctly after repairing.
随着晶片互连密度的快速增加,出现了一些物理缺陷,如耦合缺陷等。这些缺陷很难用普通的伪随机序列模式检测出来,需要一些特殊的测试模式。此外,由于三维芯片制造变薄导致的芯片翘曲以及tsv周围应力不均或微凸起会导致互连的聚集性故障。对于这些缺陷,传统互连冗余方法的修复率会降低。本文提出了一种基于物理感知的小芯片互连测试与修复方法,采用特定的测试模式和聚类故障冗余电路,提高了小芯片互连测试覆盖率和修复率。提出了自动修模电路和多模间修模数据同步方案,使修模数据的计算和编程不需要依赖于ATE编程,多模间修模数据的同步可以由硬件电路自动完成,保证修模后的正确互连。
{"title":"Physical-aware Interconnect Testing and Repairing of Chiplets","authors":"C. Cui, Tuanhui Xu, Haitao Fu, Junlin Huang","doi":"10.1109/ETS56758.2023.10174179","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174179","url":null,"abstract":"As the interconnect density of chiplets increases rapidly, some physics related defects appeared, such as coupling defects, etc. These defects are hard to detect with ordinary pseudo-random sequence patterns, some special test patterns are needed. Besides, the chip warpage caused by the thinning of 3D chips manufacturing and the uneven stress around TSVs or micro bumps will bring clustered faults of interconnections. For these defects, the repair rate of conventional interconnect redundancy method will be decreased. This paper proposes a physical-aware interconnect testing and repairing method of chiplets, using specific test patterns and clustered faults redundancy circuits to improve the interconnect test coverage and repair rate of chiplets. We also propose automatic repair circuits and the repair data synchronization scheme between multiple dies, so that the calculating and programming of repair data do not need to rely on the ATE programming, and the synchronization of the repair data between multiple dies can be done by hardware circuits automatically, which ensure the interconnection correctly after repairing.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121830751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High Throughput and Energy Efficient SHA-2 ASIC Design for Continuous Integrity Checking Applications 用于持续完整性检查应用的高通量和节能SHA-2 ASIC设计
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174095
Asimina Koutra, V. Tenentes
High throughput and energy efficient integrated cryptographic hash primitives are important for the continuous integrity checking and tampering detection in secure access management mechanisms of on-chip instrumentation, such as the IJTAG. However, previous SHA-256 cores focus only on throughput. In this paper, we synthesize with a 32 nm CMOS Technology SHA-256 cores that can be integrated in ASICs, and we present insights on their achieved throughput and energy efficiency. Moreover, we present a novel clock-gated design for reducing dynamic power dissipation of SHA-256 cores; and a novel Multi-Vt design for reducing static power dissipation of SHA-256 cores. The proposed designs can achieve upto 25.9% improvement of the energy efficiency of existing SHA-256 designs, without impacting their performed throughput. To the best of our knowledge, this is the first work that applies low power design techniques on SHA-256 cores.
高吞吐量和高能效的集成加密哈希原语对于片上仪器(如IJTAG)的安全访问管理机制中的连续完整性检查和篡改检测非常重要。然而,以前的SHA-256内核只关注吞吐量。在本文中,我们合成了可集成在asic中的32 nm CMOS技术SHA-256内核,并对其实现的吞吐量和能效提出了见解。此外,我们提出了一种新的时钟门控设计,以降低SHA-256内核的动态功耗;以及一种新颖的Multi-Vt设计,用于降低SHA-256内核的静态功耗。所提出的设计可以在不影响其执行吞吐量的情况下,将现有SHA-256设计的能源效率提高25.9%。据我们所知,这是第一个在SHA-256内核上应用低功耗设计技术的工作。
{"title":"High Throughput and Energy Efficient SHA-2 ASIC Design for Continuous Integrity Checking Applications","authors":"Asimina Koutra, V. Tenentes","doi":"10.1109/ETS56758.2023.10174095","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174095","url":null,"abstract":"High throughput and energy efficient integrated cryptographic hash primitives are important for the continuous integrity checking and tampering detection in secure access management mechanisms of on-chip instrumentation, such as the IJTAG. However, previous SHA-256 cores focus only on throughput. In this paper, we synthesize with a 32 nm CMOS Technology SHA-256 cores that can be integrated in ASICs, and we present insights on their achieved throughput and energy efficiency. Moreover, we present a novel clock-gated design for reducing dynamic power dissipation of SHA-256 cores; and a novel Multi-Vt design for reducing static power dissipation of SHA-256 cores. The proposed designs can achieve upto 25.9% improvement of the energy efficiency of existing SHA-256 designs, without impacting their performed throughput. To the best of our knowledge, this is the first work that applies low power design techniques on SHA-256 cores.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130237177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Fault-Tolerance for Memristive Neuromorphic Fabric Based on Local Approximation 基于局部逼近的记忆神经形态网络在线容错
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174237
Soyed Tuhin Ahmed, R. Rakhmatullin, M. Tahoori
Neural networks (NNs) are a widely-used problem-solving tool, but their high computational and power consumption makes them expensive. Computation-in-Memory (CiM) architecture, which uses resistive non-volatile memories, is a promising solution due to its high energy efficiency. However, manufacturing defects and in-field faults can reduce the reliability and inference accuracy of CiM-implemented neural networks. Existing sophisticated fault detection and tolerance techniques require long downtime for testing and repair. In certain applications, e.g., "always on" NN applications, such downtime may not be acceptable. Thus, in this paper, a low-cost online fault tolerance technique based on local approximations is proposed to ensure continuous neural network operation with acceptable accuracy. Our approach reduces hardware overhead by up to 99.37% compared to conventional redundancy-based approaches while still achieving accuracy within 2% of the trained NNs.
神经网络(nn)是一种广泛使用的解决问题的工具,但其高计算和功耗使其昂贵。内存计算(CiM)架构使用电阻式非易失性存储器,由于其高能效而成为一种很有前途的解决方案。然而,制造缺陷和现场故障会降低基于cim的神经网络的可靠性和推理精度。现有复杂的故障检测和容限技术需要长时间的停机时间进行测试和修复。在某些应用中,例如,“始终在线”的神经网络应用,这样的停机时间可能是不可接受的。为此,本文提出了一种基于局部逼近的低成本在线容错技术,以保证神经网络以可接受的精度连续运行。与传统的基于冗余的方法相比,我们的方法减少了高达99.37%的硬件开销,同时仍然在训练好的神经网络的2%以内实现准确率。
{"title":"Online Fault-Tolerance for Memristive Neuromorphic Fabric Based on Local Approximation","authors":"Soyed Tuhin Ahmed, R. Rakhmatullin, M. Tahoori","doi":"10.1109/ETS56758.2023.10174237","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174237","url":null,"abstract":"Neural networks (NNs) are a widely-used problem-solving tool, but their high computational and power consumption makes them expensive. Computation-in-Memory (CiM) architecture, which uses resistive non-volatile memories, is a promising solution due to its high energy efficiency. However, manufacturing defects and in-field faults can reduce the reliability and inference accuracy of CiM-implemented neural networks. Existing sophisticated fault detection and tolerance techniques require long downtime for testing and repair. In certain applications, e.g., \"always on\" NN applications, such downtime may not be acceptable. Thus, in this paper, a low-cost online fault tolerance technique based on local approximations is proposed to ensure continuous neural network operation with acceptable accuracy. Our approach reduces hardware overhead by up to 99.37% compared to conventional redundancy-based approaches while still achieving accuracy within 2% of the trained NNs.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130141564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Data Background-Based Test Development for All Interconnect and Contact Defects in RRAMs rram中所有互连和接触缺陷的数据背景测试开发
Pub Date : 2023-05-22 DOI: 10.1109/ETS56758.2023.10174106
Hanzhi Xun, M. Fieback, S. Yuan, Ziwei Zhang, M. Taouil, S. Hamdioui
Resistive Random Access Memory (RRAM) is a potential technology to replace conventional memories by providing low power consumption and high-density storage. As various manufacturing vendors make significant efforts to push it to high-volume production and commercialization, high-quality and efficient test solutions are of great importance. This paper analyzes interconnect and contact defects in RRAMs, while considering the impact of the memory Data Background (DB), and proposes test solutions. The complete interconnect and contact defect space in a layout-independent RRAM design is defined. Exhaustive defect injection and circuit simulation are performed in a systematic manner to derive appropriate fault models, not only for single-cell and two-cell coupling faults, but also for multi-cell coupling faults where the DBs are important. The results show the existence of unique 3-cell and 4-cell coupling faults due to e.g., the sneak path in the array induced by defects. These unique faults cannot be detected with traditional RRAM test solutions. Therefore, the paper introduces a test generation method that takes into account the DB, which is able to efficiently detect all these faults; hence, further improving the fault/defect coverage in RRAMs.
电阻式随机存取存储器(RRAM)具有低功耗和高密度存储的特点,是一种有潜力取代传统存储器的技术。随着各种制造厂商努力将其推向大批量生产和商业化,高质量和高效的测试解决方案非常重要。分析了rram的互连和接触缺陷,同时考虑了内存数据背景(DB)的影响,并提出了测试解决方案。定义了与布局无关的RRAM设计中的完整互连和接触缺陷空间。穷尽缺陷注入和电路仿真以系统的方式推导出合适的故障模型,不仅适用于单单元和双单元耦合故障,而且适用于db很重要的多单元耦合故障。结果表明,由于缺陷引起的阵列内的潜行路径等原因,阵列存在独特的3-cell和4-cell耦合故障。这些独特的故障无法通过传统的RRAM测试解决方案检测到。因此,本文引入了一种考虑DB的测试生成方法,能够有效地检测出所有这些故障;因此,进一步提高rram中的故障/缺陷覆盖率。
{"title":"Data Background-Based Test Development for All Interconnect and Contact Defects in RRAMs","authors":"Hanzhi Xun, M. Fieback, S. Yuan, Ziwei Zhang, M. Taouil, S. Hamdioui","doi":"10.1109/ETS56758.2023.10174106","DOIUrl":"https://doi.org/10.1109/ETS56758.2023.10174106","url":null,"abstract":"Resistive Random Access Memory (RRAM) is a potential technology to replace conventional memories by providing low power consumption and high-density storage. As various manufacturing vendors make significant efforts to push it to high-volume production and commercialization, high-quality and efficient test solutions are of great importance. This paper analyzes interconnect and contact defects in RRAMs, while considering the impact of the memory Data Background (DB), and proposes test solutions. The complete interconnect and contact defect space in a layout-independent RRAM design is defined. Exhaustive defect injection and circuit simulation are performed in a systematic manner to derive appropriate fault models, not only for single-cell and two-cell coupling faults, but also for multi-cell coupling faults where the DBs are important. The results show the existence of unique 3-cell and 4-cell coupling faults due to e.g., the sneak path in the array induced by defects. These unique faults cannot be detected with traditional RRAM test solutions. Therefore, the paper introduces a test generation method that takes into account the DB, which is able to efficiently detect all these faults; hence, further improving the fault/defect coverage in RRAMs.","PeriodicalId":211522,"journal":{"name":"2023 IEEE European Test Symposium (ETS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125330838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2023 IEEE European Test Symposium (ETS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1