Mojtaba Ebrahimi, Nour Sayed, Maryam Rashvand, M. Tahoori
{"title":"Fault injection acceleration by architectural importance sampling","authors":"Mojtaba Ebrahimi, Nour Sayed, Maryam Rashvand, M. Tahoori","doi":"10.1109/CODESISSS.2015.7331384","DOIUrl":null,"url":null,"abstract":"Radiation-induced soft errors are major reliability concerns in advanced technology nodes. The de facto approach for evaluation of the soft error vulnerability is to perform a costly fault injection campaign. Due to the long residency of some errors in system states, the error has to be traced for even millions of cycles. However, only a very small portion of injected errors leads to the failure. This means that many simulation cycles are wasted as they contribute to no failure due to various masking effects. In this paper, we present an importance sampling technique based on Architecturally Correct Execution (ACE) analysis to identify the non-vulnerable time intervals in memory arrays and avoid unnecessary fault injections to speedup the soft error vulnerability evaluation process without sacrificing the accuracy. Our analysis reveals that this approach significantly expedites our architecture-level fault injection technique (on average by 13X).","PeriodicalId":281383,"journal":{"name":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CODESISSS.2015.7331384","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25
Abstract
Radiation-induced soft errors are major reliability concerns in advanced technology nodes. The de facto approach for evaluation of the soft error vulnerability is to perform a costly fault injection campaign. Due to the long residency of some errors in system states, the error has to be traced for even millions of cycles. However, only a very small portion of injected errors leads to the failure. This means that many simulation cycles are wasted as they contribute to no failure due to various masking effects. In this paper, we present an importance sampling technique based on Architecturally Correct Execution (ACE) analysis to identify the non-vulnerable time intervals in memory arrays and avoid unnecessary fault injections to speedup the soft error vulnerability evaluation process without sacrificing the accuracy. Our analysis reveals that this approach significantly expedites our architecture-level fault injection technique (on average by 13X).