无声性:用故障注入实验研究无声故障

2014 Tenth European Dependable Computing Conference Pub Date : 2014-05-13 DOI:10.1109/EDCC.2014.16

E. V. D. Kouwe, Cristiano Giuffrida, A. Tanenbaum

{"title":"无声性:用故障注入实验研究无声故障","authors":"E. V. D. Kouwe, Cristiano Giuffrida, A. Tanenbaum","doi":"10.1109/EDCC.2014.16","DOIUrl":null,"url":null,"abstract":"Fault injection campaigns have been used extensively to characterize the behavior of systems under errors. Traditional characterization studies, however, focus only on analyzing fail-stop behavior, incorrect test results, and other obvious failures observed during the experiment. More research is needed to evaluate the impact of silent failures-a relevant and insidious class of real-world failures-and doing so in a fully automated way in a fault injection setting. This paper presents a new methodology to identify fault injection-induced silent failures and assess their impact in a fully automated way. Drawing inspiration from system call-based anomaly detection, we compare faulty and fault-free execution runs and pinpoint behavioral differences that result in externally visible changes-not reported to the user-to detect silent failures. Our investigation across several different programs demonstrates that the impact of silent failures is relevant, consistent with field data, and should be carefully considered to avoid compromising the soundness of fault injection results.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"On the Soundness of Silence: Investigating Silent Failures Using Fault Injection Experiments\",\"authors\":\"E. V. D. Kouwe, Cristiano Giuffrida, A. Tanenbaum\",\"doi\":\"10.1109/EDCC.2014.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fault injection campaigns have been used extensively to characterize the behavior of systems under errors. Traditional characterization studies, however, focus only on analyzing fail-stop behavior, incorrect test results, and other obvious failures observed during the experiment. More research is needed to evaluate the impact of silent failures-a relevant and insidious class of real-world failures-and doing so in a fully automated way in a fault injection setting. This paper presents a new methodology to identify fault injection-induced silent failures and assess their impact in a fully automated way. Drawing inspiration from system call-based anomaly detection, we compare faulty and fault-free execution runs and pinpoint behavioral differences that result in externally visible changes-not reported to the user-to detect silent failures. Our investigation across several different programs demonstrates that the impact of silent failures is relevant, consistent with field data, and should be carefully considered to avoid compromising the soundness of fault injection results.\",\"PeriodicalId\":364377,\"journal\":{\"name\":\"2014 Tenth European Dependable Computing Conference\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Tenth European Dependable Computing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EDCC.2014.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Tenth European Dependable Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EDCC.2014.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

故障注入活动已被广泛用于描述系统在错误下的行为。然而，传统的表征研究只关注分析故障停止行为、错误的测试结果以及在实验中观察到的其他明显故障。需要更多的研究来评估无声故障的影响——一种相关的、潜在的现实世界故障——并在故障注入设置中以完全自动化的方式进行评估。本文提出了一种新的方法来识别故障注入引起的沉默故障，并以全自动的方式评估其影响。从基于系统调用的异常检测中获得灵感，我们比较了有故障和无故障的执行运行，并查明导致外部可见的更改(未报告给用户)的行为差异，以检测无声故障。我们对几个不同项目的调查表明，无声故障的影响是相关的，与现场数据一致，应该仔细考虑，以避免损害故障注入结果的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

On the Soundness of Silence: Investigating Silent Failures Using Fault Injection Experiments

Fault injection campaigns have been used extensively to characterize the behavior of systems under errors. Traditional characterization studies, however, focus only on analyzing fail-stop behavior, incorrect test results, and other obvious failures observed during the experiment. More research is needed to evaluate the impact of silent failures-a relevant and insidious class of real-world failures-and doing so in a fully automated way in a fault injection setting. This paper presents a new methodology to identify fault injection-induced silent failures and assess their impact in a fully automated way. Drawing inspiration from system call-based anomaly detection, we compare faulty and fault-free execution runs and pinpoint behavioral differences that result in externally visible changes-not reported to the user-to detect silent failures. Our investigation across several different programs demonstrates that the impact of silent failures is relevant, consistent with field data, and should be carefully considered to avoid compromising the soundness of fault injection results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 Tenth European Dependable Computing Conference

自引率

0.00%

发文量