首页 > 最新文献

2014 Tenth European Dependable Computing Conference最新文献

英文 中文
On a Modeling Approach to Analyze Resilience of a Smart Grid Infrastructure 智能电网基础设施弹性分析的建模方法
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.34
S. Chiaradonna, F. Giandomenico, N. Murru
The evolution of electrical grids, both in terms of enhanced ICT functionalities to improve efficiency, reliability and economics, as well as the increasing penetration of renewable redistributed energy resources, results in a more sophisticated electrical infrastructure which poses new challenges from several perspectives, including resilience and quality of service analysis. In addition, the presence of interdependencies, which more and more characterize critical infrastructures (including the power sector), exacerbates the need for advanced analysis approaches, to be possibly employed since the early phases of the system design, to identify vulnerabilities and appropriate countermeasures. In this paper, we outline an approach to model and analyze smart grids and discuss the major challenges to be addressed in stochastic model-based analysis to account for the peculiarities of the involved system elements. Representation of dynamic and flexible behavior of generators and loads, as well as representation of the complex ICT control functions required to preserve and/or re-establish electrical equilibrium in presence of changes need to be faced to assess suitable indicators of the resilience and quality of service of the smart grid.
电网的发展,无论是在增强信息通信技术功能以提高效率、可靠性和经济性方面,还是在可再生再分配能源的不断渗透方面,都导致了更复杂的电力基础设施,这从几个方面提出了新的挑战,包括弹性和服务分析质量。此外,关键基础设施(包括电力部门)日益具有的相互依赖性的存在加剧了对先进分析方法的需要,这些方法可能在系统设计的早期阶段就采用,以确定脆弱性和适当的对策。在本文中,我们概述了一种建模和分析智能电网的方法,并讨论了在基于随机模型的分析中需要解决的主要挑战,以考虑所涉及系统元素的特殊性。为了评估智能电网的弹性和服务质量的适当指标,需要对发电机和负载的动态和灵活行为的表示,以及在发生变化时保持和/或重新建立电力平衡所需的复杂ICT控制功能的表示进行评估。
{"title":"On a Modeling Approach to Analyze Resilience of a Smart Grid Infrastructure","authors":"S. Chiaradonna, F. Giandomenico, N. Murru","doi":"10.1109/EDCC.2014.34","DOIUrl":"https://doi.org/10.1109/EDCC.2014.34","url":null,"abstract":"The evolution of electrical grids, both in terms of enhanced ICT functionalities to improve efficiency, reliability and economics, as well as the increasing penetration of renewable redistributed energy resources, results in a more sophisticated electrical infrastructure which poses new challenges from several perspectives, including resilience and quality of service analysis. In addition, the presence of interdependencies, which more and more characterize critical infrastructures (including the power sector), exacerbates the need for advanced analysis approaches, to be possibly employed since the early phases of the system design, to identify vulnerabilities and appropriate countermeasures. In this paper, we outline an approach to model and analyze smart grids and discuss the major challenges to be addressed in stochastic model-based analysis to account for the peculiarities of the involved system elements. Representation of dynamic and flexible behavior of generators and loads, as well as representation of the complex ICT control functions required to preserve and/or re-establish electrical equilibrium in presence of changes need to be faced to assess suitable indicators of the resilience and quality of service of the smart grid.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117053984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A Framework for Creating Binary Rewriting Tools (Short Paper) 创建二进制重写工具的框架(短论文)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.14
Jason Hiser, A. Nguyen-Tuong, M. Co, Benjamin D. Rodes, M. Hall, Clark L. Coleman, J. Knight, J. Davidson
The design of many binary-level rewriting and instrumentation systems is based on specifying and implementing an application-programmer interface (API) to provide the necessary functionality for implementing various instrumentation tools. Users implement instrumentation tools by writing code that uses the functionality provided by the implementation of the API. This paper describes a novel approach for realizing powerful binary level instrumentation systems that is based on a simple, low-level language, called SPRI. SPRI is a simple language for specifying edits to apply to a binary. Its simplicity makes SPRI an ideal target language for binary analysis and instrumentation tools. A SPRI specification can then be applied to a binary statically or dynamically to insert the desired instrumentation. To demonstrate the advantages, flexibility, and power of this approach, the paper presents an exemplar instrumentation system that uses a software dynamic translator to apply SPRI-specified edits to a binary, along with several case studies of how the system has been used.
许多二进制级重写和检测系统的设计都是基于指定和实现应用程序程序员接口(API)来提供实现各种检测工具所需的功能。用户通过编写使用API实现提供的功能的代码来实现插装工具。本文描述了一种基于简单的低级语言SPRI实现功能强大的二进制电平检测系统的新方法。SPRI是一种简单的语言,用于指定要应用于二进制文件的编辑。它的简单性使SPRI成为二进制分析和仪器工具的理想目标语言。然后可以静态或动态地将SPRI规范应用于二进制文件,以插入所需的工具。为了演示这种方法的优点、灵活性和功能,本文介绍了一个示例仪器系统,该系统使用软件动态翻译器将spi指定的编辑应用于二进制文件,以及如何使用该系统的几个案例研究。
{"title":"A Framework for Creating Binary Rewriting Tools (Short Paper)","authors":"Jason Hiser, A. Nguyen-Tuong, M. Co, Benjamin D. Rodes, M. Hall, Clark L. Coleman, J. Knight, J. Davidson","doi":"10.1109/EDCC.2014.14","DOIUrl":"https://doi.org/10.1109/EDCC.2014.14","url":null,"abstract":"The design of many binary-level rewriting and instrumentation systems is based on specifying and implementing an application-programmer interface (API) to provide the necessary functionality for implementing various instrumentation tools. Users implement instrumentation tools by writing code that uses the functionality provided by the implementation of the API. This paper describes a novel approach for realizing powerful binary level instrumentation systems that is based on a simple, low-level language, called SPRI. SPRI is a simple language for specifying edits to apply to a binary. Its simplicity makes SPRI an ideal target language for binary analysis and instrumentation tools. A SPRI specification can then be applied to a binary statically or dynamically to insert the desired instrumentation. To demonstrate the advantages, flexibility, and power of this approach, the paper presents an exemplar instrumentation system that uses a software dynamic translator to apply SPRI-specified edits to a binary, along with several case studies of how the system has been used.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133570408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Application of Statistical Testing to the Data Processing and Control System for the Dungeness B Nuclear Power Plant (Practical Experience Report) 统计检验在Dungeness B核电站数据处理与控制系统中的应用(实践经验报告)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.38
Helen Gough, S. Kuball
This experience report describes the successful application of statistical software testing to a large-scale realworld equipment replacement project. The Data Processing System at Dungeness B nuclear power station is obsolescent and needs to be replaced by a new system involving different technology. The new Data Processing and Control System (DPCS) involves a large amount of software code, with the potential for software failures, and therefore confidence needs to be established in the dependability of the new software-based system. Statistical software testing (ST) was used as one technique to do this. The aim was to support the claim that with high confidence the higher integrity parts of the replacement software have a failure probability of better than 10-2 failures per demand (fpd). This paper describes how this has been achieved by careful design of the statistical tests and by the development of a large number of supporting tools and procedures. A total of 395 statistical tests with no failure was achieved providing at least 98% confidence that the safety claim has been met.
这份经验报告描述了统计软件测试在大规模实际设备更换项目中的成功应用。邓杰内斯B核电站的数据处理系统已经过时,需要用一种涉及不同技术的新系统来取代。新的数据处理和控制系统(DPCS)涉及大量的软件代码,有可能出现软件故障,因此需要对新的基于软件的系统的可靠性建立信心。统计软件测试(ST)被用作一种技术来做到这一点。其目的是支持这样一种说法,即在高置信度下,替换软件的较高完整性部件的故障概率优于每需求10-2个故障(fpd)。本文描述了如何通过精心设计统计测试和开发大量辅助工具和程序来实现这一目标。总共进行了395次统计测试,没有失败,提供了至少98%的置信度,表明安全声明已经得到满足。
{"title":"Application of Statistical Testing to the Data Processing and Control System for the Dungeness B Nuclear Power Plant (Practical Experience Report)","authors":"Helen Gough, S. Kuball","doi":"10.1109/EDCC.2014.38","DOIUrl":"https://doi.org/10.1109/EDCC.2014.38","url":null,"abstract":"This experience report describes the successful application of statistical software testing to a large-scale realworld equipment replacement project. The Data Processing System at Dungeness B nuclear power station is obsolescent and needs to be replaced by a new system involving different technology. The new Data Processing and Control System (DPCS) involves a large amount of software code, with the potential for software failures, and therefore confidence needs to be established in the dependability of the new software-based system. Statistical software testing (ST) was used as one technique to do this. The aim was to support the claim that with high confidence the higher integrity parts of the replacement software have a failure probability of better than 10-2 failures per demand (fpd). This paper describes how this has been achieved by careful design of the statistical tests and by the development of a large number of supporting tools and procedures. A total of 395 statistical tests with no failure was achieved providing at least 98% confidence that the safety claim has been met.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122358911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Binary-Level Fault Injection for AUTOSAR Systems (Short Paper) AUTOSAR系统的二值级故障注入(短文)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.21
M. Islam, Nithilan Meenakshi Karunakaran, Johan Haraldsson, Fredrik Bernin, J. Karlsson
Continuously growing complexity of the automotive Electrical and/or Electronic (E/E) systems has prompted major manufacturers (OEMs) and suppliers to embrace the AUTOSAR (Automotive Open System Architecture) standard. Furthermore, recent introduction of ISO 26262, a functional safety standard for road vehicles, has confronted the automotive industry with rigorous requirements. This has become even more challenging as AUTOSAR highly promotes COTS (Commercial Off-The-Shelf) software for which access to source code is either restricted or non-existent. This paper contributes with a method and prototype tool for binary-level fault injection (BLFI) for AUTOSAR-based systems. The proposed BLFI tool can assist in achieving functional safety by evaluating robustness of software at any AUTOSAR layer even though only binaries are available. The proposed technique is quite generic and is evaluated with a proof-of-concept implementation of an AUTOSAR-based application.
汽车电气和/或电子(E/E)系统的复杂性不断增加,促使主要制造商(oem)和供应商采用AUTOSAR(汽车开放系统架构)标准。此外,最近引入的道路车辆功能安全标准ISO 26262,对汽车行业提出了严格的要求。随着AUTOSAR高度推广COTS(商用现货)软件,这变得更加具有挑战性,这些软件对源代码的访问要么受到限制,要么不存在。本文提出了一种基于autosar系统的二进制级故障注入(BLFI)方法和原型工具。提出的BLFI工具可以通过评估任何AUTOSAR层软件的稳健性来帮助实现功能安全,即使只有二进制文件可用。所提出的技术非常通用,并通过基于autosar的应用程序的概念验证实现进行了评估。
{"title":"Binary-Level Fault Injection for AUTOSAR Systems (Short Paper)","authors":"M. Islam, Nithilan Meenakshi Karunakaran, Johan Haraldsson, Fredrik Bernin, J. Karlsson","doi":"10.1109/EDCC.2014.21","DOIUrl":"https://doi.org/10.1109/EDCC.2014.21","url":null,"abstract":"Continuously growing complexity of the automotive Electrical and/or Electronic (E/E) systems has prompted major manufacturers (OEMs) and suppliers to embrace the AUTOSAR (Automotive Open System Architecture) standard. Furthermore, recent introduction of ISO 26262, a functional safety standard for road vehicles, has confronted the automotive industry with rigorous requirements. This has become even more challenging as AUTOSAR highly promotes COTS (Commercial Off-The-Shelf) software for which access to source code is either restricted or non-existent. This paper contributes with a method and prototype tool for binary-level fault injection (BLFI) for AUTOSAR-based systems. The proposed BLFI tool can assist in achieving functional safety by evaluating robustness of software at any AUTOSAR layer even though only binaries are available. The proposed technique is quite generic and is evaluated with a proof-of-concept implementation of an AUTOSAR-based application.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122389720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
On the Soundness of Silence: Investigating Silent Failures Using Fault Injection Experiments 无声性:用故障注入实验研究无声故障
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.16
E. V. D. Kouwe, Cristiano Giuffrida, A. Tanenbaum
Fault injection campaigns have been used extensively to characterize the behavior of systems under errors. Traditional characterization studies, however, focus only on analyzing fail-stop behavior, incorrect test results, and other obvious failures observed during the experiment. More research is needed to evaluate the impact of silent failures-a relevant and insidious class of real-world failures-and doing so in a fully automated way in a fault injection setting. This paper presents a new methodology to identify fault injection-induced silent failures and assess their impact in a fully automated way. Drawing inspiration from system call-based anomaly detection, we compare faulty and fault-free execution runs and pinpoint behavioral differences that result in externally visible changes-not reported to the user-to detect silent failures. Our investigation across several different programs demonstrates that the impact of silent failures is relevant, consistent with field data, and should be carefully considered to avoid compromising the soundness of fault injection results.
故障注入活动已被广泛用于描述系统在错误下的行为。然而,传统的表征研究只关注分析故障停止行为、错误的测试结果以及在实验中观察到的其他明显故障。需要更多的研究来评估无声故障的影响——一种相关的、潜在的现实世界故障——并在故障注入设置中以完全自动化的方式进行评估。本文提出了一种新的方法来识别故障注入引起的沉默故障,并以全自动的方式评估其影响。从基于系统调用的异常检测中获得灵感,我们比较了有故障和无故障的执行运行,并查明导致外部可见的更改(未报告给用户)的行为差异,以检测无声故障。我们对几个不同项目的调查表明,无声故障的影响是相关的,与现场数据一致,应该仔细考虑,以避免损害故障注入结果的可靠性。
{"title":"On the Soundness of Silence: Investigating Silent Failures Using Fault Injection Experiments","authors":"E. V. D. Kouwe, Cristiano Giuffrida, A. Tanenbaum","doi":"10.1109/EDCC.2014.16","DOIUrl":"https://doi.org/10.1109/EDCC.2014.16","url":null,"abstract":"Fault injection campaigns have been used extensively to characterize the behavior of systems under errors. Traditional characterization studies, however, focus only on analyzing fail-stop behavior, incorrect test results, and other obvious failures observed during the experiment. More research is needed to evaluate the impact of silent failures-a relevant and insidious class of real-world failures-and doing so in a fully automated way in a fault injection setting. This paper presents a new methodology to identify fault injection-induced silent failures and assess their impact in a fully automated way. Drawing inspiration from system call-based anomaly detection, we compare faulty and fault-free execution runs and pinpoint behavioral differences that result in externally visible changes-not reported to the user-to detect silent failures. Our investigation across several different programs demonstrates that the impact of silent failures is relevant, consistent with field data, and should be carefully considered to avoid compromising the soundness of fault injection results.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133225816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
From Resilient Computing Architectural Concepts to Wireless Sensor Network-Based Applications (Short Paper) 从弹性计算架构概念到基于无线传感器网络的应用(短文)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.23
M. Stoicescu, J. Fabre, Matthieu Roy, Animesh Pathak
Resilient computing is defined as the ability of a system to remain dependable when facing changes. To mitigate faults at runtime, dependable systems employ fault tolerance mechanisms such as replication techniques. These mechanisms have to be systematically and rigorously applied in order to guarantee the conformance between the application runtime behavior and its dependability requirements. To this end, we propose architectural concepts for developing resilient computing systems using component-based middleware. The development process of adaptive fault tolerance (AFT) is shortly presented. We then illustrate the benefits of AFT on a Wireless Sensor Network-based application for parking management.
弹性计算被定义为系统在面对变化时保持可靠的能力。为了减轻运行时的错误,可靠的系统采用容错机制,如复制技术。为了保证应用程序运行时行为与其可靠性需求之间的一致性,必须系统地、严格地应用这些机制。为此,我们提出了使用基于组件的中间件开发弹性计算系统的体系结构概念。简要介绍了自适应容错技术的发展过程。然后,我们说明了AFT在基于无线传感器网络的停车管理应用程序中的好处。
{"title":"From Resilient Computing Architectural Concepts to Wireless Sensor Network-Based Applications (Short Paper)","authors":"M. Stoicescu, J. Fabre, Matthieu Roy, Animesh Pathak","doi":"10.1109/EDCC.2014.23","DOIUrl":"https://doi.org/10.1109/EDCC.2014.23","url":null,"abstract":"Resilient computing is defined as the ability of a system to remain dependable when facing changes. To mitigate faults at runtime, dependable systems employ fault tolerance mechanisms such as replication techniques. These mechanisms have to be systematically and rigorously applied in order to guarantee the conformance between the application runtime behavior and its dependability requirements. To this end, we propose architectural concepts for developing resilient computing systems using component-based middleware. The development process of adaptive fault tolerance (AFT) is shortly presented. We then illustrate the benefits of AFT on a Wireless Sensor Network-based application for parking management.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129674506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaining Confidence on Dependability Benchmarks' Conclusions through "Back-to-Back" Testing (Practical Experience Report) 通过“背靠背”测试获得对可靠性基准结论的信心(实践经验报告)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.20
Miquel Martínez, D. Andrés, Juan-Carlos Ruiz-Garcia
The main goal of any benchmark is to guide decisions through system ranking, but surprisingly little research has been focused so far on providing means to gain confidence on the analysis carried out with benchmark results. The inclusion of a back-to-back testing approach in the benchmark analysis process to compare conclusions and gain confidence on the final adopted choices seems convenient to cope with this challenge. The proposal is to look for the coherence of rankings issued from the application of independent multiple-criteria decision making (MCDM) techniques on results. Although any MCDM method can be potentially used, this paper reports our experience using the Logic Score of Preferences (LSP) and the Analytic Hierarchy Process (AHP). Discrepancies in provided rankings invalidate conclusions and must be tracked to discover in coherences and correct the related analysis errors. Once rankings are coherent, the underlying analysis also does, thus increasing our confidence on supplied conclusions.
任何基准测试的主要目标都是通过系统排名来指导决策,但令人惊讶的是,迄今为止很少有研究关注于提供方法来获得对基准测试结果进行的分析的信心。在基准分析过程中包含一个背靠背测试方法,以比较结论并获得对最终采用的选择的信心,似乎可以方便地应对这一挑战。该建议是寻找从应用独立的多标准决策(MCDM)技术对结果发布的排名的一致性。尽管任何MCDM方法都可以潜在地使用,但本文报告了我们使用逻辑偏好评分(LSP)和层次分析法(AHP)的经验。所提供的排名的差异使结论无效,必须跟踪以发现一致性并纠正相关的分析错误。一旦排名是一致的,基础分析也会一致,从而增加我们对所提供结论的信心。
{"title":"Gaining Confidence on Dependability Benchmarks' Conclusions through \"Back-to-Back\" Testing (Practical Experience Report)","authors":"Miquel Martínez, D. Andrés, Juan-Carlos Ruiz-Garcia","doi":"10.1109/EDCC.2014.20","DOIUrl":"https://doi.org/10.1109/EDCC.2014.20","url":null,"abstract":"The main goal of any benchmark is to guide decisions through system ranking, but surprisingly little research has been focused so far on providing means to gain confidence on the analysis carried out with benchmark results. The inclusion of a back-to-back testing approach in the benchmark analysis process to compare conclusions and gain confidence on the final adopted choices seems convenient to cope with this challenge. The proposal is to look for the coherence of rankings issued from the application of independent multiple-criteria decision making (MCDM) techniques on results. Although any MCDM method can be potentially used, this paper reports our experience using the Logic Score of Preferences (LSP) and the Analytic Hierarchy Process (AHP). Discrepancies in provided rankings invalidate conclusions and must be tracked to discover in coherences and correct the related analysis errors. Once rankings are coherent, the underlying analysis also does, thus increasing our confidence on supplied conclusions.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133460715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Increasing Dependability of Component-Based Software Systems by Online Failure Prediction (Short Paper) 通过在线故障预测提高基于组件的软件系统的可靠性(短文)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.28
Teerat Pitakrat, A. Hoorn, Lars Grunske
Online failure prediction for large-scale software systems is a challenging task. One reason is the complex structure of many-partially inter-dependent-hardware and software components. State-of-the-art approaches use separate prediction models for parameters of interest or a monolithic prediction model which includes different parameters of all components. However, they have problems when dealing with evolving systems. In this paper, we propose our preliminary research work on online failure prediction targeting large-scale component-based software systems. For the prediction, three complementary types of models are used: (i) an architectural model captures relevant properties of hardware and software components as well as dependencies among them, (ii) for each component, a prediction model captures the current state of a component and predicts independent component failures in the future, (iii) a system-level prediction model represents the current state of the system and-using the component-level prediction models and information on dependencies-allows to predict failures and analyze impacts of architectural system changes for proactive failure management.
大型软件系统的在线故障预测是一项具有挑战性的任务。原因之一是许多部分相互依赖的硬件和软件组件的复杂结构。最先进的方法对感兴趣的参数使用单独的预测模型,或者使用包含所有组件的不同参数的整体预测模型。然而,它们在处理不断发展的系统时会遇到问题。本文提出了针对大型构件软件系统的在线故障预测的初步研究工作。对于预测,使用了三种互补类型的模型:(i)架构模型捕获硬件和软件组件的相关属性以及它们之间的依赖关系;(ii)对于每个组件,预测模型捕获组件的当前状态并预测未来独立组件的故障;(iii)系统级预测模型表示系统的当前状态,并使用组件级预测模型和依赖关系信息,允许预测故障并分析架构系统更改的影响,以进行主动故障管理。
{"title":"Increasing Dependability of Component-Based Software Systems by Online Failure Prediction (Short Paper)","authors":"Teerat Pitakrat, A. Hoorn, Lars Grunske","doi":"10.1109/EDCC.2014.28","DOIUrl":"https://doi.org/10.1109/EDCC.2014.28","url":null,"abstract":"Online failure prediction for large-scale software systems is a challenging task. One reason is the complex structure of many-partially inter-dependent-hardware and software components. State-of-the-art approaches use separate prediction models for parameters of interest or a monolithic prediction model which includes different parameters of all components. However, they have problems when dealing with evolving systems. In this paper, we propose our preliminary research work on online failure prediction targeting large-scale component-based software systems. For the prediction, three complementary types of models are used: (i) an architectural model captures relevant properties of hardware and software components as well as dependencies among them, (ii) for each component, a prediction model captures the current state of a component and predicts independent component failures in the future, (iii) a system-level prediction model represents the current state of the system and-using the component-level prediction models and information on dependencies-allows to predict failures and analyze impacts of architectural system changes for proactive failure management.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114125655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Autonomic Implementation of Reliable Broadcast Based on Dynamic Spanning Trees 基于动态生成树的可靠广播自治实现
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.31
L. A. Rodrigues, L. Arantes, E. P. Duarte
Reliable broadcast is a basic building block of dependable distributed systems that allows the dissemination of messages to all processes guaranteeing that either none or all correct processes deliver the message, despite the occurrence of failures. In this work we introduce an autonomic strategy to implement reliable broadcast. Processes self-organize themselves on an overlay based on a hypercube-like topology. The overlay is maintained by the execution of an underlying monitoring algorithm that guarantees several logarithmic properties even in the presence of processes failures. The reliable broadcast strategy employs spanning trees that are dynamically built embedded in the overlay. Nodes fail by crashing, crashes are permanent, and the broadcast strategy works correctly for an arbitrary number of process crashes. After a process crashes, the reconstruction of the spanning trees is transparent, not inducing, therefore, performance degradation. Besides the formal specification, we also present experimental results obtained with simulation.
可靠广播是可靠分布式系统的基本组成部分,它允许向所有进程传播消息,保证不管发生故障,要么没有进程传递消息,要么所有正确的进程都传递消息。在这项工作中,我们引入了一种自治策略来实现可靠的广播。进程在基于超立方体拓扑的覆盖层上自组织。覆盖层由底层监控算法的执行来维护,该算法即使在存在进程故障的情况下也能保证几个对数属性。可靠广播策略采用嵌入在覆盖层中动态构建的生成树。节点因崩溃而失败,崩溃是永久性的,并且广播策略对于任意数量的进程崩溃都能正确工作。在进程崩溃后,生成树的重建是透明的,因此不会导致性能下降。除了形式说明外,我们还给出了仿真得到的实验结果。
{"title":"An Autonomic Implementation of Reliable Broadcast Based on Dynamic Spanning Trees","authors":"L. A. Rodrigues, L. Arantes, E. P. Duarte","doi":"10.1109/EDCC.2014.31","DOIUrl":"https://doi.org/10.1109/EDCC.2014.31","url":null,"abstract":"Reliable broadcast is a basic building block of dependable distributed systems that allows the dissemination of messages to all processes guaranteeing that either none or all correct processes deliver the message, despite the occurrence of failures. In this work we introduce an autonomic strategy to implement reliable broadcast. Processes self-organize themselves on an overlay based on a hypercube-like topology. The overlay is maintained by the execution of an underlying monitoring algorithm that guarantees several logarithmic properties even in the presence of processes failures. The reliable broadcast strategy employs spanning trees that are dynamically built embedded in the overlay. Nodes fail by crashing, crashes are permanent, and the broadcast strategy works correctly for an arbitrary number of process crashes. After a process crashes, the reconstruction of the spanning trees is transparent, not inducing, therefore, performance degradation. Besides the formal specification, we also present experimental results obtained with simulation.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121725435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Mining Invariants from SaaS Application Logs (Practical Experience Report) 从SaaS应用程序日志中挖掘不变量(实践经验报告)
Pub Date : 2014-05-13 DOI: 10.1109/EDCC.2014.18
S. Sarkar, R. Ganesan, M. Cinque, Flavio Frattini, S. Russo, Agostino Savignano
The increasing popularity of Software as a Service (SaaS) stresses the need of solutions to predict failures and avoid service interruptions, which invariably result in SLA violations and severe loss of revenue. A promising approach to continuously monitor the correct functioning of the system is to check the execution conformance to a set of invariants, i.e., properties that must hold when the system is deemed to run correctly. In this paper we propose a framework and a tool to automatically discover invariants from application logs and to online detect their violation. The framework has been applied on 9 months of log events from a real-world SaaS application. Results show that the proposed tool is able to automatically select 12 invariants with a stringent goodness of fit criteria out of more than 500 potential relationships. We also show the usefulness of our approach to detect runtime issues from logs in the form of violations of selected invariants, corresponding to silent errors that usually go unnoticed by the system maintenance personnel, even if they could represent symptoms of upcoming service failures.
软件即服务(SaaS)的日益普及强调了对预测故障和避免服务中断的解决方案的需求,而服务中断总是导致违反SLA和严重的收入损失。持续监视系统正确功能的一种很有前途的方法是检查执行是否符合一组不变量,即,当系统被认为正确运行时必须保持的属性。在本文中,我们提出了一个框架和工具来自动发现应用程序日志中的不变量,并在线检测它们的违反。该框架已经应用于一个真实的SaaS应用程序中9个月的日志事件。结果表明,该工具能够从500多个潜在关系中自动选择具有严格拟合优度标准的12个不变量。我们还展示了我们的方法在以违反所选不变量的形式从日志中检测运行时问题方面的有用性,这些问题对应于系统维护人员通常不会注意到的静默错误,即使它们可能代表即将发生的服务故障的症状。
{"title":"Mining Invariants from SaaS Application Logs (Practical Experience Report)","authors":"S. Sarkar, R. Ganesan, M. Cinque, Flavio Frattini, S. Russo, Agostino Savignano","doi":"10.1109/EDCC.2014.18","DOIUrl":"https://doi.org/10.1109/EDCC.2014.18","url":null,"abstract":"The increasing popularity of Software as a Service (SaaS) stresses the need of solutions to predict failures and avoid service interruptions, which invariably result in SLA violations and severe loss of revenue. A promising approach to continuously monitor the correct functioning of the system is to check the execution conformance to a set of invariants, i.e., properties that must hold when the system is deemed to run correctly. In this paper we propose a framework and a tool to automatically discover invariants from application logs and to online detect their violation. The framework has been applied on 9 months of log events from a real-world SaaS application. Results show that the proposed tool is able to automatically select 12 invariants with a stringent goodness of fit criteria out of more than 500 potential relationships. We also show the usefulness of our approach to detect runtime issues from logs in the form of violations of selected invariants, corresponding to silent errors that usually go unnoticed by the system maintenance personnel, even if they could represent symptoms of upcoming service failures.","PeriodicalId":364377,"journal":{"name":"2014 Tenth European Dependable Computing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129565746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2014 Tenth European Dependable Computing Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1