2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)最新文献

英文中文

Considering Reliability of Deep Learning Function to Boost Data Suitability and Anomaly Detection 考虑深度学习函数的可靠性以提高数据适用性和异常检测

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00081

Lydia Gauerhof, Yuki Hagiwara, Christoph Schorn, M. Trapp

The increased demand of Deep Neural Networks (DNNs) in safety-critical systems, such as autonomous vehicles, leads to increasing importance of training data suitability. Firstly, we focus on how to extract the relevant data content for ensuring DNN reliability. Then, we identify error categories and propose mitigation measures with emphasis on data suitability. Despite all efforts to boost data suitability, not all possible variations of a real application can be identified. Hence, we analyse the case of unknown out-of-distribution data. In this case, we suggest to complement data suitability with online anomaly detection using FACER that supervises the behaviour of the DNN.

随着自动驾驶汽车等安全关键系统对深度神经网络(dnn)需求的增加，训练数据的适用性变得越来越重要。首先，我们重点研究了如何提取相关的数据内容以保证深度神经网络的可靠性。然后，我们识别错误类别并提出缓解措施，重点是数据适用性。尽管所有的努力都在提高数据的适用性，但并不是一个实际应用程序的所有可能的变化都能被识别出来。因此，我们分析了未知分布外数据的情况。在这种情况下，我们建议使用监督DNN行为的FACER在线异常检测来补充数据适用性。

引用次数: 1

WoSAR 2020 Workshop Keynotes WoSAR 2020研讨会主题演讲

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/issrew51248.2020.00018

引用次数: 0

CRESCO Framework and Checker: Automatic generation of Reflective UML State Machine’s C++ Code and Checker CRESCO框架和检查器:反射UML状态机的c++代码和检查器的自动生成

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00032

Miren Illarramendi Rezabal, L. Elorza, F. Larrinaga, Goiuria Sagardui Mendieta

Software Systems are becoming increasingly complex leading to new Validation & Verification challenges. Model checking and testing techniques are used at development time while runtime verification aims to verify that a system satisfies a given property at runtime. This second technique complements the first one. This paper presents a tool that enables the developers to generate automatically reflective UML State Machine controllers and the Runtime Safety Properties Checker (RSPC) which checks a component-based software system’s safety properties defined at design phase. We address embedded systems whose software components are designed by Unified Modelling Language-State Machines (UML-SM) and their internal information can be observed in terms of model elements at runtime. RESCO (REflective State Machines-based observable software COmponents) framework, generates software components that provide this runtime observability. The checker uses software components’ internal status information to check system level safety properties. The checker detects when a system safety property is violated and starts a safe adaptation process to prevent the hazardous scenario. Thus, as demonstrated in the evaluated experiment but not shown in the paper due to the space limitation, the safety of the system is enhanced.

软件系统变得越来越复杂，导致新的验证和验证挑战。模型检查和测试技术在开发时使用，而运行时验证的目的是在运行时验证系统是否满足给定的属性。第二种技术是对第一种技术的补充。本文提出了一种工具，使开发人员能够自动生成反映的UML状态机控制器和运行时安全属性检查器(RSPC)， RSPC检查在设计阶段定义的基于组件的软件系统的安全属性。我们讨论的嵌入式系统的软件组件是由统一建模语言状态机(UML-SM)设计的，它们的内部信息可以在运行时根据模型元素观察到。RESCO(基于反射状态机的可观察软件组件)框架，生成提供这种运行时可观察性的软件组件。检查器使用软件组件的内部状态信息来检查系统级的安全属性。检查器检测何时违反了系统安全属性，并启动安全适应过程以防止危险情况的发生。因此，正如评估实验所证明的那样，系统的安全性得到了提高，但由于篇幅限制，本文未作说明。

{"title":"CRESCO Framework and Checker: Automatic generation of Reflective UML State Machine’s C++ Code and Checker","authors":"Miren Illarramendi Rezabal, L. Elorza, F. Larrinaga, Goiuria Sagardui Mendieta","doi":"10.1109/ISSREW51248.2020.00032","DOIUrl":"https://doi.org/10.1109/ISSREW51248.2020.00032","url":null,"abstract":"Software Systems are becoming increasingly complex leading to new Validation & Verification challenges. Model checking and testing techniques are used at development time while runtime verification aims to verify that a system satisfies a given property at runtime. This second technique complements the first one. This paper presents a tool that enables the developers to generate automatically reflective UML State Machine controllers and the Runtime Safety Properties Checker (RSPC) which checks a component-based software system’s safety properties defined at design phase. We address embedded systems whose software components are designed by Unified Modelling Language-State Machines (UML-SM) and their internal information can be observed in terms of model elements at runtime. RESCO (REflective State Machines-based observable software COmponents) framework, generates software components that provide this runtime observability. The checker uses software components’ internal status information to check system level safety properties. The checker detects when a system safety property is violated and starts a safe adaptation process to prevent the hazardous scenario. Thus, as demonstrated in the evaluated experiment but not shown in the paper due to the space limitation, the safety of the system is enhanced.","PeriodicalId":202247,"journal":{"name":"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121345305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Declarative Dashboard Generation 声明式仪表板生成

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00075

Alessandro Tundo, Chiara Castelnovo, M. Mobilio, O. Riganelli, L. Mariani

Systems of systems are highly dynamic software systems that require flexible monitoring solutions to be observed and controlled. Indeed, operators have to frequently adapt the set of collected indicators according to changing circumstances, to visualize the behavior of the monitored systems and timely take actions, if needed. Unfortunately, dashboard systems are still quite cumbersome to conFigure and adapt to a changing set of indicators that must be visualized.This paper reports our initial effort towards the definition of an automatic dashboard generation process that exploits meta-model layouts to create a full dashboard from a set of indicators selected by operators.

系统的系统是高度动态的软件系统，需要灵活的监控解决方案来观察和控制。事实上，作业者必须根据不断变化的环境，经常调整收集到的指标集，以可视化被监测系统的行为，并在必要时及时采取行动。不幸的是，仪表板系统在配置和适应一组必须可视化的不断变化的指标方面仍然相当麻烦。本文报告了我们对自动仪表板生成过程定义的初步努力，该过程利用元模型布局从操作员选择的一组指标中创建完整的仪表板。

引用次数: 1

Message from the WoSoCer 2020 Workshop Chairs 2020世界足球锦标赛研讨会主席致辞

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/issrew51248.2020.00022

引用次数: 0

Safety-Critical Software - Quantification of Test Results 安全关键软件-测试结果的量化

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00089

Johan Sundell, K. Lundqvist, H. Forsberg

Safety-critical software systems have traditionally been found in few domains, e.g., aerospace, nuclear and medical. As technology advances and software capability increases, such systems can be found in more and more applications, e.g., selfdriving cars, autonomous trains. This development will dramatically increase the operational exposure of such systems. All safety-critical applications need to meet exceptionally stringent criteria in terms of dependability. Proving compliance is a challenge for the industry and there is a lack of accepted methods to determine the status of safety-critical software. The regulatory bodies often require a certain amount of testing to be performed but do not, for software systems, require evidence of a given failure rate. This paper addresses quantification of test results. It examines both theoretical and practical aspects. The contribution of this paper is an equation that estimates the remaining undetected faults in the software system after testing. The equation considers partial test coverage. The theoretical results are validated with results from a large industry study (commercial military software). Additionally, the industry results are used to analyze the concept of entropy also known as Shannon information, which is shown to describe the knowledge gained from a test effort.

安全关键型软件系统传统上只存在于少数领域，例如航空航天、核和医疗。随着技术的进步和软件能力的提高，这种系统可以在越来越多的应用中找到，例如自动驾驶汽车，自动驾驶火车。这一发展将大大增加这类系统的作战风险。所有安全关键型应用程序都需要在可靠性方面满足非常严格的标准。证明合规性对行业来说是一个挑战，而且缺乏公认的方法来确定安全关键软件的状态。监管机构通常要求执行一定数量的测试，但对于软件系统，不要求给定故障率的证据。本文讨论了测试结果的量化。它考察了理论和实践两个方面。本文的贡献是一个方程，用于估计测试后软件系统中剩余未检测到的故障。这个等式考虑了部分测试覆盖率。理论结果与大型工业研究(商业军事软件)的结果进行了验证。此外，行业结果用于分析熵的概念，也称为香农信息，它用于描述从测试工作中获得的知识。

{"title":"Safety-Critical Software - Quantification of Test Results","authors":"Johan Sundell, K. Lundqvist, H. Forsberg","doi":"10.1109/ISSREW51248.2020.00089","DOIUrl":"https://doi.org/10.1109/ISSREW51248.2020.00089","url":null,"abstract":"Safety-critical software systems have traditionally been found in few domains, e.g., aerospace, nuclear and medical. As technology advances and software capability increases, such systems can be found in more and more applications, e.g., selfdriving cars, autonomous trains. This development will dramatically increase the operational exposure of such systems. All safety-critical applications need to meet exceptionally stringent criteria in terms of dependability. Proving compliance is a challenge for the industry and there is a lack of accepted methods to determine the status of safety-critical software. The regulatory bodies often require a certain amount of testing to be performed but do not, for software systems, require evidence of a given failure rate. This paper addresses quantification of test results. It examines both theoretical and practical aspects. The contribution of this paper is an equation that estimates the remaining undetected faults in the software system after testing. The equation considers partial test coverage. The theoretical results are validated with results from a large industry study (commercial military software). Additionally, the industry results are used to analyze the concept of entropy also known as Shannon information, which is shown to describe the knowledge gained from a test effort.","PeriodicalId":202247,"journal":{"name":"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130258457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Challenges Faced with Application Performance Monitoring (APM) when Migrating to the Cloud 迁移到云时应用程序性能监控(APM)面临的挑战

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00046

V. Debroy, Alireza Mansoori, James Haleblian, Mark Wilkens

Application Performance Management (APM) is an important software engineering process that is critical to properly assessing the reliability of software in its targeted environment. Of late, migrating applications to the cloud has become increasingly popular for many reasons such as reduced operational costs, scalability, and increased IT productivity. While much has been said about how to migrate applications, relatively less has been said on how to appropriately monitor them in the cloud, which presents its own unique challenges. This article outlines some of the technical challenges that we have faced at AT&T when migrating to the cloud, with a focus on APM, and aims to stimulate further industrial-academic research and collaboration in this area.

应用程序性能管理(APM)是一个重要的软件工程过程，对于在目标环境中正确评估软件的可靠性至关重要。最近，由于降低运营成本、可伸缩性和提高IT生产力等原因，将应用程序迁移到云已变得越来越流行。虽然关于如何迁移应用程序的讨论很多，但关于如何在云中适当地监视应用程序的讨论相对较少，这有其独特的挑战。本文概述了我们在AT&T迁移到云时所面临的一些技术挑战，重点是APM，旨在促进该领域进一步的工业-学术研究和合作。

引用次数: 0

Root cause prediction based on bug reports 基于bug报告的根本原因预测

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00067

Thomas Hirsch, Birgit Hofer

This paper proposes a supervised machine learning approach for predicting the root cause of a given bug report. Knowing the root cause of a bug can help developers in the debugging process—either directly or indirectly by choosing proper tool support for the debugging task. We mined 54755 closed bug reports from the issue trackers of 103 GitHub projects and applied a set of heuristics to create a benchmark consisting of 10459 reports. A subset was manually classified into three groups (semantic, memory, and concurrency) based on the bugs’ root causes. Since the types of root cause are not equally distributed, a combination of keyword search and random selection was applied. Our data set for the machine learning approach consists of 369 bug reports (122 concurrency, 121 memory, and 126 semantic bugs). The bug reports are used as input to a natural language processing algorithm. We evaluated the performance of several classifiers for predicting the root causes for the given bug reports. Linear Support Vector machines achieved the highest mean precision (0.74) and recall (0.72) scores. The created bug data set and classification are publicly available.

本文提出了一种有监督的机器学习方法来预测给定bug报告的根本原因。了解bug的根本原因可以帮助开发人员在调试过程中直接或间接地为调试任务选择合适的工具支持。我们从103个GitHub项目的问题跟踪器中挖掘了54755个已关闭的bug报告，并应用了一组启发式方法来创建一个由10459个报告组成的基准。根据错误的根本原因，将一个子集手动分为三组(语义组、内存组和并发组)。由于根本原因的类型不是均匀分布的，所以采用了关键词搜索和随机选择相结合的方法。我们的机器学习方法的数据集由369个错误报告组成(122个并发错误，121个内存错误和126个语义错误)。错误报告被用作自然语言处理算法的输入。我们评估了几个分类器的性能，以预测给定bug报告的根本原因。线性支持向量机获得了最高的平均精度(0.74)和召回率(0.72)分数。所创建的错误数据集和分类是公开的。

{"title":"Root cause prediction based on bug reports","authors":"Thomas Hirsch, Birgit Hofer","doi":"10.1109/ISSREW51248.2020.00067","DOIUrl":"https://doi.org/10.1109/ISSREW51248.2020.00067","url":null,"abstract":"This paper proposes a supervised machine learning approach for predicting the root cause of a given bug report. Knowing the root cause of a bug can help developers in the debugging process—either directly or indirectly by choosing proper tool support for the debugging task. We mined 54755 closed bug reports from the issue trackers of 103 GitHub projects and applied a set of heuristics to create a benchmark consisting of 10459 reports. A subset was manually classified into three groups (semantic, memory, and concurrency) based on the bugs’ root causes. Since the types of root cause are not equally distributed, a combination of keyword search and random selection was applied. Our data set for the machine learning approach consists of 369 bug reports (122 concurrency, 121 memory, and 126 semantic bugs). The bug reports are used as input to a natural language processing algorithm. We evaluated the performance of several classifiers for predicting the root causes for the given bug reports. Linear Support Vector machines achieved the highest mean precision (0.74) and recall (0.72) scores. The created bug data set and classification are publicly available.","PeriodicalId":202247,"journal":{"name":"2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128557325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Systematic Software Testing of Critical Embedded Digital Devices in Nuclear Power Applications 核电应用中关键嵌入式数字器件的系统软件测试

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00042

A. Jayakumar, S. Gautham, D. R. Kuhn, B. Simon, Aidan G. Collins, Thomas Dirsch, R. Kacker, C. Elks

While design assurance and testing methods for safety-critical systems have been widely researched and studied for years across a number of industry domains, there are few efforts reported in the literature on the actual application of software testing methods to nuclear power digital I&C systems or devices. We see this as a gap in the knowledge basis. The motivation for this research was to investigate the efficacy and challenges that arise when planning, automating and conducting systematic software testing on actual real-time embedded digital devices. In this paper, we present results on the application of a systematic testing methodology called Pseudo-Exhaustive testing. The systematic testing methods were applied at the unit and module integration levels of the software. The findings suggest that Pseudo Exhaustive testing supported by automated testing technology is an effective approach to testing real-time embedded digital devices in critical nuclear applications.

虽然安全关键系统的设计保证和测试方法已经在许多行业领域进行了多年的广泛研究和研究，但文献中很少报道软件测试方法在核电数字I&C系统或设备中的实际应用。我们认为这是知识基础上的差距。这项研究的动机是调查在实际的实时嵌入式数字设备上规划、自动化和进行系统软件测试时出现的功效和挑战。在本文中，我们提出了一种称为伪穷举测试的系统测试方法的应用结果。系统的测试方法应用于软件的单元和模块集成层面。研究结果表明，自动化测试技术支持的伪穷举测试是关键核应用中实时嵌入式数字设备测试的有效方法。

引用次数: 2

Reliability Evaluation of ML systems, the oracle problem 机器学习系统的可靠性评估，oracle问题

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

Pub Date : 2020-10-01 DOI: 10.1109/ISSREW51248.2020.00050

Antonio Guerriero

The growing adoption of machine learning (ML) in safety-critical contexts makes reliability evaluation of ML systems a crucial task. Although testing represents one of the most used practices to evaluate the reliability of “traditional” systems, just few techniques can be used to evaluate ML-systems’ reliability due to the oracle problem. In this paper, I present a test oracle surrogate able to automatically classify tests’ outcome to obtain feedback about tests whose expected output is unknown. For this purpose, various sources of knowledge are considered to evaluate the outcome of each test. The aim is to exploit this test oracle surrogate to apply classical testing strategies to perform reliability assessment of ML systems. Some preliminary experiments have been performed considering a Convolutional Neural Network (CNN) and exploiting the well known MNIST dataset. These results promise that the presented technique can be effectively used to evaluate the reliability of ML systems.

在安全关键环境中越来越多地采用机器学习(ML)使得机器学习系统的可靠性评估成为一项至关重要的任务。尽管测试代表了评估“传统”系统可靠性的最常用实践之一，但是由于oracle问题，只有少数技术可以用于评估ml系统的可靠性。在本文中，我提出了一个测试oracle代理，它能够自动对测试的结果进行分类，以获得关于预期输出未知的测试的反馈。为此，我们考虑了各种知识来源来评估每个测试的结果。目的是利用这个测试oracle代理来应用经典的测试策略来执行ML系统的可靠性评估。一些初步的实验已经进行了考虑卷积神经网络(CNN)和利用著名的MNIST数据集。这些结果表明，所提出的技术可以有效地用于评估机器学习系统的可靠性。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀