Software Testing Verification & Reliability最新文献_第5页

Combinatorial methods for testing Internet of Things smart home systems 测试物联网智能家居系统的组合方法

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-12-28 DOI: 10.1002/stvr.1805

Bernhard Garn, Dominik Schreiber, D. Simos, D. R. Kuhn, J. Voas, R. Kacker

In this paper, we report on applying combinatorial testing to Internet of Things (IoT) home automation hub systems. We detail how to create a dedicated input parameter model of an IoT home automation hub system for use with combinatorial test case generation strategies. Further, we developed an automated test execution framework and two test oracles for evaluation purposes. We applied and evaluated our proposed methodological approach to a real‐world IoT system and analysed the obtained results of various combinatorial test sets with different properties generated based on the derived input model. Additionally, we compare these results to a random testing approach. Our empirical testing evaluations revealed multiple errors in the tested devices and also showed that all considered approaches performed nearly equally well.

在本文中，我们报告了将组合测试应用于物联网(IoT)家庭自动化中心系统。我们详细介绍了如何创建物联网家庭自动化中心系统的专用输入参数模型，以便与组合测试用例生成策略一起使用。此外，我们开发了一个自动化的测试执行框架和两个用于评估目的的测试oracle。我们将我们提出的方法应用并评估了现实世界的物联网系统，并分析了基于派生输入模型生成的具有不同属性的各种组合测试集的获得结果。此外，我们将这些结果与随机测试方法进行比较。我们的经验测试评估揭示了测试设备中的多个错误，也表明所有考虑的方法执行得几乎一样好。

引用次数: 5

A mapping study on mutation testing for mobile applications 移动应用程序突变检测的映射研究

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-12-20 DOI: 10.1002/stvr.1801

Henrique Neves da Silva, Jackson A. Prado Lima, S. Vergilio, A. T. Endo

The use of mutation testing for mobile applications (apps for short) is still a challenge. Mobile apps are usually event‐driven and encompass graphical user interfaces (GUIs) and a complex execution environment. Then, they require mutant operators to describe specific apps faults, and the automation of the mutation process phases like execution and analysis of the mutants is not an easy task. To encourage research addressing such challenges, this paper presents results from a mapping study on mutation testing for mobile apps. Following a systematic plan, we found 16 primary studies that were analysed according to three aspects: (i) trends and statistics about the field; (ii) study characteristics such as focus, proposed operators and automated support for the mutation testing phases; and (iii) evaluation aspects. The great majority of studies (98%) have been published in the last 3 years. The most addressed language is Java, and Android is the only operating system considered. Mutant operators of GUI and configuration types are prevalent in a total of 138 operators found. Most studies implement a supporting tool, but few tools support mutant execution and analysis. The evaluation conducted by the studies includes apps mainly from the finance and utility domain. Nevertheless, there is a lack of benchmarks and more rigorous experiments. Future research should address other specific types of faults, languages, and operating systems. They should offer support for mutant execution and analysis, as well as to reduce the mutation testing cost and limitations in the mobile context.

在移动应用程序(简称应用程序)中使用突变测试仍然是一个挑战。手机应用通常是事件驱动的，包含图形用户界面(gui)和复杂的执行环境。然后，他们要求突变操作人员描述特定的应用程序故障，而执行和分析突变等突变过程阶段的自动化并不是一件容易的事。为了鼓励解决这些挑战的研究，本文介绍了一项针对移动应用程序突变测试的映射研究的结果。根据系统计划，我们找到了16项主要研究，并根据三个方面进行了分析:(i)该领域的趋势和统计数据;(ii)研究特征，例如突变测试阶段的重点、建议的操作人员和自动化支持;(三)评价方面。绝大多数研究(98%)是在最近3年内发表的。最受关注的语言是Java, Android是唯一考虑的操作系统。GUI和配置类型的突变操作符在总共138个操作符中很普遍。大多数研究实现了一个支持工具，但很少有工具支持突变的执行和分析。本研究进行的评估主要包括来自金融和公用事业领域的应用程序。然而，缺乏基准和更严格的实验。未来的研究应该解决其他特定类型的故障、语言和操作系统。它们应该为突变的执行和分析提供支持，以及减少突变测试的成本和在移动环境中的限制。

{"title":"A mapping study on mutation testing for mobile applications","authors":"Henrique Neves da Silva, Jackson A. Prado Lima, S. Vergilio, A. T. Endo","doi":"10.1002/stvr.1801","DOIUrl":"https://doi.org/10.1002/stvr.1801","url":null,"abstract":"The use of mutation testing for mobile applications (apps for short) is still a challenge. Mobile apps are usually event‐driven and encompass graphical user interfaces (GUIs) and a complex execution environment. Then, they require mutant operators to describe specific apps faults, and the automation of the mutation process phases like execution and analysis of the mutants is not an easy task. To encourage research addressing such challenges, this paper presents results from a mapping study on mutation testing for mobile apps. Following a systematic plan, we found 16 primary studies that were analysed according to three aspects: (i) trends and statistics about the field; (ii) study characteristics such as focus, proposed operators and automated support for the mutation testing phases; and (iii) evaluation aspects. The great majority of studies (98%) have been published in the last 3 years. The most addressed language is Java, and Android is the only operating system considered. Mutant operators of GUI and configuration types are prevalent in a total of 138 operators found. Most studies implement a supporting tool, but few tools support mutant execution and analysis. The evaluation conducted by the studies includes apps mainly from the finance and utility domain. Nevertheless, there is a lack of benchmarks and more rigorous experiments. Future research should address other specific types of faults, languages, and operating systems. They should offer support for mutant execution and analysis, as well as to reduce the mutation testing cost and limitations in the mobile context.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"31 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85928361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

HOTFUZ: Cost‐effective higher‐order mutation‐based fault localization HOTFUZ:基于高阶突变的低成本故障定位

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-12-13 DOI: 10.1002/stvr.1802

Jong-In Jang, Duksan Ryu, Jong-Chan Baik

Fault localization techniques are used to deduce the exact source of a failure from a set of failure indications while debugging software and play a crucial role in improving software quality. Mutation‐based fault localization (MBFL) techniques are proposed to localize faults at a finer granularity and with higher accuracy than traditional fault localization techniques. Despite the technique's effectiveness, the immense cost of mutation analysis hinders MBFL's practical application in the industry. Various mutation alternative strategies are utilized to lower the cost of MBFL, but they sacrifice the accuracy of localization results. Higher‐order mutation testing was proposed to search for valuable mutants that drive testing harder and reduce the overall test effort. However, higher‐order mutants (HOMs) never have been used to address the cost problem of MBFL to the extent of our knowledge. This paper proposes a novel, cost‐effective MBFL technique called HOTFUZ, Higher‐Order muTation‐based FaUlt localiZation, that employs HOMs to reduce the cost while minimizing the accuracy degradation. HOTFUZ combines mutants of a program under test into HOMs to decrease the number of mutants by more than half, depending on the order of HOMs. An experimental study is conducted using 65 real‐world faults of CoREBench to assess the proposed approach's cost‐effectiveness. The experimental results show that HOTFUZ outperforms the extant mutation alternative strategies by localizing faults more accurately using the same number of mutants executed. HOTFUZ has three main benefits over existing mutant reduction techniques for MBFL: (a) It keeps the advantage of using the whole set of mutation operators; (b) it does not discard generated mutants randomly for the sake of efficiency; and, finally, (c) it significantly decreases the proportion of equivalent mutants.

故障定位技术用于在调试软件时从一组故障指示中推断出故障的确切来源，对提高软件质量起着至关重要的作用。基于突变的故障定位(MBFL)技术比传统的故障定位技术在更细的粒度和更高的精度上进行故障定位。尽管该技术很有效，但突变分析的巨大成本阻碍了MBFL在工业中的实际应用。为了降低MBFL的成本，采用了多种突变替代策略，但牺牲了定位结果的准确性。提出了高阶突变测试，以搜索有价值的突变，使测试更加困难，并减少整体测试工作量。然而，据我们所知，高阶突变体(HOMs)从未被用于解决MBFL的成本问题。本文提出了一种新颖的、具有成本效益的MBFL技术，称为HOTFUZ，即基于高阶突变的故障定位技术，该技术利用HOMs来降低成本，同时最大限度地降低精度退化。HOTFUZ将被测程序的突变体组合到HOMs中，根据HOMs的顺序，将突变体的数量减少一半以上。利用CoREBench的65个真实故障进行了一项实验研究，以评估所提出方法的成本效益。实验结果表明，HOTFUZ在执行相同数量的突变时，能够更准确地定位故障，优于现有的突变替代策略。与现有的MBFL突变减少技术相比，HOTFUZ有三个主要优点:(a)它保留了使用全套突变算子的优势;(b)不会为了效率而随意丢弃产生的突变体;最后，(c)显著降低了等效突变体的比例。

{"title":"HOTFUZ: Cost‐effective higher‐order mutation‐based fault localization","authors":"Jong-In Jang, Duksan Ryu, Jong-Chan Baik","doi":"10.1002/stvr.1802","DOIUrl":"https://doi.org/10.1002/stvr.1802","url":null,"abstract":"Fault localization techniques are used to deduce the exact source of a failure from a set of failure indications while debugging software and play a crucial role in improving software quality. Mutation‐based fault localization (MBFL) techniques are proposed to localize faults at a finer granularity and with higher accuracy than traditional fault localization techniques. Despite the technique's effectiveness, the immense cost of mutation analysis hinders MBFL's practical application in the industry. Various mutation alternative strategies are utilized to lower the cost of MBFL, but they sacrifice the accuracy of localization results. Higher‐order mutation testing was proposed to search for valuable mutants that drive testing harder and reduce the overall test effort. However, higher‐order mutants (HOMs) never have been used to address the cost problem of MBFL to the extent of our knowledge. This paper proposes a novel, cost‐effective MBFL technique called HOTFUZ, Higher‐Order muTation‐based FaUlt localiZation, that employs HOMs to reduce the cost while minimizing the accuracy degradation. HOTFUZ combines mutants of a program under test into HOMs to decrease the number of mutants by more than half, depending on the order of HOMs. An experimental study is conducted using 65 real‐world faults of CoREBench to assess the proposed approach's cost‐effectiveness. The experimental results show that HOTFUZ outperforms the extant mutation alternative strategies by localizing faults more accurately using the same number of mutants executed. HOTFUZ has three main benefits over existing mutant reduction techniques for MBFL: (a) It keeps the advantage of using the whole set of mutation operators; (b) it does not discard generated mutants randomly for the sake of efficiency; and, finally, (c) it significantly decreases the proportion of equivalent mutants.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"70 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82148627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Model checking, testing and debugging 模型检查、测试和调试

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-12-05 DOI: 10.1002/stvr.1803

R. Hierons, Tao Xie

This issue contains four papers. The first paper focuses on model checking, the second and third papers focus on testing and the last paper focuses on debugging. The first paper, ‘Model checking C++ programs’ by Felipe R. Monteiro, Mikhail R. Gadelha and Lucas C. Cordeiro, is motivated by memory safety issues and how these have proved to be a source of security vulnerabilities. The authors devised a novel bounded model checking approach. The first step was to encode a number of C++ features in a decidable fragment of first-order logic. SMT solvers were then used to carry out verification. In experiments, the proposed approach was found to outperform state-of-the-art verifiers. The prototype tool also found arithmeticoverflow errors in a commercial application. (Recommended by Professor Pretscher) The second paper, ‘GPU acceleration of finite state machine input execution: Improving scale and performance’, by Vanya Yaneva, Ajitha Rajan and Christophe Dubach looks at the problem of executing a large number of tests on a finite state machine (FSM). The motivation for this work is model validation. The approach devised uses GPUs to allow multiple tests to be run in parallel. The authors built on their previous work, which showed how FSM execution can be performed on a GPU, by addressing a number of limitations. In particular, the authors addressed the data transfer overhead and they also performed experiments with FSMs that were too large to fit into GPU memory. In the experiments, the novel optimisations led to further improvements, with the GPU being over four times faster, on average, than a 16-core CPU. (Recommended by Professor Pretscher) The third paper, ‘Survey on test case generation, selection and prioritization for cyber-physical systems’, by Zahra Sadri-Moshkenani, Justin Bradley and Gregg Rothermel, presents a survey of approaches that generate, select or prioritise test cases for cyber-physical systems. The authors identified 34 related papers (26 papers on test generation, 6 papers on test selection and 7 papers on test prioritisation) and classified them according to 8 properties distilled by the authors from past experience. From the survey results, the authors identified a number of open challenges. To address some of these challenges, existing approaches may be adapted or new approaches may be developed. (Recommended by Professor Phil McMinn) The fourth paper, ‘Effective fault localization and context-aware debugging for concurrent programs’, by Justin Chu, Tingting Yu, Jane Huffman Hayes, Xue Han and Yu Zhao, presents Coadec, an approach for automatically generating interthread control flow paths to diagnose concurrency bugs. Coadec consists of two phases: concurrency fault localization and context-aware debugging. The authors evaluated Coadec on 10 real-world multithreaded Java applications and showed that Coadec outperforms state-of-the-art approaches for localising concurrency faults and that Coadec’s context debugging can help

这期有四篇论文。第一篇论文的重点是模型检查，第二篇和第三篇论文的重点是测试，最后一篇论文的重点是调试。第一篇论文是由Felipe R. Monteiro、Mikhail R. Gadelha和Lucas C. Cordeiro撰写的《c++程序的模型检查》，其动机是内存安全问题，以及这些问题如何被证明是安全漏洞的来源。作者设计了一种新的有界模型检验方法。第一步是在一阶逻辑的可确定片段中编码一些c++特性。然后使用SMT求解器进行验证。在实验中，所提出的方法被发现优于最先进的验证器。原型工具还在商业应用中发现了算术覆盖流错误。第二篇论文，“有限状态机输入执行的GPU加速:改进规模和性能”，由Vanya Yaneva, Ajitha Rajan和Christophe Dubach撰写，着眼于在有限状态机(FSM)上执行大量测试的问题。这项工作的动机是模型验证。设计的方法使用gpu来允许多个测试并行运行。作者在他们之前的工作的基础上，通过解决一些限制，展示了如何在GPU上执行FSM。特别地，作者解决了数据传输开销，他们还对太大而无法装入GPU内存的fsm进行了实验。在实验中，新的优化带来了进一步的改进，GPU的平均速度是16核CPU的四倍多。第三篇论文，“对网络物理系统的测试用例生成、选择和优先级的调查”，由Zahra Sadri-Moshkenani、Justin Bradley和Gregg Rothermel撰写，介绍了为网络物理系统生成、选择或优先级测试用例的方法的调查。作者选取了34篇相关论文(26篇关于测试生成，6篇关于测试选择，7篇关于测试优先级)，并根据作者从过去经验中提炼出来的8个属性对其进行了分类。从调查结果中，作者确定了一些开放的挑战。为了应对其中的一些挑战，可能会调整现有的方法或开发新的方法。第四篇论文，“并发程序的有效故障定位和上下文感知调试”，由Justin Chu、Tingting Yu、Jane Huffman Hayes、Xue Han和Yu Zhao撰写，介绍了Coadec，一种自动生成线程间控制流路径来诊断并发错误的方法。Coadec包括两个阶段:并发错误定位和上下文感知调试。作者在10个真实的多线程Java应用程序上评估了Coadec，并表明Coadec在定位并发性错误方面优于最先进的方法，而且Coadec的上下文调试可以帮助开发人员通过检查一小部分代码来理解并发性错误。(Marc Roper推荐)

{"title":"Model checking, testing and debugging","authors":"R. Hierons, Tao Xie","doi":"10.1002/stvr.1803","DOIUrl":"https://doi.org/10.1002/stvr.1803","url":null,"abstract":"This issue contains four papers. The first paper focuses on model checking, the second and third papers focus on testing and the last paper focuses on debugging. The first paper, ‘Model checking C++ programs’ by Felipe R. Monteiro, Mikhail R. Gadelha and Lucas C. Cordeiro, is motivated by memory safety issues and how these have proved to be a source of security vulnerabilities. The authors devised a novel bounded model checking approach. The first step was to encode a number of C++ features in a decidable fragment of first-order logic. SMT solvers were then used to carry out verification. In experiments, the proposed approach was found to outperform state-of-the-art verifiers. The prototype tool also found arithmeticoverflow errors in a commercial application. (Recommended by Professor Pretscher) The second paper, ‘GPU acceleration of finite state machine input execution: Improving scale and performance’, by Vanya Yaneva, Ajitha Rajan and Christophe Dubach looks at the problem of executing a large number of tests on a finite state machine (FSM). The motivation for this work is model validation. The approach devised uses GPUs to allow multiple tests to be run in parallel. The authors built on their previous work, which showed how FSM execution can be performed on a GPU, by addressing a number of limitations. In particular, the authors addressed the data transfer overhead and they also performed experiments with FSMs that were too large to fit into GPU memory. In the experiments, the novel optimisations led to further improvements, with the GPU being over four times faster, on average, than a 16-core CPU. (Recommended by Professor Pretscher) The third paper, ‘Survey on test case generation, selection and prioritization for cyber-physical systems’, by Zahra Sadri-Moshkenani, Justin Bradley and Gregg Rothermel, presents a survey of approaches that generate, select or prioritise test cases for cyber-physical systems. The authors identified 34 related papers (26 papers on test generation, 6 papers on test selection and 7 papers on test prioritisation) and classified them according to 8 properties distilled by the authors from past experience. From the survey results, the authors identified a number of open challenges. To address some of these challenges, existing approaches may be adapted or new approaches may be developed. (Recommended by Professor Phil McMinn) The fourth paper, ‘Effective fault localization and context-aware debugging for concurrent programs’, by Justin Chu, Tingting Yu, Jane Huffman Hayes, Xue Han and Yu Zhao, presents Coadec, an approach for automatically generating interthread control flow paths to diagnose concurrency bugs. Coadec consists of two phases: concurrency fault localization and context-aware debugging. The authors evaluated Coadec on 10 real-world multithreaded Java applications and showed that Coadec outperforms state-of-the-art approaches for localising concurrency faults and that Coadec’s context debugging can help ","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"12 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88219290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Property generation/verification and empirical studies 属性生成/验证和实证研究

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-11-08 DOI: 10.1002/stvr.1800

R. Hierons, Tao Xie

This issue contains four papers. The first and third papers focus on property generation and property verification, respectively, while the second and fourth papers focus on empirical studies of a fault prediction algorithm and test flakiness, respectively. The first paper, “Documentation-based functional constraint generation for library methods,” by Renhe Jiang, Zhengzhao Chen, Yu Pei, Minxue Pan, Tian Zhang, and Xuandong Li, proposes DOC2SMT, an approach that generates functional constraints for library methods based on their documentations. DOC2SMT first translates a method’s documentation into candidate constraint clauses, which are then filtered based on static and dynamic validations. The experimental results show the effectiveness and efficiency of DOC2SMT and also show the benefits of the generated constraints for symbolic-execution-based test generation (recommended by Peter Müller). The second paper, “An empirical study of Linespots: A novel past-fault algorithm,” by Maximilian Scholz and Richard Torkar, proposes a new fault prediction algorithm called Linespots. The authors focus on fault prediction based on past faults and refine a previous algorithm (Bugspots). Interestingly, they used a different granularity: line as opposed to file, and this necessitated the development of a benchmark set of experimental subjects. In experiments, Linespots was found to outperform Bugspots (recommended by Xiaoyin Wang). The third paper, “Integrating pattern matching and abstract interpretation for verifying cautions of microcontrollers,” by Thuy Nguyen, Takashi Tomita, Junpei Endo, and Toshiaki Aoki, proposes a semi-automatic approach for verifying cautions, which are hardware-dependent properties described in microcontrollers hardware manuals. For this approach, the authors integrate pattern matching and abstract interpretation, two static program analysis techniques. The experimental results show the feasibility and applicability of the approach (recommended by Marcio Delamaro). The fourth paper, “Empirical analysis of practitioners’ perceptions of test flakiness factors,” by Azeem Ahmad, Ola Leifler, and Kristian Sandahl, concerns flaky tests. A flaky test is one where different executions with the same test can lead to different outcomes/verdicts. The authors explore developer perception regarding factors that affect flakiness, concentrating on developers of closed-source software. They also examine two test suites and identify the test smells that lead to flakiness (recommended by Mike Papadakis).

这期有四篇论文。第一篇和第三篇论文分别关注属性生成和属性验证，第二篇和第四篇论文分别关注故障预测算法和测试片度的实证研究。第一篇论文，“基于文档的库方法功能约束生成”，由蒋仁和、陈正昭、裴宇、潘敏学、张田和李宣东撰写，提出了DOC2SMT，一种基于文档为库方法生成功能约束的方法。DOC2SMT首先将方法的文档转换为候选约束子句，然后根据静态和动态验证对其进行过滤。实验结果显示了DOC2SMT的有效性和效率，也显示了生成的约束对于基于符号执行的测试生成(由Peter m ller推荐)的好处。第二篇论文，“线点的实证研究:一种新的过去故障算法”，由Maximilian Scholz和Richard Torkar提出了一种新的故障预测算法，称为线点。作者专注于基于过去故障的故障预测，并改进了以前的算法(Bugspots)。有趣的是，他们使用了不同的粒度:行而不是文件，这就需要开发一组实验对象的基准。在实验中，linespot被发现优于bugspot(由王小银推荐)。第三篇论文，“集成模式匹配和抽象解释以验证微控制器的警告”，由Thuy Nguyen, Takashi Tomita, Junpei Endo和Toshiaki Aoki提出了一种半自动方法来验证警告，这是微控制器硬件手册中描述的硬件相关属性。该方法结合了模式匹配和抽象解释这两种静态程序分析技术。实验结果表明了该方法的可行性和适用性(Marcio Delamaro推荐)。第四篇论文，由Azeem Ahmad、Ola Leifler和Kristian Sandahl撰写的“实践者对测试不稳定因素的感知的实证分析”，涉及不稳定的测试。一个不稳定的测试是指使用相同测试的不同执行可能导致不同的结果/判决。作者探讨了开发人员对影响脆弱性的因素的看法，集中于闭源软件的开发人员。他们还检查了两个测试套件，并确定了导致片状的测试气味(由Mike Papadakis推荐)。

{"title":"Property generation/verification and empirical studies","authors":"R. Hierons, Tao Xie","doi":"10.1002/stvr.1800","DOIUrl":"https://doi.org/10.1002/stvr.1800","url":null,"abstract":"This issue contains four papers. The first and third papers focus on property generation and property verification, respectively, while the second and fourth papers focus on empirical studies of a fault prediction algorithm and test flakiness, respectively. The first paper, “Documentation-based functional constraint generation for library methods,” by Renhe Jiang, Zhengzhao Chen, Yu Pei, Minxue Pan, Tian Zhang, and Xuandong Li, proposes DOC2SMT, an approach that generates functional constraints for library methods based on their documentations. DOC2SMT first translates a method’s documentation into candidate constraint clauses, which are then filtered based on static and dynamic validations. The experimental results show the effectiveness and efficiency of DOC2SMT and also show the benefits of the generated constraints for symbolic-execution-based test generation (recommended by Peter Müller). The second paper, “An empirical study of Linespots: A novel past-fault algorithm,” by Maximilian Scholz and Richard Torkar, proposes a new fault prediction algorithm called Linespots. The authors focus on fault prediction based on past faults and refine a previous algorithm (Bugspots). Interestingly, they used a different granularity: line as opposed to file, and this necessitated the development of a benchmark set of experimental subjects. In experiments, Linespots was found to outperform Bugspots (recommended by Xiaoyin Wang). The third paper, “Integrating pattern matching and abstract interpretation for verifying cautions of microcontrollers,” by Thuy Nguyen, Takashi Tomita, Junpei Endo, and Toshiaki Aoki, proposes a semi-automatic approach for verifying cautions, which are hardware-dependent properties described in microcontrollers hardware manuals. For this approach, the authors integrate pattern matching and abstract interpretation, two static program analysis techniques. The experimental results show the feasibility and applicability of the approach (recommended by Marcio Delamaro). The fourth paper, “Empirical analysis of practitioners’ perceptions of test flakiness factors,” by Azeem Ahmad, Ola Leifler, and Kristian Sandahl, concerns flaky tests. A flaky test is one where different executions with the same test can lead to different outcomes/verdicts. The authors explore developer perception regarding factors that affect flakiness, concentrating on developers of closed-source software. They also examine two test suites and identify the test smells that lead to flakiness (recommended by Mike Papadakis).","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"21 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83656010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effective fault localization and context‐aware debugging for concurrent programs 并行程序的有效故障定位和上下文感知调试

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-10-13 DOI: 10.1002/stvr.1797

J. Chu, Tingting Yu, J. Hayes, Xue Han, Yu Zhao

Concurrent programs are difficult to debug because concurrency faults usually occur under specific inputs and thread interleavings. Fault localization techniques for sequential programs are often ineffective because the root causes of concurrency faults involve memory accesses across multiple threads rather than single statements. Previous research has proposed techniques to analyse passing and failing executions obtained from running a set of test cases for identifying faulty memory access patterns. However, stand‐alone access patterns do not provide enough contextual information, such as the path leading to the failure, for developers to understand the bug. We present an approach, Coadec, to automatically generate interthread control flow paths that can link memory access patterns that occurred most frequently in the failing executions to better diagnose concurrency bugs. Coadec consists of two phases. In the first phase, we use feature selection techniques from machine learning to localize suspicious memory access patterns based on failing and passing executions. The patterns with maximum feature diversity information can point to the most suspicious pattern. We then apply a data mining technique and identify the memory access patterns that occurred most frequently in the failing executions. Finally, Coadec identifies faulty program paths by connecting both the frequent patterns and the suspicious pattern. We also evaluate the effectiveness of fault localization using test suites generated from different test adequacy criteria. We introduce and have evaluated Coadec on 10 real‐world multithreaded Java applications. Results indicate that Coadec outperforms state‐of‐the‐art approaches for localizing concurrency faults and that Coadec's context debugging can help developers understand concurrency fault by inspecting a small percentage of code.

并发程序很难调试，因为并发错误通常发生在特定输入和线程交错的情况下。串行程序的错误定位技术通常是无效的，因为并发性错误的根本原因涉及跨多个线程而不是单个语句的内存访问。以前的研究已经提出了一些技术来分析通过运行一组测试用例来识别错误的内存访问模式而获得的通过和失败的执行。然而，独立的访问模式不能提供足够的上下文信息，例如导致故障的路径，以供开发人员理解错误。我们提出了一种方法Coadec，用于自动生成线程间控制流路径，该路径可以链接失败执行中最常见的内存访问模式，从而更好地诊断并发错误。Coadec包括两个阶段。在第一阶段，我们使用机器学习中的特征选择技术来定位基于失败和通过执行的可疑内存访问模式。特征多样性信息最大的模式可以指向最可疑的模式。然后，我们应用数据挖掘技术，确定在失败执行中最常见的内存访问模式。最后，Coadec通过连接频繁模式和可疑模式来识别错误的程序路径。我们还使用由不同测试充分性标准生成的测试套件来评估故障定位的有效性。我们在10个真实的多线程Java应用程序中介绍并评估了Coadec。结果表明，Coadec在并发错误本地化方面优于最先进的方法，并且Coadec的上下文调试可以帮助开发人员通过检查一小部分代码来理解并发错误。

{"title":"Effective fault localization and context‐aware debugging for concurrent programs","authors":"J. Chu, Tingting Yu, J. Hayes, Xue Han, Yu Zhao","doi":"10.1002/stvr.1797","DOIUrl":"https://doi.org/10.1002/stvr.1797","url":null,"abstract":"Concurrent programs are difficult to debug because concurrency faults usually occur under specific inputs and thread interleavings. Fault localization techniques for sequential programs are often ineffective because the root causes of concurrency faults involve memory accesses across multiple threads rather than single statements. Previous research has proposed techniques to analyse passing and failing executions obtained from running a set of test cases for identifying faulty memory access patterns. However, stand‐alone access patterns do not provide enough contextual information, such as the path leading to the failure, for developers to understand the bug. We present an approach, Coadec, to automatically generate interthread control flow paths that can link memory access patterns that occurred most frequently in the failing executions to better diagnose concurrency bugs. Coadec consists of two phases. In the first phase, we use feature selection techniques from machine learning to localize suspicious memory access patterns based on failing and passing executions. The patterns with maximum feature diversity information can point to the most suspicious pattern. We then apply a data mining technique and identify the memory access patterns that occurred most frequently in the failing executions. Finally, Coadec identifies faulty program paths by connecting both the frequent patterns and the suspicious pattern. We also evaluate the effectiveness of fault localization using test suites generated from different test adequacy criteria. We introduce and have evaluated Coadec on 10 real‐world multithreaded Java applications. Results indicate that Coadec outperforms state‐of‐the‐art approaches for localizing concurrency faults and that Coadec's context debugging can help developers understand concurrency fault by inspecting a small percentage of code.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"46 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80645927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Adaptive or embedded software testing and mutation testing 自适应或嵌入式软件测试和突变测试

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-10-12 DOI: 10.1002/stvr.1798

R. Hierons, Tao Xie

This issue contains four papers. The first paper provides a survey of work on testing adaptive and context-aware systems, while the second one concerns testing embedded systems. The remaining two papers explore particular problems associated with an area well known to most STVR readers: mutation testing. The first paper, ‘Testing of adaptive and context-aware systems: Approaches and challenges’, by Bento R. Siqueira, Fabiano C. Ferrari, Kathiani E. Souza, Valter V. Camargo and Rogério de Lemos, introduces a systematic literature review and a thematic analysis of studies to characterize the state of the art in testing adaptive systems (ASs) and context-aware systems (CASs) and discuss approaches, challenges, observed trends and research limitations and directions. The authors discover recurring research concerns related to AS and CAS testing (such as generation of test cases and built-in tests), recurring testing challenges (such as context monitoring and runtime decisions), some trends (such as model-based testing and hybrid techniques) and some little investigated issues (such as uncertainty and prediction of changes). (Recommended by T.Y. Chen) The second paper, ‘Remote embedded devices test framework on the cloud’, by Il-Seok (Benjamin) Choi and Chang-Sung Jeong, introduces a remote embedded device test framework on the cloud named RED-TFC, whose reliability test manager component can automatically perform various tests for evaluating reliability and performance of distributed shared devices by utilizing the cloud concept. RED-TFC includes two major techniques: the adaptive sample scale for reliability test (ASRT) and the mass sample reliability test (MSRT). The authors analyse two Android smartphone models that include many embedded components and show that RED-TFC can help detect a high number of reliability problems in smartphones. (Recommended by Tanja Vos) The third paper, ‘Analysing the combination of cost reduction techniques in Android mutation testing’, by Macario Polo-Usaola and Isyed Rodríguez-Trujillo, concerns the use of mutation testing when testing mobile apps. As the authors note, when testing an app, one typically deploys the app and its mutants on mobile devices or executes them on an emulator. Doing so increases the test execution time. Naturally, it can also significantly increase the cost of mutation testing, especially when there are many mutants. The authors investigate several techniques that have been devised for reducing execution time in mutation testing and produce a mathematical model with the aim of predicting the time taken when some combination of these techniques is used. (Recommended by Mike Papadakis) The final paper is ‘An ensemble-based predictive mutation testing approach that considers impact of unreached mutants’ by Alireza Aghamohammadi and Seyed-Hassan Mirian-Hosseinabadi. This paper also concerns both mutation testing and prediction. However, the authors look at a different prediction problem: that of

这期有四篇论文。第一篇论文概述了测试自适应和上下文感知系统的工作，而第二篇论文则涉及测试嵌入式系统。其余两篇论文探讨了与大多数STVR读者所熟知的领域相关的特定问题:突变检测。第一篇论文，“自适应和上下文感知系统的测试:方法和挑战”，由Bento R. Siqueira, Fabiano C. Ferrari, Kathiani E. Souza, Valter V. Camargo和rogacimrio de Lemos撰写，介绍了系统的文献综述和研究的专题分析，以表征测试自适应系统(ASs)和上下文感知系统(CASs)的最新技术，并讨论了方法，挑战，观察到的趋势以及研究的局限性和方向。作者发现了与AS和CAS测试(如测试用例和内置测试的生成)、反复出现的测试挑战(如上下文监控和运行时决策)、一些趋势(如基于模型的测试和混合技术)和一些很少调查的问题(如不确定性和变化预测)相关的反复出现的研究问题。第二篇论文，“云上的远程嵌入式设备测试框架”，作者Il-Seok (Benjamin) Choi和Chang-Sung Jeong介绍了一个名为RED-TFC的云上远程嵌入式设备测试框架，其可靠性测试管理器组件可以利用云概念自动执行各种测试，以评估分布式共享设备的可靠性和性能。RED-TFC包括两大技术:自适应样本信度测试(ASRT)和质量样本信度测试(MSRT)。作者分析了两种包含许多嵌入式组件的安卓智能手机模型，并表明RED-TFC可以帮助检测智能手机中的大量可靠性问题。Macario Polo-Usaola和Isyed Rodríguez-Trujillo撰写的第三篇论文《分析Android突变测试中成本降低技术的组合》关注的是在测试手机应用时使用突变测试。正如作者所指出的那样，在测试应用程序时，通常会在移动设备上部署应用程序及其变体，或者在模拟器上执行它们。这样做会增加测试执行时间。当然，它也会显著增加突变检测的成本，尤其是在存在许多突变的情况下。作者研究了几种用于减少突变测试执行时间的技术，并建立了一个数学模型，以预测使用这些技术的某些组合所花费的时间。最后一篇论文是Alireza Aghamohammadi和Seyed-Hassan Mirian-Hosseinabadi的“基于集合的预测突变测试方法，考虑了未到达突变的影响”。本文还涉及突变检测和预测。然而，作者着眼于一个不同的预测问题:预测一个突变体是否会被杀死。作者注意到以前的工作没有考虑不可到达的突变的影响:那些突变点没有被任何使用的测试用例覆盖的地方。有人认为，由于许多突变工具排除了无法到达的突变体，因此这些突变体也应该从任何经验评估中删除。作者报告了重复先前研究的结果，同时也消除了无法到达的突变，发现预测技术的最终性能远低于报道的结果。作者随后提出了另一种预测模型，当无法到达的突变体被移除时，该模型被证明是有效的。(推荐人:Tanja Vos)

{"title":"Adaptive or embedded software testing and mutation testing","authors":"R. Hierons, Tao Xie","doi":"10.1002/stvr.1798","DOIUrl":"https://doi.org/10.1002/stvr.1798","url":null,"abstract":"This issue contains four papers. The first paper provides a survey of work on testing adaptive and context-aware systems, while the second one concerns testing embedded systems. The remaining two papers explore particular problems associated with an area well known to most STVR readers: mutation testing. The first paper, ‘Testing of adaptive and context-aware systems: Approaches and challenges’, by Bento R. Siqueira, Fabiano C. Ferrari, Kathiani E. Souza, Valter V. Camargo and Rogério de Lemos, introduces a systematic literature review and a thematic analysis of studies to characterize the state of the art in testing adaptive systems (ASs) and context-aware systems (CASs) and discuss approaches, challenges, observed trends and research limitations and directions. The authors discover recurring research concerns related to AS and CAS testing (such as generation of test cases and built-in tests), recurring testing challenges (such as context monitoring and runtime decisions), some trends (such as model-based testing and hybrid techniques) and some little investigated issues (such as uncertainty and prediction of changes). (Recommended by T.Y. Chen) The second paper, ‘Remote embedded devices test framework on the cloud’, by Il-Seok (Benjamin) Choi and Chang-Sung Jeong, introduces a remote embedded device test framework on the cloud named RED-TFC, whose reliability test manager component can automatically perform various tests for evaluating reliability and performance of distributed shared devices by utilizing the cloud concept. RED-TFC includes two major techniques: the adaptive sample scale for reliability test (ASRT) and the mass sample reliability test (MSRT). The authors analyse two Android smartphone models that include many embedded components and show that RED-TFC can help detect a high number of reliability problems in smartphones. (Recommended by Tanja Vos) The third paper, ‘Analysing the combination of cost reduction techniques in Android mutation testing’, by Macario Polo-Usaola and Isyed Rodríguez-Trujillo, concerns the use of mutation testing when testing mobile apps. As the authors note, when testing an app, one typically deploys the app and its mutants on mobile devices or executes them on an emulator. Doing so increases the test execution time. Naturally, it can also significantly increase the cost of mutation testing, especially when there are many mutants. The authors investigate several techniques that have been devised for reducing execution time in mutation testing and produce a mathematical model with the aim of predicting the time taken when some combination of these techniques is used. (Recommended by Mike Papadakis) The final paper is ‘An ensemble-based predictive mutation testing approach that considers impact of unreached mutants’ by Alireza Aghamohammadi and Seyed-Hassan Mirian-Hosseinabadi. This paper also concerns both mutation testing and prediction. However, the authors look at a different prediction problem: that of","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"113 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84905434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

GPU acceleration of finite state machine input execution: Improving scale and performance 有限状态机输入执行的GPU加速:提高规模和性能

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-10-08 DOI: 10.1002/stvr.1796

Vanya Yaneva, A. Rajan, Christophe Dubach

Model‐based development is a popular development approach in which software is implemented and verified based on a model of the required system. Finite state machines (FSMs) are widely used as models for systems in several domains. Validating that a model accurately represents the required behaviour involves the generation and execution of a large number of input sequences, which is often an expensive and time‐consuming process. In this paper, we speed up the execution of input sequences for FSM validation, by leveraging the high degree of parallelism of modern graphics processing units (GPUs) for the automatic execution of FSM input sequences in parallel on the GPU threads. We expand our existing work by providing techniques that improve the performance and scalability of this approach. We conduct extensive empirical evaluation using 15 large FSMs from the networking domain and measure GPU speed‐up over a 16‐core CPU, taking into account total GPU time, which includes both data transfer and kernel execution time. We found that GPUs execute FSM input sequences up to 9.28× faster than a 16‐core CPU, with an average speed‐up of 4.53× across all subjects. Our optimizations achieve an average improvement over existing work of 58.95% for speed‐up and scalability to large FSMs with over 2K states and 500K transitions. We also found that techniques aimed at reducing the number of required input sequences for large FSMs with high density were ineffective when applied to all‐transition pair coverage, thus emphasizing the need for approaches like ours that speed up input execution.

基于模型的开发是一种流行的开发方法，其中软件是基于所需系统的模型实现和验证的。有限状态机(fsm)作为系统模型被广泛应用于许多领域。验证一个模型是否准确地代表了所需的行为，涉及到大量输入序列的生成和执行，这通常是一个昂贵且耗时的过程。在本文中，我们利用现代图形处理单元(GPU)的高度并行性，在GPU线程上并行自动执行FSM输入序列，从而加快了FSM验证输入序列的执行速度。我们通过提供改进这种方法的性能和可伸缩性的技术来扩展我们现有的工作。我们使用来自网络领域的15个大型fsm进行了广泛的经验评估，并测量了16核CPU上GPU的速度，同时考虑了GPU的总时间，其中包括数据传输和内核执行时间。我们发现gpu执行FSM输入序列的速度比16核CPU快9.28倍，在所有受试者中平均速度提高4.53倍。我们的优化在现有工作的基础上实现了58.95%的提速和可扩展性，可扩展到具有超过2K状态和500K转换的大型fsm。我们还发现，旨在减少高密度大型fsm所需输入序列数量的技术在应用于全过渡对覆盖时是无效的，因此强调需要像我们这样加快输入执行的方法。

{"title":"GPU acceleration of finite state machine input execution: Improving scale and performance","authors":"Vanya Yaneva, A. Rajan, Christophe Dubach","doi":"10.1002/stvr.1796","DOIUrl":"https://doi.org/10.1002/stvr.1796","url":null,"abstract":"Model‐based development is a popular development approach in which software is implemented and verified based on a model of the required system. Finite state machines (FSMs) are widely used as models for systems in several domains. Validating that a model accurately represents the required behaviour involves the generation and execution of a large number of input sequences, which is often an expensive and time‐consuming process. In this paper, we speed up the execution of input sequences for FSM validation, by leveraging the high degree of parallelism of modern graphics processing units (GPUs) for the automatic execution of FSM input sequences in parallel on the GPU threads. We expand our existing work by providing techniques that improve the performance and scalability of this approach. We conduct extensive empirical evaluation using 15 large FSMs from the networking domain and measure GPU speed‐up over a 16‐core CPU, taking into account total GPU time, which includes both data transfer and kernel execution time. We found that GPUs execute FSM input sequences up to 9.28× faster than a 16‐core CPU, with an average speed‐up of 4.53× across all subjects. Our optimizations achieve an average improvement over existing work of 58.95% for speed‐up and scalability to large FSMs with over 2K states and 500K transitions. We also found that techniques aimed at reducing the number of required input sequences for large FSMs with high density were ineffective when applied to all‐transition pair coverage, thus emphasizing the need for approaches like ours that speed up input execution.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"9 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90833737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Metamorphic relation prioritization for effective regression testing 有效回归测试的变质关系优先级

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-09-20 DOI: 10.1002/stvr.1807

Madhusudan Srinivasan, Upulee Kanewala

Metamorphic testing (MT) is widely used for testing programs that face the oracle problem. It uses a set of metamorphic relations (MRs), which are relations among multiple inputs and their corresponding outputs to determine whether the program under test is faulty. Typically, MRs vary in their ability to detect faults in the program under test, and some MRs tend to detect the same set of faults. In this paper, we propose approaches to prioritize MRs to improve the efficiency and effectiveness of MT for regression testing. We present two MR prioritization approaches: (i) fault‐based and (ii) coverage‐based. To evaluate these MR prioritization approaches, we conduct experiments on three complex open‐source software systems. Our results show that the MR prioritization approaches developed by us significantly outperform the current practice of executing the source and follow‐up test cases of the MRs in an ad‐hoc manner in terms of fault detection effectiveness. Further, fault‐based MR prioritization leads to reducing the number of source and follow‐up test cases that needs to be executed as well as reducing the average time taken to detect a fault, which would result in saving time and cost during the testing process.

变形测试(MT)广泛用于测试面临oracle问题的程序。它使用一组变质关系(MRs)来确定被测程序是否存在故障，这些变质关系是多个输入与其对应的输出之间的关系。通常，MRs检测被测程序中的故障的能力各不相同，有些MRs倾向于检测同一组故障。在本文中，我们提出了对MRs进行优先排序的方法，以提高机器翻译在回归测试中的效率和有效性。我们提出了两种MR优先排序方法:(i)基于故障和(ii)基于覆盖。为了评估这些MR优先级方法，我们在三个复杂的开源软件系统上进行了实验。我们的研究结果表明，我们开发的MR优先级方法在故障检测效率方面明显优于当前以临时方式执行MR源和后续测试用例的实践。此外，基于故障的MR优先级导致减少需要执行的源和后续测试用例的数量，以及减少检测故障所花费的平均时间，这将节省测试过程中的时间和成本。

{"title":"Metamorphic relation prioritization for effective regression testing","authors":"Madhusudan Srinivasan, Upulee Kanewala","doi":"10.1002/stvr.1807","DOIUrl":"https://doi.org/10.1002/stvr.1807","url":null,"abstract":"Metamorphic testing (MT) is widely used for testing programs that face the oracle problem. It uses a set of metamorphic relations (MRs), which are relations among multiple inputs and their corresponding outputs to determine whether the program under test is faulty. Typically, MRs vary in their ability to detect faults in the program under test, and some MRs tend to detect the same set of faults. In this paper, we propose approaches to prioritize MRs to improve the efficiency and effectiveness of MT for regression testing. We present two MR prioritization approaches: (i) fault‐based and (ii) coverage‐based. To evaluate these MR prioritization approaches, we conduct experiments on three complex open‐source software systems. Our results show that the MR prioritization approaches developed by us significantly outperform the current practice of executing the source and follow‐up test cases of the MRs in an ad‐hoc manner in terms of fault detection effectiveness. Further, fault‐based MR prioritization leads to reducing the number of source and follow‐up test cases that needs to be executed as well as reducing the average time taken to detect a fault, which would result in saving time and cost during the testing process.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"42 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73771014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Survey on test case generation, selection and prioritization for cyber‐physical systems 对网络物理系统的测试用例生成、选择和优先排序的调查

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Testing Verification & Reliability

Pub Date : 2021-09-15 DOI: 10.1002/stvr.1794

Zahra Sadri‐Moshkenani, Justin Bradley, G. Rothermel

A cyber‐physical system (CPS) is a collection of computing devices that communicate with each other, operate in the target environment via actuators and interact with the physical world through sensors in a feedback loop. CPSs need to be safe and reliable and function in accordance with their requirements. Testing, focusing on a CPS model and/or its code, is the primary approach used by engineers to achieve this. Generating, selecting and prioritizing test cases that can reveal faults in CPSs, from the wide range of possible input values and stimuli that affect their operation, are of central importance in this process. To date, however, in our search of the literature, we have found no comprehensive survey of research on test case generation, selection and prioritization for CPSs. In this article, therefore, we report the results of a survey of approaches for generating, selecting and prioritizing test cases for CPSs; the results illustrate the progress that has been made on these approaches to date, the properties that characterize the approaches and the challenges that remain open in these areas of research.

网络物理系统(CPS)是一组相互通信的计算设备，通过执行器在目标环境中运行，并通过反馈回路中的传感器与物理世界进行交互。cps需要安全可靠，功能符合要求。测试，集中在一个CPS模型和/或它的代码，是工程师用来实现这一目标的主要方法。从影响cps操作的可能输入值和刺激的广泛范围内，生成、选择和排序能够揭示cps中的错误的测试用例，在这个过程中是非常重要的。然而，到目前为止，在我们对文献的搜索中，我们没有发现对cps的测试用例生成、选择和优先级的研究的全面调查。因此，在这篇文章中，我们报告了对cps生成、选择和排序测试用例的方法的调查结果;结果说明了迄今为止在这些方法上取得的进展，这些方法的特性以及在这些研究领域中仍然存在的挑战。

{"title":"Survey on test case generation, selection and prioritization for cyber‐physical systems","authors":"Zahra Sadri‐Moshkenani, Justin Bradley, G. Rothermel","doi":"10.1002/stvr.1794","DOIUrl":"https://doi.org/10.1002/stvr.1794","url":null,"abstract":"A cyber‐physical system (CPS) is a collection of computing devices that communicate with each other, operate in the target environment via actuators and interact with the physical world through sensors in a feedback loop. CPSs need to be safe and reliable and function in accordance with their requirements. Testing, focusing on a CPS model and/or its code, is the primary approach used by engineers to achieve this. Generating, selecting and prioritizing test cases that can reveal faults in CPSs, from the wide range of possible input values and stimuli that affect their operation, are of central importance in this process. To date, however, in our search of the literature, we have found no comprehensive survey of research on test case generation, selection and prioritization for CPSs. In this article, therefore, we report the results of a survey of approaches for generating, selecting and prioritizing test cases for CPSs; the results illustrate the progress that has been made on these approaches to date, the properties that characterize the approaches and the challenges that remain open in these areas of research.","PeriodicalId":49506,"journal":{"name":"Software Testing Verification & Reliability","volume":"3 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84500814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8