首页 > 最新文献

Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation最新文献

英文 中文
Model-based testing of stochastic systems with IOCO theory 基于IOCO理论的随机系统模型测试
M. Gerhold, M. Stoelinga
We present essential concepts of a model-based testing framework for probabilistic systems with continuous time. Markov automata are used as an underlying model. Key result of the work is the solid core of a probabilistic test theory, that incorporates real-time stochastic behaviour. We connect ioco theory and hypothesis testing to infer about trace probabilities. We show that our conformance relation conservatively extends ioco and discuss the meaning of quiescence in the presence of exponentially distributed time delays.
提出了一种基于模型的连续时间概率系统测试框架的基本概念。使用马尔可夫自动机作为底层模型。这项工作的关键成果是概率测试理论的坚实核心,它包含了实时随机行为。我们将ioco理论和假设检验联系起来,推断出跟踪概率。我们证明了我们的一致性关系保守地扩展了ioco,并讨论了存在指数分布时滞时的静态的意义。
{"title":"Model-based testing of stochastic systems with IOCO theory","authors":"M. Gerhold, M. Stoelinga","doi":"10.1145/2994291.2994298","DOIUrl":"https://doi.org/10.1145/2994291.2994298","url":null,"abstract":"We present essential concepts of a model-based testing framework for probabilistic systems with continuous time. Markov automata are used as an underlying model. Key result of the work is the solid core of a probabilistic test theory, that incorporates real-time stochastic behaviour. We connect ioco theory and hypothesis testing to infer about trace probabilities. We show that our conformance relation conservatively extends ioco and discuss the meaning of quiescence in the presence of exponentially distributed time delays.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124602782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The complementary aspect of automatically and manually generated test case sets 自动和手动生成的测试用例集的互补方面
A. Vincenzi, T. Bachiega, Daniel G. de Oliveira, S. Souza, J. Maldonado
The test is a mandatory activity for software quality assurance. The knowledge about the software under testing is necessary to generate high-quality test cases, but to execute more than 80% of its source code is not an easy task, and demands an in-depth knowledge of the business rules it implements. In this article, we investigate the adequacy, effectiveness, and cost of manually generated test sets versus automatically generated test sets for Java programs. We observed that, in general, manual test sets determine higher statement coverage and mutation score than automatically generated test sets. But one interesting aspect recognized is that the automatically generated test sets are complementary to the manual test set. When we combined manual with automated test sets, the resultant test sets overcame in more that 10%, on average, statement coverage and mutation score when compared to the rates of manual test set, keeping a reasonable cost. Therefore, we advocate that we should concentrate the use of manually generated test sets on testing essential and critical parts of the software.
测试是软件质量保证的强制性活动。关于被测软件的知识对于生成高质量的测试用例是必要的,但是要执行80%以上的源代码并不是一项容易的任务,并且需要对其实现的业务规则有深入的了解。在本文中,我们将研究为Java程序手动生成测试集与自动生成测试集的充分性、有效性和成本。我们观察到,通常,手动测试集比自动生成的测试集确定更高的语句覆盖率和突变分数。但是认识到的一个有趣的方面是,自动生成的测试集是对手动测试集的补充。当我们将手动测试集与自动化测试集结合起来时,与手动测试集的比率相比,结果测试集平均克服了超过10%的语句覆盖率和突变分数,保持了合理的成本。因此,我们主张我们应该集中使用手动生成的测试集来测试软件的基本和关键部分。
{"title":"The complementary aspect of automatically and manually generated test case sets","authors":"A. Vincenzi, T. Bachiega, Daniel G. de Oliveira, S. Souza, J. Maldonado","doi":"10.1145/2994291.2994295","DOIUrl":"https://doi.org/10.1145/2994291.2994295","url":null,"abstract":"The test is a mandatory activity for software quality assurance. The knowledge about the software under testing is necessary to generate high-quality test cases, but to execute more than 80% of its source code is not an easy task, and demands an in-depth knowledge of the business rules it implements. In this article, we investigate the adequacy, effectiveness, and cost of manually generated test sets versus automatically generated test sets for Java programs. We observed that, in general, manual test sets determine higher statement coverage and mutation score than automatically generated test sets. But one interesting aspect recognized is that the automatically generated test sets are complementary to the manual test set. When we combined manual with automated test sets, the resultant test sets overcame in more that 10%, on average, statement coverage and mutation score when compared to the rates of manual test set, keeping a reasonable cost. Therefore, we advocate that we should concentrate the use of manually generated test sets on testing essential and critical parts of the software.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133048665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Automated workflow regression testing for multi-tenant SaaS: integrated support in self-service configuration dashboard 多租户SaaS的自动化工作流回归测试:自助配置仪表板中的集成支持
Majid Makki, D. Landuyt, W. Joosen
Single-instance multi-tenant SaaS applications allow tenant administrators to (extensively) customize the application according to the requirements of their organizations. In the specific case of workflow-driven applications, the SaaS provider may offer a set of pre-defined workflow activities and leave their composition to the tenant administrators. In such cases, the tenant administrator can instantiate new variants of the application without deploying new software. This effectively makes these tenant administrators part of the DevOps team, and in turn creates the need for the SaaS provider to provide them with Quality Assurance tool support. One such tool is a regression testing framework that allows them to make sure that a new version of a workflow can behave similarly as to a successful execution of a previous version. This paper highlights the potential and discusses the inherent challenges of running regression tests on workflows in the production environment of a multi-tenant SaaS application and outlines a solution in terms of architecture and automation techniques for mocking and regression detection under control of tenant administrators.
单实例多租户SaaS应用程序允许租户管理员根据其组织的需求(广泛地)定制应用程序。在工作流驱动的应用程序的特定情况下,SaaS提供商可能会提供一组预定义的工作流活动,并将其组合留给租户管理员。在这种情况下,租户管理员可以实例化应用程序的新变体,而无需部署新软件。这有效地使这些租户管理员成为DevOps团队的一部分,反过来又需要SaaS提供商为他们提供质量保证工具支持。一个这样的工具是回归测试框架,它允许他们确保工作流的新版本可以像成功执行前一个版本一样运行。本文强调了在多租户SaaS应用程序的生产环境中对工作流运行回归测试的潜力,并讨论了其固有的挑战,并概述了在租户管理员控制下用于模拟和回归检测的体系结构和自动化技术方面的解决方案。
{"title":"Automated workflow regression testing for multi-tenant SaaS: integrated support in self-service configuration dashboard","authors":"Majid Makki, D. Landuyt, W. Joosen","doi":"10.1145/2994291.2994302","DOIUrl":"https://doi.org/10.1145/2994291.2994302","url":null,"abstract":"Single-instance multi-tenant SaaS applications allow tenant administrators to (extensively) customize the application according to the requirements of their organizations. In the specific case of workflow-driven applications, the SaaS provider may offer a set of pre-defined workflow activities and leave their composition to the tenant administrators. In such cases, the tenant administrator can instantiate new variants of the application without deploying new software. This effectively makes these tenant administrators part of the DevOps team, and in turn creates the need for the SaaS provider to provide them with Quality Assurance tool support. One such tool is a regression testing framework that allows them to make sure that a new version of a workflow can behave similarly as to a successful execution of a previous version. This paper highlights the potential and discusses the inherent challenges of running regression tests on workflows in the production environment of a multi-tenant SaaS application and outlines a solution in terms of architecture and automation techniques for mocking and regression detection under control of tenant administrators.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127647075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Mitigating (and exploiting) test reduction slippage 减轻(并利用)测试减少滑动
J. Holmes, Alex Groce, Mohammad Amin Alipour
Reducing the size of tests, typically by delta debugging or a related algorithm, is a critical component of effective automated testing and debugging. Automatically generated or user-submitted tests are often far longer than required, full of unnecessary components that make debugging difficult. Test reduction algorithms automatically remove components of such tests, while preserving the property that the test fails. Unfortunately, reduction can sometimes transform a failing test that detects a subtle, critical, and previously unknown fault into a test that detects a trivial-to-find, unimportant, and already known fault. When reducing a test detecting fault(s) F produces a test that does not detect the same F, this is known as slippage. In the case where an interesting fault slips to an uninteresting fault, slippage is a problem, and must be avoided. However, slippage can also be beneficial, when a long test can be reduced to detect a fault that has not otherwise been detected (including by the original test). While traditional delta debugging only produces one reduced test, the concept of slippage suggests an alternative approach, where the output of reduction is a set of reduced tests, in order to avoid problematic slippage and induce beneficial slippage. In this paper, we present preliminary efforts to understand slippage, and compare two approaches to slippage mitigation.
减少测试的大小,通常通过增量调试或相关的算法,是有效的自动化测试和调试的关键组成部分。自动生成或用户提交的测试通常比所需的长得多,充满了不必要的组件,使调试变得困难。测试缩减算法自动删除此类测试的组件,同时保留测试失败的属性。不幸的是,简化有时会将检测到细微的、关键的和先前未知的错误的失败测试转换为检测到微不足道的、不重要的和已知的错误的测试。当减少一个检测故障F的测试时,产生一个不检测相同F的测试,这被称为滑动。当一个有趣的断层向另一个不有趣的断层滑动时,滑移是一个问题,必须避免。然而,滑移也可能是有益的,当一个长时间的测试可以减少,以检测一个没有被检测到的故障(包括原始测试)。虽然传统的增量调试只产生一个减少的测试,但滑动的概念提出了一种替代方法,其中减少的输出是一组减少的测试,以避免有问题的滑动并诱导有益的滑动。在本文中,我们提出了初步的努力,以了解滑移,并比较两种方法的滑移缓解。
{"title":"Mitigating (and exploiting) test reduction slippage","authors":"J. Holmes, Alex Groce, Mohammad Amin Alipour","doi":"10.1145/2994291.2994301","DOIUrl":"https://doi.org/10.1145/2994291.2994301","url":null,"abstract":"Reducing the size of tests, typically by delta debugging or a related algorithm, is a critical component of effective automated testing and debugging. Automatically generated or user-submitted tests are often far longer than required, full of unnecessary components that make debugging difficult. Test reduction algorithms automatically remove components of such tests, while preserving the property that the test fails. Unfortunately, reduction can sometimes transform a failing test that detects a subtle, critical, and previously unknown fault into a test that detects a trivial-to-find, unimportant, and already known fault. When reducing a test detecting fault(s) F produces a test that does not detect the same F, this is known as slippage. In the case where an interesting fault slips to an uninteresting fault, slippage is a problem, and must be avoided. However, slippage can also be beneficial, when a long test can be reduced to detect a fault that has not otherwise been detected (including by the original test). While traditional delta debugging only produces one reduced test, the concept of slippage suggests an alternative approach, where the output of reduction is a set of reduced tests, in order to avoid problematic slippage and induce beneficial slippage. In this paper, we present preliminary efforts to understand slippage, and compare two approaches to slippage mitigation.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126534724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Towards an MDE-based approach to test entity reconciliation applications 迈向基于mde的方法来测试实体协调应用程序
J. G. Enríquez, Raquel Blanco, Francisco José Domínguez Mayo, J. Tuya, María José Escalona Cuaresma
The management of large volumes of data has given rise to significant challenges to the entity reconciliation problem (which refers to combining data from different sources for a unified vision) due to the fact that the data are becoming more unstructured, unclean and incomplete, need to be more linked, etc. Testing the applications that implement the entity reconciliation problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. In this paper, we present a first approach, based on MDE, which allows the creation of test models for the integration testing of entity reconciliation applications.
由于数据变得越来越非结构化、不干净和不完整,需要更多的链接等事实,对大量数据的管理对实体协调问题(指将来自不同来源的数据组合起来以实现统一的愿景)提出了重大挑战。测试实现实体协调问题的应用程序对于确保协调过程的正确性和协调数据的质量至关重要。在本文中,我们提出了第一种基于MDE的方法,它允许为实体协调应用程序的集成测试创建测试模型。
{"title":"Towards an MDE-based approach to test entity reconciliation applications","authors":"J. G. Enríquez, Raquel Blanco, Francisco José Domínguez Mayo, J. Tuya, María José Escalona Cuaresma","doi":"10.1145/2994291.2994303","DOIUrl":"https://doi.org/10.1145/2994291.2994303","url":null,"abstract":"The management of large volumes of data has given rise to significant challenges to the entity reconciliation problem (which refers to combining data from different sources for a unified vision) due to the fact that the data are becoming more unstructured, unclean and incomplete, need to be more linked, etc. Testing the applications that implement the entity reconciliation problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. In this paper, we present a first approach, based on MDE, which allows the creation of test models for the integration testing of entity reconciliation applications.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"218 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124318776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
PredSym: estimating software testing budget for a bug-free release PredSym:为一个没有bug的版本估算软件测试预算
Arnamoy Bhattacharyya, Timur Malgazhdarov
Symbolic execution tools are widely used during a software testing phase for finding hidden bugs and software vulnerabilities. Accurately predicting the time required by a symbolic execution tool to explore a chosen code coverage helps in planning the budget required in the testing phase. In this work, we present an automatic tool, PredSym, that uses static program features to predict the coverage explored by a symbolic execution tool - KLEE, for a given time budget and to predict the time required to explore a given coverage. PredSym uses LASSO regression to build a model that does not suffer from overfitting and can predict both the coverage and the time with a worst error of 10% on unseen datapoints. PredSym also gives code improvement suggestions based on a heuristic for improving the coverage generated by KLEE.
符号执行工具在软件测试阶段被广泛用于发现隐藏的错误和软件漏洞。准确地预测符号执行工具探索选定的代码覆盖率所需的时间,有助于规划测试阶段所需的预算。在这项工作中,我们提出了一个自动工具PredSym,它使用静态程序特征来预测符号执行工具(KLEE)在给定时间预算下探索的覆盖范围,并预测探索给定覆盖范围所需的时间。PredSym使用LASSO回归来建立一个模型,该模型不会受到过拟合的影响,并且可以在未见过的数据点上以10%的最大误差预测覆盖率和时间。PredSym还提供了基于启发式方法的代码改进建议,以改进由KLEE生成的覆盖率。
{"title":"PredSym: estimating software testing budget for a bug-free release","authors":"Arnamoy Bhattacharyya, Timur Malgazhdarov","doi":"10.1145/2994291.2994294","DOIUrl":"https://doi.org/10.1145/2994291.2994294","url":null,"abstract":"Symbolic execution tools are widely used during a software testing phase for finding hidden bugs and software vulnerabilities. Accurately predicting the time required by a symbolic execution tool to explore a chosen code coverage helps in planning the budget required in the testing phase. In this work, we present an automatic tool, PredSym, that uses static program features to predict the coverage explored by a symbolic execution tool - KLEE, for a given time budget and to predict the time required to explore a given coverage. PredSym uses LASSO regression to build a model that does not suffer from overfitting and can predict both the coverage and the time with a worst error of 10% on unseen datapoints. PredSym also gives code improvement suggestions based on a heuristic for improving the coverage generated by KLEE.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"363 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123559295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
EventFlowSlicer: goal based test generation for graphical user interfaces EventFlowSlicer:为图形用户界面生成基于目标的测试
Jonathan A. Saddler, Myra B. Cohen
Automated test generation techniques for graphical user interfaces include model-based approaches that generate tests from a graph or state machine model, capture-replay methods that require the user to demonstrate each test case, and pattern-based approaches that provide templates for abstract test cases. There has been little work, however, in automated goal-based testing, where the goal is a realistic user task, a function, or an abstract behavior. Recent work in human performance regression testing has shown that there is a need for generating multiple test cases that execute the same user task in different ways, however that work does not have an efficient way to generate tests and only a single type of goal has been considered. In this paper we expand the notion of goal based interface testing to generate tests for a variety of goals. We develop a direct test generation technique, EventFlowSlicer, that is more efficient than that used in human performance regression testing, reducing run times by 92.5
图形用户界面的自动化测试生成技术包括基于模型的方法(从图或状态机模型生成测试)、要求用户演示每个测试用例的捕获-重放方法,以及为抽象测试用例提供模板的基于模式的方法。然而,在基于目标的自动化测试中很少有工作,其中目标是实际的用户任务、功能或抽象行为。最近在人类性能回归测试方面的工作表明,有必要生成以不同方式执行相同用户任务的多个测试用例,但是该工作没有有效的方法来生成测试,并且只考虑了单一类型的目标。在本文中,我们扩展了基于目标的接口测试的概念,以生成针对各种目标的测试。我们开发了一种直接测试生成技术EventFlowSlicer,它比人类性能回归测试中使用的技术更有效,将运行时间减少了92.5
{"title":"EventFlowSlicer: goal based test generation for graphical user interfaces","authors":"Jonathan A. Saddler, Myra B. Cohen","doi":"10.1145/2994291.2994293","DOIUrl":"https://doi.org/10.1145/2994291.2994293","url":null,"abstract":"Automated test generation techniques for graphical user interfaces include model-based approaches that generate tests from a graph or state machine model, capture-replay methods that require the user to demonstrate each test case, and pattern-based approaches that provide templates for abstract test cases. There has been little work, however, in automated goal-based testing, where the goal is a realistic user task, a function, or an abstract behavior. Recent work in human performance regression testing has shown that there is a need for generating multiple test cases that execute the same user task in different ways, however that work does not have an efficient way to generate tests and only a single type of goal has been considered. In this paper we expand the notion of goal based interface testing to generate tests for a variety of goals. We develop a direct test generation technique, EventFlowSlicer, that is more efficient than that used in human performance regression testing, reducing run times by 92.5","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134170893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Development and maintenance efforts testing graphical user interfaces: a comparison 开发和维护工作测试图形用户界面:比较
Antonia Kresse, Peter M. Kruse
For testing of graphical user interfaces many tools exists. The aim of this work is a statement regarding the advantages and disadvantages of various testing tools with regard to their use in the economic context to be taken. It is compared, inter alia, whether there are differences in the generations of test tools in terms of finding defects and which tool has the lowest development and maintenance costs. Results show that with QF-Test test suites can be created the quickest while EggPlant has the shortest maintenance time. TestComplete performs worse in both disciplines. For test robustness, no clear picture can be drawn. The selection of a test tool is typically done once in a project at the beginning and should be considered carefully.
对于图形用户界面的测试,存在许多工具。这项工作的目的是一个关于各种测试工具的优点和缺点的声明,考虑到它们在经济背景下的使用。它是比较的,特别是,在发现缺陷方面,测试工具的世代之间是否存在差异,以及哪个工具具有最低的开发和维护成本。结果表明,使用QF-Test可以最快地创建测试套件,而茄子的维护时间最短。TestComplete在这两个方面的表现都较差。对于测试稳健性,没有清晰的图像可以绘制。测试工具的选择通常在项目开始时完成一次,应该仔细考虑。
{"title":"Development and maintenance efforts testing graphical user interfaces: a comparison","authors":"Antonia Kresse, Peter M. Kruse","doi":"10.1145/2994291.2994299","DOIUrl":"https://doi.org/10.1145/2994291.2994299","url":null,"abstract":"For testing of graphical user interfaces many tools exists. The aim of this work is a statement regarding the advantages and disadvantages of various testing tools with regard to their use in the economic context to be taken. It is compared, inter alia, whether there are differences in the generations of test tools in terms of finding defects and which tool has the lowest development and maintenance costs. Results show that with QF-Test test suites can be created the quickest while EggPlant has the shortest maintenance time. TestComplete performs worse in both disciplines. For test robustness, no clear picture can be drawn. The selection of a test tool is typically done once in a project at the beginning and should be considered carefully.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123490381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Modernizing hierarchical delta debugging 现代化的分层增量调试
Renáta Hodován, Ákos Kiss
Programmers tasked with the fixing of a bug prefer working on a minimal test case where every single bit is needed to reproduce the failure. However, cutting off the excess parts of a potentially large test case can be a tedious and time-consuming task if performed manually, which has led to the research and development of automated test case reduction techniques. The decade-old Hierarchical Delta Debugging (HDD) algorithm targets structured test inputs, parses them with the help of grammars and applies the minimizing Delta Debugging algorithm to the built trees. We have investigated this algorithm and its implementation, and propose improvements in this paper to address the found shortcomings. We argue that using extended context-free grammars with HDD is beneficial in several ways and the experimental evaluation of our modernized HDD implementation, called Picireny, supports the outlined ideas: the reduced outputs are significantly smaller (by circa 25-40%) on the investigated test cases than those produced by the reference HDD implementation using standard context-free grammars. These results, together with the technical improvements that ease the use of the modernized tool, can hopefully help spreading the adaptation of HDD in practice.
负责修复bug的程序员更喜欢在最小的测试用例上工作,其中每个比特都需要重现故障。然而,如果手动执行,切断潜在的大型测试用例的多余部分可能是一项乏味且耗时的任务,这导致了自动化测试用例缩减技术的研究和开发。已有十年历史的分层增量调试(HDD)算法以结构化测试输入为目标,在语法的帮助下对其进行解析,并将最小化增量调试算法应用于构建的树。我们研究了该算法及其实现,并在本文中提出了改进意见,以解决所发现的不足。我们认为,在HDD中使用扩展的上下文无关语法在几个方面是有益的,我们的现代化HDD实现的实验性评估,称为Picireny,支持概述的想法:在调查的测试用例中,减少的输出明显小于使用标准上下文无关语法的参考HDD实现产生的输出(大约25-40%)。这些结果,再加上简化现代化工具使用的技术改进,有望有助于在实践中推广HDD的适应性。
{"title":"Modernizing hierarchical delta debugging","authors":"Renáta Hodován, Ákos Kiss","doi":"10.1145/2994291.2994296","DOIUrl":"https://doi.org/10.1145/2994291.2994296","url":null,"abstract":"Programmers tasked with the fixing of a bug prefer working on a minimal test case where every single bit is needed to reproduce the failure. However, cutting off the excess parts of a potentially large test case can be a tedious and time-consuming task if performed manually, which has led to the research and development of automated test case reduction techniques. The decade-old Hierarchical Delta Debugging (HDD) algorithm targets structured test inputs, parses them with the help of grammars and applies the minimizing Delta Debugging algorithm to the built trees. We have investigated this algorithm and its implementation, and propose improvements in this paper to address the found shortcomings. We argue that using extended context-free grammars with HDD is beneficial in several ways and the experimental evaluation of our modernized HDD implementation, called Picireny, supports the outlined ideas: the reduced outputs are significantly smaller (by circa 25-40%) on the investigated test cases than those produced by the reference HDD implementation using standard context-free grammars. These results, together with the technical improvements that ease the use of the modernized tool, can hopefully help spreading the adaptation of HDD in practice.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115518528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Complete IOCO test cases: a case study 完整的IOCO测试用例:案例研究
S. C. Paiva, A. Simão, M. Varshosaz, M. Mousavi
Input/Output Transition Systems (IOTSs) have been widely used as test models in model-based testing. Traditionally, input output conformance testing (IOCO) has been used to generate random test cases from IOTSs. A recent test case generation method for IOTSs, called Complete IOCO, applies fault models to obtain complete test suites with guaranteed fault coverage for IOTSs. This paper measures the efficiency of Complete IOCO in comparison with the traditional IOCO test case generation implemented in the JTorX tool. To this end, we use a case study involving five specification models from the automotive and the railway domains. Faulty mutations of the specifications were produced in order to compare the efficiency of both test generation methods in killing them. The results indicate that Complete IOCO is more efficient in detecting deep faults in large state spaces while IOCO is more efficient in detecting shallow faults in small state spaces.
在基于模型的测试中,输入/输出转换系统(IOTSs)作为测试模型被广泛使用。传统上,输入输出一致性测试(IOCO)被用于从iots生成随机测试用例。最近一种用于iots的测试用例生成方法称为Complete IOCO,它应用故障模型来获得具有保证iots故障覆盖的完整测试套件。本文测量了完全IOCO的效率,并与JTorX工具中实现的传统IOCO测试用例生成进行了比较。为此,我们使用了一个案例研究,涉及来自汽车和铁路领域的五个规范模型。为了比较两种测试生成方法杀死它们的效率,产生了规格的错误突变。结果表明,完全IOCO在大状态空间检测深断层时效率更高,而在小状态空间检测浅断层时效率更高。
{"title":"Complete IOCO test cases: a case study","authors":"S. C. Paiva, A. Simão, M. Varshosaz, M. Mousavi","doi":"10.1145/2994291.2994297","DOIUrl":"https://doi.org/10.1145/2994291.2994297","url":null,"abstract":"Input/Output Transition Systems (IOTSs) have been widely used as test models in model-based testing. Traditionally, input output conformance testing (IOCO) has been used to generate random test cases from IOTSs. A recent test case generation method for IOTSs, called Complete IOCO, applies fault models to obtain complete test suites with guaranteed fault coverage for IOTSs. This paper measures the efficiency of Complete IOCO in comparison with the traditional IOCO test case generation implemented in the JTorX tool. To this end, we use a case study involving five specification models from the automotive and the railway domains. Faulty mutations of the specifications were produced in order to compare the efficiency of both test generation methods in killing them. The results indicate that Complete IOCO is more efficient in detecting deep faults in large state spaces while IOCO is more efficient in detecting shallow faults in small state spaces.","PeriodicalId":255079,"journal":{"name":"Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122549830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Proceedings of the 7th International Workshop on Automating Test Case Design, Selection, and Evaluation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1