首页 > 最新文献

2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)最新文献

英文 中文
An Experimental Evaluation of Web Mutation Operators Web突变算子的实验评价
Upsorn Praphamontripong, A. Offutt, Lin Deng, Jingjing Gu
While modern web development technologies enhance the capabilities of web applications, they introduce challenges for testers. This paper introduces, evaluates, and refines web mutation operators that target interaction faults in web applications. An experimental study is conducted on 11 subject web applications using 15 web mutation operators. The effectiveness of 12 independently developed test sets are analyzed in terms of how well they kill web mutants. Web mutation adequate tests are compared with independently created test sets to evaluate the web mutation operators. Tests designed to satisfy the web mutation testing criterion provide 100% coverage while the tests designed to satisfy traditional testing criteria provide, on average, 47%coverage. The paper also analyzes which mutants and mutation operators the traditional tests had difficulty killing. We found that some types of mutants that are not particularly hard to kill were missed by all traditional tests. Additionally, web mutation testing produces very few equivalent mutants.
虽然现代web开发技术增强了web应用程序的功能,但它们给测试人员带来了挑战。本文介绍、评估并改进了针对web应用中交互故障的web突变算子。利用15种网络变异算子对11个主题网络应用进行了实验研究。对12个独立开发的测试集的有效性进行了分析,以确定它们杀死网络突变体的效果。将Web突变充分测试与独立创建的测试集进行比较,以评估Web突变操作符。设计用于满足网络突变测试标准的测试提供100%的覆盖率,而设计用于满足传统测试标准的测试平均提供47%的覆盖率。分析了传统方法难以杀灭的突变体和突变算子。我们发现,所有传统测试都遗漏了一些不是特别难杀死的突变类型。此外,网络突变测试产生的等效突变很少。
{"title":"An Experimental Evaluation of Web Mutation Operators","authors":"Upsorn Praphamontripong, A. Offutt, Lin Deng, Jingjing Gu","doi":"10.1109/ICSTW.2016.17","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.17","url":null,"abstract":"While modern web development technologies enhance the capabilities of web applications, they introduce challenges for testers. This paper introduces, evaluates, and refines web mutation operators that target interaction faults in web applications. An experimental study is conducted on 11 subject web applications using 15 web mutation operators. The effectiveness of 12 independently developed test sets are analyzed in terms of how well they kill web mutants. Web mutation adequate tests are compared with independently created test sets to evaluate the web mutation operators. Tests designed to satisfy the web mutation testing criterion provide 100% coverage while the tests designed to satisfy traditional testing criteria provide, on average, 47%coverage. The paper also analyzes which mutants and mutation operators the traditional tests had difficulty killing. We found that some types of mutants that are not particularly hard to kill were missed by all traditional tests. Additionally, web mutation testing produces very few equivalent mutants.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129056060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Diversity-Aware Mutation Adequacy Criterion for Improving Fault Detection Capability 提高故障检测能力的多样性感知突变充分性准则
Donghwan Shin, S. Yoo, Doo-Hwan Bae
Many existing testing techniques adopt diversity as an important criterion for the selection and prioritization of tests. However, mutation adequacy has been content with simply maximizing the number of mutants that have been killed. We propose a novel mutation adequacy criterion that considers the diversity in the relationship between tests and mutants, as well as whether mutants are killed. Intuitively, the proposed criterion is based on the notion that mutants can be distinguished by the sets of tests that kill them. A test suite is deemed adequate by our criterion if the test suite distinguishes all mutants in terms of their kill patterns. Our hypothesis is that, simply by using a stronger adequacy criterion, it is possible to improve fault detection capabilities of mutation-adequate test suites. The empirical evaluation selects tests for real world applications using the proposed mutation adequacy criterion to test our hypothesis. The results show that, for real world faults, test suites adequate to our criterion can increase the fault detection success rate by up to 76.8 percentage points compared to test suites adequate to the traditional criterion.
许多现有的测试技术采用多样性作为测试选择和优先排序的重要标准。然而,突变充分性已经满足于仅仅最大化被杀死的突变体的数量。我们提出了一个新的突变充分性标准,该标准考虑了测试和突变体之间关系的多样性,以及突变体是否被杀死。直观地说,这个标准是基于这样一种观念,即可以通过杀死突变体的一系列测试来区分突变体。根据我们的标准,如果测试套件根据它们的杀死模式区分了所有的突变体,那么测试套件就被认为是足够的。我们的假设是,仅仅通过使用更强的充分性标准,就有可能提高适当突变测试套件的故障检测能力。经验评估选择测试,为现实世界的应用使用提出的突变充分性准则来检验我们的假设。结果表明,对于真实世界的故障,与传统标准的测试套件相比,适合我们标准的测试套件可以将故障检测成功率提高76.8个百分点。
{"title":"Diversity-Aware Mutation Adequacy Criterion for Improving Fault Detection Capability","authors":"Donghwan Shin, S. Yoo, Doo-Hwan Bae","doi":"10.1109/ICSTW.2016.37","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.37","url":null,"abstract":"Many existing testing techniques adopt diversity as an important criterion for the selection and prioritization of tests. However, mutation adequacy has been content with simply maximizing the number of mutants that have been killed. We propose a novel mutation adequacy criterion that considers the diversity in the relationship between tests and mutants, as well as whether mutants are killed. Intuitively, the proposed criterion is based on the notion that mutants can be distinguished by the sets of tests that kill them. A test suite is deemed adequate by our criterion if the test suite distinguishes all mutants in terms of their kill patterns. Our hypothesis is that, simply by using a stronger adequacy criterion, it is possible to improve fault detection capabilities of mutation-adequate test suites. The empirical evaluation selects tests for real world applications using the proposed mutation adequacy criterion to test our hypothesis. The results show that, for real world faults, test suites adequate to our criterion can increase the fault detection success rate by up to 76.8 percentage points compared to test suites adequate to the traditional criterion.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117030156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Combinatorial Testing: Implementations in Solutions Testing 组合测试:解决方案测试中的实现
Anwar Sherif
It is apparent that implementing Combinatorial (CT) Testing, being relatively new to many organizations, comes with both technical and non-technical challenges. This paper focuses the technical challenges and how they were addressed in implementing CT for solutions testing. The author believes that these challenges are not unique to solution testing, and that they are equally applicable to implementations of CT in general.
很明显,对于许多组织来说,实现组合(CT)测试是相对较新的,它带来了技术和非技术方面的挑战。本文的重点是技术挑战,以及如何在实施CT解决方案测试中解决这些挑战。作者认为,这些挑战并不是解决方案测试所独有的,它们同样适用于一般的CT实现。
{"title":"Combinatorial Testing: Implementations in Solutions Testing","authors":"Anwar Sherif","doi":"10.1109/ICSTW.2016.39","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.39","url":null,"abstract":"It is apparent that implementing Combinatorial (CT) Testing, being relatively new to many organizations, comes with both technical and non-technical challenges. This paper focuses the technical challenges and how they were addressed in implementing CT for solutions testing. The author believes that these challenges are not unique to solution testing, and that they are equally applicable to implementations of CT in general.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126869340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using Petri Nets to Test Concurrent Behavior of Web Applications 使用Petri网测试Web应用程序的并发行为
Sunitha Thummala, A. Offutt
Web applications are used by companies across the world to deploy their products and services. Because of the technologies used to build web applications, they are by nature concurrent, for example, multiple users can have problems when accessing the same limited resources. The combination of the stateless nature of web applications and concurrent behavior creates unique challenges. Models have traditionally been used to test specific aspects of systems. However, existing web application models do not effectively address the combination of concurrent behavior and stateless protocol. This research project is using a novel Petri net-based model for web applications. This paper defines a novel way to design model-based coverage criteria tests that address concurrent behavior involving HTTP browser-based sessions. A tool that extracts the Petri net model of a web application has been developed and used to study ten web applications totaling 17,535 lines of code. The tool was used to extract the model and generate tests, revealing 36 naturally occurring software faults that had not been found during previous testing.
世界各地的公司都使用Web应用程序来部署他们的产品和服务。由于用于构建web应用程序的技术,它们本质上是并发的,例如,多个用户在访问相同的有限资源时可能会出现问题。web应用程序的无状态特性和并发行为的结合带来了独特的挑战。传统上,模型被用来测试系统的特定方面。然而,现有的web应用程序模型并不能有效地处理并发行为和无状态协议的组合。本研究项目使用一种新颖的基于Petri网的web应用程序模型。本文定义了一种设计基于模型的覆盖标准测试的新方法,用于处理涉及基于HTTP浏览器的会话的并发行为。开发了一个提取web应用程序的Petri网模型的工具,并用于研究10个web应用程序,总计17,535行代码。该工具用于提取模型并生成测试,揭示了36个在以前的测试中未发现的自然发生的软件错误。
{"title":"Using Petri Nets to Test Concurrent Behavior of Web Applications","authors":"Sunitha Thummala, A. Offutt","doi":"10.1109/ICSTW.2016.15","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.15","url":null,"abstract":"Web applications are used by companies across the world to deploy their products and services. Because of the technologies used to build web applications, they are by nature concurrent, for example, multiple users can have problems when accessing the same limited resources. The combination of the stateless nature of web applications and concurrent behavior creates unique challenges. Models have traditionally been used to test specific aspects of systems. However, existing web application models do not effectively address the combination of concurrent behavior and stateless protocol. This research project is using a novel Petri net-based model for web applications. This paper defines a novel way to design model-based coverage criteria tests that address concurrent behavior involving HTTP browser-based sessions. A tool that extracts the Petri net model of a web application has been developed and used to study ten web applications totaling 17,535 lines of code. The tool was used to extract the model and generate tests, revealing 36 naturally occurring software faults that had not been found during previous testing.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126068787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On strong mutation and subsuming mutants 强突变体和包容突变体
B. Lindström, András Márki
Mutation analysis is a powerful technique for software testing but it is also known to be computationally expensive. The main reason for the high computational cost is that many of the mutants are redundant and thus, do not contribute to the quality of the test suite. One of the most promising approaches to avoid producing redundant mutants is to identify subsumption relations among mutants, preferably before these are generated. Such relations have for example, been identified at an operator level for mutants created by the ROR operator. This reduced set of non-redundant mutants has been used in several recent studies and is also the default option in at least one mutation testing tool that supports strong mutation. This raises questions on whether the identified subsumption relations between the mutants hold in a context of strong mutation or variants of weak mutation that require some limited error propagation (firm mutation). We have conducted an experimental study to investigate the subsumption relations in the context of strong or firm mutation. We observed that it is possible to create a test suite that is 100% adequate for the reduced set of mutants while not being 100% adequate for the complete set. This shows that the subsumption relations do not hold for strong or firm mutation. We provide several examples on this behavior and discuss the root causes. Our findings are important since strong and firm mutation both are frequently used to evaluate test suites and testing criteria. The choice of whether to use a reduced set of mutants or an entire set should however, not be made without consideration of the context in which they are used (i.e., strong, firm or weak mutation) since the subsumption relations between ROR mutants do not hold for strong or firm mutation. Just as redundant mutants can give an overestimation of the mutation score for a test suite, using the reduced set of mutants can give an underestimation if used together with strong or firm mutation. Results reported from such studies should therefore, be accompanied by information on whether the reduced or complete set of mutants was used and if the researchers used strong, firm or weak mutation.
突变分析是一种强大的软件测试技术,但众所周知,它的计算成本很高。计算成本高的主要原因是许多突变是冗余的,因此对测试套件的质量没有贡献。避免产生冗余突变体的最有希望的方法之一是识别突变体之间的包容关系,最好是在这些突变体产生之前。例如,对于由ROR操作符创建的突变,已经在操作符级别确定了这样的关系。这种减少的非冗余突变集已在最近的几项研究中使用,也是至少一种支持强突变的突变检测工具的默认选择。这就提出了一个问题,即突变体之间确定的包容关系是否适用于强突变或需要一些有限错误传播(强突变)的弱突变的变体。我们进行了一项实验研究,探讨了强突变或强突变背景下的包容关系。我们观察到,有可能创建一个测试套件,它100%适合减少的突变集,而不是100%适合完整的集合。这表明,包容关系并不适用于强突变或强突变。我们将提供有关这种行为的几个示例,并讨论其根本原因。我们的发现很重要,因为强突变和强突变都经常用于评估测试套件和测试标准。但是,如果不考虑使用突变的上下文(即强突变、强突变或弱突变),就不应该选择使用突变的约简集还是整个集,因为ROR突变之间的包容关系不适用于强突变或强突变。正如冗余突变可能会高估测试套件的突变得分一样,如果与强突变或强突变一起使用,则使用减少的突变集可能会导致低估。因此,这些研究报告的结果应该附有信息,说明是否使用了减少的或完整的突变集,以及研究人员是使用了强突变、强突变还是弱突变。
{"title":"On strong mutation and subsuming mutants","authors":"B. Lindström, András Márki","doi":"10.1109/ICSTW.2016.28","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.28","url":null,"abstract":"Mutation analysis is a powerful technique for software testing but it is also known to be computationally expensive. The main reason for the high computational cost is that many of the mutants are redundant and thus, do not contribute to the quality of the test suite. One of the most promising approaches to avoid producing redundant mutants is to identify subsumption relations among mutants, preferably before these are generated. Such relations have for example, been identified at an operator level for mutants created by the ROR operator. This reduced set of non-redundant mutants has been used in several recent studies and is also the default option in at least one mutation testing tool that supports strong mutation. This raises questions on whether the identified subsumption relations between the mutants hold in a context of strong mutation or variants of weak mutation that require some limited error propagation (firm mutation). We have conducted an experimental study to investigate the subsumption relations in the context of strong or firm mutation. We observed that it is possible to create a test suite that is 100% adequate for the reduced set of mutants while not being 100% adequate for the complete set. This shows that the subsumption relations do not hold for strong or firm mutation. We provide several examples on this behavior and discuss the root causes. Our findings are important since strong and firm mutation both are frequently used to evaluate test suites and testing criteria. The choice of whether to use a reduced set of mutants or an entire set should however, not be made without consideration of the context in which they are used (i.e., strong, firm or weak mutation) since the subsumption relations between ROR mutants do not hold for strong or firm mutation. Just as redundant mutants can give an overestimation of the mutation score for a test suite, using the reduced set of mutants can give an underestimation if used together with strong or firm mutation. Results reported from such studies should therefore, be accompanied by information on whether the reduced or complete set of mutants was used and if the researchers used strong, firm or weak mutation.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124932092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Experience Report: White Box Test Case Generation for Automotive Embedded Software 经验报告:汽车嵌入式软件白盒测试用例生成
A. Hoffmann, Jochen Quante, M. Woehrle
Testing software based on symbolic execution is a promising and well-researched approach. Recently, various dynamic symbolic execution approaches have been proposed that are of interest for industrial practice. However, the corresponding tools are typically not directly usable on model-based languages that are predominantly used in the automotive sector. To this end, we use an internal software analysis framework for model-based, embedded software that allows us to easily integrate various approaches leveraging static and dynamic symbolic execution. In our case study, we describe idiosyncrasies of automotive embedded software and show the benefits of leveraging symbolic information to generate test cases. We study and compare different techniques for generating test cases. We evaluate all techniques empirically on relevant, real-world programs from engine control.
基于符号执行的软件测试是一种很有前途的研究方法。最近,各种动态符号执行方法被提出,并引起了工业实践的兴趣。然而,相应的工具通常不能直接用于汽车领域中主要使用的基于模型的语言。为此,我们为基于模型的嵌入式软件使用内部软件分析框架,该框架允许我们轻松地集成利用静态和动态符号执行的各种方法。在我们的案例研究中,我们描述了汽车嵌入式软件的特性,并展示了利用符号信息来生成测试用例的好处。我们研究并比较了生成测试用例的不同技术。我们根据相关的、真实世界的引擎控制程序来评估所有技术。
{"title":"Experience Report: White Box Test Case Generation for Automotive Embedded Software","authors":"A. Hoffmann, Jochen Quante, M. Woehrle","doi":"10.1109/ICSTW.2016.18","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.18","url":null,"abstract":"Testing software based on symbolic execution is a promising and well-researched approach. Recently, various dynamic symbolic execution approaches have been proposed that are of interest for industrial practice. However, the corresponding tools are typically not directly usable on model-based languages that are predominantly used in the automotive sector. To this end, we use an internal software analysis framework for model-based, embedded software that allows us to easily integrate various approaches leveraging static and dynamic symbolic execution. In our case study, we describe idiosyncrasies of automotive embedded software and show the benefits of leveraging symbolic information to generate test cases. We study and compare different techniques for generating test cases. We evaluate all techniques empirically on relevant, real-world programs from engine control.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125508834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Are We There Yet? How Redundant and Equivalent Mutants Affect Determination of Test Completeness 我们到了吗?冗余和等效突变体如何影响测试完整性的确定
Bob Kurtz, P. Ammann, A. Offutt, M. Kurtz
Mutation score has long been used in research as a metric to measure the effectiveness of testing strategies. This paper presents evidence that mutation score lacks the desired accuracy to determine the completeness of a test suite due to noise introduced by the redundancy inherent in traditional mutation, and that dominator mutation score is a superior metric for this purpose. We evaluate the impact of different levels of redundant and equivalent mutants on mutation score and the ability to determine completeness in developing a mutation-adequate test suite. We conclude that, in the context of our model, redundant mutants make it very difficult to accurately assess test completeness. Equivalent mutants, on the other hand, have little effect on determining completeness. Based on this information, we suggest limits to redundancy and equivalency that mutation tools must achieve to be practical for general use in software testing.
长期以来,突变分数一直是研究中用来衡量测试策略有效性的指标。本文提出的证据表明,由于传统突变中固有的冗余引入的噪声,突变评分缺乏确定测试套件完整性所需的准确性,并且显性突变评分是用于此目的的优越度量。我们评估了不同水平的冗余和等效突变对突变评分的影响,以及在开发足够突变的测试套件时确定完整性的能力。我们的结论是,在我们模型的上下文中,冗余的突变使得准确评估测试完整性变得非常困难。另一方面,等效突变对确定完备性的影响很小。基于这些信息,我们建议对冗余和等效性的限制,突变工具必须达到在软件测试中普遍使用的实用性。
{"title":"Are We There Yet? How Redundant and Equivalent Mutants Affect Determination of Test Completeness","authors":"Bob Kurtz, P. Ammann, A. Offutt, M. Kurtz","doi":"10.1109/ICSTW.2016.41","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.41","url":null,"abstract":"Mutation score has long been used in research as a metric to measure the effectiveness of testing strategies. This paper presents evidence that mutation score lacks the desired accuracy to determine the completeness of a test suite due to noise introduced by the redundancy inherent in traditional mutation, and that dominator mutation score is a superior metric for this purpose. We evaluate the impact of different levels of redundant and equivalent mutants on mutation score and the ability to determine completeness in developing a mutation-adequate test suite. We conclude that, in the context of our model, redundant mutants make it very difficult to accurately assess test completeness. Equivalent mutants, on the other hand, have little effect on determining completeness. Based on this information, we suggest limits to redundancy and equivalency that mutation tools must achieve to be practical for general use in software testing.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121107690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Evaluating the Effectiveness of BEN in Localizing Different Types of Software Fault 评估BEN在不同类型软件故障定位中的有效性
Jaganmohan Chandrasekaran, Laleh Shikh Gholamhossein Ghandehari, Yu Lei, R. Kacker, D. R. Kuhn
Debugging or fault localization is one of the most challenging tasks during software development. Many tools have been developed to reduce the amount of effort and time software developers have to spend on fault localization. In this paper, we evaluate the effectiveness of a fault localization tool called BEN in localizing different types of software fault. Assuming that combinatorial testing has been performed on the subject program, BEN leverages the result obtained from combinatorial testing to perform fault localization. Our evaluation focuses on impact of three properties of software fault on the effectiveness of BEN. The three properties include accessibility, input value sensitivity and control flow sensitivity. A random test set-based approach is used to measure the three properties. The experimental results suggest that BEN is more effective, respectively, in localizing faults of lower accessibility, input value-insensitive faults or control flow-insensitive faults than localizing faults of higher accessibility, input value-sensitive or control flow-sensitive faults in the subject programs. The insights obtained from our evaluation can be applied to other fault localization tools that are similar to BEN, and can be used to identify opportunities for further research on combinatorial testing-based fault localization.
调试或故障定位是软件开发过程中最具挑战性的任务之一。已经开发了许多工具来减少软件开发人员在故障定位上花费的精力和时间。在本文中,我们评估了一个名为BEN的故障定位工具在定位不同类型软件故障方面的有效性。假设对主题程序进行了组合测试,BEN利用组合测试的结果进行故障定位。我们的评估侧重于软件故障的三个属性对本方法有效性的影响。这三个属性包括可访问性、输入值敏感性和控制流敏感性。使用基于随机测试集的方法来测量这三个属性。实验结果表明,相对于可达性较高的故障、输入值敏感的故障和控制流敏感的故障,本算法在可达性较低的故障、输入值不敏感的故障和控制流不敏感的故障的定位上更有效。从我们的评估中获得的见解可以应用于其他类似于BEN的故障定位工具,并可用于确定进一步研究基于组合测试的故障定位的机会。
{"title":"Evaluating the Effectiveness of BEN in Localizing Different Types of Software Fault","authors":"Jaganmohan Chandrasekaran, Laleh Shikh Gholamhossein Ghandehari, Yu Lei, R. Kacker, D. R. Kuhn","doi":"10.1109/ICSTW.2016.44","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.44","url":null,"abstract":"Debugging or fault localization is one of the most challenging tasks during software development. Many tools have been developed to reduce the amount of effort and time software developers have to spend on fault localization. In this paper, we evaluate the effectiveness of a fault localization tool called BEN in localizing different types of software fault. Assuming that combinatorial testing has been performed on the subject program, BEN leverages the result obtained from combinatorial testing to perform fault localization. Our evaluation focuses on impact of three properties of software fault on the effectiveness of BEN. The three properties include accessibility, input value sensitivity and control flow sensitivity. A random test set-based approach is used to measure the three properties. The experimental results suggest that BEN is more effective, respectively, in localizing faults of lower accessibility, input value-insensitive faults or control flow-insensitive faults than localizing faults of higher accessibility, input value-sensitive or control flow-sensitive faults in the subject programs. The insights obtained from our evaluation can be applied to other fault localization tools that are similar to BEN, and can be used to identify opportunities for further research on combinatorial testing-based fault localization.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122480332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Automated Fault-Tolerance Testing 自动化容错测试
Adithya Nagarajan, Ajay Vaddadi
Software Fault Tolerance is an ability of computer software to continue its normal operation despite the presence of system or hardware faults. Most companies are moving towards a microservices-based architecture where complex applications are developed with a suite of small services, each of which communicates using some common protocols like Hypertext Transfer Protocol (HTTP). While this architecture enables agility in software development and go-to-market, it poses a critical challenge of assessing fault tolerance and resiliency of the overall system. A failure in one of the dependent services can cause an unexpected impact on the upstream services causing severe customer facing issues. Such issues are a result of lack of resiliency in the architecture of the system. There is a need for an automated tool to be able to understand the service architecture, topology, and be able to inject faults to assess fault tolerance and resiliency of the system. In this paper, we present Screwdriver -- a new automated solution developed at Groupon to address this need.
软件容错是计算机软件在出现系统或硬件故障的情况下继续正常运行的一种能力。大多数公司正在转向基于微服务的架构,在这种架构中,复杂的应用程序是用一套小服务开发的,每个小服务都使用一些通用协议(如超文本传输协议(HTTP))进行通信。虽然这种体系结构支持软件开发和进入市场的敏捷性,但它提出了评估整个系统的容错性和弹性的关键挑战。其中一个依赖服务中的故障可能会对上游服务造成意想不到的影响,从而导致严重的客户问题。这些问题是系统架构缺乏弹性的结果。需要一种自动化工具来理解服务体系结构、拓扑结构,并能够注入故障以评估系统的容错性和弹性。在本文中,我们介绍了螺丝刀——Groupon为满足这一需求而开发的一种新的自动化解决方案。
{"title":"Automated Fault-Tolerance Testing","authors":"Adithya Nagarajan, Ajay Vaddadi","doi":"10.1109/ICSTW.2016.34","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.34","url":null,"abstract":"Software Fault Tolerance is an ability of computer software to continue its normal operation despite the presence of system or hardware faults. Most companies are moving towards a microservices-based architecture where complex applications are developed with a suite of small services, each of which communicates using some common protocols like Hypertext Transfer Protocol (HTTP). While this architecture enables agility in software development and go-to-market, it poses a critical challenge of assessing fault tolerance and resiliency of the overall system. A failure in one of the dependent services can cause an unexpected impact on the upstream services causing severe customer facing issues. Such issues are a result of lack of resiliency in the architecture of the system. There is a need for an automated tool to be able to understand the service architecture, topology, and be able to inject faults to assess fault tolerance and resiliency of the system. In this paper, we present Screwdriver -- a new automated solution developed at Groupon to address this need.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125730570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Exploring the Presence of Technical Debt in Industrial GUI-Based Testware: A Case Study 探索工业gui测试软件中技术债务的存在:一个案例研究
Emil Alégroth, Marcello Steiner, A. Martini
Technical debt (TD) is a concept used to describe a sub-optimal solution of a software artifact that negatively affects its comprehensibility, extendability and maintainability. As such, TD adversely affects the costs or quality associated with the artifact, which is also called interest. TD has through research been identified in all types of software artifacts, from architectural design to automated tests (Testware). However, research into testware technical debt (TTD) is limited and primarily focused on testing on lower level of system abstraction, i.e. unit-and integration tests, leaving a need for more TTD research on GUI-based testing. In this study we explore this gap in knowledge through an industrial case study at a Swedish avionics software development company. Four repositories are studied for the presence of TTD using expert interviews, semi-automated document analysis and automatic metric analysis. Results of the study provide initial support that the concept of TTD is applicable to GUI-based testware and show the presence of both TD items unique to GUI-based testware and items common to software. The implications of these results are that engineering best practices must be established for GUI-based testware to minimize TD interest.
技术债务(TD)是一个概念,用于描述对软件工件的可理解性、可扩展性和可维护性产生负面影响的次优解决方案。因此,TD会对与工件相关的成本或质量产生不利影响,这也被称为利益。通过研究,从架构设计到自动化测试(Testware),在所有类型的软件工件中都确定了TD。然而,对测试软件技术债务(TTD)的研究是有限的,并且主要集中在较低层次的系统抽象上的测试,即单元和集成测试,这就需要对基于gui的测试进行更多的TTD研究。在这项研究中,我们通过一个瑞典航空电子软件开发公司的工业案例研究来探讨这种知识差距。通过专家访谈、半自动文档分析和自动度量分析,研究了四个存储库是否存在TTD。研究结果初步支持了TTD的概念适用于基于gui的测试软件,并显示了基于gui的测试软件特有的TD项目和软件共有的项目的存在。这些结果的含义是,必须为基于gui的测试软件建立工程最佳实践,以最小化对TD的兴趣。
{"title":"Exploring the Presence of Technical Debt in Industrial GUI-Based Testware: A Case Study","authors":"Emil Alégroth, Marcello Steiner, A. Martini","doi":"10.1109/ICSTW.2016.47","DOIUrl":"https://doi.org/10.1109/ICSTW.2016.47","url":null,"abstract":"Technical debt (TD) is a concept used to describe a sub-optimal solution of a software artifact that negatively affects its comprehensibility, extendability and maintainability. As such, TD adversely affects the costs or quality associated with the artifact, which is also called interest. TD has through research been identified in all types of software artifacts, from architectural design to automated tests (Testware). However, research into testware technical debt (TTD) is limited and primarily focused on testing on lower level of system abstraction, i.e. unit-and integration tests, leaving a need for more TTD research on GUI-based testing. In this study we explore this gap in knowledge through an industrial case study at a Swedish avionics software development company. Four repositories are studied for the presence of TTD using expert interviews, semi-automated document analysis and automatic metric analysis. Results of the study provide initial support that the concept of TTD is applicable to GUI-based testware and show the presence of both TD items unique to GUI-based testware and items common to software. The implications of these results are that engineering best practices must be established for GUI-based testware to minimize TD interest.","PeriodicalId":335145,"journal":{"name":"2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127257167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2016 IEEE Ninth International Conference on Software Testing, Verification and Validation Workshops (ICSTW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1