An empirical analysis of flaky tests

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering Pub Date : 2014-11-11 DOI:10.1145/2635868.2635920

Q. Luo, Farah Hariri, Lamyaa Eloussi, D. Marinov

引用次数: 332

Abstract

Regression testing is a crucial part of software development. It checks that software changes do not break existing functionality. An important assumption of regression testing is that test outcomes are deterministic: an unmodified test is expected to either always pass or always fail for the same code under test. Unfortunately, in practice, some tests often called flaky tests—have non-deterministic outcomes. Such tests undermine the regression testing as they make it difficult to rely on test results. We present the first extensive study of flaky tests. We study in detail a total of 201 commits that likely fix flaky tests in 51 open-source projects. We classify the most common root causes of flaky tests, identify approaches that could manifest flaky behavior, and describe common strategies that developers use to fix flaky tests. We believe that our insights and implications can help guide future research on the important topic of (avoiding) flaky tests.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

薄片测试的实证分析

回归测试是软件开发的关键部分。它检查软件更改不会破坏现有功能。回归测试的一个重要假设是测试结果是确定的:对于相同的测试代码，一个未修改的测试要么总是通过，要么总是失败。不幸的是，在实践中，一些测试(通常称为片状测试)具有不确定的结果。这样的测试破坏了回归测试，因为它们使依赖测试结果变得困难。我们提出了第一个片状测试的广泛研究。我们详细研究了51个开源项目中总共201个可能修复不可靠测试的提交。我们对不稳定测试最常见的根本原因进行了分类，确定了可能显示不稳定行为的方法，并描述了开发人员用于修复不稳定测试的通用策略。我们相信，我们的见解和启示可以帮助指导未来关于(避免)不可靠测试的重要主题的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering

自引率

0.00%

发文量