An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems

2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE) Pub Date : 2016-02-18 DOI:10.1145/2884781.2884872

Fan Long, M. Rinard

{"title":"An Analysis of the Search Spaces for Generate and Validate Patch Generation Systems","authors":"Fan Long, M. Rinard","doi":"10.1145/2884781.2884872","DOIUrl":null,"url":null,"abstract":"We present the first systematic analysis of key characteristics of patch search spaces for automatic patch generation systems. We analyze sixteen different configurations of the patch search spaces of SPR and Prophet, two cur- rent state-of-the-art patch generation systems. The analysis shows that 1) correct patches are sparse in the search spaces (typically at most one correct patch per search space per defect), 2) incorrect patches that nevertheless pass all of the test cases in the validation test suite are typically orders of magnitude more abundant, and 3) leveraging information other than the test suite is therefore critical for enabling the system to successfully isolate correct patches.We also characterize a key tradeoff in the structure of the search spaces. Larger and richer search spaces that contain correct patches for more defects can actually cause systems to find fewer, not more, correct patches. We identify two reasons for this phenomenon: 1) increased validation times because of the presence of more candidate patches and 2) more incorrect patches that pass the test suite and block the discovery of correct patches. These fundamental properties, which are all characterized for the first time in this paper, help explain why past systems often fail to generate correct patches and help identify challenges, opportunities, and productive future directions for the field.","PeriodicalId":6485,"journal":{"name":"2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)","volume":"535 1","pages":"702-713"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"153","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2884781.2884872","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 153

Abstract

We present the first systematic analysis of key characteristics of patch search spaces for automatic patch generation systems. We analyze sixteen different configurations of the patch search spaces of SPR and Prophet, two cur- rent state-of-the-art patch generation systems. The analysis shows that 1) correct patches are sparse in the search spaces (typically at most one correct patch per search space per defect), 2) incorrect patches that nevertheless pass all of the test cases in the validation test suite are typically orders of magnitude more abundant, and 3) leveraging information other than the test suite is therefore critical for enabling the system to successfully isolate correct patches.We also characterize a key tradeoff in the structure of the search spaces. Larger and richer search spaces that contain correct patches for more defects can actually cause systems to find fewer, not more, correct patches. We identify two reasons for this phenomenon: 1) increased validation times because of the presence of more candidate patches and 2) more incorrect patches that pass the test suite and block the discovery of correct patches. These fundamental properties, which are all characterized for the first time in this paper, help explain why past systems often fail to generate correct patches and help identify challenges, opportunities, and productive future directions for the field.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

补丁生成与验证系统的搜索空间分析

本文首次对自动补丁生成系统的补丁搜索空间的关键特征进行了系统分析。我们分析了SPR和Prophet这两种当前最先进的补丁生成系统的16种不同的补丁搜索空间配置。分析表明1)正确的补丁在搜索空间中是稀疏的(通常每个缺陷在每个搜索空间中最多只有一个正确的补丁)，2)尽管如此，在验证测试套件中通过所有测试用例的错误补丁通常要丰富得多，并且3)利用测试套件以外的信息因此对于使系统成功地分离正确的补丁是至关重要的。我们还描述了搜索空间结构中的一个关键权衡。包含针对更多缺陷的正确补丁的更大、更丰富的搜索空间实际上会导致系统找到更少、而不是更多的正确补丁。我们确定了这种现象的两个原因:1)由于存在更多候选补丁而增加了验证时间;2)通过测试套件并阻止发现正确补丁的错误补丁更多。这些基本特性在本文中首次被描述，有助于解释为什么过去的系统经常不能生成正确的补丁，并有助于识别该领域的挑战、机遇和富有成效的未来方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量

期刊最新文献

Scalable Thread Sharing Analysis Overcoming Open Source Project Entry Barriers with a Portal for Newcomers Nomen est Omen: Exploring and Exploiting Similarities between Argument and Parameter Names Reliability of Run-Time Quality-of-Service Evaluation Using Parametric Model Checking On the Techniques We Create, the Tools We Build, and Their Misalignments: A Study of KLEE