Revisiting Object Similarity-based Patch Ranking in Automated Program Repair: An Extensive Study

2022 IEEE/ACM International Workshop on Automated Program Repair (APR) Pub Date : 2022-05-01 DOI:10.1145/3524459.3527354

Ali Ghanbari

{"title":"Revisiting Object Similarity-based Patch Ranking in Automated Program Repair: An Extensive Study","authors":"Ali Ghanbari","doi":"10.1145/3524459.3527354","DOIUrl":null,"url":null,"abstract":"Test-based generate-and-validate automated program repair (APR) systems often generate plausible patches that pass the test suite without fixing the bug. So far, several approaches for automatic assessment of the APR-generated patches are proposed. Among them, dynamic patch correctness assessment relies on comparing run-time information obtained from the program before and after patching. Object similarity-based dynamic patch ranking approaches, specifically, capture system state snapshots after the impact point of patches and express behavior differences in term of object graphs similarities. Dynamic approaches rely on the assumption that, when running the originally passing test cases, the correct patches will not alter the program behavior in a significant way, but such patches will significantly change program behavior for the failing test cases. This paper presents the results of an extensive empirical study on two object similarity-based approaches, i.e., ObjSim and CIP, to rank 1,290 APR-generated patches, used in previous APR research. We found that although ObjSim outperforms CIP, in terms of the number of patches ranked in top-1 position, it still does not offer an improvement over random baseline ranking, representing the setting with no automatic patch correctness assessment in place. This observation warrants further research on the validity of the assumptions underlying these two techniques and the techniques based on similar assumptions.","PeriodicalId":131481,"journal":{"name":"2022 IEEE/ACM International Workshop on Automated Program Repair (APR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM International Workshop on Automated Program Repair (APR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3524459.3527354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Test-based generate-and-validate automated program repair (APR) systems often generate plausible patches that pass the test suite without fixing the bug. So far, several approaches for automatic assessment of the APR-generated patches are proposed. Among them, dynamic patch correctness assessment relies on comparing run-time information obtained from the program before and after patching. Object similarity-based dynamic patch ranking approaches, specifically, capture system state snapshots after the impact point of patches and express behavior differences in term of object graphs similarities. Dynamic approaches rely on the assumption that, when running the originally passing test cases, the correct patches will not alter the program behavior in a significant way, but such patches will significantly change program behavior for the failing test cases. This paper presents the results of an extensive empirical study on two object similarity-based approaches, i.e., ObjSim and CIP, to rank 1,290 APR-generated patches, used in previous APR research. We found that although ObjSim outperforms CIP, in terms of the number of patches ranked in top-1 position, it still does not offer an improvement over random baseline ranking, representing the setting with no automatic patch correctness assessment in place. This observation warrants further research on the validity of the assumptions underlying these two techniques and the techniques based on similar assumptions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于重访对象相似性的自动程序修复补丁排序:一个广泛的研究

基于测试的生成和验证自动程序修复(APR)系统通常生成可信的补丁，这些补丁通过了测试套件，而没有修复错误。到目前为止，已经提出了几种自动评估apr生成补丁的方法。其中，动态补丁正确性评估依赖于比较补丁前后从程序中获得的运行时信息。基于对象相似度的动态补丁排序方法，具体来说是捕捉补丁撞击点后的系统状态快照，用对象图相似度表示行为差异。动态方法依赖于这样的假设:当运行最初通过的测试用例时，正确的补丁不会显著地改变程序行为，但是这样的补丁将显著地改变失败测试用例的程序行为。本文采用ObjSim和CIP两种基于对象相似性的方法对1290个APR生成的补丁进行了排序，并对以往的APR研究进行了广泛的实证研究。我们发现，尽管ObjSim在排名前1位的补丁数量方面优于CIP，但它仍然没有提供随机基线排名的改进，这代表了没有自动补丁正确性评估的设置。这一观察结果值得进一步研究这两种技术背后的假设以及基于类似假设的技术的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE/ACM International Workshop on Automated Program Repair (APR)

自引率

0.00%

发文量