Framing Program Repair as Code Completion

2022 IEEE/ACM International Workshop on Automated Program Repair (APR) Pub Date : 2022-05-01 DOI:10.1145/3524459.3527347

Francisco Ribeiro, Rui Abreu, João Saraiva

{"title":"Framing Program Repair as Code Completion","authors":"Francisco Ribeiro, Rui Abreu, João Saraiva","doi":"10.1145/3524459.3527347","DOIUrl":null,"url":null,"abstract":"Many techniques have contributed to the advancement of auto-mated program repair, such as: generate and validate approaches, constraint-based solvers and even neural machine translation. Si-multaneously, artificial intelligence has allowed the creation of general-purpose pre-trained models that support several down-stream tasks. In this paper, we describe a technique that takes advantage of a generative model - CodeGPT - to automatically repair buggy programs by making use of its code completion capa-bilities. We also elaborate on where to perform code completion in a buggy line and how we circumvent the open-ended nature of code generation to appropriately fit the new code in the original pro-gram. Furthermore, we validate our approach on the ManySStuBs4J dataset containing real-world open-source projects and show that our tool is able to fix 1739 programs out of 6415 - a 27% repair rate. The repaired programs range from single-line changes to multiple line modifications. In fact, our technique is able to fix programs which were missing relatively complex expressions prior to being analyzed. In the end, we present case studies that showcase different scenarios our technique was able to handle.","PeriodicalId":131481,"journal":{"name":"2022 IEEE/ACM International Workshop on Automated Program Repair (APR)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM International Workshop on Automated Program Repair (APR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3524459.3527347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Many techniques have contributed to the advancement of auto-mated program repair, such as: generate and validate approaches, constraint-based solvers and even neural machine translation. Si-multaneously, artificial intelligence has allowed the creation of general-purpose pre-trained models that support several down-stream tasks. In this paper, we describe a technique that takes advantage of a generative model - CodeGPT - to automatically repair buggy programs by making use of its code completion capa-bilities. We also elaborate on where to perform code completion in a buggy line and how we circumvent the open-ended nature of code generation to appropriately fit the new code in the original pro-gram. Furthermore, we validate our approach on the ManySStuBs4J dataset containing real-world open-source projects and show that our tool is able to fix 1739 programs out of 6415 - a 27% repair rate. The repaired programs range from single-line changes to multiple line modifications. In fact, our technique is able to fix programs which were missing relatively complex expressions prior to being analyzed. In the end, we present case studies that showcase different scenarios our technique was able to handle.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

框架程序修复作为代码完成

许多技术为自动程序修复的进步做出了贡献，例如:生成和验证方法，基于约束的求解器，甚至神经机器翻译。同时，人工智能允许创建支持多个下游任务的通用预训练模型。在本文中，我们描述了一种利用生成模型- CodeGPT -通过利用其代码完成功能来自动修复错误程序的技术。我们还详细说明了在有bug的行中执行代码补全的位置，以及如何规避代码生成的开放式特性，以适当地适应原始程序中的新代码。此外，我们在包含真实开源项目的ManySStuBs4J数据集上验证了我们的方法，并显示我们的工具能够修复6415个程序中的1739个——修复率为27%。修复的程序范围从单行更改到多行修改。事实上，我们的技术能够修复那些在分析之前缺少相对复杂表达式的程序。最后，我们提供了案例研究，展示了我们的技术能够处理的不同场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE/ACM International Workshop on Automated Program Repair (APR)

自引率

0.00%

发文量