A Comparison of Large Language Models and Genetic Programming for Program Synthesis

IF 11.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Evolutionary Computation Pub Date : 2024-06-06 DOI:10.1109/TEVC.2024.3410873

Dominik Sobania;Justyna Petke;Martin Briesch;Franz Rothlauf

引用次数: 0

Abstract

Large language models have recently become known for their ability to generate computer programs, especially through tools, such as GitHub Copilot, a domain where genetic programming (GP) has been very successful so far. Although they require different inputs (free-text versus input/output examples) their goal is the same—program synthesis. Therefore, in this work, we compare how well GitHub Copilot and GP perform on common program synthesis benchmark problems. We study the structure and diversity of the generated programs by using well-known software metrics. We find that GitHub Copilot and GP solve a similar number of benchmark problems (85.2% versus 77.8%, respectively). We find that GitHub Copilot generated smaller and less complex programs as GP, while GP is able to find new and unique problem solving strategies. This increase in diversity of solutions comes at a cost. When analyzing the success rates for 100 runs per problem, GitHub Copilot outperforms GP on over 50% of the problems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于程序合成的大型语言模型与遗传编程的比较

大型语言模型最近因其生成计算机程序的能力而闻名，特别是通过工具，如GitHub Copilot，遗传编程（GP）到目前为止已经非常成功。尽管它们需要不同的输入（自由文本与输入/输出示例），但它们的目标是相同的程序合成。因此，在这项工作中，我们比较了GitHub Copilot和GP在常见程序合成基准问题上的表现。我们通过使用知名的软件度量来研究生成程序的结构和多样性。我们发现，GitHub Copilot和GP解决的基准问题数量相似（分别为85.2%和77.8%）。我们发现GitHub Copilot生成的程序更小、更不复杂，而GP能够找到新的、独特的解决问题的策略。解决方案多样性的增加是有代价的。当分析每个问题100次运行的成功率时，GitHub Copilot在超过50%的问题上优于GP。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Evolutionary Computation 工程技术-计算机：理论方法

CiteScore

21.90

自引率

9.80%

发文量

196

审稿时长

3.6 months

期刊介绍： The IEEE Transactions on Evolutionary Computation is published by the IEEE Computational Intelligence Society on behalf of 13 societies: Circuits and Systems; Computer; Control Systems; Engineering in Medicine and Biology; Industrial Electronics; Industry Applications; Lasers and Electro-Optics; Oceanic Engineering; Power Engineering; Robotics and Automation; Signal Processing; Social Implications of Technology; and Systems, Man, and Cybernetics. The journal publishes original papers in evolutionary computation and related areas such as nature-inspired algorithms, population-based methods, optimization, and hybrid systems. It welcomes both purely theoretical papers and application papers that provide general insights into these areas of computation.