{"title":"Unveiling the Role of GPT-4 in Solving LeetCode Programming Problems","authors":"Sarthak Vishnu, Sahil, Naman Garg","doi":"10.1002/cae.22815","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The landscape of programming education is undergoing a transformative shift in the era of AI and machine learning. This research delves into the role of GPT-4, a state-of-the-art language model, in solving intermediate-level programming problems, focusing on the renowned LeetCode platform. For this, the work employs different programming problems from two LeetCode contests, providing a comprehensive evaluation of GPT-4's capabilities. The results reveal intriguing patterns in the model's behavior. Initial attempts, when provided with all inputs simultaneously, exhibit high accuracy, but subsequent attempts show consistent fluctuations, rarely surpassing the accuracy of the first attempt. Upon closer examination, a distinct pattern emerges in GPT-4's problem-solving approach, where the model iteratively refines its solutions, incorporating corrections in subsequent attempts. However, the lack of a historical context for past attempts raises questions about the model's attention span and its ability to rectify mistakes. Notably, GPT-4 consistently fails on the same test case with the same generated output, suggesting a potential limitation in addressing specific challenges. But, on leveraging human assistance to AI tools, the observations and patterns from the incorrect codes can be drawn and required adjustments to rectify the codes can be made. A direct result of this is observed in the increased success rate in problem-solving by students, rising from 68% in the moderate learning stage to 92% in the advanced learning stage. Hence, the presented work proposes a human-supervised methodology to leverage the AI-assisted code generation and employs that in improving the effectiveness of AI-assisted teaching–learning process.</p>\n </div>","PeriodicalId":50643,"journal":{"name":"Computer Applications in Engineering Education","volume":"33 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Applications in Engineering Education","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cae.22815","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The landscape of programming education is undergoing a transformative shift in the era of AI and machine learning. This research delves into the role of GPT-4, a state-of-the-art language model, in solving intermediate-level programming problems, focusing on the renowned LeetCode platform. For this, the work employs different programming problems from two LeetCode contests, providing a comprehensive evaluation of GPT-4's capabilities. The results reveal intriguing patterns in the model's behavior. Initial attempts, when provided with all inputs simultaneously, exhibit high accuracy, but subsequent attempts show consistent fluctuations, rarely surpassing the accuracy of the first attempt. Upon closer examination, a distinct pattern emerges in GPT-4's problem-solving approach, where the model iteratively refines its solutions, incorporating corrections in subsequent attempts. However, the lack of a historical context for past attempts raises questions about the model's attention span and its ability to rectify mistakes. Notably, GPT-4 consistently fails on the same test case with the same generated output, suggesting a potential limitation in addressing specific challenges. But, on leveraging human assistance to AI tools, the observations and patterns from the incorrect codes can be drawn and required adjustments to rectify the codes can be made. A direct result of this is observed in the increased success rate in problem-solving by students, rising from 68% in the moderate learning stage to 92% in the advanced learning stage. Hence, the presented work proposes a human-supervised methodology to leverage the AI-assisted code generation and employs that in improving the effectiveness of AI-assisted teaching–learning process.
期刊介绍:
Computer Applications in Engineering Education provides a forum for publishing peer-reviewed timely information on the innovative uses of computers, Internet, and software tools in engineering education. Besides new courses and software tools, the CAE journal covers areas that support the integration of technology-based modules in the engineering curriculum and promotes discussion of the assessment and dissemination issues associated with these new implementation methods.