Predicting the First Response Latency of Maintainers and Contributors in Pull Requests

IF 6.5 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING IEEE Transactions on Software Engineering Pub Date : 2024-08-13 DOI:10.1109/TSE.2024.3443741

SayedHassan Khatoonabadi;Ahmad Abdellatif;Diego Elias Costa;Emad Shihab

{"title":"Predicting the First Response Latency of Maintainers and Contributors in Pull Requests","authors":"SayedHassan Khatoonabadi;Ahmad Abdellatif;Diego Elias Costa;Emad Shihab","doi":"10.1109/TSE.2024.3443741","DOIUrl":null,"url":null,"abstract":"The success of a Pull Request (PR) depends on the responsiveness of the maintainers and the contributor during the review process. Being aware of the expected waiting times can lead to better interactions and managed expectations for both the maintainers and the contributor. In this paper, we propose a machine-learning approach to predict the first response latency of the maintainers following the submission of a PR, and the first response latency of the contributor after receiving the first response from the maintainers. We curate a dataset of 20 large and popular open-source projects on GitHub and extract 21 features to characterize projects, contributors, PRs, and review processes. Using these features, we then evaluate seven types of classifiers to identify the best-performing models. We also conduct permutation feature importance and SHAP analyses to understand the importance and the impact of different features on the predicted response latencies. We find that our CatBoost models are the most effective for predicting the first response latencies of both maintainers and contributors. Compared to a dummy classifier that always returns the majority class, these models achieved an average improvement of 29% in AUC-ROC and 51% in AUC-PR for maintainers, as well as 39% in AUC-ROC and 89% in AUC-PR for contributors across the studied projects. The results indicate that our models can aptly predict the first response latencies using the selected features. We also observe that PRs submitted earlier in the week, containing an average number of commits, and with concise descriptions are more likely to receive faster first responses from the maintainers. Similarly, PRs with a lower first response latency from maintainers, that received the first response of maintainers earlier in the week, and containing an average number of commits tend to receive faster first responses from the contributors. Additionally, contributors with a higher acceptance rate and a history of timely responses in the project are likely to both obtain and provide faster first responses. Moreover, we show the effectiveness of our approach in a cross-project setting. Finally, we discuss key guidelines for maintainers, contributors, and researchers to help facilitate the PR review process.","PeriodicalId":13324,"journal":{"name":"IEEE Transactions on Software Engineering","volume":"50 10","pages":"2529-2543"},"PeriodicalIF":6.5000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Software Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10636098/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

The success of a Pull Request (PR) depends on the responsiveness of the maintainers and the contributor during the review process. Being aware of the expected waiting times can lead to better interactions and managed expectations for both the maintainers and the contributor. In this paper, we propose a machine-learning approach to predict the first response latency of the maintainers following the submission of a PR, and the first response latency of the contributor after receiving the first response from the maintainers. We curate a dataset of 20 large and popular open-source projects on GitHub and extract 21 features to characterize projects, contributors, PRs, and review processes. Using these features, we then evaluate seven types of classifiers to identify the best-performing models. We also conduct permutation feature importance and SHAP analyses to understand the importance and the impact of different features on the predicted response latencies. We find that our CatBoost models are the most effective for predicting the first response latencies of both maintainers and contributors. Compared to a dummy classifier that always returns the majority class, these models achieved an average improvement of 29% in AUC-ROC and 51% in AUC-PR for maintainers, as well as 39% in AUC-ROC and 89% in AUC-PR for contributors across the studied projects. The results indicate that our models can aptly predict the first response latencies using the selected features. We also observe that PRs submitted earlier in the week, containing an average number of commits, and with concise descriptions are more likely to receive faster first responses from the maintainers. Similarly, PRs with a lower first response latency from maintainers, that received the first response of maintainers earlier in the week, and containing an average number of commits tend to receive faster first responses from the contributors. Additionally, contributors with a higher acceptance rate and a history of timely responses in the project are likely to both obtain and provide faster first responses. Moreover, we show the effectiveness of our approach in a cross-project setting. Finally, we discuss key guidelines for maintainers, contributors, and researchers to help facilitate the PR review process.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预测拉取请求中维护者和贡献者的首次响应延迟

拉取请求（PR）的成功与否取决于维护者和贡献者在审核过程中的响应速度。了解预期的等待时间可以让维护者和贡献者进行更好的互动，并对预期进行管理。在本文中，我们提出了一种机器学习方法来预测维护者在提交 PR 后的首次响应延迟，以及贡献者在收到维护者首次响应后的首次响应延迟。我们整理了 GitHub 上 20 个大型流行开源项目的数据集，并提取了 21 个特征来描述项目、贡献者、PR 和审查流程。利用这些特征，我们对七种分类器进行了评估，以确定表现最佳的模型。我们还进行了排列特征重要性和 SHAP 分析，以了解不同特征对预测响应延迟的重要性和影响。我们发现 CatBoost 模型在预测维护者和贡献者的首次响应延迟方面最为有效。与总是返回多数类的虚拟分类器相比，在所研究的项目中，这些模型对维护者的 AUC-ROC 和 AUC-PR 平均分别提高了 29% 和 51%，对贡献者的 AUC-ROC 和 AUC-PR 平均分别提高了 39% 和 89%。结果表明，我们的模型可以利用所选特征准确预测首次响应延迟。我们还观察到，在一周内较早提交、包含平均提交次数和简洁描述的 PR 更有可能得到维护者更快的首次响应。同样，维护者首次响应延迟较低、在一周内较早收到维护者首次响应、包含平均提交次数的 PR 往往能更快地收到贡献者的首次响应。此外，在项目中接受率较高且有及时响应历史的贡献者也有可能获得并提供更快的首次响应。此外，我们还展示了我们的方法在跨项目环境中的有效性。最后，我们讨论了维护者、贡献者和研究人员的关键指南，以帮助促进公关审查过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Software Engineering 工程技术-工程：电子与电气

CiteScore

9.70

自引率

10.80%

发文量

724

审稿时长

6 months

期刊介绍： IEEE Transactions on Software Engineering seeks contributions comprising well-defined theoretical results and empirical studies with potential impacts on software construction, analysis, or management. The scope of this Transactions extends from fundamental mechanisms to the development of principles and their application in specific environments. Specific topic areas include: a) Development and maintenance methods and models: Techniques and principles for specifying, designing, and implementing software systems, encompassing notations and process models. b) Assessment methods: Software tests, validation, reliability models, test and diagnosis procedures, software redundancy, design for error control, and measurements and evaluation of process and product aspects. c) Software project management: Productivity factors, cost models, schedule and organizational issues, and standards. d) Tools and environments: Specific tools, integrated tool environments, associated architectures, databases, and parallel and distributed processing issues. e) System issues: Hardware-software trade-offs. f) State-of-the-art surveys: Syntheses and comprehensive reviews of the historical development within specific areas of interest.