CodeS: Towards Code Model Generalization Under Distribution Shift

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER) Pub Date : 2022-06-11 DOI:10.1109/ICSE-NIER58687.2023.00007

Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

{"title":"CodeS: Towards Code Model Generalization Under Distribution Shift","authors":"Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon","doi":"10.1109/ICSE-NIER58687.2023.00007","DOIUrl":null,"url":null,"abstract":"Distribution shift has been a longstanding challenge for the reliable deployment of deep learning (DL) models due to unexpected accuracy degradation. Although DL has been becoming a driving force for large-scale source code analysis in the big code era, limited progress has been made on distribution shift analysis and benchmarking for source code tasks. To fill this gap, this paper initiates to propose CodeS, a distribution shift benchmark dataset, for source code learning. Specifically, CodeS supports two programming languages (Java and Python) and five shift types (task, programmer, time-stamp, token, and concrete syntax tree). Extensive experiments based on CodeS reveal that 1) out-of-distribution detectors from other domains (e.g., computer vision) do not generalize to source code, 2) all code classification models suffer from distribution shifts, 3) representation-based shifts have a higher impact on the model than others, and 4) pretrained bimodal models are relatively more resistant to distribution shifts.","PeriodicalId":297025,"journal":{"name":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSE-NIER58687.2023.00007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Distribution shift has been a longstanding challenge for the reliable deployment of deep learning (DL) models due to unexpected accuracy degradation. Although DL has been becoming a driving force for large-scale source code analysis in the big code era, limited progress has been made on distribution shift analysis and benchmarking for source code tasks. To fill this gap, this paper initiates to propose CodeS, a distribution shift benchmark dataset, for source code learning. Specifically, CodeS supports two programming languages (Java and Python) and five shift types (task, programmer, time-stamp, token, and concrete syntax tree). Extensive experiments based on CodeS reveal that 1) out-of-distribution detectors from other domains (e.g., computer vision) do not generalize to source code, 2) all code classification models suffer from distribution shifts, 3) representation-based shifts have a higher impact on the model than others, and 4) pretrained bimodal models are relatively more resistant to distribution shifts.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

代码:分布移位下的代码模型泛化

由于意想不到的精度下降，分布转移一直是深度学习(DL)模型可靠部署的一个长期挑战。虽然深度学习已经成为大代码时代大规模源代码分析的推动力，但在源代码任务的分布转移分析和基准测试方面进展有限。为了填补这一空白，本文提出了用于源代码学习的分布位移基准数据集CodeS。具体来说，CodeS支持两种编程语言(Java和Python)和五种转换类型(任务、程序员、时间戳、令牌和具体语法树)。基于CodeS的大量实验表明，1)来自其他领域(如计算机视觉)的分布外检测器不能推广到源代码，2)所有代码分类模型都受到分布移位的影响，3)基于表示的移位对模型的影响比其他的更大，4)预训练的双峰模型相对更能抵抗分布移位。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

自引率

0.00%

发文量

期刊最新文献

Performance Analysis with Bayesian Inference Interpersonal Trust in OSS: Exploring Dimensions of Trust in GitHub Pull Requests Message from the ICSE 2023 General Chair A Novel and Pragmatic Scenario Modeling Framework with Verification-in-the-loop for Autonomous Driving Systems Test-Driven Development Benefits Beyond Design Quality: Flow State and Developer Experience