{"title":"Code Search Method Based on Multimodal Representation","authors":"Xiao Chen, Junhua Wu","doi":"10.1109/QRS-C57518.2022.00078","DOIUrl":null,"url":null,"abstract":"Developers tend to search and reuse code snippets from large-scale corpora while implementing some of the features that existed in development. This will improve the efficiency of development. Code search is to search for semantically relevant code snippets based on a given natural language query. In existing methods, the semantic similarity between code and query is quantified as their distance in the shared vector space. To improve the vector space and map the code vector and query vector into a shared vector space so that the semantically similar code-query pairs are close to each other, we propose a code search method with multimodal representations. It can better enhance the semantic relationship between code snippets and queries. Experiments on Java datasets show that the multimodal representation model MulCS improves the quality of code search. MulCS outperforms several existing advanced models in several performance metrics.","PeriodicalId":183728,"journal":{"name":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QRS-C57518.2022.00078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Developers tend to search and reuse code snippets from large-scale corpora while implementing some of the features that existed in development. This will improve the efficiency of development. Code search is to search for semantically relevant code snippets based on a given natural language query. In existing methods, the semantic similarity between code and query is quantified as their distance in the shared vector space. To improve the vector space and map the code vector and query vector into a shared vector space so that the semantically similar code-query pairs are close to each other, we propose a code search method with multimodal representations. It can better enhance the semantic relationship between code snippets and queries. Experiments on Java datasets show that the multimodal representation model MulCS improves the quality of code search. MulCS outperforms several existing advanced models in several performance metrics.