Using Multinomial Naive Bayes Machine Learning Method To Classify, Detect, And Recognize Programming Language Source Code

A. Odeh, Munther Odeh, Nada Odeh
{"title":"Using Multinomial Naive Bayes Machine Learning Method To Classify, Detect, And Recognize Programming Language Source Code","authors":"A. Odeh, Munther Odeh, Nada Odeh","doi":"10.1109/ACIT57182.2022.9994117","DOIUrl":null,"url":null,"abstract":"Processing programming languages are very similar to processing natural languages, especially high-level languages such as Python, Java, C#, C, C++, and others. Therefore, the natural language processing concepts can be applied as one of the most important branches of artificial intelligence in detecting, recognizing, and classification scripts written in different programming languages. The programming language script classification can be counted as a classical machine learning problem. This research aims to present a model using Multinomial Naïve Bayes (MNB) artificial intelligence algorithm to identify and classify the programming language used in writing the source code file provided as an input for the proposed model. A set of categorized files containing source codes will be used in training the proposed model, and then the model will be able to automatically detect and classify a new script into one of the already trained categories. The machine learning method called NB Multinomial will be used to implement this matter. This work is very important for Mufti-programming language editors such as Visual Studio Code, Notepad+, and others, where the user can paste the source code, and the editor will recognize the programming language automatically.","PeriodicalId":256713,"journal":{"name":"2022 International Arab Conference on Information Technology (ACIT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Arab Conference on Information Technology (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT57182.2022.9994117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Processing programming languages are very similar to processing natural languages, especially high-level languages such as Python, Java, C#, C, C++, and others. Therefore, the natural language processing concepts can be applied as one of the most important branches of artificial intelligence in detecting, recognizing, and classification scripts written in different programming languages. The programming language script classification can be counted as a classical machine learning problem. This research aims to present a model using Multinomial Naïve Bayes (MNB) artificial intelligence algorithm to identify and classify the programming language used in writing the source code file provided as an input for the proposed model. A set of categorized files containing source codes will be used in training the proposed model, and then the model will be able to automatically detect and classify a new script into one of the already trained categories. The machine learning method called NB Multinomial will be used to implement this matter. This work is very important for Mufti-programming language editors such as Visual Studio Code, Notepad+, and others, where the user can paste the source code, and the editor will recognize the programming language automatically.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用多项朴素贝叶斯机器学习方法分类,检测和识别编程语言源代码
处理编程语言非常类似于处理自然语言,特别是高级语言,如Python、Java、c#、C、c++等。因此,自然语言处理概念可以作为人工智能最重要的分支之一,用于检测、识别和分类用不同编程语言编写的脚本。编程语言脚本分类可以算作一个经典的机器学习问题。本研究旨在提出一个使用Multinomial Naïve Bayes (MNB)人工智能算法的模型,以识别和分类编写源代码文件时使用的编程语言,并将其作为所提议模型的输入。一组包含源代码的分类文件将用于训练提议的模型,然后模型将能够自动检测并将新脚本分类到已经训练的类别之一中。我们将使用名为NB多项式的机器学习方法来实现这个问题。这项工作对于多编程语言编辑器(如Visual Studio Code、Notepad+等)非常重要,用户可以粘贴源代码,编辑器会自动识别编程语言。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Engaging Online Learning at AAU and Its Impact on Students' Performance during COVID-19 Utilizing Drones in Long-Term Construction Projects: the Impact on Cash Flow Management Electronic Monitoring in the Criminal Justice System of the UAE & France The Past, Present, and Future of Robotic Systems in Automotive Applications Multi-Objectives AI-based Optimization Model for Microplastics- free Water Bottles
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1