A Comparative Analysis of Machine Learning Techniques for Classification and Detection of Malware

2020 21st International Arab Conference on Information Technology (ACIT) Pub Date : 2020-11-28 DOI:10.1109/ACIT50332.2020.9300081

Maryam Al-Janabi, A. Altamimi

{"title":"A Comparative Analysis of Machine Learning Techniques for Classification and Detection of Malware","authors":"Maryam Al-Janabi, A. Altamimi","doi":"10.1109/ACIT50332.2020.9300081","DOIUrl":null,"url":null,"abstract":"Malicious software, commonly known as malware, is one of the most harmful threats developed by cyber attackers to intentionally cause damage or gaining access to computer systems. Malware has evolved over the years and comes in all shapes with different types and functions depending on the goals of the developer. Virus, Spyware, Bots, and Ransomware are just some examples of malware. While those described above found themselves causing issues by accident, however, they all share one thing in common, harming the system. As a response, many infection treatments and detecting methods have been proposed. The signature-based methods are currently utilized to delete malware; however, these methods cannot provide accurate detection of zero-day attacks and polymorphic viruses. Contrarily, the use of machine learning-based detection has been recognized as one of the most modern and notable methods. Specifically, these methods can be categorized based on their analysis technique into static, dynamic, or hybrid. The purpose of this work was to provide a survey that determines the best features extraction and classification methods that result in the best accuracy in detecting malware. Moreover, a review of representable research papers in this topic is represented with a detailed tabular comparison between them based on their accuracy in detecting malware. Among these methods, the J48 algorithm and Hybrid analysis outperformed the others with the accuracy of 100% in detecting malware in the Windows system. On the other hand, the same accuracy has been achieved in the Android system when employing the Decision Tree algorithm through Dynamic analysis. We believe that this study performs a base for further research in the field of malware analysis with machine learning methods.","PeriodicalId":193891,"journal":{"name":"2020 21st International Arab Conference on Information Technology (ACIT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 21st International Arab Conference on Information Technology (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT50332.2020.9300081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Malicious software, commonly known as malware, is one of the most harmful threats developed by cyber attackers to intentionally cause damage or gaining access to computer systems. Malware has evolved over the years and comes in all shapes with different types and functions depending on the goals of the developer. Virus, Spyware, Bots, and Ransomware are just some examples of malware. While those described above found themselves causing issues by accident, however, they all share one thing in common, harming the system. As a response, many infection treatments and detecting methods have been proposed. The signature-based methods are currently utilized to delete malware; however, these methods cannot provide accurate detection of zero-day attacks and polymorphic viruses. Contrarily, the use of machine learning-based detection has been recognized as one of the most modern and notable methods. Specifically, these methods can be categorized based on their analysis technique into static, dynamic, or hybrid. The purpose of this work was to provide a survey that determines the best features extraction and classification methods that result in the best accuracy in detecting malware. Moreover, a review of representable research papers in this topic is represented with a detailed tabular comparison between them based on their accuracy in detecting malware. Among these methods, the J48 algorithm and Hybrid analysis outperformed the others with the accuracy of 100% in detecting malware in the Windows system. On the other hand, the same accuracy has been achieved in the Android system when employing the Decision Tree algorithm through Dynamic analysis. We believe that this study performs a base for further research in the field of malware analysis with machine learning methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

恶意软件分类与检测的机器学习技术比较分析

恶意软件，通常被称为恶意软件，是网络攻击者开发的最有害的威胁之一，旨在故意造成损害或获取计算机系统的访问权限。恶意软件已经发展了多年，并且根据开发人员的目标，具有不同类型和功能的各种形状。病毒、间谍软件、机器人和勒索软件只是恶意软件的一些例子。虽然上面描述的这些问题都是偶然造成的，但是它们都有一个共同点，那就是损害系统。作为回应，许多感染治疗和检测方法被提出。基于签名的方法目前被用于删除恶意软件;然而，这些方法不能提供零日攻击和多态病毒的准确检测。相反，使用基于机器学习的检测已被认为是最现代和最显著的方法之一。具体来说，这些方法可以根据其分析技术分为静态、动态或混合。这项工作的目的是提供一项调查，以确定最佳特征提取和分类方法，从而在检测恶意软件时达到最佳准确性。此外，回顾了该主题的代表性研究论文，并根据它们在检测恶意软件方面的准确性对它们进行了详细的表格比较。其中，J48算法和Hybrid分析方法在Windows系统下检测恶意软件的准确率达到100%，优于其他方法。另一方面，通过动态分析，采用决策树算法在Android系统中也达到了同样的精度。我们相信这项研究为进一步研究机器学习方法在恶意软件分析领域奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 21st International Arab Conference on Information Technology (ACIT)

自引率

0.00%

发文量

期刊最新文献

Wireless Sensor Network MAC Energy - efficiency Protocols: A Survey Keystroke Identifier Using Fuzzy Logic to Increase Password Security A seq2seq Neural Network based Conversational Agent for Gulf Arabic Dialect Machine Learning and Soft Robotics Studying and Analyzing the Fog-based Internet of Robotic Things