Efficacy of Machine Learning-Based Classifiers for Binary and Multi-Class Network Intrusion Detection

2021 IEEE International Conference on Automatic Control & Intelligent Systems (I2CACIS) Pub Date : 2021-06-26 DOI:10.1109/I2CACIS52118.2021.9495877

Toya Acharya, Ishan Khatri, A. Annamalai, M. Chouikha

{"title":"Efficacy of Machine Learning-Based Classifiers for Binary and Multi-Class Network Intrusion Detection","authors":"Toya Acharya, Ishan Khatri, A. Annamalai, M. Chouikha","doi":"10.1109/I2CACIS52118.2021.9495877","DOIUrl":null,"url":null,"abstract":"The internet-based services undoubtedly led the worldwide revolution with exponential growth, but security breaches resulting personal digital asset losses which need for a comprehensive cybersecurity solution. Traditionally, signature-based network intrusion detection is employed to capture attributes of normal and abnormal traffics in a network, but it fails to detect the zero-day attack. The machine learning-based approach is attractive among various known NIDS methods to circumvent the shortcoming because machine learning based approach can efficiently analyze the big network traffic data and efficiently detect the zero-day attack. The imbalanced NIDS dataset does not provide better performance on practical implementation scenarios. Reducing the number of target classes into a new target class creates a balanced NIDS and improved classifier performance. In this paper, we present the efficacy of several machine learning algorithms, including Random forest (RF), J48, Naïve Bayes, Bayesian Network, Bagging, AdaBoost, and Support Vector Machine (SVM) using network logs traffic (KDD99, UNSW-NB15, and CIC-IDS2017) using WEKA. This paper examined the impact of changing the number of output classes of the publicly available network intrusion datasets on sensitivity (True Positive Rate), False Positive Rate (FPR), Area under the ROC curve (AUC) and incorrectly identified percentage. Interestingly, the efficiency of these classifiers has increased, adding strongly correlated features to the target classes. The experimented results reveal that the machine learning classifiers performance improved when the number of target classes decreased. The addition of a highly correlated feature to the output class increases the performance of the classifiers.","PeriodicalId":210770,"journal":{"name":"2021 IEEE International Conference on Automatic Control & Intelligent Systems (I2CACIS)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Automatic Control & Intelligent Systems (I2CACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2CACIS52118.2021.9495877","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The internet-based services undoubtedly led the worldwide revolution with exponential growth, but security breaches resulting personal digital asset losses which need for a comprehensive cybersecurity solution. Traditionally, signature-based network intrusion detection is employed to capture attributes of normal and abnormal traffics in a network, but it fails to detect the zero-day attack. The machine learning-based approach is attractive among various known NIDS methods to circumvent the shortcoming because machine learning based approach can efficiently analyze the big network traffic data and efficiently detect the zero-day attack. The imbalanced NIDS dataset does not provide better performance on practical implementation scenarios. Reducing the number of target classes into a new target class creates a balanced NIDS and improved classifier performance. In this paper, we present the efficacy of several machine learning algorithms, including Random forest (RF), J48, Naïve Bayes, Bayesian Network, Bagging, AdaBoost, and Support Vector Machine (SVM) using network logs traffic (KDD99, UNSW-NB15, and CIC-IDS2017) using WEKA. This paper examined the impact of changing the number of output classes of the publicly available network intrusion datasets on sensitivity (True Positive Rate), False Positive Rate (FPR), Area under the ROC curve (AUC) and incorrectly identified percentage. Interestingly, the efficiency of these classifiers has increased, adding strongly correlated features to the target classes. The experimented results reveal that the machine learning classifiers performance improved when the number of target classes decreased. The addition of a highly correlated feature to the output class increases the performance of the classifiers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于机器学习的分类器在二元和多类网络入侵检测中的有效性

基于互联网的服务无疑以指数级增长引领了全球革命，但安全漏洞导致个人数字资产损失，这需要一个全面的网络安全解决方案。传统的基于签名的网络入侵检测主要用于捕获网络中正常流量和异常流量的属性，但无法检测到零日攻击。基于机器学习的方法可以有效地分析大网络流量数据并有效地检测零日攻击，因此在各种已知的NIDS方法中具有很大的吸引力。不平衡的NIDS数据集在实际实现场景中不能提供更好的性能。将目标类的数量减少到一个新的目标类可以创建一个平衡的NIDS并提高分类器的性能。在本文中，我们展示了几种机器学习算法的有效性，包括随机森林(RF)， J48, Naïve贝叶斯，贝叶斯网络，Bagging, AdaBoost和支持向量机(SVM)，使用WEKA使用网络日志流量(KDD99, UNSW-NB15和ics - ids2017)。本文研究了改变公开可用的网络入侵数据集的输出类别数量对灵敏度(真阳性率)、假阳性率(FPR)、ROC曲线下面积(AUC)和错误识别百分比的影响。有趣的是，这些分类器的效率提高了，向目标类添加了强相关的特征。实验结果表明，当目标类别数量减少时，机器学习分类器的性能有所提高。向输出类添加高度相关的特征可以提高分类器的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE International Conference on Automatic Control & Intelligent Systems (I2CACIS)

自引率

0.00%

发文量