GAN based Augmentation for Improving Anomaly Detection Accuracy in Host-based Intrusion Detection Systems

Kangseok Kim
{"title":"GAN based Augmentation for Improving Anomaly Detection Accuracy in Host-based Intrusion Detection Systems","authors":"Kangseok Kim","doi":"10.37624/IJERT/13.11.2020.3987-3996","DOIUrl":null,"url":null,"abstract":"This study proposes a methodology for anomaly detection in HIDS using supervised and semi-supervised anomaly detection approaches by applying GAN (Generative Adversarial Network) based data augmentation. An anomaly-based intrusion detection system detects abnormal patterns based on deviations from expected normal behaviors; however, such a system has a low detection rate. Also a detection accuracy may vary depending on whether abnormal samples are used during learning. Moreover, it may vary according to the degree of class imbalance that means the imbalance of data class distributions. To avoid the problem and to enhance the low predictive accuracy, it might need to augment minority datasets through the creation of new samples. Therefore, recently, some of existing studies have involved the development of intrusion detection models using machine/deep learning algorithms to overcome the limitations of existing anomaly-based intrusion detection methodologies and to avoid class imbalance problems. In a similar vein, this study proposes a method for improving classification performance of normal and abnormal data in anomaly-based intrusion detection systems by applying data augmentation using GAN. To verify the effectiveness of the proposed anomaly detection method, we use the ADFA-LD Dataset which consists of system call traces for attacks on the latest operating systems. Experiments were performed using SVM (Support Vector Machine) and CNN (Convolution Neural Network) for classification, and GAN and SMOTE for data augmentation, respectively. The experimental results indicated that GAN based approach provides a slightly more reliable way of working with data augmentation than SMOTE. In addition, it was confirmed based on the experimental results that the classification performance can be improved as the number of samples belonging to each imbalanced class increases.","PeriodicalId":14123,"journal":{"name":"International journal of engineering research and technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of engineering research and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37624/IJERT/13.11.2020.3987-3996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

This study proposes a methodology for anomaly detection in HIDS using supervised and semi-supervised anomaly detection approaches by applying GAN (Generative Adversarial Network) based data augmentation. An anomaly-based intrusion detection system detects abnormal patterns based on deviations from expected normal behaviors; however, such a system has a low detection rate. Also a detection accuracy may vary depending on whether abnormal samples are used during learning. Moreover, it may vary according to the degree of class imbalance that means the imbalance of data class distributions. To avoid the problem and to enhance the low predictive accuracy, it might need to augment minority datasets through the creation of new samples. Therefore, recently, some of existing studies have involved the development of intrusion detection models using machine/deep learning algorithms to overcome the limitations of existing anomaly-based intrusion detection methodologies and to avoid class imbalance problems. In a similar vein, this study proposes a method for improving classification performance of normal and abnormal data in anomaly-based intrusion detection systems by applying data augmentation using GAN. To verify the effectiveness of the proposed anomaly detection method, we use the ADFA-LD Dataset which consists of system call traces for attacks on the latest operating systems. Experiments were performed using SVM (Support Vector Machine) and CNN (Convolution Neural Network) for classification, and GAN and SMOTE for data augmentation, respectively. The experimental results indicated that GAN based approach provides a slightly more reliable way of working with data augmentation than SMOTE. In addition, it was confirmed based on the experimental results that the classification performance can be improved as the number of samples belonging to each imbalanced class increases.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于GAN增强的主机入侵检测系统异常检测精度提高
本研究提出了一种基于GAN(生成对抗网络)数据增强的HIDS异常检测方法,该方法使用监督和半监督异常检测方法。一种基于异常的入侵检测系统,基于对预期正常行为的偏离来检测异常模式;但是,这种系统的检出率很低。此外,检测精度可能因学习过程中是否使用异常样本而异。此外,它可能根据类不平衡的程度而变化,这意味着数据类分布的不平衡。为了避免这个问题并提高低预测精度,它可能需要通过创建新样本来增加少数数据集。因此,最近已有的一些研究涉及使用机器/深度学习算法开发入侵检测模型,以克服现有基于异常的入侵检测方法的局限性,并避免类不平衡问题。同样,本研究提出了一种方法,通过使用GAN应用数据增强来提高基于异常的入侵检测系统中正常和异常数据的分类性能。为了验证所提出的异常检测方法的有效性,我们使用了由最新操作系统攻击的系统调用跟踪组成的ADFA-LD数据集。实验分别使用支持向量机(SVM)和卷积神经网络(CNN)进行分类,使用GAN和SMOTE进行数据增强。实验结果表明,基于GAN的方法提供了一种比SMOTE更可靠的数据增强方法。此外,根据实验结果证实,随着属于每个不平衡类的样本数量的增加,分类性能可以得到提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
New Equations for Rate of Energy Dissipation of a Stepped Spillway with Slope less than Critical and Specific Step Height Blockchain-Based Secure Smart Health IoT solution Using RBAC Architecture Fatigue life assessment of high-speed train’s bogie frame due to dynamic loads under the influence of wheel flat Luenberger Observer-Based Speed Sensor Fault Detection: real time implementation to DC Motors Ultra-High-Performance Concrete (UHPC) - Applications Worldwide: A State-of-the-Art Review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1