Constructing important features from massive network traffic for lightweight intrusion detection

IET Inf. Secur. Pub Date : 2015-10-12 DOI:10.1049/iet-ifs.2014.0353

Wei Wang, Yongzhong He, Jiqiang Liu, Sylvain Gombault

{"title":"Constructing important features from massive network traffic for lightweight intrusion detection","authors":"Wei Wang, Yongzhong He, Jiqiang Liu, Sylvain Gombault","doi":"10.1049/iet-ifs.2014.0353","DOIUrl":null,"url":null,"abstract":"Efficiently processing massive data is a big issue in high-speed network intrusion detection, as network traffic has become increasingly large and complex. In this work, instead of constructing a large number of features from massive network traffic, the authors aim to select the most important features and use them to detect intrusions in a fast and effective manner. The authors first employed several techniques, that is, information gain (IG), wrapper with Bayesian networks (BN) and Decision trees (C4.5), to select important subsets of features for network intrusion detection based on KDD'99 data. The authors then validate the feature selection schemes in a real network test bed to detect distributed denial-of-service attacks. The feature selection schemes are extensively evaluated based on the two data sets. The empirical results demonstrate that with only the most important 10 features selected from all the original 41 features, the attack detection accuracy almost remains the same or even becomes better based on both BN and C4.5 classifiers. Constructing fewer features can also improve the efficiency of network intrusion detection.","PeriodicalId":13305,"journal":{"name":"IET Inf. Secur.","volume":"61 1","pages":"374-379"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"46","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Inf. Secur.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/iet-ifs.2014.0353","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 46

Abstract

Efficiently processing massive data is a big issue in high-speed network intrusion detection, as network traffic has become increasingly large and complex. In this work, instead of constructing a large number of features from massive network traffic, the authors aim to select the most important features and use them to detect intrusions in a fast and effective manner. The authors first employed several techniques, that is, information gain (IG), wrapper with Bayesian networks (BN) and Decision trees (C4.5), to select important subsets of features for network intrusion detection based on KDD'99 data. The authors then validate the feature selection schemes in a real network test bed to detect distributed denial-of-service attacks. The feature selection schemes are extensively evaluated based on the two data sets. The empirical results demonstrate that with only the most important 10 features selected from all the original 41 features, the attack detection accuracy almost remains the same or even becomes better based on both BN and C4.5 classifiers. Constructing fewer features can also improve the efficiency of network intrusion detection.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从海量网络流量中构建用于轻量级入侵检测的重要特征

随着网络流量的日益庞大和复杂，如何高效地处理海量数据是高速网络入侵检测的一大难题。在这项工作中，作者的目标不是从大量的网络流量中构建大量的特征，而是选择最重要的特征并使用它们来快速有效地检测入侵。作者首先采用了几种技术，即信息增益(IG)，贝叶斯网络包装(BN)和决策树(C4.5)，以选择基于KDD'99数据的网络入侵检测的重要特征子集。并在一个真实的网络测试平台上验证了特征选择方案对分布式拒绝服务攻击的检测效果。基于这两个数据集对特征选择方案进行了广泛的评估。实证结果表明，在原始41个特征中只选择最重要的10个特征时，基于BN和C4.5分类器的攻击检测准确率几乎保持不变，甚至有所提高。构造更少的特征也可以提高网络入侵检测的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IET Inf. Secur.

自引率

0.00%

发文量

期刊最新文献

Revisit Two Memoryless State-Recovery Cryptanalysis Methods on A5/1 Improved Lattice-Based Mix-Nets for Electronic Voting Adaptive and survivable trust management for Internet of Things systems Comment on 'Targeted Ciphers for Format-Preserving Encryption' from Selected Areas in Cryptography 2018 Time-specific encrypted range query with minimum leakage disclosure