CENTIME: A Direct Comprehensive Traffic Features Extraction for Encrypted Traffic Classification

Maonan Wang, K. Zheng, Xinyi Ning, Yanqing Yang, Xiujuan Wang
{"title":"CENTIME: A Direct Comprehensive Traffic Features Extraction for Encrypted Traffic Classification","authors":"Maonan Wang, K. Zheng, Xinyi Ning, Yanqing Yang, Xiujuan Wang","doi":"10.1109/ICCCS52626.2021.9449280","DOIUrl":null,"url":null,"abstract":"With the rapid development of the network, encrypted traffic classification plays a vital role in guaranteeing the quality of network services and ensuring the security of the network. Recent studies show that machine learning approaches based on statistical features and raw traffic sessions are effective for this task. However, the performance of the statistical-based approaches largely depends on the quality of the features. Experts need to design different features for different encrypted traffic classification tasks, which is time-consuming. Meanwhile, the raw traffic-based approach needs to uniformize the traffic size; this will cause the loss of information about the overall structure of the network traffic; for example, we do not know the time from the first packet to the last packet in a session. This paper proposes the CENTIME, which can extract comprehensive information based on ResNet and AutoEncoder to identify encrypted traffic. ResNet is used to extract information from uniformized traffic, and AutoEncoder is used to encode statistical features. The statistical features are used to compensate for the information loss caused by traffic uniformization. They only need to be designed once rather than be designed separately for different tasks. Moreover, the pooling layers are removed, and 1D convolution layers are used to help CENTIME make more effective use of raw traffic information. We evaluate the CENTIME on the public dataset “ISCX VPN-nonVPN”, and the results demonstrate the CENTIME outperforms the state-of-the-art encrypted traffic classification methods. More importantly, comprehensive traffic features generated in the CENTIME can represent different classes of traffic well.","PeriodicalId":376290,"journal":{"name":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS52626.2021.9449280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

With the rapid development of the network, encrypted traffic classification plays a vital role in guaranteeing the quality of network services and ensuring the security of the network. Recent studies show that machine learning approaches based on statistical features and raw traffic sessions are effective for this task. However, the performance of the statistical-based approaches largely depends on the quality of the features. Experts need to design different features for different encrypted traffic classification tasks, which is time-consuming. Meanwhile, the raw traffic-based approach needs to uniformize the traffic size; this will cause the loss of information about the overall structure of the network traffic; for example, we do not know the time from the first packet to the last packet in a session. This paper proposes the CENTIME, which can extract comprehensive information based on ResNet and AutoEncoder to identify encrypted traffic. ResNet is used to extract information from uniformized traffic, and AutoEncoder is used to encode statistical features. The statistical features are used to compensate for the information loss caused by traffic uniformization. They only need to be designed once rather than be designed separately for different tasks. Moreover, the pooling layers are removed, and 1D convolution layers are used to help CENTIME make more effective use of raw traffic information. We evaluate the CENTIME on the public dataset “ISCX VPN-nonVPN”, and the results demonstrate the CENTIME outperforms the state-of-the-art encrypted traffic classification methods. More importantly, comprehensive traffic features generated in the CENTIME can represent different classes of traffic well.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CENTIME:一种用于加密流量分类的直接综合流量特征提取方法
随着网络的飞速发展,加密流分类对保证网络服务质量和网络安全起着至关重要的作用。最近的研究表明,基于统计特征和原始流量会话的机器学习方法对于这项任务是有效的。然而,基于统计的方法的性能在很大程度上取决于特征的质量。专家需要针对不同的加密流分类任务设计不同的特征,这是非常耗时的。同时,基于原始流量的方法需要统一流量大小;这将导致有关网络流量整体结构的信息丢失;例如,我们不知道会话中从第一个数据包到最后一个数据包的时间。本文提出了基于ResNet和AutoEncoder的CENTIME算法,它可以提取综合信息来识别加密流量。使用ResNet从统一流量中提取信息,使用AutoEncoder对统计特征进行编码。统计特征用来弥补流量统一带来的信息丢失。它们只需要设计一次,而不是为不同的任务单独设计。此外,删除了池化层,并使用1D卷积层来帮助CENTIME更有效地利用原始交通信息。我们在公共数据集“ISCX vpn -非vpn”上对CENTIME进行了评估,结果表明CENTIME优于最先进的加密流量分类方法。更重要的是,CENTIME生成的综合流量特征可以很好地表示不同类别的流量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Method of Measuring Data Fusion Based on EMBET Real Time Noise Power Estimation for Single Carrier Frequency Domain Equalization The CPDA Detector for the MIMO OCDM System A Cooperative Search Algorithm Based on Improved Particle Swarm Optimization Decision for UAV Swarm A Network Topology Awareness Based Probabilistic Broadcast Protocol for Data Transmission in Mobile Ad Hoc Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1