{"title":"A comparative study of deep learning and decision tree based ensemble learning algorithms for network traffic identification","authors":"Nedeljko Nikolić, S. Tomovic, I. Radusinović","doi":"10.5937/telfor2202061n","DOIUrl":null,"url":null,"abstract":"In this paper, we apply Deep Learning (DL) and decision-tree-based ensemble learning algorithms to classify network traffic by application. Various Deep Learning (DL) models for network traffic identification have been presented, implemented and compared, including 1D convolutional, stacked autoencoder, multi-layer perceptron, and combination of the aforementioned. Then the results of DL models have been compared to those obtained with two popular ensemble learning models based on decision trees-Random Forest and XGBoost. To train and test the classification models, a dataset containing both encrypted and unencrypted traffic has been collected in a real network, under normal operating conditions, and pre-processed in a way that ensures non-biased results. The classification uncertainties of the models have been also quantified on publicly available ISCX VPN-nonVPN dataset. The models have been compared in terms of precision, recall, F1 score and accuracy, for different levels of complexity and training dataset sizes. The evaluation results indicate that the decision-tree ensemble learning algorithms provide more accurate results and outperform the DL algorithms. The performance gap reduces with the dataset complexity.","PeriodicalId":37719,"journal":{"name":"Telfor Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Telfor Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5937/telfor2202061n","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we apply Deep Learning (DL) and decision-tree-based ensemble learning algorithms to classify network traffic by application. Various Deep Learning (DL) models for network traffic identification have been presented, implemented and compared, including 1D convolutional, stacked autoencoder, multi-layer perceptron, and combination of the aforementioned. Then the results of DL models have been compared to those obtained with two popular ensemble learning models based on decision trees-Random Forest and XGBoost. To train and test the classification models, a dataset containing both encrypted and unencrypted traffic has been collected in a real network, under normal operating conditions, and pre-processed in a way that ensures non-biased results. The classification uncertainties of the models have been also quantified on publicly available ISCX VPN-nonVPN dataset. The models have been compared in terms of precision, recall, F1 score and accuracy, for different levels of complexity and training dataset sizes. The evaluation results indicate that the decision-tree ensemble learning algorithms provide more accurate results and outperform the DL algorithms. The performance gap reduces with the dataset complexity.
期刊介绍:
The TELFOR Journal is an open access international scientific journal publishing improved and extended versions of the selected best papers initially reported at the annual TELFOR Conference (www.telfor.rs), papers invited by the Editorial Board, and papers submitted by authors themselves for publishing. All papers are subject to reviewing. The TELFOR Journal is published in the English language, with both electronic and printed versions. Being an IEEE co-supported publication, it will follow all the IEEE rules and procedures. The TELFOR Journal covers all the essential branches of modern telecommunications and information technology: Telecommunications Policy and Services, Telecommunications Networks, Radio Communications, Communications Systems, Signal Processing, Optical Communications, Applied Electromagnetics, Applied Electronics, Multimedia, Software Tools and Applications, as well as other fields related to ICT. This large spectrum of topics accounts for the rapid convergence through telecommunications of the underlying technologies towards the information and knowledge society. The Journal provides a medium for exchanging research results and technological achievements accomplished by the scientific community from academia and industry.