{"title":"Performance evaluation of feature selection and tree-based algorithms for traffic classification","authors":"Ons Aouedi, Kandaraj Piamrat, B. Parrein","doi":"10.1109/ICCWorkshops50388.2021.9473580","DOIUrl":null,"url":null,"abstract":"The rapid development of smart devices triggers a surge in new traffic and applications. Thus, network traffic classification has become a challenge in modern communications and may be applied to a various range of applications ranging from QoS provisioning to security-related applications. Developing Machine Learning (ML) methods, which can successfully distinguish network applications from each other, is one of the most important tasks. Since ML algorithms are as good as the quality of data, feature selection has become a crucial step in the ML process. Therefore, selecting effective and relevant features for traffic analysis is also another essential issue. In this paper, we are interested in identifying the most relevant features to characterize network traffic. Empirical results indicate that significant input feature selection is important to classify network traffic. Then, a comparative analysis of various Decision Tree-based models (both traditional and recent algorithms) has been conducted with feature selection methods in terms of accuracy, training, and classification time.","PeriodicalId":127186,"journal":{"name":"2021 IEEE International Conference on Communications Workshops (ICC Workshops)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Communications Workshops (ICC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWorkshops50388.2021.9473580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
The rapid development of smart devices triggers a surge in new traffic and applications. Thus, network traffic classification has become a challenge in modern communications and may be applied to a various range of applications ranging from QoS provisioning to security-related applications. Developing Machine Learning (ML) methods, which can successfully distinguish network applications from each other, is one of the most important tasks. Since ML algorithms are as good as the quality of data, feature selection has become a crucial step in the ML process. Therefore, selecting effective and relevant features for traffic analysis is also another essential issue. In this paper, we are interested in identifying the most relevant features to characterize network traffic. Empirical results indicate that significant input feature selection is important to classify network traffic. Then, a comparative analysis of various Decision Tree-based models (both traditional and recent algorithms) has been conducted with feature selection methods in terms of accuracy, training, and classification time.