{"title":"Research on elephant flow detection method based on information entropy and improved random forest","authors":"Xiaohong Hu, Xianghua Miao, Meiyu Yuan","doi":"10.1117/12.2636382","DOIUrl":null,"url":null,"abstract":"In the increasingly complex network environment, various attacks emerge one after another. Being able to accurately detect elephant flow and mouse flow plays an extremely important role in defending against large-scale network attacks. Aiming at some shortcomings of current methods for detecting elephant flow and mouse flow, this paper proposes a detection method based on information entropy and improved random forest. After preprocessing the data, first calculate the data feature score with information entropy, screen out the truly valuable features according to the score, then put them into the improved random forest classifier, and finally get the detection results. In this paper, the grid search algorithm is used to optimize the tree and depth of random forest tree, so that the detection results can be obtained quickly and accurately. Experiments show that due to the significant difference between the data characteristics of elephant flow and mouse flow, this method can effectively identify elephant flow and mouse flow. The accuracy rate is 97.93%, the precision rate is 99.99%, the recall rate is 97.91%, and the F1-score is 98.94%, which is improved compared with other algorithms.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":"183 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Algorithms, Microchips and Network Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2636382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the increasingly complex network environment, various attacks emerge one after another. Being able to accurately detect elephant flow and mouse flow plays an extremely important role in defending against large-scale network attacks. Aiming at some shortcomings of current methods for detecting elephant flow and mouse flow, this paper proposes a detection method based on information entropy and improved random forest. After preprocessing the data, first calculate the data feature score with information entropy, screen out the truly valuable features according to the score, then put them into the improved random forest classifier, and finally get the detection results. In this paper, the grid search algorithm is used to optimize the tree and depth of random forest tree, so that the detection results can be obtained quickly and accurately. Experiments show that due to the significant difference between the data characteristics of elephant flow and mouse flow, this method can effectively identify elephant flow and mouse flow. The accuracy rate is 97.93%, the precision rate is 99.99%, the recall rate is 97.91%, and the F1-score is 98.94%, which is improved compared with other algorithms.