{"title":"基于随机森林参数估计的入侵检测系统改进","authors":"A. N. Iman, T. Ahmad","doi":"10.1109/ICoSTA48221.2020.1570609975","DOIUrl":null,"url":null,"abstract":"To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.","PeriodicalId":375166,"journal":{"name":"2020 International Conference on Smart Technology and Applications (ICoSTA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Improving Intrusion Detection System by Estimating Parameters of Random Forest in Boruta\",\"authors\":\"A. N. Iman, T. Ahmad\",\"doi\":\"10.1109/ICoSTA48221.2020.1570609975\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.\",\"PeriodicalId\":375166,\"journal\":{\"name\":\"2020 International Conference on Smart Technology and Applications (ICoSTA)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Smart Technology and Applications (ICoSTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoSTA48221.2020.1570609975\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Smart Technology and Applications (ICoSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoSTA48221.2020.1570609975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Intrusion Detection System by Estimating Parameters of Random Forest in Boruta
To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.