{"title":"gan生成的网络流量可以用来训练流量异常分类器吗?","authors":"Pasquale A. T. Zingo, A. Novocin","doi":"10.1109/IEMCON51383.2020.9284901","DOIUrl":null,"url":null,"abstract":"Recent attempts to introduce the Generative Adversarial Network (GAN) to the computer network traffic domain have shown promise, including several frameworks which generate realistic traffic. This paper presents the ‘GAN vs Real (GvR) score’, a task-based metric which quantifies how well a traffic GAN generator informs a classifier compared to the original data. This metric is derived from the ‘Train-on-Synthetic, Test-on-Real’ (TSTR) method, with the added step of comparing the TSTR accuracy to the performance of the same classifier trained on real data and tested on real data. We use this framework to evaluate the B-WGAN-GP model for generating NetFlow traffic records using several stock classifiers. Using GvR we conclude that it is possible to train accurate traffic anomaly detectors with GAN-generated network traffic data.","PeriodicalId":6871,"journal":{"name":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"56 1","pages":"0540-0545"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Can GAN-Generated Network Traffic be used to Train Traffic Anomaly Classifiers?\",\"authors\":\"Pasquale A. T. Zingo, A. Novocin\",\"doi\":\"10.1109/IEMCON51383.2020.9284901\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent attempts to introduce the Generative Adversarial Network (GAN) to the computer network traffic domain have shown promise, including several frameworks which generate realistic traffic. This paper presents the ‘GAN vs Real (GvR) score’, a task-based metric which quantifies how well a traffic GAN generator informs a classifier compared to the original data. This metric is derived from the ‘Train-on-Synthetic, Test-on-Real’ (TSTR) method, with the added step of comparing the TSTR accuracy to the performance of the same classifier trained on real data and tested on real data. We use this framework to evaluate the B-WGAN-GP model for generating NetFlow traffic records using several stock classifiers. Using GvR we conclude that it is possible to train accurate traffic anomaly detectors with GAN-generated network traffic data.\",\"PeriodicalId\":6871,\"journal\":{\"name\":\"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)\",\"volume\":\"56 1\",\"pages\":\"0540-0545\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IEMCON51383.2020.9284901\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON51383.2020.9284901","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
最近将生成对抗网络(GAN)引入计算机网络流量领域的尝试显示出了希望,包括几个生成现实流量的框架。本文提出了“GAN vs Real (GvR)分数”,这是一种基于任务的度量,用于量化与原始数据相比,交通GAN生成器通知分类器的程度。这个度量来源于“合成训练,真实测试”(TSTR)方法,增加了将TSTR精度与在真实数据上训练和测试的同一分类器的性能进行比较的步骤。我们使用该框架来评估B-WGAN-GP模型,该模型使用几个存量分类器生成NetFlow流量记录。使用GvR,我们得出结论,可以用gan生成的网络流量数据训练准确的流量异常检测器。
Can GAN-Generated Network Traffic be used to Train Traffic Anomaly Classifiers?
Recent attempts to introduce the Generative Adversarial Network (GAN) to the computer network traffic domain have shown promise, including several frameworks which generate realistic traffic. This paper presents the ‘GAN vs Real (GvR) score’, a task-based metric which quantifies how well a traffic GAN generator informs a classifier compared to the original data. This metric is derived from the ‘Train-on-Synthetic, Test-on-Real’ (TSTR) method, with the added step of comparing the TSTR accuracy to the performance of the same classifier trained on real data and tested on real data. We use this framework to evaluate the B-WGAN-GP model for generating NetFlow traffic records using several stock classifiers. Using GvR we conclude that it is possible to train accurate traffic anomaly detectors with GAN-generated network traffic data.