{"title":"基于PSO-SVM的异常大数据剔除研究","authors":"Haiting Cui","doi":"10.1109/IAEAC.2018.8577474","DOIUrl":null,"url":null,"abstract":"In order to improve detection rate and reduce missing detection rate and false detection rate of big data, an abnormal large data elimination method based on PSO-SVM is proposed. Big data is chosen as a set, proximity of which is measured, according to fuzzy sets in fuzzy theory to measure data’ similarity degree. In order to determine redundant data and judge whether big data is abnormal, using support vector machine to train each particle and get fitness function through measuring the proximity between data by a constructed function, and then eliminating abnormal big data through the sliding window. Taking KDD99 big data as object, simulation experiment has higher detection rate and low false detection rate based on PSO-SVM method.","PeriodicalId":6573,"journal":{"name":"2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","volume":"17 10","pages":"2460-2463"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Research on Eliminating Abnormal Big Data based on PSO-SVM\",\"authors\":\"Haiting Cui\",\"doi\":\"10.1109/IAEAC.2018.8577474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve detection rate and reduce missing detection rate and false detection rate of big data, an abnormal large data elimination method based on PSO-SVM is proposed. Big data is chosen as a set, proximity of which is measured, according to fuzzy sets in fuzzy theory to measure data’ similarity degree. In order to determine redundant data and judge whether big data is abnormal, using support vector machine to train each particle and get fitness function through measuring the proximity between data by a constructed function, and then eliminating abnormal big data through the sliding window. Taking KDD99 big data as object, simulation experiment has higher detection rate and low false detection rate based on PSO-SVM method.\",\"PeriodicalId\":6573,\"journal\":{\"name\":\"2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"volume\":\"17 10\",\"pages\":\"2460-2463\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC.2018.8577474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC.2018.8577474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Eliminating Abnormal Big Data based on PSO-SVM
In order to improve detection rate and reduce missing detection rate and false detection rate of big data, an abnormal large data elimination method based on PSO-SVM is proposed. Big data is chosen as a set, proximity of which is measured, according to fuzzy sets in fuzzy theory to measure data’ similarity degree. In order to determine redundant data and judge whether big data is abnormal, using support vector machine to train each particle and get fitness function through measuring the proximity between data by a constructed function, and then eliminating abnormal big data through the sliding window. Taking KDD99 big data as object, simulation experiment has higher detection rate and low false detection rate based on PSO-SVM method.