Anwesha Mal, A. Sabitha, Abhay Bansal, B. White, L. Cottrell
{"title":"PingER网络数据的分析与聚类","authors":"Anwesha Mal, A. Sabitha, Abhay Bansal, B. White, L. Cottrell","doi":"10.1109/CONFLUENCE.2016.7508127","DOIUrl":null,"url":null,"abstract":"The PingER project was started by the SLAC National Accelerator Laboratory, Stanford, California for the purpose of monitoring end to end network performance. For the last eighteen years PingER has generated an enormous amount of data that has been stored in space separated files. However due to the difficulties faced in retrieving data efficiently, it has been proposed that all the data be put into the form of RDF triples. Interpreting and analyzing such large volumes of data becomes a primary concern. By making using of clustering algorithms new and interesting patterns can be observed in the data sets. Outlier analysis can be performed giving insight to the exceptions occurring in the dataset and analyzing the probable causes of such. Patterns could be observed based on the country to which the data belongs and comparisons can be drawn between the patterns between the different countries.","PeriodicalId":299044,"journal":{"name":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Analysis and clustering of PingER network data\",\"authors\":\"Anwesha Mal, A. Sabitha, Abhay Bansal, B. White, L. Cottrell\",\"doi\":\"10.1109/CONFLUENCE.2016.7508127\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The PingER project was started by the SLAC National Accelerator Laboratory, Stanford, California for the purpose of monitoring end to end network performance. For the last eighteen years PingER has generated an enormous amount of data that has been stored in space separated files. However due to the difficulties faced in retrieving data efficiently, it has been proposed that all the data be put into the form of RDF triples. Interpreting and analyzing such large volumes of data becomes a primary concern. By making using of clustering algorithms new and interesting patterns can be observed in the data sets. Outlier analysis can be performed giving insight to the exceptions occurring in the dataset and analyzing the probable causes of such. Patterns could be observed based on the country to which the data belongs and comparisons can be drawn between the patterns between the different countries.\",\"PeriodicalId\":299044,\"journal\":{\"name\":\"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONFLUENCE.2016.7508127\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONFLUENCE.2016.7508127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The PingER project was started by the SLAC National Accelerator Laboratory, Stanford, California for the purpose of monitoring end to end network performance. For the last eighteen years PingER has generated an enormous amount of data that has been stored in space separated files. However due to the difficulties faced in retrieving data efficiently, it has been proposed that all the data be put into the form of RDF triples. Interpreting and analyzing such large volumes of data becomes a primary concern. By making using of clustering algorithms new and interesting patterns can be observed in the data sets. Outlier analysis can be performed giving insight to the exceptions occurring in the dataset and analyzing the probable causes of such. Patterns could be observed based on the country to which the data belongs and comparisons can be drawn between the patterns between the different countries.