G. Privitera, G. Ghidini, S. P. Emmons, David Levine, P. Bellavista, Jeffrey O. Smith
{"title":"软实时GPRS流量分析商业M2M通信使用spark","authors":"G. Privitera, G. Ghidini, S. P. Emmons, David Levine, P. Bellavista, Jeffrey O. Smith","doi":"10.1109/SMARTCOMP.2014.7043833","DOIUrl":null,"url":null,"abstract":"Commercial applications of wireless sensor networks, also known as machine-to-machine (M2M) communications, feature hundreds of thousands or even millions of devices. These M2M applications often rely on cellular networks like GSM that were not designed with such use cases in mind. Based on our first-hand experience at a large provider of M2M communications solutions, there is a need for soft real-time traffic analytics solutions to help engineers monitor and manage the millions of devices deployed in these M2M applications. We present a solution for soft real-time GPRS traffic analytics built on Apache Spark, a framework for distributed in-memory computing. The proposed solution captures GPRS traffic, processes it, and decorates it with details about the devices, networks, and M2M applications. It then computes a whole array of statistics that are presented in charts and maps on a live Web application dashboard, or may be fed to other systems for data mining. In a series of experiments, previously captured GPRS traffic from real-life commercial M2M applications is played back to the traffic analytics solution at different rates, and is processed on clusters of varying size. Results show that our solution handles GPRS traffic rates of 3,333 packets/sec, which are 2X the rates of an M2M application with close to one million devices, with a latency below one minute on a Spark cluster with four m1.large slave instances in Amazon EC2 at a cost of $7,665/year. These costs can be reduced to approx. $700/year by bidding on SPOT instances.","PeriodicalId":169858,"journal":{"name":"2014 International Conference on Smart Computing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Soft real-time GPRS traffic analytics for commercial M2M communications using spark\",\"authors\":\"G. Privitera, G. Ghidini, S. P. Emmons, David Levine, P. Bellavista, Jeffrey O. Smith\",\"doi\":\"10.1109/SMARTCOMP.2014.7043833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Commercial applications of wireless sensor networks, also known as machine-to-machine (M2M) communications, feature hundreds of thousands or even millions of devices. These M2M applications often rely on cellular networks like GSM that were not designed with such use cases in mind. Based on our first-hand experience at a large provider of M2M communications solutions, there is a need for soft real-time traffic analytics solutions to help engineers monitor and manage the millions of devices deployed in these M2M applications. We present a solution for soft real-time GPRS traffic analytics built on Apache Spark, a framework for distributed in-memory computing. The proposed solution captures GPRS traffic, processes it, and decorates it with details about the devices, networks, and M2M applications. It then computes a whole array of statistics that are presented in charts and maps on a live Web application dashboard, or may be fed to other systems for data mining. In a series of experiments, previously captured GPRS traffic from real-life commercial M2M applications is played back to the traffic analytics solution at different rates, and is processed on clusters of varying size. Results show that our solution handles GPRS traffic rates of 3,333 packets/sec, which are 2X the rates of an M2M application with close to one million devices, with a latency below one minute on a Spark cluster with four m1.large slave instances in Amazon EC2 at a cost of $7,665/year. These costs can be reduced to approx. $700/year by bidding on SPOT instances.\",\"PeriodicalId\":169858,\"journal\":{\"name\":\"2014 International Conference on Smart Computing\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Smart Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SMARTCOMP.2014.7043833\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Smart Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMARTCOMP.2014.7043833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Soft real-time GPRS traffic analytics for commercial M2M communications using spark
Commercial applications of wireless sensor networks, also known as machine-to-machine (M2M) communications, feature hundreds of thousands or even millions of devices. These M2M applications often rely on cellular networks like GSM that were not designed with such use cases in mind. Based on our first-hand experience at a large provider of M2M communications solutions, there is a need for soft real-time traffic analytics solutions to help engineers monitor and manage the millions of devices deployed in these M2M applications. We present a solution for soft real-time GPRS traffic analytics built on Apache Spark, a framework for distributed in-memory computing. The proposed solution captures GPRS traffic, processes it, and decorates it with details about the devices, networks, and M2M applications. It then computes a whole array of statistics that are presented in charts and maps on a live Web application dashboard, or may be fed to other systems for data mining. In a series of experiments, previously captured GPRS traffic from real-life commercial M2M applications is played back to the traffic analytics solution at different rates, and is processed on clusters of varying size. Results show that our solution handles GPRS traffic rates of 3,333 packets/sec, which are 2X the rates of an M2M application with close to one million devices, with a latency below one minute on a Spark cluster with four m1.large slave instances in Amazon EC2 at a cost of $7,665/year. These costs can be reduced to approx. $700/year by bidding on SPOT instances.