{"title":"Space Reduction for Extreme Aggregation of Data Stream over Time-Based Sliding Window","authors":"Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao","doi":"10.1109/CLOUD.2012.80","DOIUrl":null,"url":null,"abstract":"Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.","PeriodicalId":214084,"journal":{"name":"2012 IEEE Fifth International Conference on Cloud Computing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Fifth International Conference on Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLOUD.2012.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.