Interval-based Queries over Lossy IoT Event Streams

Nimrod Busany, Han van der Aa, Arik Senderovich, A. Gal, M. Weidlich
{"title":"Interval-based Queries over Lossy IoT Event Streams","authors":"Nimrod Busany, Han van der Aa, Arik Senderovich, A. Gal, M. Weidlich","doi":"10.1145/3385191","DOIUrl":null,"url":null,"abstract":"Recognising patterns that correlate multiple events over time becomes increasingly important in applications that exploit the Internet of Things, reaching from urban transportation through surveillance monitoring to business workflows. In many real-world scenarios, however, timestamps of events may be erroneously recorded, and events may be dropped from a stream due to network failures or load shedding policies. In this work, we present SimpMatch, a novel simplex-based algorithm for probabilistic evaluation of event queries using constraints over event orderings in a stream. Our approach avoids learning probability distributions for time-points or occurrence intervals. Instead, we employ the abstraction of segmented intervals and compute the probability of a sequence of such segments using the notion of order statistics. The algorithm runs in linear time to the number of lost events and shows high accuracy, yielding exact results if event generation is based on a Poisson process and providing a good approximation otherwise. We demonstrate empirically that SimpMatch enables efficient and effective reasoning over event streams, outperforming state-of-the-art methods for probabilistic evaluation of event queries by up to two orders of magnitude.","PeriodicalId":93404,"journal":{"name":"ACM/IMS transactions on data science","volume":"31 1","pages":"1 - 27"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IMS transactions on data science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3385191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Recognising patterns that correlate multiple events over time becomes increasingly important in applications that exploit the Internet of Things, reaching from urban transportation through surveillance monitoring to business workflows. In many real-world scenarios, however, timestamps of events may be erroneously recorded, and events may be dropped from a stream due to network failures or load shedding policies. In this work, we present SimpMatch, a novel simplex-based algorithm for probabilistic evaluation of event queries using constraints over event orderings in a stream. Our approach avoids learning probability distributions for time-points or occurrence intervals. Instead, we employ the abstraction of segmented intervals and compute the probability of a sequence of such segments using the notion of order statistics. The algorithm runs in linear time to the number of lost events and shows high accuracy, yielding exact results if event generation is based on a Poisson process and providing a good approximation otherwise. We demonstrate empirically that SimpMatch enables efficient and effective reasoning over event streams, outperforming state-of-the-art methods for probabilistic evaluation of event queries by up to two orders of magnitude.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于时间间隔的有损物联网事件流查询
在利用物联网的应用中,从城市交通到监视监控再到业务工作流,识别多个事件随时间的关联模式变得越来越重要。然而,在许多实际场景中,事件的时间戳可能会被错误地记录,并且由于网络故障或负载减少策略,事件可能会从流中删除。在这项工作中,我们提出了SimpMatch,这是一种新的基于simplex的算法,用于使用流中事件顺序的约束对事件查询进行概率评估。我们的方法避免了学习时间点或发生区间的概率分布。相反,我们采用分段区间的抽象,并使用顺序统计的概念计算这些分段序列的概率。该算法以丢失事件的数量为线性时间运行,并且显示出很高的准确性,如果事件生成基于泊松过程,则产生精确的结果,否则提供良好的近似。我们通过经验证明,SimpMatch能够对事件流进行高效和有效的推理,在事件查询的概率评估方面,其性能比最先进的方法高出两个数量级。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Recent Developments in Privacy-Preserving Mining of Clinical Data. PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning A Survey on the Role of Centrality as Seed Nodes for Information Propagation in Large Scale Network DataStorm: Coupled, Continuous Simulations for Complex Urban Environments TabReformer: Unsupervised Representation Learning for Erroneous Data Detection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1