Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems最新文献

英文中文

A Scalable Framework for Accelerating Situation Prediction over Spatio-temporal Event Streams 一种加速时空事件流态势预测的可扩展框架

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3220508

A. Kammoun, Tanguy Raynaud, Syed Gillani, K. Singh, J. Fayolle, F. Laforest

This paper presents a generic solution to the spatiotemporal prediction problem provided for the DEBS Grand Challenge 2018. Our solution employs an efficient multi-dimensional index to store the training and historical dataset. With the arrival of new tasks of events, we query our indexing structure to determine the closest points of interests. Based on these points, we select the ones with the highest overall score and predict the destination and time of the vessel in question. Our solution does not rely on existing machine learning techniques and provides a novel view of the prediction problem in the streaming settings. Hence, the prediction is not just based on the recent data, but on all the useful historical dataset.

本文提出了为DEBS 2018大挑战提供的时空预测问题的通用解决方案。我们的解决方案采用高效的多维索引来存储训练和历史数据集。随着新的事件任务的到来，我们查询我们的索引结构来确定最近的兴趣点。基于这些点，我们选择总分最高的点，并预测该船的目的地和时间。我们的解决方案不依赖于现有的机器学习技术，并提供了流设置中预测问题的新视图。因此，预测不仅基于最近的数据，而且基于所有有用的历史数据集。

引用次数: 1

Retro-λ

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3210285

Dominik Meißner, Benjamin Erb, Frank Kargl, Matthias Tichy

State changes over time are inherent characteristics of stateful applications. So far, there are almost no attempts to make the past application history programmatically accessible or even modifiable. This is primarily due to the complexity of temporal changes and a difficult alignment with prevalent programming primitives and persistence strategies. Retroactive computing enables powerful capabilities though, including computations and predictions of alternate application timelines, post-hoc bug fixes, or retroactive state explorations. We propose an event-driven programming model that is oriented towards serverless computing and applies retroaction to the event sourcing paradigm. Our model is deliberately restrictive, but therefore keeps the complexity of retroactive operations in check. We introduce retro-λ, a runtime platform that implements the model and provides retroactive capabilites to its applications. While retro-λ only shows negligible performance overheads compared to similar solutions for running regular applications, it enables its users to execute retroactive computations on the application histories as part of its programming model.

{"title":"Retro-λ","authors":"Dominik Meißner, Benjamin Erb, Frank Kargl, Matthias Tichy","doi":"10.1145/3210284.3210285","DOIUrl":"https://doi.org/10.1145/3210284.3210285","url":null,"abstract":"State changes over time are inherent characteristics of stateful applications. So far, there are almost no attempts to make the past application history programmatically accessible or even modifiable. This is primarily due to the complexity of temporal changes and a difficult alignment with prevalent programming primitives and persistence strategies. Retroactive computing enables powerful capabilities though, including computations and predictions of alternate application timelines, post-hoc bug fixes, or retroactive state explorations. We propose an event-driven programming model that is oriented towards serverless computing and applies retroaction to the event sourcing paradigm. Our model is deliberately restrictive, but therefore keeps the complexity of retroactive operations in check. We introduce retro-λ, a runtime platform that implements the model and provides retroactive capabilites to its applications. While retro-λ only shows negligible performance overheads compared to similar solutions for running regular applications, it enables its users to execute retroactive computations on the application histories as part of its programming model.","PeriodicalId":412438,"journal":{"name":"Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122466129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Buffering Strategies for Large-Scale Data-Acquisition Systems 大规模数据采集系统的缓冲策略

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3219500

Alejandro Santos

Data acquisition systems for particle physics experiments produce vasts amounts of data. It is sometimes unfeasible to store it all since the storage requirements will be enormous. For this reason, an on-line filtering system selects the relevant pieces of information according to the goals of the experiment, before finally sending them to permanent storage. While data is being analyzed, it is temporarily stored in a large high-speed buffering system. Data production follows a cycle, with long periods of many hours where no data is being produced by the experiment. Also, data production is not constant, and there are fluctuations in the input rate. This offers the possibility of over-provisioning the buffering system and trading processing power for storage space. This buffer can be used for storage for periods of many days. In this work, a model was created to study the behavior of some aspects of the ATLAS data acquisition system, and specifically the buffering system for the on-line filter.

粒子物理实验的数据采集系统产生大量的数据。由于存储需求非常大，因此有时不可能全部存储。因此，在线过滤系统根据实验目标选择相关的信息片段，最后将其永久存储。当数据被分析时，它被临时存储在一个大型高速缓冲系统中。数据的产生遵循一个周期，有很长一段时间，实验中没有数据产生。此外，数据的产生不是恒定的，输入速率存在波动。这提供了过度配置缓冲系统和交换存储空间的处理能力的可能性。这个缓冲可以用来储存许多天。在这项工作中，创建了一个模型来研究ATLAS数据采集系统的某些方面的行为，特别是在线滤波器的缓冲系统。

引用次数: 0

Distributed and Dynamic Clustering For News Events 新闻事件的分布式和动态聚类

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3219774

Vinay Setty

The primary consumption of news is now increasingly online and has resulted in a large volume of online news from varied news outlets. Consequently news aggregators have become popular for clustering, ranking and personalization of news which process millions of news articles each day. In addition, since news articles stream constantly, there is a need for a scalable event-based system which can facilitate news mining in an online fashion. To address these challenges, we propose a distributed framework to process news articles and cluster them to facilitate many news mining tasks. The core of our system is a novel and scalable distributed clustering algorithm using Locality Sensitive Hashing which is robust to outliers and noise. In addition, we also propose an online version of the clustering algorithm to dynamically maintain the news event clusters. We implement the proposed solution on Apache Spark. Using a large news collection with over 8 million news articles, we show that our approach outperforms widely-used clustering techniques such as K-Means both in run time and clustering quality.

新闻的主要消费现在越来越多地在网上，这导致了来自各种新闻媒体的大量在线新闻。因此，新闻聚合器已经成为流行的聚类，排名和个性化的新闻，每天处理数以百万计的新闻文章。此外，由于新闻文章流不断，因此需要一个可扩展的基于事件的系统，以在线方式促进新闻挖掘。为了解决这些挑战，我们提出了一个分布式框架来处理新闻文章，并将它们聚类，以促进许多新闻挖掘任务。该系统的核心是一种新颖的、可扩展的分布式聚类算法，该算法使用局域敏感散列，对异常值和噪声具有鲁棒性。此外，我们还提出了一种在线版本的聚类算法来动态维护新闻事件聚类。我们在Apache Spark上实现了该解决方案。使用超过800万篇新闻文章的大型新闻集合，我们表明我们的方法在运行时间和聚类质量上都优于广泛使用的聚类技术，如K-Means。

引用次数: 0

Scalable Maritime Traffic Map Inference and Real-time Prediction of Vessels' Future Locations on Apache Spark 基于Apache Spark的可扩展海上交通地图推断和船舶未来位置的实时预测

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3220506

Rim Moussa

In this paper, we propose scalable algorithms allowing primo to infer a map of vessels' trajectories and secundo to predict future locations of a vessel on sea. Our system is based on Apache Spark -a fast and scalable engine for large-scale data processing. The training dataset is event-based. Each event depicts the GPS position of the vessel at a timestamp. We propose and implement a workflow computing trips' patterns, with GPS locations of each trip summarized using GeoHashing. The latter is an efficient encoding of a geographic location into a short string of letters and digits. In order to perform prediction queries efficiently, we propose (i) a geohash positional index which maps each geohash to a list of pairs (trip-pattern-identifier, offset of the geohash in the geohash sequence of the trip-pattern), (ii) a departure-port index which maps each departure port to a list of trip-patterns' identifiers, as well as (iii) a pairwise geohash sequence alignment allowing to score the similarity of two geohash-sequences using queen-spatial neighborhood.

在本文中，我们提出了可扩展的算法，允许primo推断船舶轨迹的地图，并允许second来预测船舶在海上的未来位置。我们的系统基于Apache Spark——一个快速、可扩展的大规模数据处理引擎。训练数据集是基于事件的。每个事件描述了船只在时间戳上的GPS位置。我们提出并实现了一种计算旅行模式的工作流，并使用geohash对每次旅行的GPS位置进行汇总。后者是将地理位置有效地编码为由字母和数字组成的短字符串。为了有效地执行预测查询，我们提出(i)一个地理哈希位置索引，它将每个地理哈希映射到一个对列表(trip-pattern-identifier, trip-pattern的geohash序列中的地理哈希偏移量)，(ii)一个出发港索引，它将每个出发港映射到一个旅行模式标识符列表，以及(iii)一个成对的地理哈希序列对齐，允许使用皇后空间邻域对两个地理哈希序列的相似性进行评分。

引用次数: 2

BeaConvey BeaConvey

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3210287

Chen Chen, Y. Tock, Sarunas Girdzijauskas

Distributed pub/sub must make principal design choices with regards to overlay topologies and routing protocols. It is challenging to tackle both aspects together, and most existing work merely considers one. We argue the necessity to address both problems simultaneously since only the right combination of the two can deliver an efficient internet-scale pub/sub. Traditional design space spans from structured data-oblivious overlays employing greedy routing strategies all the way to unstructured data-driven overlays using naive broadcast-based routing. The two ends of the spectra come with unacceptable prices: the former often exerts considerable overhead on each node for forwarding irrelevant messages, while the latter is difficult to scale due to prohibitive latencies stemming from unbounded node degrees and network diameters. To achieve the best of both worlds, we propose BeaConvey, a distributed pub/sub system for federated environments. First, we define the small-world and interest-close overlay (SWICO) that embraces both small-world properties and pub/sub semantics. To construct a SWCIO, we devise a greedy heuristic to assign small-world identifiers and fingers in a centralized manner. Second, we develop a family of peer-to-peer pub/sub routing protocols that leverages such SWICOs. Empirical evaluation shows that BeaConvey achieves substantial improvement in routing overhead and propagation delays. For instance, the routing overhead of BeaConvey is only 20% to 40% of the state of the art. This acceleration is consistent across a variety of pub/sub workloads, and BeaConvey obtains such adaptability by optimizing both overlay and routing, which complement each other in different situations. Under one Facebook workload with a skewed distribution, 78% of the improvement is accredited to a better overlay. Under another non-skewed workload, more advanced routing contributes 95% of cost reduction.

{"title":"BeaConvey","authors":"Chen Chen, Y. Tock, Sarunas Girdzijauskas","doi":"10.1145/3210284.3210287","DOIUrl":"https://doi.org/10.1145/3210284.3210287","url":null,"abstract":"Distributed pub/sub must make principal design choices with regards to overlay topologies and routing protocols. It is challenging to tackle both aspects together, and most existing work merely considers one. We argue the necessity to address both problems simultaneously since only the right combination of the two can deliver an efficient internet-scale pub/sub. Traditional design space spans from structured data-oblivious overlays employing greedy routing strategies all the way to unstructured data-driven overlays using naive broadcast-based routing. The two ends of the spectra come with unacceptable prices: the former often exerts considerable overhead on each node for forwarding irrelevant messages, while the latter is difficult to scale due to prohibitive latencies stemming from unbounded node degrees and network diameters. To achieve the best of both worlds, we propose BeaConvey, a distributed pub/sub system for federated environments. First, we define the small-world and interest-close overlay (SWICO) that embraces both small-world properties and pub/sub semantics. To construct a SWCIO, we devise a greedy heuristic to assign small-world identifiers and fingers in a centralized manner. Second, we develop a family of peer-to-peer pub/sub routing protocols that leverages such SWICOs. Empirical evaluation shows that BeaConvey achieves substantial improvement in routing overhead and propagation delays. For instance, the routing overhead of BeaConvey is only 20% to 40% of the state of the art. This acceleration is consistent across a variety of pub/sub workloads, and BeaConvey obtains such adaptability by optimizing both overlay and routing, which complement each other in different situations. Under one Facebook workload with a skewed distribution, 78% of the improvement is accredited to a better overlay. Under another non-skewed workload, more advanced routing contributes 95% of cost reduction.","PeriodicalId":412438,"journal":{"name":"Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122895968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Ubiquitous Artificial Intelligence and Dynamic Data Streams 泛在人工智能和动态数据流

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3214345

A. Bifet, J. Read

Artificial Intelligence is leading to ubiquitous sources of Big Data arriving at high-velocity and in real-time. To effectively deal with it, we need to be able to adapt to changes in the distribution of the data being produced, and we need to do it using a minimum amount of time and memory. In this paper, we detail modern applications falling into this context, and discuss some state-of-the-art methodologies in mining data streams in real-time, and the open source tools that are available to do machine learning/data mining in real-time for this challenging setting.

人工智能正在导致无处不在的大数据来源以高速和实时的方式到来。为了有效地处理它，我们需要能够适应正在生成的数据分布的变化，并且我们需要使用最少的时间和内存来做到这一点。在本文中，我们详细介绍了在这种背景下的现代应用，并讨论了实时挖掘数据流的一些最先进的方法，以及可用于在这种具有挑战性的环境中进行实时机器学习/数据挖掘的开源工具。

引用次数: 3

Bayesian Estimation of Vessel Destination and Arrival Times 船舶目的地和到达时间的贝叶斯估计

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3220501

Hyungkun Jung, Kang-Woo Lee, Joong-Hyun Choi, Eun-Sun Cho

1Predicting the destination port and arrival time of a vessel is challenging, even with the availability of a tremendous amount of trace data. Our goal for this challenge is to build a solution to accurately predict the destination port and arrival times of a given vessel using Bayesian inference and heuristics.

预测船舶的目的港和到达时间是具有挑战性的，即使有大量的跟踪数据可用。我们这个挑战的目标是建立一个解决方案，使用贝叶斯推理和启发式来准确预测给定船只的目的港口和到达时间。

引用次数: 4

Optimization Strategies for Integration Pattern Compositions 集成模式组合的优化策略

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3210295

Daniel Ritter, Norman May, F. Forsberg, S. Rinderle-Ma

Enterprise Application Integration is the centerpiece of current on-premise, cloud and device integration scenarios. We describe optimization strategies that help reduce the model complexity, and improve the process execution using design time techniques. In order to achieve this, we formalize compositions of Enterprise Integration Patterns based on their characteristics, and propose a realization of optimization strategies using graph rewriting. The framework is successfully evaluated on a real-world catalog of pattern compositions, containing over 900 integration scenarios.

企业应用集成是当前内部部署、云和设备集成场景的核心。我们描述了使用设计时技术帮助降低模型复杂性和改进流程执行的优化策略。为了实现这一目标，我们根据企业集成模式的特征形式化了其组成，并提出了一种使用图重写实现优化策略的方法。该框架在包含900多个集成场景的模式组合的真实目录上成功地进行了评估。

引用次数: 7

Vessel Trajectory Prediction using Sequence-to-Sequence Models over Spatial Grid 空间网格上序列对序列模型的船舶轨迹预测

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

Pub Date : 2018-06-25 DOI: 10.1145/3210284.3219775

Duc-Duy Nguyen, Chan Le Van, M. Ali

In this paper, we propose a neural network based system to predict vessels' trajectories including the destination port and estimated arrival time. The system is designed to address DEBS Grand Challenge 2018, which provides a set of data streams containing vessel information and coordinates ordered by time. Our goal is to design a system which can accurately predict future trajectories, destination port and arrival time for a vessel. Our solution is based on the sequence-to-sequence model which uses a spatial grid for trajectory prediction. We divided sea area into a spatial grid and then used vessels' recent trajectory as a sequence of codes to extract movement tendency. The extracted movement tendency allowed us to predict future movements till the destination. We built our solution using distributed architecture model and applied load balancing techniques to achieve maximum performance and scalability. We also design an interactive user interface which showcases real-time trajectories of vessels including their predicted destination and arrival time.

在本文中，我们提出了一个基于神经网络的系统来预测船舶的轨迹，包括目的港和预计到达时间。该系统旨在应对DEBS 2018年大挑战，该挑战提供了一组包含船舶信息和按时间排序的坐标的数据流。我们的目标是设计一个能够准确预测船舶未来轨迹、目的港和到达时间的系统。我们的解决方案是基于序列到序列模型，该模型使用空间网格进行轨迹预测。我们将海域划分为一个空间网格，然后使用船只最近的轨迹作为一系列代码来提取运动趋势。提取的运动趋势使我们能够预测未来的运动，直到目的地。我们使用分布式架构模型构建我们的解决方案，并应用负载平衡技术来实现最大的性能和可伸缩性。我们还设计了一个交互式用户界面，显示船舶的实时轨迹，包括预测的目的地和到达时间。

引用次数: 40

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀