Compact Data Structures for Temporal Graphs

2013 Data Compression Conference Pub Date : 2013-03-20 DOI:10.1109/DCC.2013.59

Guillermo de Bernardo, N. Brisaboa, Diego Caro, Michael A. Rodriguez

{"title":"Compact Data Structures for Temporal Graphs","authors":"Guillermo de Bernardo, N. Brisaboa, Diego Caro, Michael A. Rodriguez","doi":"10.1109/DCC.2013.59","DOIUrl":null,"url":null,"abstract":"Summary form only given. In this paper we propose three compact data structures to answer queries on temporal graphs. We define a temporal graph as a graph whose edges appear or disappear along time. Possible queries are related to adjacency along time, for example, to get the neighbors of a node at a given time point or interval. A naive representation consists of a time-ordered sequence of graphs, each of them valid at a particular time instant. The main issue of this representation is the unnecessary use of space if many nodes and their connections remain unchanged during a long period of time. The work in this paper proposes to store only what changes at each time instant. The ttk2-tree is conceptually a dynamic k2-tree in which each leaf and internal node contains a change list of time instants when its bit value has changed. All the change lists are stored consecutively in a dynamic sequence. During query processing, the change lists are used to expand only valid regions in the dynamic k2-tree. It supports updates of the current or past states of the graph. The ltg-index is a set of snapshots and logs of changes between consecutive snapshots. The structure keeps a log for each node, storing the edge and the time where a change has been produced. To retrieve direct neighbors of a node, the previous snapshot is queried, and then the log is traversed adding or removing edges to the result. The differential k2-tree stores snapshots of some time instants in k2-trees. For the other time instants, a k2-tree is also built, but these are differential (they store the edges that differ from the last snapshot). To perform a query it accesses the k2-tree of the given time and the previous full snapshot. The edges that appear in exactly one of these two k2-trees will be the final results. We test our proposals using synthetic and real datasets. Our results show that the ltg-index obtains the smallest space in general. We also measure times for direct and reverse neighbor queries in a time instant or a time interval. For all these queries, the times of our best proposal range from tens of μs to several ms, depending on the size of the dataset and the number of results returned. The ltg-index is the fastest for direct queries (almost as fast as accessing a snapshot), but it is 5-20 times slower in reverse queries. The differential k2-tree is very fast in time instant queries, but slower in time interval queries. The ttk2-tree obtains similar times for direct and reverse queries and different time intervals, being the fastest in some reverse interval queries. It has also the advantage of being dynamic.","PeriodicalId":388717,"journal":{"name":"2013 Data Compression Conference","volume":"159 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.2013.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

Abstract

Summary form only given. In this paper we propose three compact data structures to answer queries on temporal graphs. We define a temporal graph as a graph whose edges appear or disappear along time. Possible queries are related to adjacency along time, for example, to get the neighbors of a node at a given time point or interval. A naive representation consists of a time-ordered sequence of graphs, each of them valid at a particular time instant. The main issue of this representation is the unnecessary use of space if many nodes and their connections remain unchanged during a long period of time. The work in this paper proposes to store only what changes at each time instant. The ttk2-tree is conceptually a dynamic k2-tree in which each leaf and internal node contains a change list of time instants when its bit value has changed. All the change lists are stored consecutively in a dynamic sequence. During query processing, the change lists are used to expand only valid regions in the dynamic k2-tree. It supports updates of the current or past states of the graph. The ltg-index is a set of snapshots and logs of changes between consecutive snapshots. The structure keeps a log for each node, storing the edge and the time where a change has been produced. To retrieve direct neighbors of a node, the previous snapshot is queried, and then the log is traversed adding or removing edges to the result. The differential k2-tree stores snapshots of some time instants in k2-trees. For the other time instants, a k2-tree is also built, but these are differential (they store the edges that differ from the last snapshot). To perform a query it accesses the k2-tree of the given time and the previous full snapshot. The edges that appear in exactly one of these two k2-trees will be the final results. We test our proposals using synthetic and real datasets. Our results show that the ltg-index obtains the smallest space in general. We also measure times for direct and reverse neighbor queries in a time instant or a time interval. For all these queries, the times of our best proposal range from tens of μs to several ms, depending on the size of the dataset and the number of results returned. The ltg-index is the fastest for direct queries (almost as fast as accessing a snapshot), but it is 5-20 times slower in reverse queries. The differential k2-tree is very fast in time instant queries, but slower in time interval queries. The ttk2-tree obtains similar times for direct and reverse queries and different time intervals, being the fastest in some reverse interval queries. It has also the advantage of being dynamic.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

时态图的紧凑数据结构

只提供摘要形式。在本文中，我们提出了三种紧凑的数据结构来回答时间图的查询。我们将时间图定义为其边随时间出现或消失的图。可能的查询与时间上的邻接性有关，例如，在给定的时间点或间隔上获取节点的邻居。朴素表示由时间顺序的图序列组成，其中每个图在特定的时间瞬间有效。这种表示的主要问题是，如果许多节点及其连接在很长一段时间内保持不变，则会不必要地使用空间。本文的工作建议只存储每个时刻的变化。ttk2-tree在概念上是一个动态的k2-tree，其中每个叶子和内部节点都包含其位值发生变化时的时间瞬间的变化列表。所有更改列表以动态顺序连续存储。在查询处理期间，更改列表仅用于扩展动态k2树中的有效区域。它支持对图的当前或过去状态进行更新。ltg-index是一组快照和连续快照之间的更改日志。该结构为每个节点保留日志，存储产生更改的边缘和时间。要检索节点的直接邻居，首先查询前一个快照，然后遍历日志，在结果中添加或删除边。差分k2-tree在k2-tree中存储了一些时间瞬间的快照。对于其他时间瞬间，也构建了一个k2树，但这些是不同的(它们存储与上次快照不同的边)。要执行查询，它将访问给定时间的k2树和上一个完整快照。在这两棵k2树中恰好出现的边就是最终结果。我们使用合成和真实的数据集来测试我们的建议。我们的结果表明，ltg-index在一般情况下获得最小的空间。我们还测量在一个时间瞬间或一个时间间隔内直接和反向邻居查询的时间。对于所有这些查询，我们的最佳建议的时间范围从几十μs到几ms，这取决于数据集的大小和返回的结果数量。ltg-index对于直接查询是最快的(几乎和访问快照一样快)，但是对于反向查询要慢5-20倍。微分k2树在即时查询中非常快，但在时间间隔查询中较慢。ttk2-tree对于直接查询和反向查询以及不同的时间间隔获得相似的时间，在某些反向间隔查询中是最快的。它还具有动态的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助