{"title":"Distributed Algorithm for Relationship Queries on Large Graphs","authors":"P. Agarwal, Maya Ramanath, Gautam M. Shroff","doi":"10.1145/2809948.2809949","DOIUrl":null,"url":null,"abstract":"Massive-sized graph-structured data is now ubiquitous, e.g., social networks, databases, knowledge-bases, web-graphs, etc. An important class of queries on graph-structured data is \"relationship queries\". Essentially, given a set of entities (corresponding to nodes in the graph), finding a ranked list of interesting interconnections among them. While this problem has been studied for many years, the solutions proposed in the literature so far focus on the non-distributed setting. Clearly, such solutions will not scale with large graphs having billions of nodes and edges that are becoming commonplace. In this paper, we present an algorithm for keyword search on large graphs, which is based on the distributed parallel processing paradigm. We also analyze why our algorithm generates optimal answers. Finally, we report on preliminary empirical results of relationship queries on a subset of the Linked-Open Data graph.","PeriodicalId":142249,"journal":{"name":"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2809948.2809949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Massive-sized graph-structured data is now ubiquitous, e.g., social networks, databases, knowledge-bases, web-graphs, etc. An important class of queries on graph-structured data is "relationship queries". Essentially, given a set of entities (corresponding to nodes in the graph), finding a ranked list of interesting interconnections among them. While this problem has been studied for many years, the solutions proposed in the literature so far focus on the non-distributed setting. Clearly, such solutions will not scale with large graphs having billions of nodes and edges that are becoming commonplace. In this paper, we present an algorithm for keyword search on large graphs, which is based on the distributed parallel processing paradigm. We also analyze why our algorithm generates optimal answers. Finally, we report on preliminary empirical results of relationship queries on a subset of the Linked-Open Data graph.