首页 > 最新文献

Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)最新文献

英文 中文
Going with the Flow: Real-Time Max-Flow on Asynchronous Dynamic Graphs 随波逐流:异步动态图的实时最大流量
Juntong Luo, Scott Sallinen, M. Ripeanu
Processing graphs that evolve over time has seen renewed attention. Processing solutions on dynamic graphs (often dubbed "graph streaming" solutions) aim to maintain the state for a graph query as the graph evolves over time, and to timely offer a solution (approximate, or precise) when requested by the user. In this space, and in the context of shared-nothing platforms, solutions have been proposed only for relatively simple problems (e.g., BFS, SSSP, PageRank), and some are limited to incremental-only evolutions traces. Support for more complex problems remains rather unexplored. To close this gap, we present a solution for the maximum flow problem that supports both add and delete events. We build this solution on top of an event-based abstraction. Integral to this abstraction is that events tied to both graph topology changes and algorithmic maintenance are processed asynchronously, concurrently, and autonomously (i.e., without shared state). We show that our implementation provides favourable time-to-solution and scales well by evaluating it on a real-world dynamic graph with 80 million edges. We compare its performance with snapshot-based solutions both internally (with our own implementation of a shared-nothing static algorithm) and externally (with Galois, a popular shared-memory framework for static graphs).
随着时间的推移,对图形的处理重新引起了人们的关注。动态图的处理解决方案(通常被称为“图流”解决方案)旨在维护图查询的状态,因为图随着时间的推移而演变,并在用户请求时及时提供解决方案(近似或精确)。在这个领域,在无共享平台的背景下,只针对相对简单的问题(例如,BFS、SSSP、PageRank)提出了解决方案,有些解决方案仅限于增量式的进化轨迹。对更复杂问题的支持仍然相当未被探索。为了缩小这一差距,我们提出了一个支持添加和删除事件的最大流问题的解决方案。我们在基于事件的抽象之上构建此解决方案。这个抽象的整体是,与图拓扑变化和算法维护相关的事件被异步、并发和自主地处理(即,没有共享状态)。我们证明了我们的实现提供了有利的时间到解决方案,并通过在具有8000万条边的现实世界动态图上进行评估来很好地扩展。我们将其性能与基于快照的解决方案进行了内部(使用我们自己的无共享静态算法实现)和外部(使用Galois,一种流行的用于静态图形的共享内存框架)的比较。
{"title":"Going with the Flow: Real-Time Max-Flow on Asynchronous Dynamic Graphs","authors":"Juntong Luo, Scott Sallinen, M. Ripeanu","doi":"10.1145/3594778.3594882","DOIUrl":"https://doi.org/10.1145/3594778.3594882","url":null,"abstract":"Processing graphs that evolve over time has seen renewed attention. Processing solutions on dynamic graphs (often dubbed \"graph streaming\" solutions) aim to maintain the state for a graph query as the graph evolves over time, and to timely offer a solution (approximate, or precise) when requested by the user. In this space, and in the context of shared-nothing platforms, solutions have been proposed only for relatively simple problems (e.g., BFS, SSSP, PageRank), and some are limited to incremental-only evolutions traces. Support for more complex problems remains rather unexplored. To close this gap, we present a solution for the maximum flow problem that supports both add and delete events. We build this solution on top of an event-based abstraction. Integral to this abstraction is that events tied to both graph topology changes and algorithmic maintenance are processed asynchronously, concurrently, and autonomously (i.e., without shared state). We show that our implementation provides favourable time-to-solution and scales well by evaluating it on a real-world dynamic graph with 80 million edges. We compare its performance with snapshot-based solutions both internally (with our own implementation of a shared-nothing static algorithm) and externally (with Galois, a popular shared-memory framework for static graphs).","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115563400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fast Synthetic Data-Aware Log Generation for Temporal Declarative Models 用于时态声明模型的快速综合数据感知日志生成
Giacomo Bergami
Business Process Management algorithms are heavily limited by suboptimal algorithmic implementations that cannot leverage state-of-the-art algorithms in the field of relational and graph databases. The recent interest in this discipline for various IT sectors (cyber-security, Industry 4.0, and e-Health) calls for defining new algorithms improving the performance of existing ones. This paper focuses on generating several traces collected in a log from declarative temporal models by pre-emptively representing those as a specific type of finite state automaton: we show that this task boils down to a single-source multi-target graph traversal on such automaton where both the number of distinct paths to be visited as well as their length are bounded. This paper presents a novel algorithm running in polynomial time over the size of the declarative model represented as a graph and the desired log's size. The final experiments show that the resulting algorithm outperforms the state-of-the-art data-aware and dataless sequence generations in business process management.
业务流程管理算法受到次优算法实现的严重限制,这些算法无法利用关系数据库和图形数据库领域中最先进的算法。最近,各个IT部门(网络安全、工业4.0和电子健康)对这一学科的兴趣要求定义新的算法来改进现有算法的性能。本文着重于通过将声明性时间模型先发制人地表示为特定类型的有限状态自动机来生成日志中收集的几个轨迹:我们表明,该任务归结为在这种自动机上的单源多目标图遍历,其中要访问的不同路径的数量及其长度都是有限的。本文提出了一种新的算法,在以图表示的声明性模型的大小和所需日志的大小的多项式时间内运行。最后的实验表明,所得到的算法在业务流程管理中优于最先进的数据感知和无数据序列生成。
{"title":"Fast Synthetic Data-Aware Log Generation for Temporal Declarative Models","authors":"Giacomo Bergami","doi":"10.1145/3594778.3594881","DOIUrl":"https://doi.org/10.1145/3594778.3594881","url":null,"abstract":"Business Process Management algorithms are heavily limited by suboptimal algorithmic implementations that cannot leverage state-of-the-art algorithms in the field of relational and graph databases. The recent interest in this discipline for various IT sectors (cyber-security, Industry 4.0, and e-Health) calls for defining new algorithms improving the performance of existing ones. This paper focuses on generating several traces collected in a log from declarative temporal models by pre-emptively representing those as a specific type of finite state automaton: we show that this task boils down to a single-source multi-target graph traversal on such automaton where both the number of distinct paths to be visited as well as their length are bounded. This paper presents a novel algorithm running in polynomial time over the size of the declarative model represented as a graph and the desired log's size. The final experiments show that the resulting algorithm outperforms the state-of-the-art data-aware and dataless sequence generations in business process management.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"62 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114060256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Demonstration of Interpretability Methods for Graph Neural Networks 图神经网络可解释性方法的演示
Ehsan Bonabi Mobaraki, Arijit Khan
Graph neural networks (GNNs) are widely used in many downstream applications, such as graphs and nodes classification, entity resolution, link prediction, and question answering. Several interpretability methods for GNNs have been proposed recently. However, since they have not been thoroughly compared with each other, their trade-offs and efficiency in the context of underlying GNNs and downstream applications are unclear. To support more research in this domain, we develop an end-to-end interactive tool, named gInterpreter, by re-implementing 15 recent GNN interpretability methods in a common environment on top of a number of state-of-the-art GNNs employed for different downstream tasks. This paper demonstrates gInterpreter with an interactive performance profiling of 15 recent GNN inter-pretability methods, aiming to explain the complex deep learning pipelines over graph-structured data.
图神经网络(gnn)广泛应用于图和节点分类、实体解析、链路预测和问题回答等下游应用。最近提出了几种gnn的可解释性方法。然而,由于它们之间尚未进行彻底的比较,因此它们在潜在gnn和下游应用背景下的权衡和效率尚不清楚。为了支持该领域的更多研究,我们开发了一个端到端交互工具,名为gInterpreter,通过在用于不同下游任务的许多最先进的GNN之上的公共环境中重新实现15种最新的GNN可解释性方法。本文通过15种最新GNN可解释性方法的交互式性能分析来演示gInterpreter,旨在解释图结构数据上复杂的深度学习管道。
{"title":"A Demonstration of Interpretability Methods for Graph Neural Networks","authors":"Ehsan Bonabi Mobaraki, Arijit Khan","doi":"10.1145/3594778.3594880","DOIUrl":"https://doi.org/10.1145/3594778.3594880","url":null,"abstract":"Graph neural networks (GNNs) are widely used in many downstream applications, such as graphs and nodes classification, entity resolution, link prediction, and question answering. Several interpretability methods for GNNs have been proposed recently. However, since they have not been thoroughly compared with each other, their trade-offs and efficiency in the context of underlying GNNs and downstream applications are unclear. To support more research in this domain, we develop an end-to-end interactive tool, named gInterpreter, by re-implementing 15 recent GNN interpretability methods in a common environment on top of a number of state-of-the-art GNNs employed for different downstream tasks. This paper demonstrates gInterpreter with an interactive performance profiling of 15 recent GNN inter-pretability methods, aiming to explain the complex deep learning pipelines over graph-structured data.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"901 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130757907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Commercial Side of Graph Analytics: Big Uses, Big Mistakes, Big Opportunities 图表分析的商业面:大用途,大错误,大机会
A. Hodler
Connectivity is the cornerstone of our contemporary world, permeating various sectors like retail, communications, biology, and finance. Although this inherent interconnectedness holds substantial meaning and predictive power, harnessing it for practical use in the business realm often proves challenging. In this presentation, we will delve into the commercial applications of graph analytics, highlighting both common pitfalls to avoid and promising opportunities to explore. To begin, we will explore the prevalent use cases of graph analytics, encompassing areas such as fraud detection, supply chain optimization, data management, and recommendations. We'll also shed light on why many teams tend to deploy only a limited set of graph algorithms. Additionally, we will examine how the COVID-19 pandemic has impacted the utilization of graphs in business settings. Next, we will venture into the major mistakes that businesses often make when implementing graph analytics. These blunders range from technical hurdles like scalability issues and handling tricky data types to human challenges such as fostering a graph-thinking mindset and avoiding excessive perfectionism. Moreover, you will gain quick tips to help teams secure funding for graph projects. Lastly, we will delve into some of the most significant prospects within the commercial space. We will address enduring challenges, such as transforming business data into a graph format and ensuring interoperability with production processes. We will also dedicate time to exploring the rising interest in combining graphs with AI systems, particularly the recent buzz surrounding combining graphs with generative AI. While this particular trend garners attention, we will look at other promising opportunities that it may overshadow. By the end of this talk, you will have gained a comprehensive understanding of the practical applications of graph analytics in business contexts. Furthermore, you'll gain valuable knowledge about pitfalls to avoid, strategies for securing funding, and a forward-looking perspective on emerging possibilities in this dynamic field.
互联互通是当今世界的基石,已渗透到零售、通信、生物、金融等各个领域。尽管这种内在的互联性具有实质性的意义和预测能力,但将其用于商业领域的实际应用通常是具有挑战性的。在本次演讲中,我们将深入探讨图形分析的商业应用,强调要避免的常见陷阱和有希望探索的机会。首先,我们将探讨图形分析的流行用例,包括欺诈检测、供应链优化、数据管理和建议等领域。我们还将阐明为什么许多团队倾向于只部署一组有限的图算法。此外,我们将研究COVID-19大流行如何影响商业环境中图形的使用。接下来,我们将探讨企业在实施图形分析时经常犯的主要错误。这些错误包括技术障碍,如可扩展性问题和处理棘手的数据类型,以及人类挑战,如培养图形思维心态和避免过度的完美主义。此外,您将获得快速提示,以帮助团队获得图形项目的资金。最后,我们将深入探讨商业领域中一些最重要的前景。我们将解决长期存在的挑战,例如将业务数据转换为图形格式,并确保与生产流程的互操作性。我们还将花时间探讨将图与人工智能系统相结合的兴趣,特别是最近围绕将图与生成式人工智能相结合的嗡嗡声。虽然这种特殊的趋势引起了人们的注意,但我们将关注它可能掩盖的其他有希望的机会。在本讲座结束时,您将对图形分析在商业环境中的实际应用有一个全面的了解。此外,您将获得有关避免陷阱的宝贵知识、获得资金的策略以及对这个动态领域中出现的可能性的前瞻性观点。
{"title":"The Commercial Side of Graph Analytics: Big Uses, Big Mistakes, Big Opportunities","authors":"A. Hodler","doi":"10.1145/3594778.3596883","DOIUrl":"https://doi.org/10.1145/3594778.3596883","url":null,"abstract":"Connectivity is the cornerstone of our contemporary world, permeating various sectors like retail, communications, biology, and finance. Although this inherent interconnectedness holds substantial meaning and predictive power, harnessing it for practical use in the business realm often proves challenging. In this presentation, we will delve into the commercial applications of graph analytics, highlighting both common pitfalls to avoid and promising opportunities to explore. To begin, we will explore the prevalent use cases of graph analytics, encompassing areas such as fraud detection, supply chain optimization, data management, and recommendations. We'll also shed light on why many teams tend to deploy only a limited set of graph algorithms. Additionally, we will examine how the COVID-19 pandemic has impacted the utilization of graphs in business settings. Next, we will venture into the major mistakes that businesses often make when implementing graph analytics. These blunders range from technical hurdles like scalability issues and handling tricky data types to human challenges such as fostering a graph-thinking mindset and avoiding excessive perfectionism. Moreover, you will gain quick tips to help teams secure funding for graph projects. Lastly, we will delve into some of the most significant prospects within the commercial space. We will address enduring challenges, such as transforming business data into a graph format and ensuring interoperability with production processes. We will also dedicate time to exploring the rising interest in combining graphs with AI systems, particularly the recent buzz surrounding combining graphs with generative AI. While this particular trend garners attention, we will look at other promising opportunities that it may overshadow. By the end of this talk, you will have gained a comprehensive understanding of the practical applications of graph analytics in business contexts. Furthermore, you'll gain valuable knowledge about pitfalls to avoid, strategies for securing funding, and a forward-looking perspective on emerging possibilities in this dynamic field.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115997436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Better Distributed Graph Query Planning With Scouting Queries 更好的分布式图查询规划与侦察查询
T. Faltín, Vasileios Trigonakis, Ayoub Berdai, Luigi Fusco, Călin Iorgulescu, Sungpack Hong, Hassan Chafi
Query planning is essential for graph query execution performance. In distributed graph processing, data partitioning and messaging significantly influence performance. However, these aspects are difficult to model analytically, which makes query planning especially challenging. This paper introduces scouting queries, a lightweight mechanism to gather runtime information about different query plans, which can then be used to choose the "best" plan. In a pipelined, depth-first-oriented graph processing engine, scouting queries typically execute for a brief amount of time with negligible overhead. Partial results can be reused to avoid redundant work. We evaluate scouting queries and show that they bring speedups of up to 8.7× for heavy queries, while adding low overhead for those queries that do not benefit.
查询规划对于图查询执行性能至关重要。在分布式图处理中,数据分区和消息传递对性能影响很大。然而,很难对这些方面进行分析建模,这使得查询规划特别具有挑战性。本文介绍了侦察查询,这是一种轻量级机制,用于收集关于不同查询计划的运行时信息,然后可以使用这些信息来选择“最佳”计划。在流水线的、面向深度优先的图形处理引擎中,侦察查询通常执行的时间很短,开销可以忽略不计。可以重用部分结果以避免冗余工作。我们对侦察查询进行了评估,结果表明,它们为重查询带来了高达8.7倍的速度提升,同时为那些没有好处的查询增加了较低的开销。
{"title":"Better Distributed Graph Query Planning With Scouting Queries","authors":"T. Faltín, Vasileios Trigonakis, Ayoub Berdai, Luigi Fusco, Călin Iorgulescu, Sungpack Hong, Hassan Chafi","doi":"10.1145/3594778.3594884","DOIUrl":"https://doi.org/10.1145/3594778.3594884","url":null,"abstract":"Query planning is essential for graph query execution performance. In distributed graph processing, data partitioning and messaging significantly influence performance. However, these aspects are difficult to model analytically, which makes query planning especially challenging. This paper introduces scouting queries, a lightweight mechanism to gather runtime information about different query plans, which can then be used to choose the \"best\" plan. In a pipelined, depth-first-oriented graph processing engine, scouting queries typically execute for a brief amount of time with negligible overhead. Partial results can be reused to avoid redundant work. We evaluate scouting queries and show that they bring speedups of up to 8.7× for heavy queries, while adding low overhead for those queries that do not benefit.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"17 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113957008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Future-Time Temporal Path Queries 未来时间时间路径查询
Christos Gkartzios, E. Pitoura
Most previous research considers processing queries on the current or previous states of a graph. In this paper, we propose processing future-time graph queries, i.e., predicting the output of a query on some future state of the graph. To process future-time queries, we present a generic approach that exploits a predictive model that provides oracles about the future state of the graph. We focus on future-time shortest path queries that given a temporal graph and two nodes return the shortest path between them at some future time. We present two algorithms each invoking a different type of oracle: (a) a link prediction oracle that given two nodes returns the probability of an edge between them, and (b) a connection prediction oracle that given a node u and a future time instance t returns the node υ that u will connect to at t. Finally, we present experimental results using off-the-shelf prediction models that provide such oracles.
大多数先前的研究都考虑对图的当前或以前的状态处理查询。在本文中,我们提出了处理未来时间图查询,即预测在图的某些未来状态下查询的输出。为了处理未来的查询,我们提出了一种利用预测模型的通用方法,该模型提供了关于图的未来状态的预言。我们关注的是未来时间最短路径查询,它给出一个时间图,两个节点返回它们之间在未来某个时间的最短路径。我们提出了两种算法,每种算法调用不同类型的oracle:(a)给定两个节点返回它们之间边的概率的链接预测oracle,以及(b)给定节点u和未来时间实例t的连接预测oracle,返回u将在t连接到的节点υ。最后,我们使用现成的预测模型提供了实验结果。
{"title":"Future-Time Temporal Path Queries","authors":"Christos Gkartzios, E. Pitoura","doi":"10.1145/3594778.3594879","DOIUrl":"https://doi.org/10.1145/3594778.3594879","url":null,"abstract":"Most previous research considers processing queries on the current or previous states of a graph. In this paper, we propose processing future-time graph queries, i.e., predicting the output of a query on some future state of the graph. To process future-time queries, we present a generic approach that exploits a predictive model that provides oracles about the future state of the graph. We focus on future-time shortest path queries that given a temporal graph and two nodes return the shortest path between them at some future time. We present two algorithms each invoking a different type of oracle: (a) a link prediction oracle that given two nodes returns the probability of an edge between them, and (b) a connection prediction oracle that given a node u and a future time instance t returns the node υ that u will connect to at t. Finally, we present experimental results using off-the-shelf prediction models that provide such oracles.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114419525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EAGER: Explainable Question Answering Using Knowledge Graphs EAGER:使用知识图谱进行可解释的问题回答
Andrew Chai, Alireza Vezvaei, Lukasz Golab, M. Kargar, D. Srivastava, Jaroslaw Szlichta, Morteza Zihayat
We present EAGER: a tool for answering questions expressed in natural language. Core to EAGER is a modular pipeline for generating a knowledge graph from raw text without human intervention. Notably, EAGER uses the knowledge graph to answer questions and to explain the reasoning behind the derivation of answers. Our demonstration will showcase both the automated knowledge graph generation pipeline and the explainable question answering functionality. Lastly, we outline open problems and directions for future work.
我们提出EAGER:一个回答用自然语言表达的问题的工具。EAGER的核心是一个模块化管道,用于在没有人工干预的情况下从原始文本生成知识图。值得注意的是,EAGER使用知识图来回答问题并解释推导答案背后的推理。我们的演示将展示自动知识图生成管道和可解释的问答功能。最后,我们概述了有待解决的问题和未来工作的方向。
{"title":"EAGER: Explainable Question Answering Using Knowledge Graphs","authors":"Andrew Chai, Alireza Vezvaei, Lukasz Golab, M. Kargar, D. Srivastava, Jaroslaw Szlichta, Morteza Zihayat","doi":"10.1145/3594778.3594877","DOIUrl":"https://doi.org/10.1145/3594778.3594877","url":null,"abstract":"We present EAGER: a tool for answering questions expressed in natural language. Core to EAGER is a modular pipeline for generating a knowledge graph from raw text without human intervention. Notably, EAGER uses the knowledge graph to answer questions and to explain the reasoning behind the derivation of answers. Our demonstration will showcase both the automated knowledge graph generation pipeline and the explainable question answering functionality. Lastly, we outline open problems and directions for future work.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122536547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph Feature Management: Impact, Challenges and Opportunities 图表特征管理:影响、挑战和机遇
James Cheng
Graph features are crucial to many applications such as recommender systems and risk management systems. The process to obtain useful graph features involves ingesting data from various upstream data sources, defining the desired graph features for the required applications, constructing a feature engineering workflow to compute the features, and storing and managing the resulting features for downstream tasks (e.g., graph AI and graph BI) and for future reuse. To the majority of users, especially SMEs and non-tech companies, this process poses daunting challenges as it requires users to not only learn various methods (e.g., graph analytical algorithms, non-GNN graph embeddings, GNNs) to define graph features and program their computation, but also learn many infrastructures (e.g., upstream databases, downstream ML systems, graph analytics systems) to compute, manage and use the graph features in production. These challenges have significantly restricted the wider applications of graph technologies such as graph AI and graph BI currently in industry. The current solution provided by major graph database vendors (e.g., Amazon Neptune, Neo4j, Tiger-Graph) is to connect various upstream and downstream systems to their own graph database, which is used to compute and manage graph features. However, such a solution ties users to a specific graph infrastructure that may not be the preferred infrastructure and may even require them to re-develop their applications on a new infrastructure. In addition, a specific graph database or infrastructure often does not have the best performance for all workloads and certainly does not support the computation of all types of graph features. As a result, the existing solution limits users' flexibility in choosing their own infrastructure and their productivity in developing their applications. In Part 1 of this talk, I will introduce various types of graph features and their applications. Then I will present some trends in using graph databases for graph feature computation and management, analyze the limitations of the existing methods, and identify the requirements of a graph feature management solution that is practical and highly usable to average users. In Part 2 of this talk, I will introduce our ongoing project that aims at providing a highly usable graph feature platform. Our solution decouples graph feature logic specification and management (i.e., how features are defined, coded and managed) from the generation and execution of the workflow for feature computation (i.e., execution plan generation and the actual execution), so that users can flexibly select different infrastructures suitable for the computation of specific types of graph features. It also manages the upstream, downstream and feature engineering and serving infrastructures, so as to free users from tedious tasks associated with deploying infrastructures and connecting them in a feature engineering dataflow. Thus, users can focus on creating
图形特性对于许多应用程序(如推荐系统和风险管理系统)至关重要。获取有用的图形特征的过程包括从各种上游数据源摄取数据,为所需的应用定义所需的图形特征,构建特征工程工作流来计算特征,并为下游任务(例如,图形AI和图形BI)和未来重用存储和管理结果特征。对于大多数用户,特别是中小企业和非科技公司来说,这个过程带来了艰巨的挑战,因为它要求用户不仅要学习各种方法(例如,图分析算法,非gnn图嵌入,gnn)来定义图特征并对其计算进行编程,还需要学习许多基础设施(例如,上游数据库,下游ML系统,图分析系统)来计算,管理和使用生产中的图特征。这些挑战极大地限制了图形技术(如图形人工智能和图形商业智能)在工业上的广泛应用。目前主要的图数据库供应商(例如Amazon Neptune, Neo4j, Tiger-Graph)提供的解决方案是将各种上下游系统连接到他们自己的图数据库,该数据库用于计算和管理图特征。然而,这样的解决方案将用户绑定到特定的图形基础设施上,而这些基础设施可能不是首选的基础设施,甚至可能要求用户在新的基础设施上重新开发应用程序。此外,特定的图形数据库或基础设施通常不会对所有工作负载具有最佳性能,并且肯定不支持所有类型的图形特征的计算。因此,现有的解决方案限制了用户选择自己的基础设施的灵活性和开发应用程序的生产力。在本讲座的第1部分,我将介绍各种类型的图形特征及其应用。然后,我将介绍使用图数据库进行图特征计算和管理的一些趋势,分析现有方法的局限性,并确定一种实用且对普通用户高可用性的图特征管理解决方案的需求。在本演讲的第2部分,我将介绍我们正在进行的项目,旨在提供一个高度可用的图形特性平台。我们的解决方案将图特征逻辑规范和管理(即特征如何定义、编码和管理)与特征计算工作流的生成和执行(即执行计划的生成和实际执行)解耦,以便用户可以灵活地选择适合特定类型图特征计算的不同基础设施。它还管理上游、下游、特征工程和服务基础设施,从而将用户从部署基础设施和在特征工程数据流中连接它们的繁琐任务中解放出来。因此,用户可以专注于创建和交付创新的功能工作流逻辑。最后,我还将强调图形特征管理的一些可能的未来方向。
{"title":"Graph Feature Management: Impact, Challenges and Opportunities","authors":"James Cheng","doi":"10.1145/3594778.3596882","DOIUrl":"https://doi.org/10.1145/3594778.3596882","url":null,"abstract":"Graph features are crucial to many applications such as recommender systems and risk management systems. The process to obtain useful graph features involves ingesting data from various upstream data sources, defining the desired graph features for the required applications, constructing a feature engineering workflow to compute the features, and storing and managing the resulting features for downstream tasks (e.g., graph AI and graph BI) and for future reuse. To the majority of users, especially SMEs and non-tech companies, this process poses daunting challenges as it requires users to not only learn various methods (e.g., graph analytical algorithms, non-GNN graph embeddings, GNNs) to define graph features and program their computation, but also learn many infrastructures (e.g., upstream databases, downstream ML systems, graph analytics systems) to compute, manage and use the graph features in production. These challenges have significantly restricted the wider applications of graph technologies such as graph AI and graph BI currently in industry. The current solution provided by major graph database vendors (e.g., Amazon Neptune, Neo4j, Tiger-Graph) is to connect various upstream and downstream systems to their own graph database, which is used to compute and manage graph features. However, such a solution ties users to a specific graph infrastructure that may not be the preferred infrastructure and may even require them to re-develop their applications on a new infrastructure. In addition, a specific graph database or infrastructure often does not have the best performance for all workloads and certainly does not support the computation of all types of graph features. As a result, the existing solution limits users' flexibility in choosing their own infrastructure and their productivity in developing their applications. In Part 1 of this talk, I will introduce various types of graph features and their applications. Then I will present some trends in using graph databases for graph feature computation and management, analyze the limitations of the existing methods, and identify the requirements of a graph feature management solution that is practical and highly usable to average users. In Part 2 of this talk, I will introduce our ongoing project that aims at providing a highly usable graph feature platform. Our solution decouples graph feature logic specification and management (i.e., how features are defined, coded and managed) from the generation and execution of the workflow for feature computation (i.e., execution plan generation and the actual execution), so that users can flexibly select different infrastructures suitable for the computation of specific types of graph features. It also manages the upstream, downstream and feature engineering and serving infrastructures, so as to free users from tedious tasks associated with deploying infrastructures and connecting them in a feature engineering dataflow. Thus, users can focus on creating","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130600300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Graph Neural Networks using Exact Compression 使用精确压缩学习图神经网络
Jeroen Bollen, Jasper Steegmans, Jan Van den Bussche, Stijn Vansummeren
Graph Neural Networks (GNNs) are a form of deep learning that enable a wide range of machine learning applications on graph-structured data. The learning of GNNs, however, is known to pose challenges for memory-constrained devices such as GPUs. In this paper, we study exact compression as a way to reduce the memory requirements of learning GNNs on large graphs. In particular, we adopt a formal approach to compression and propose a methodology that transforms GNN learning problems into provably equivalent compressed GNN learning problems. In a preliminary experimental evaluation, we give insights into the compression ratios that can be obtained on real-world graphs and apply our methodology to an existing GNN benchmark.
图神经网络(gnn)是深度学习的一种形式,可以在图结构数据上实现广泛的机器学习应用。然而,众所周知,gnn的学习对gpu等内存受限的设备构成了挑战。在本文中,我们研究精确压缩作为一种减少在大图上学习gnn的内存需求的方法。特别是,我们采用了一种形式化的压缩方法,并提出了一种将GNN学习问题转换为可证明等效压缩GNN学习问题的方法。在初步的实验评估中,我们深入了解了可以在现实世界的图形上获得的压缩比,并将我们的方法应用于现有的GNN基准。
{"title":"Learning Graph Neural Networks using Exact Compression","authors":"Jeroen Bollen, Jasper Steegmans, Jan Van den Bussche, Stijn Vansummeren","doi":"10.1145/3594778.3594878","DOIUrl":"https://doi.org/10.1145/3594778.3594878","url":null,"abstract":"Graph Neural Networks (GNNs) are a form of deep learning that enable a wide range of machine learning applications on graph-structured data. The learning of GNNs, however, is known to pose challenges for memory-constrained devices such as GPUs. In this paper, we study exact compression as a way to reduce the memory requirements of learning GNNs on large graphs. In particular, we adopt a formal approach to compression and propose a methodology that transforms GNN learning problems into provably equivalent compressed GNN learning problems. In a preliminary experimental evaluation, we give insights into the compression ratios that can be obtained on real-world graphs and apply our methodology to an existing GNN benchmark.","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130611363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA) 第六届图数据管理经验与系统(等级)与网络数据分析(NDA)联合研讨会论文集
{"title":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","authors":"","doi":"10.1145/3594778","DOIUrl":"https://doi.org/10.1145/3594778","url":null,"abstract":"","PeriodicalId":371215,"journal":{"name":"Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115373235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1