首页 > 最新文献

Operating Systems Review (ACM)最新文献

英文 中文
Taking the Pulse of Financial Activities with Online Graph Processing 用在线图形处理把握金融活动的脉搏
Q3 Computer Science Pub Date : 2021-06-04 DOI: 10.1145/3469379.3469389
Xiaowei Zhu, Zhisong Fu, Zhenxuan Pan, Jin Jiang, Chuntao Hong, Yongchao Liu, Yang Fang, Wenguang Chen, Changhua He
Graph processing has been widely adopted in various financial scenarios at Ant Group to detect malicious and prohibited user behaviors. The low latency requirement under big data volume and high throughput raises rigorous challenges for efficient online graph processing. This paper gives a brief introduction of our encountered issues, the current solutions, and some future directions we are exploring.
图处理在蚂蚁集团的各种金融场景中被广泛采用,用于检测恶意和被禁止的用户行为。大数据量和高吞吐量下的低延迟要求对高效的在线图形处理提出了严峻的挑战。本文简要介绍了我们遇到的问题,目前的解决方案,以及我们正在探索的一些未来的方向。
{"title":"Taking the Pulse of Financial Activities with Online Graph Processing","authors":"Xiaowei Zhu, Zhisong Fu, Zhenxuan Pan, Jin Jiang, Chuntao Hong, Yongchao Liu, Yang Fang, Wenguang Chen, Changhua He","doi":"10.1145/3469379.3469389","DOIUrl":"https://doi.org/10.1145/3469379.3469389","url":null,"abstract":"Graph processing has been widely adopted in various financial scenarios at Ant Group to detect malicious and prohibited user behaviors. The low latency requirement under big data volume and high throughput raises rigorous challenges for efficient online graph processing. This paper gives a brief introduction of our encountered issues, the current solutions, and some future directions we are exploring.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"84 - 87"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469389","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47808632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Deeper Dive into Pattern-Aware Subgraph Exploration with PEREGRINE 使用PEREGRINE深入研究模式感知子图探索
Q3 Computer Science Pub Date : 2021-06-04 DOI: 10.1145/3469379.3469381
Kasra Jamshidi, Keval Vora
Graph mining workloads aim to extract structural properties of a graph by exploring its subgraph structures. PEREGRINE is a general-purpose graph mining system that provides a generic runtime to efficiently explore subgraph structures of interest and perform various graph mining analyses. It takes a 'pattern-aware' approach by incorporating a pattern-based programming model along with efficient pattern matching strategies. The programming model enables easier expression of complex graph mining use cases and enables PEREGRINE to extract the semantics of patterns. By analyzing the patterns, PEREGRINE generates efficient exploration plans which it uses to guide its subgraph exploration. In this paper, we present an in-depth view of the patternanalysis techniques powering the matching engine of PEREGRINE. Beyond the theoretical foundations from prior research, we expose opportunities based on how the exploration plans are evaluated, and develop key techniques for computation reuse, enumeration depth reduction, and branch elimination. Our experiments show the importance of patternawareness for scalable and performant graph mining where the presented new techniques speed up the performance by up to two orders of magnitude on top of the benefits achieved from the prior theoretical foundations that generate the initial exploration plans.
图挖掘工作负载旨在通过探索图的子图结构来提取图的结构属性。PEREGRINE是一个通用的图挖掘系统,它提供了一个通用的运行时来有效地探索感兴趣的子图结构并执行各种图挖掘分析。它采用“模式感知”方法,将基于模式的编程模型与高效的模式匹配策略结合在一起。编程模型使复杂的图挖掘用例更容易表达,并使PEREGRINE能够提取模式的语义。PEREGRINE通过对模式的分析,生成有效的勘探计划,并以此指导子图的勘探。在本文中,我们深入地介绍了为PEREGRINE匹配引擎提供动力的模式分析技术。除了先前研究的理论基础之外,我们还根据如何评估勘探计划揭示了机会,并开发了计算重用、枚举深度减少和分支消除的关键技术。我们的实验显示了模式感知对于可扩展和高性能图挖掘的重要性,其中所提出的新技术在产生初始勘探计划的先前理论基础所获得的好处之上,将性能提高了两个数量级。
{"title":"A Deeper Dive into Pattern-Aware Subgraph Exploration with PEREGRINE","authors":"Kasra Jamshidi, Keval Vora","doi":"10.1145/3469379.3469381","DOIUrl":"https://doi.org/10.1145/3469379.3469381","url":null,"abstract":"Graph mining workloads aim to extract structural properties of a graph by exploring its subgraph structures. PEREGRINE is a general-purpose graph mining system that provides a generic runtime to efficiently explore subgraph structures of interest and perform various graph mining analyses. It takes a 'pattern-aware' approach by incorporating a pattern-based programming model along with efficient pattern matching strategies. The programming model enables easier expression of complex graph mining use cases and enables PEREGRINE to extract the semantics of patterns. By analyzing the patterns, PEREGRINE generates efficient exploration plans which it uses to guide its subgraph exploration. In this paper, we present an in-depth view of the patternanalysis techniques powering the matching engine of PEREGRINE. Beyond the theoretical foundations from prior research, we expose opportunities based on how the exploration plans are evaluated, and develop key techniques for computation reuse, enumeration depth reduction, and branch elimination. Our experiments show the importance of patternawareness for scalable and performant graph mining where the presented new techniques speed up the performance by up to two orders of magnitude on top of the benefits achieved from the prior theoretical foundations that generate the initial exploration plans.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"1 - 10"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469381","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43971497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Towards Next-Generation Cybersecurity with Graph AI 用图形人工智能实现下一代网络安全
Q3 Computer Science Pub Date : 2021-06-04 DOI: 10.1145/3469379.3469386
Benjamin Bowman, H. H. Huang
Cybersecurity professionals are inundated with large amounts of data, and require intelligent algorithms capable of distinguishing vulnerable from patched, normal from anomalous, and malicious from benign. Unfortunately, not all machine learning (ML) and artificial intelligence (AI) algorithms are created equal, and in this position paper we posit that a new breed of ML, specifically graph-based machine learning (Graph AI), is poised to make a significant impact in this domain. We will discuss the primary differentiators between traditional ML and graph ML, and provide reasons and justifications for why the latter is well-suited to many aspects of cybersecurity. We will present several example applications and result of graph ML in cybersecurity, followed by a discussion of the challenges that lie ahead.
网络安全专业人员被大量数据淹没,需要能够区分漏洞与修补、正常与异常、恶意与良性的智能算法。不幸的是,并非所有的机器学习(ML)和人工智能(AI)算法都是平等的,在这篇立场论文中,我们假设一种新的机器学习,特别是基于图的机器学习。我们将讨论传统ML和图ML之间的主要区别,并提供后者非常适合网络安全的许多方面的原因和理由。我们将介绍图ML在网络安全中的几个示例应用和结果,然后讨论未来的挑战。
{"title":"Towards Next-Generation Cybersecurity with Graph AI","authors":"Benjamin Bowman, H. H. Huang","doi":"10.1145/3469379.3469386","DOIUrl":"https://doi.org/10.1145/3469379.3469386","url":null,"abstract":"Cybersecurity professionals are inundated with large amounts of data, and require intelligent algorithms capable of distinguishing vulnerable from patched, normal from anomalous, and malicious from benign. Unfortunately, not all machine learning (ML) and artificial intelligence (AI) algorithms are created equal, and in this position paper we posit that a new breed of ML, specifically graph-based machine learning (Graph AI), is poised to make a significant impact in this domain. We will discuss the primary differentiators between traditional ML and graph ML, and provide reasons and justifications for why the latter is well-suited to many aspects of cybersecurity. We will present several example applications and result of graph ML in cybersecurity, followed by a discussion of the challenges that lie ahead.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"61 - 67"},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469386","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47663855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
GraphZero GraphZero
Q3 Computer Science Pub Date : 2021-06-02 DOI: 10.1145/3469379.3469383
Daniel Mawhirter, Sam Reinehr, Connor Holmes, Tongping Liu, Bo Wu
Subgraph matching is a fundamental task in many applications which identifies all the embeddings of a query pattern in an input graph. Compilation-based subgraph matching systems generate specialized implementations for the provided patterns and often substantially outperform other systems. However, the generated code causes significant computation redundancy and the compilation process incurs too much overhead to be used online, both due to the inherent symmetry in the structure of the query pattern. In this paper, we propose an optimizing query compiler, named GraphZero, to completely address these limitations through symmetry breaking based on group theory. GraphZero implements three novel techniques. First, its schedule explorer efficiently prunes the schedule space without missing any high-performance schedule. Second, it automatically generates and enforces a set of restrictions to eliminate computation redundancy. Third, it generalizes orientation, a surprisingly effective optimization that was only used for clique patterns, to apply to arbitrary patterns. Evaluation on multiple query patterns shows that GraphZero outperforms two state-of-the-art compilation and non-compilation based systems by up to 40X and 2654X, respectively.
子图匹配是许多应用程序中的一项基本任务,它识别输入图中查询模式的所有嵌入。基于编译的子图匹配系统为所提供的模式生成专门的实现,并且通常显著优于其他系统。然而,由于查询模式结构中固有的对称性,生成的代码会导致显著的计算冗余,编译过程会产生太多的开销,无法在线使用。在本文中,我们提出了一个名为GraphZero的优化查询编译器,通过基于群论的对称性破坏来完全解决这些限制。GraphZero实现了三种新颖的技术。首先,它的时间表资源管理器有效地修剪时间表空间,而不会错过任何高性能的时间表。其次,它自动生成并强制执行一组限制,以消除计算冗余。第三,它将定向推广到任意模式,这是一种仅用于集团模式的令人惊讶的有效优化。对多个查询模式的评估表明,GraphZero的性能分别比两个最先进的基于编译和非编译的系统高出40X和2654X。
{"title":"GraphZero","authors":"Daniel Mawhirter, Sam Reinehr, Connor Holmes, Tongping Liu, Bo Wu","doi":"10.1145/3469379.3469383","DOIUrl":"https://doi.org/10.1145/3469379.3469383","url":null,"abstract":"Subgraph matching is a fundamental task in many applications which identifies all the embeddings of a query pattern in an input graph. Compilation-based subgraph matching systems generate specialized implementations for the provided patterns and often substantially outperform other systems. However, the generated code causes significant computation redundancy and the compilation process incurs too much overhead to be used online, both due to the inherent symmetry in the structure of the query pattern. In this paper, we propose an optimizing query compiler, named GraphZero, to completely address these limitations through symmetry breaking based on group theory. GraphZero implements three novel techniques. First, its schedule explorer efficiently prunes the schedule space without missing any high-performance schedule. Second, it automatically generates and enforces a set of restrictions to eliminate computation redundancy. Third, it generalizes orientation, a surprisingly effective optimization that was only used for clique patterns, to apply to arbitrary patterns. Evaluation on multiple query patterns shows that GraphZero outperforms two state-of-the-art compilation and non-compilation based systems by up to 40X and 2654X, respectively.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"21 - 37"},"PeriodicalIF":0.0,"publicationDate":"2021-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469383","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49370241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Predicting file lifetimes for data placement in multi-tiered storage systems for HPC 预测HPC多层存储系统中数据放置的文件生命周期
Q3 Computer Science Pub Date : 2021-06-02 DOI: 10.1145/3469379.3469392
Luis Thomas, Sebastien Gougeaud, S. Rubini, Philippe Deniel, Jalil Boukhobza
The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In this paper, we assume a 2-tier storage hierarchy with a high performance tier and a high capacity archival tier. Files are placed on the high performance tier at creation time and moved to capacity tier once their lifetime expires (that is once they are no more accessed). The main contribution of this paper lies in the design of a file lifetime prediction model solely based on its path based on the use of Convolutional Neural Network. Results show that our solution strikes a good trade-off between accuracy and under-estimation. Compared to previous work, our model made it possible to reach an accuracy close to previous work (around 98.60% compared to 98.84%) while reducing the underestimations by almost 10x to reach 2.21% (compared to 21.86%). The reduction in underestimations is crucial as it avoids misplacing files in the capacity tier while they are still in use.
在高性能计算领域,百亿亿级计算机的出现将会给现有的存储系统带来更大的压力,不仅是在容量方面,还有带宽和延迟方面。由于预算有限,我们无法想象只使用存储类内存,这导致使用异构分层存储层次结构。为了最有效地利用此存储层次结构中的高性能层,我们需要能够在正确的时间将用户数据放在正确的层上。在本文中,我们假设一个包含高性能层和高容量归档层的两层存储结构。文件在创建时被放置在高性能层,并在其生命周期到期后(即不再被访问时)移动到容量层。本文的主要贡献在于利用卷积神经网络设计了一个仅基于路径的文件寿命预测模型。结果表明,我们的解决方案在准确性和低估之间取得了很好的平衡。与以前的工作相比,我们的模型可以达到接近以前工作的精度(大约98.60%,而不是98.84%),同时将低估率减少了近10倍,达到2.21%(而不是21.86%)。减少低估是至关重要的,因为它可以避免在容量层中错误地放置仍在使用的文件。
{"title":"Predicting file lifetimes for data placement in multi-tiered storage systems for HPC","authors":"Luis Thomas, Sebastien Gougeaud, S. Rubini, Philippe Deniel, Jalil Boukhobza","doi":"10.1145/3469379.3469392","DOIUrl":"https://doi.org/10.1145/3469379.3469392","url":null,"abstract":"The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In this paper, we assume a 2-tier storage hierarchy with a high performance tier and a high capacity archival tier. Files are placed on the high performance tier at creation time and moved to capacity tier once their lifetime expires (that is once they are no more accessed). The main contribution of this paper lies in the design of a file lifetime prediction model solely based on its path based on the use of Convolutional Neural Network. Results show that our solution strikes a good trade-off between accuracy and under-estimation. Compared to previous work, our model made it possible to reach an accuracy close to previous work (around 98.60% compared to 98.84%) while reducing the underestimations by almost 10x to reach 2.21% (compared to 21.86%). The reduction in underestimations is crucial as it avoids misplacing files in the capacity tier while they are still in use.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"99 - 107"},"PeriodicalIF":0.0,"publicationDate":"2021-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469392","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64044506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Scalable Graph Neural Network Training 可伸缩图神经网络训练
Q3 Computer Science Pub Date : 2021-05-05 DOI: 10.1145/3469379.3469387
M. Serafini, Hui Guan
Graph Neural Networks (GNNs) are a new and increasingly popular family of deep neural network architectures to perform learning on graphs. Training them efficiently is challenging due to the irregular nature of graph data. The problem becomes even more challenging when scaling to large graphs that exceed the capacity of single devices. Standard approaches to distributed DNN training, like data and model parallelism, do not directly apply to GNNs. Instead, two different approaches have emerged in the literature: whole-graph and sample-based training. In this paper, we review and compare the two approaches. Scalability is challenging with both approaches, but we make a case that research should focus on sample-based training since it is a more promising approach. Finally, we review recent systems supporting sample-based training.
图神经网络(GNN)是一种新的、越来越流行的深度神经网络架构家族,用于对图进行学习。由于图形数据的不规则性,有效地训练它们是具有挑战性的。当扩展到超过单个设备容量的大型图形时,这个问题变得更加具有挑战性。分布式DNN训练的标准方法,如数据和模型并行性,并不直接适用于GNN。相反,文献中出现了两种不同的方法:全图和基于样本的训练。在本文中,我们回顾并比较了这两种方法。这两种方法的可扩展性都很有挑战性,但我们认为研究应该集中在基于样本的训练上,因为这是一种更有前景的方法。最后,我们回顾了最近支持基于样本的训练的系统。
{"title":"Scalable Graph Neural Network Training","authors":"M. Serafini, Hui Guan","doi":"10.1145/3469379.3469387","DOIUrl":"https://doi.org/10.1145/3469379.3469387","url":null,"abstract":"Graph Neural Networks (GNNs) are a new and increasingly popular family of deep neural network architectures to perform learning on graphs. Training them efficiently is challenging due to the irregular nature of graph data. The problem becomes even more challenging when scaling to large graphs that exceed the capacity of single devices. Standard approaches to distributed DNN training, like data and model parallelism, do not directly apply to GNNs. Instead, two different approaches have emerged in the literature: whole-graph and sample-based training. In this paper, we review and compare the two approaches. Scalability is challenging with both approaches, but we make a case that research should focus on sample-based training since it is a more promising approach. Finally, we review recent systems supporting sample-based training.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"68 - 76"},"PeriodicalIF":0.0,"publicationDate":"2021-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469387","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48753432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
EZIOTracer EZIOTracer
Q3 Computer Science Pub Date : 2021-04-26 DOI: 10.1145/3469379.3469391
Mohammed Islam Naas, François Trahay, Alexis Colin, Pierre Olivier, S. Rubini, Frank Singhoff, Jalil Boukhobza
Tracing is a popular method for evaluating, investigating, and modeling the performance of today's storage systems. Tracing has become crucial with the increase in complexity of modern storage applications/systems, that are manipulating an ever-increasing amount of data and are subject to extreme performance requirements. There exists many tracing tools focusing either on the user-level or the kernel-level, however we observe the lack of a unified tracer targeting both levels: this prevents a comprehensive understanding of modern applications' storage performance profiles. In this paper, we present EZIOTracer, a unified I/O tracer for both (Linux) kernel and user spaces, targeting data intensive applications. EZIOTracer is composed of a userland as well as a kernel space tracer, complemented with a trace analysis framework able to merge the output of the two tracers, and in particular to relate user-level events to kernel-level ones, and vice-versa. On the kernel side, EZIOTracer relies on eBPF to offer safe, low-overhead, low memory footprint, and flexible tracing capabilities. We demonstrate using FIO benchmark the ability of EZIOTracer to track down I/O performance issues by relating events recorded at both the kernel and user levels. We show that this can be achieved with a relatively low overhead that ranges from 2% to 26% depending on the I/O intensity.
跟踪是当今存储系统性能评估、调查和建模的一种流行方法。随着现代存储应用程序/系统复杂性的增加,跟踪变得至关重要,这些应用程序/系统要处理不断增加的数据量,并受到极端性能要求的约束。有许多跟踪工具关注用户级或内核级,但是我们发现缺乏针对这两个级别的统一跟踪程序:这阻碍了对现代应用程序存储性能配置文件的全面理解。在本文中,我们提出EZIOTracer,一个统一的(Linux)内核和用户空间的I/O跟踪器,目标是数据密集型应用程序。EZIOTracer由用户空间和内核空间跟踪器组成,并辅以跟踪分析框架,可以合并两个跟踪器的输出,特别是将用户级事件与内核级事件联系起来,反之亦然。在内核方面,EZIOTracer依靠eBPF提供安全、低开销、低内存占用和灵活的跟踪功能。我们使用FIO基准测试演示了EZIOTracer通过在内核和用户级别记录相关事件来跟踪I/O性能问题的能力。我们表明,这可以通过相对较低的开销(根据I/O强度从2%到26%)来实现。
{"title":"EZIOTracer","authors":"Mohammed Islam Naas, François Trahay, Alexis Colin, Pierre Olivier, S. Rubini, Frank Singhoff, Jalil Boukhobza","doi":"10.1145/3469379.3469391","DOIUrl":"https://doi.org/10.1145/3469379.3469391","url":null,"abstract":"Tracing is a popular method for evaluating, investigating, and modeling the performance of today's storage systems. Tracing has become crucial with the increase in complexity of modern storage applications/systems, that are manipulating an ever-increasing amount of data and are subject to extreme performance requirements. There exists many tracing tools focusing either on the user-level or the kernel-level, however we observe the lack of a unified tracer targeting both levels: this prevents a comprehensive understanding of modern applications' storage performance profiles. In this paper, we present EZIOTracer, a unified I/O tracer for both (Linux) kernel and user spaces, targeting data intensive applications. EZIOTracer is composed of a userland as well as a kernel space tracer, complemented with a trace analysis framework able to merge the output of the two tracers, and in particular to relate user-level events to kernel-level ones, and vice-versa. On the kernel side, EZIOTracer relies on eBPF to offer safe, low-overhead, low memory footprint, and flexible tracing capabilities. We demonstrate using FIO benchmark the ability of EZIOTracer to track down I/O performance issues by relating events recorded at both the kernel and user levels. We show that this can be achieved with a relatively low overhead that ranges from 2% to 26% depending on the I/O intensity.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"88 - 98"},"PeriodicalIF":0.0,"publicationDate":"2021-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469391","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48149376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
GeoGraph: A Framework for Graph Processing on Geometric Data 地理:几何数据的图形处理框架
Q3 Computer Science Pub Date : 2021-01-01 DOI: 10.1145/3469379.3469384
Yiqiu Wang, Shangdi Yu, Laxman Dhulipala, Yan Gu, Julian Shun
In many applications of graph processing, the input data is often generated from an underlying geometric point data set. However, existing high-performance graph processing frameworks assume that the input data is given as a graph. Therefore, to use these frameworks, the user must write or use external programs based on computational geometry algorithms to convert their point data set to a graph, which requires more programming effort and can also lead to performance degradation. In this paper, we present our ongoing work on the GeoGraph framework for shared-memory multicore machines, which seamlessly supports routines for parallel geometric graph construction and parallel graph processing within the same environment. GeoGraph supports graph construction based on k-nearest neighbors, Delaunay triangulation, and β-skeleton graphs. It can then pass these generated graphs to over 25 graph algorithms. GeoGraph contains highperformance parallel primitives and algorithms implemented in C++, and includes a Python interface. We present four examples of using GeoGraph, and some experimental results showing good parallel speedups and improvements over the Higra library. We conclude with a vision of future directions for research in bridging graph and geometric data processing.
在图形处理的许多应用中,输入数据通常是从底层的几何点数据集生成的。然而,现有的高性能图处理框架假设输入数据是作为图给出的。因此,要使用这些框架,用户必须编写或使用基于计算几何算法的外部程序来将其点数据集转换为图形,这需要更多的编程工作,还可能导致性能下降。在本文中,我们介绍了我们正在进行的用于共享内存多核机器的地理框架的工作,该框架无缝地支持在同一环境中并行几何图形构建和并行图形处理的例程。GeoGraph支持基于k近邻、Delaunay三角剖分和β-骨架图的图构建。然后,它可以将这些生成的图形传递给超过25个图形算法。GeoGraph包含用c++实现的高性能并行原语和算法,并包含一个Python接口。我们给出了使用GeoGraph的四个示例,以及一些实验结果,显示了相对于Higra库的良好并行加速和改进。最后,我们展望了桥接图和几何数据处理的未来研究方向。
{"title":"GeoGraph: A Framework for Graph Processing on Geometric Data","authors":"Yiqiu Wang, Shangdi Yu, Laxman Dhulipala, Yan Gu, Julian Shun","doi":"10.1145/3469379.3469384","DOIUrl":"https://doi.org/10.1145/3469379.3469384","url":null,"abstract":"In many applications of graph processing, the input data is often generated from an underlying geometric point data set. However, existing high-performance graph processing frameworks assume that the input data is given as a graph. Therefore, to use these frameworks, the user must write or use external programs based on computational geometry algorithms to convert their point data set to a graph, which requires more programming effort and can also lead to performance degradation. In this paper, we present our ongoing work on the GeoGraph framework for shared-memory multicore machines, which seamlessly supports routines for parallel geometric graph construction and parallel graph processing within the same environment. GeoGraph supports graph construction based on k-nearest neighbors, Delaunay triangulation, and β-skeleton graphs. It can then pass these generated graphs to over 25 graph algorithms. GeoGraph contains highperformance parallel primitives and algorithms implemented in C++, and includes a Python interface. We present four examples of using GeoGraph, and some experimental results showing good parallel speedups and improvements over the Higra library. We conclude with a vision of future directions for research in bridging graph and geometric data processing.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"38-46"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469384","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64044488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Wukong: A Distributed Framework for Fast and Concurrent Graph Querying Wukong:一个用于快速并发图查询的分布式框架
Q3 Computer Science Pub Date : 2021-01-01 DOI: 10.1145/3469379.3469388
Rong Chen, Haibo Chen
{"title":"Wukong: A Distributed Framework for Fast and Concurrent Graph Querying","authors":"Rong Chen, Haibo Chen","doi":"10.1145/3469379.3469388","DOIUrl":"https://doi.org/10.1145/3469379.3469388","url":null,"abstract":"","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"77-83"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469388","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64044500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
VRGQ: Evaluating a Stream of Iterative Graph Queries via Value Reuse VRGQ:通过值重用评估迭代图查询流
Q3 Computer Science Pub Date : 2021-01-01 DOI: 10.1145/3469379.3469382
Xiaolin Jiang, Chengshuo Xu, Rajiv Gupta
While much of the research on graph analytics over large power-law graphs has focused on developing algorithms for evaluating a single global graph query, in practice we may be faced with a stream of queries. We observe that, due to their global nature, vertex specific graph queries present an opportunity for sharing work across queries. To take advantage of this opportunity, we have developed the VRGQ framework that accelerates the evaluation of a stream of queries via coarsegrained value reuse. In particular, the results of queries for a small set of source vertices are reused to speedup all future queries. We present a two step algorithm that in its first step initializes the query result based upon value reuse and then in the second step iteratively evaluates the query to convergence. The reused results for a small number of queries are held in a reuse table. Our experiments with best reuse configurations on four power law graphs and thousands of graph queries of five kinds yielded average speedups of 143×, 13.2×, 6.89×, 1.43×, and 1.18×.
虽然对大型幂律图的图分析的大部分研究都集中在开发评估单个全局图查询的算法上,但在实践中,我们可能会面临一系列查询。我们观察到,由于它们的全局性质,特定于顶点的图查询提供了跨查询共享工作的机会。为了利用这个机会,我们开发了VRGQ框架,它通过粗粒度的值重用来加速查询流的计算。特别是,对一小部分源顶点的查询结果将被重用,以加快未来所有查询的速度。提出了一种两步算法,第一步基于值重用初始化查询结果,第二步迭代计算查询收敛。少数查询的重用结果保存在重用表中。我们在四种幂律图和五种数千种图查询上使用最佳重用配置进行的实验产生的平均速度分别为143×、13.2×、6.89×、1.43×和1.18×。
{"title":"VRGQ: Evaluating a Stream of Iterative Graph Queries via Value Reuse","authors":"Xiaolin Jiang, Chengshuo Xu, Rajiv Gupta","doi":"10.1145/3469379.3469382","DOIUrl":"https://doi.org/10.1145/3469379.3469382","url":null,"abstract":"While much of the research on graph analytics over large power-law graphs has focused on developing algorithms for evaluating a single global graph query, in practice we may be faced with a stream of queries. We observe that, due to their global nature, vertex specific graph queries present an opportunity for sharing work across queries. To take advantage of this opportunity, we have developed the VRGQ framework that accelerates the evaluation of a stream of queries via coarsegrained value reuse. In particular, the results of queries for a small set of source vertices are reused to speedup all future queries. We present a two step algorithm that in its first step initializes the query result based upon value reuse and then in the second step iteratively evaluates the query to convergence. The reused results for a small number of queries are held in a reuse table. Our experiments with best reuse configurations on four power law graphs and thousands of graph queries of five kinds yielded average speedups of 143×, 13.2×, 6.89×, 1.43×, and 1.18×.","PeriodicalId":38935,"journal":{"name":"Operating Systems Review (ACM)","volume":"55 1","pages":"11-20"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3469379.3469382","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64044485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Operating Systems Review (ACM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1