Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics最新文献

英文中文

Session details: Technical Session I 会议详情:技术会议一

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3341229

A. Sim

引用次数: 0

Performance and Security Challenges in Science Workflows 科学工作流程中的性能和安全挑战

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329260

D. Ghosal

Scientific workflows are complex, often generating large amounts of data that need to be processed in multiple stages. The data often generated at remote locations must be transferred from the source and between the distributed HPC nodes interconnected by high-speed networks that carry other background traffic. Increasingly, many of these scientific workflows require processing to be completed within a deadline, which, in turn, imposes deadline on the network data transfer. A recent example of a deadline-driven workflow occurred when LIGO and Virgo detectors observed a gravitational wave signal associated with the merger of two neutron stars. The merger, known as a kilonova, occurred in a galaxy 130 million light-years from Earth in the southern constellation of Hydra. The data from this initial observation had to be processed in a timely manner and sent to astronomers around the world so that they could aim their instruments to the right section of the sky to image the source of the signal.

科学工作流程是复杂的，经常产生大量的数据，需要在多个阶段进行处理。通常在远程位置生成的数据必须从数据源和分布式HPC节点之间传输，这些节点由承载其他后台流量的高速网络连接。越来越多的这些科学工作流程要求在截止日期内完成处理，这反过来又给网络数据传输施加了截止日期。最近，LIGO和Virgo探测器观测到一个与两颗中子星合并有关的引力波信号，这是最后期限驱动工作流程的一个例子。这次合并被称为千新星，发生在距离地球1.3亿光年的南部九头蛇星座的星系中。这一初步观测的数据必须及时处理，并发送给世界各地的天文学家，以便他们能够将仪器对准天空的正确区域，对信号源进行成像。

引用次数: 0

A Software Defined Network Design for Analyzing Streaming Data in Transit 一种分析传输流数据的软件定义网络设计

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329257

Y. Liu, D. Katramatos

Year after year, network traffic keeps reaching new highs as unprecedented volumes of data flow between an ever-increasing number of sources and destinations. As part of the Analysis on the Wire project, we aim to develop a network-centric approach for streaming data processing to facilitate scientific data analysis and reduce the overhead in sending big data to a data center. This work discusses our design for programmable network devices to augment network capabilities for streaming data processing, including efforts in progress at Brookhaven National Laboratory and the challenges faced to date.

年复一年，网络流量不断创新高，数据流量在越来越多的源和目的之间流动。作为Analysis on the Wire项目的一部分，我们的目标是开发一种以网络为中心的流数据处理方法，以促进科学数据分析，并减少将大数据发送到数据中心的开销。这项工作讨论了我们的可编程网络设备的设计，以增强流数据处理的网络功能，包括布鲁克海文国家实验室正在进行的工作和迄今为止面临的挑战。

引用次数: 1

Real-time Multi-process Tracing Decoder Architecture 实时多进程跟踪解码器架构

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329253

Youngsoo Kim, Jonghyun Kim, Ikkyun Kim, Hyunchul Kim

Tracing is a form of logging by recording the execution information of programs. Since a large amount of data must be created and decoded in real time, a tracer composed mainly of dedicated hardware is widely used. Intel® PT records all information related to software execution from each hardware thread. When the execution of the corresponding software is completed, the accurate program flow can be indicated through the recorded trace data. The hardware trace program can be integrated into the operating system, but in the case of the Windows system, the kernel is not disclosed so tight integration is not achieved. Also, in a Windows environment, it can only trace a single process and do not provide a way to trace multiple process streams. In this paper, we propose a way of extending the PT trace program in order to overcome this shortcoming by supporting multi-process stream tracing in Windows environment.

跟踪是记录程序执行信息的一种日志记录形式。由于必须实时创建和解码大量数据，因此广泛使用主要由专用硬件组成的示踪器。Intel®PT记录来自每个硬件线程的与软件执行相关的所有信息。当相应的软件执行完成后，可以通过记录的跟踪数据指示准确的程序流程。硬件跟踪程序可以集成到操作系统中，但在Windows系统的情况下，内核没有公开，因此无法实现紧密集成。此外，在Windows环境中，它只能跟踪单个进程，而不能提供跟踪多个进程流的方法。为了克服这一缺点，本文提出了一种扩展PT跟踪程序的方法，支持Windows环境下的多进程流跟踪。

引用次数: 0

Time Series Analysis for Efficient Sample Transfers 有效样本转移的时间序列分析

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329256

Hemanta Sapkota, Bahadir A. Pehlivan, Engin Arslan

Real-time transfer optimization approaches offer promising solutions as they can discover optimal transfer configuration in the runtime without requiring an upfront work or making assumptions about underlying system architectures. On the other hand, existing implementations suffer from slow convergence speed due to running many sample transfers with suboptimal configurations. In this work, we evaluate time-series models to minimize the impact of sample transfers with suboptimal configurations by shortening the transfer duration without degrading the accuracy. The results gathered in various networks with rich set of transfer configurations indicate that, in most cases, Autoregressive model can accurately estimate sample transfer throughput in less than 5 seconds which is up-to 4x improvement over the state-of-the-art solution. We also realized that while the most common transfer applications report transfer throughput at most once a second, decreasing the reporting interval is the key to further reduce the impact of sample transfers by quickly determining their performance.

实时传输优化方法提供了很有前途的解决方案，因为它们可以在运行时发现最佳的传输配置，而不需要预先工作或对底层系统架构进行假设。另一方面，由于在次优配置下运行了许多样本传输，现有实现的收敛速度较慢。在这项工作中，我们评估了时间序列模型，通过缩短传输持续时间而不降低准确性来最小化具有次优配置的样本传输的影响。在具有丰富传输配置集的各种网络中收集的结果表明，在大多数情况下，自回归模型可以在不到5秒的时间内准确估计样本传输吞吐量，这比最先进的解决方案提高了4倍。我们还意识到，虽然最常见的传输应用程序最多每秒报告一次传输吞吐量，但通过快速确定其性能，减少报告间隔是进一步减少样本传输影响的关键。

引用次数: 6

Session details: Technical Session II 会议详情:技术会议二

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3341230

A. Lazar

引用次数: 0

Understanding Parallel I/O Performance Trends Under Various HPC Configurations 了解不同HPC配置下并行I/O性能趋势

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329258

Hanul Sung, Jiwoo Bang, A. Sim, Kesheng Wu, Hyeonsang Eom

In high-performance computing (HPC) environments, an appropriate amount of hardware resources must be used for the best parallel I/O performance. For this reason, HPC users are provided with tunable parameters to change the HPC configurations, which control the amounts of resources. However, some users are not well aware of a relationship between the parallel I/O performance and the HPC configuration, and they thus fail to utilize these parameters. Even if users who know the relationship, they have to run an application under every parameter combination to find the setting for the best performance, because each application shows different performance trends under different configurations. The paper shows the result of analyzing the I/O performance trends for HPC users to find the best configurations with minimal efforts. We divide the parallel I/O characteristic into independent and collective I/Os and measure the I/O throughput under various configurations by using synthetic workload, IOR benchmark. Through the analysis, we have figured out that the parallel I/O performance is determined by the trade-off between the gain from the parallelism of increased OSTs and the loss from the contention for shared resources. Also, this performance trend differs depending on the I/O characteristic. Our evaluation shows that HPC applications also have similar performance trends as our analysis.

在高性能计算(HPC)环境中，为了获得最佳的并行I/O性能，必须使用适量的硬件资源。因此，为HPC用户提供了可调参数来更改HPC配置，从而控制资源的数量。然而，一些用户并不清楚并行I/O性能和HPC配置之间的关系，因此他们无法利用这些参数。即使用户知道这种关系，他们也必须在每个参数组合下运行一个应用程序，以找到最佳性能的设置，因为每个应用程序在不同的配置下显示不同的性能趋势。本文展示了通过分析高性能计算用户的I/O性能趋势，以最小的努力找到最佳配置的结果。我们将并行I/O特性分为独立I/O和集体I/O，并使用综合工作负载、IOR基准测试来测量各种配置下的I/O吞吐量。通过分析，我们发现并行I/O性能是由ost增加的并行性带来的收益和共享资源争用带来的损失之间的权衡决定的。此外，这种性能趋势根据I/O特性而有所不同。我们的评估表明，高性能计算应用程序也具有与我们的分析相似的性能趋势。

{"title":"Understanding Parallel I/O Performance Trends Under Various HPC Configurations","authors":"Hanul Sung, Jiwoo Bang, A. Sim, Kesheng Wu, Hyeonsang Eom","doi":"10.1145/3322798.3329258","DOIUrl":"https://doi.org/10.1145/3322798.3329258","url":null,"abstract":"In high-performance computing (HPC) environments, an appropriate amount of hardware resources must be used for the best parallel I/O performance. For this reason, HPC users are provided with tunable parameters to change the HPC configurations, which control the amounts of resources. However, some users are not well aware of a relationship between the parallel I/O performance and the HPC configuration, and they thus fail to utilize these parameters. Even if users who know the relationship, they have to run an application under every parameter combination to find the setting for the best performance, because each application shows different performance trends under different configurations. The paper shows the result of analyzing the I/O performance trends for HPC users to find the best configurations with minimal efforts. We divide the parallel I/O characteristic into independent and collective I/Os and measure the I/O throughput under various configurations by using synthetic workload, IOR benchmark. Through the analysis, we have figured out that the parallel I/O performance is determined by the trade-off between the gain from the parallelism of increased OSTs and the loss from the contention for shared resources. Also, this performance trend differs depending on the I/O characteristic. Our evaluation shows that HPC applications also have similar performance trends as our analysis.","PeriodicalId":365009,"journal":{"name":"Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124799861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Performance Prediction for Data Transfers in LCLS Workflow LCLS工作流中数据传输的性能预测

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329254

Mengtian Jin, Youkow Homma, A. Sim, W. Kroeger, Kesheng Wu

In this work, we study the use of decision tree-based models to predict the transfer rates in different parts of the data pipeline that sends experiment data from Linac Coherent Light Source (LCLS) at SLAC National Accelerator Laboratory (SLAC) to National Energy Research Scientific Computing Center (NERSC). The system monitoring the data pipeline collects a number of characteristics such as the file size, source file system, start time and so on, all of which are known at the start of the file transfer. However, these static variables do not capture the dynamic information such as current state of the networking system. In this work, we explore a number of different ways to capture the state of the network and other dynamic information. We find that in addition to using static features, using these dynamic features can improve the transfer performance predictions by up to 10-15%. We additionally study a couple of different well-known decision-tree based models and find that Gradient-Tree Boosting algorithm performs better overall.

在这项工作中，我们研究了使用基于决策树的模型来预测将实验数据从SLAC国家加速器实验室(SLAC)的直线相干光源(LCLS)发送到国家能源研究科学计算中心(NERSC)的数据管道不同部分的传输速率。监控数据管道的系统收集了许多特征，如文件大小、源文件系统、开始时间等，所有这些特征在文件传输开始时都是已知的。但是，这些静态变量不能捕获诸如网络系统的当前状态之类的动态信息。在这项工作中，我们探索了许多不同的方法来捕获网络状态和其他动态信息。我们发现，除了使用静态特征之外，使用这些动态特征可以将传输性能预测提高10-15%。此外，我们还研究了几种不同的知名决策树模型，发现梯度树增强算法总体上表现更好。

引用次数: 3

Automatic Detection of Network Traffic Anomalies and Changes 自动检测网络流量异常和变化

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329255

Astha Syal, A. Lazar, Jinoh Kim, A. Sim, Kesheng Wu

Accurately predicting network behavior is beneficial for TCP congestion control, and can help improve routing, allocating network resources, and optimizing network designs.This task is challenging because many factors could affect network traffic, such as the number of network sessions and synthetic reordering. There are also many ways to measure the network state, such as the number of retransmissions per flow and packet duplication. For this work, we use a set of passive TCP flow measurements collected at a major computer center on multiple data transfer nodes (DTN). To assist the operations of the computer network, we propose to detect abnormally slow network transfers in real-time. The proposed system breaks the network monitoring logs into fixed-size chunks and employs a state of art classifier to identify the slow time windows. This method will be validated on real large datasets collected from several DTNs. The proposed method is able to generate models to quickly detect large intervals of low performing network transfers, which require attention from network engineers.

准确预测网络行为有利于TCP拥塞控制，有助于改进路由、分配网络资源和优化网络设计。这项任务具有挑战性，因为许多因素可能会影响网络流量，例如网络会话的数量和综合重新排序。还有许多方法可以测量网络状态，例如每个流的重传次数和数据包重复。在这项工作中，我们使用了一组在多个数据传输节点(DTN)上的主要计算机中心收集的被动TCP流量测量数据。为了帮助计算机网络的运行，我们建议实时检测异常缓慢的网络传输。该系统将网络监控日志分解为固定大小的块，并使用最先进的分类器来识别慢时间窗口。该方法将在多个ddn的真实大数据集上进行验证。该方法能够生成快速检测大间隔低性能网络传输的模型，这需要网络工程师的关注。

引用次数: 8

Similarity-based Compression with Multidimensional Pattern Matching 基于相似度的多维模式匹配压缩

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

Pub Date : 2019-06-17 DOI: 10.1145/3322798.3329252

Olivia Del Guercio, Rafael Orozco, A. Sim, Kesheng Wu

Sensors typically record their measurements using more precision than the accuracy of the sensing techniques. Thus, experimental and observational data often contain noise that appears random and cannot be easily compressed. This noise increases storage requirement as well as computation time for analyses. In this work, we describe a line of research to develop data reduction techniques that preserve the key features while reducing the storage requirement. Our core observation is that the noise in such cases could be characterized by a small number of patterns based on statistical similarity. In earlier tests, this approach was shown to reduce the storage requirement by over 100-fold for one-dimensional sequences. In this work, we explore a set of different similarity measures for multidimensional sequences. During our tests with standard quality measures such as Peak Signal to Noise Ratio (PSNR), we observe that the new compression methods reduce the storage requirements over 100- fold while maintaining relatively low errors in PSNR. Thus, we believe that this is an effective strategy to construct data reduction techniques.

传感器通常使用比传感技术的精度更高的精度来记录它们的测量。因此，实验和观测数据往往包含随机噪声，不容易压缩。这种噪声增加了存储需求以及分析的计算时间。在这项工作中，我们描述了一系列研究，以开发数据缩减技术，在减少存储需求的同时保留关键特征。我们的核心观察是，在这种情况下，噪声可以通过基于统计相似性的少量模式来表征。在早期的测试中，这种方法被证明可以将一维序列的存储需求减少100倍以上。在这项工作中，我们探索了一组不同的多维序列相似性度量。在我们使用峰值信噪比(PSNR)等标准质量指标进行的测试中，我们观察到新的压缩方法将存储需求降低了100倍以上，同时保持了相对较低的PSNR误差。因此，我们认为这是构建数据约简技术的有效策略。

{"title":"Similarity-based Compression with Multidimensional Pattern Matching","authors":"Olivia Del Guercio, Rafael Orozco, A. Sim, Kesheng Wu","doi":"10.1145/3322798.3329252","DOIUrl":"https://doi.org/10.1145/3322798.3329252","url":null,"abstract":"Sensors typically record their measurements using more precision than the accuracy of the sensing techniques. Thus, experimental and observational data often contain noise that appears random and cannot be easily compressed. This noise increases storage requirement as well as computation time for analyses. In this work, we describe a line of research to develop data reduction techniques that preserve the key features while reducing the storage requirement. Our core observation is that the noise in such cases could be characterized by a small number of patterns based on statistical similarity. In earlier tests, this approach was shown to reduce the storage requirement by over 100-fold for one-dimensional sequences. In this work, we explore a set of different similarity measures for multidimensional sequences. During our tests with standard quality measures such as Peak Signal to Noise Ratio (PSNR), we observe that the new compression methods reduce the storage requirements over 100- fold while maintaining relatively low errors in PSNR. Thus, we believe that this is an effective strategy to construct data reduction techniques.","PeriodicalId":365009,"journal":{"name":"Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130238167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the ACM Workshop on Systems and Network Telemetry and Analytics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀