Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers最新文献

英文中文

Asynchronous Abstract Machines: Anti-noise System Software for Many-core Processors 异步抽象机:多核处理器的抗噪声系统软件

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

Pub Date : 2019-06-17 DOI: 10.1145/3322789.3328744

Sebastian Maier, Timo Hönig, Peter Wägemann, Wolfgang Schröder-Preikschat

Today's systems offer an increasing number of processor cores, however, the chance to operate them efficiently by dedicating cores to specific tasks is often missed. Instead, mixed workloads are processed by each core which leads to system noise (i.e., interferences, scheduling overheads) and yields subpar performance, only. We therefore propose a system design based on Asynchronous Abstract Machines (AAMs). An AAM features a light-weight scheduler and is dedicated to a specific group of tasks with common characteristics (i.e., shared code and data). It offers an asynchronous, task-based interface for efficient interaction between AAMs. Just like applications are built from AAMs, even the OS itself consists of AAMs that are interfaced by applications via asynchronous messages instead of synchronous system calls. A dedicated OS component, which is aware of all AAMs in the system, is responsible for dynamic and exclusive allocation of cores to AAMs depending on their current workload. Thus, cores rarely switch between heterogeneous workloads of different AAMs. And, at the same time, frequent switches between homogeneous tasks become fast, local operations of an AAM, which do not involve the OS kernel. In this paper, we describe shortcomings of existing operating systems, our new system design concept, and present evaluation results of our prototype implementation.

今天的系统提供了越来越多的处理器核心，然而，通过将核心专用于特定任务来有效地运行它们的机会经常被错过。相反，混合工作负载由每个核心处理，这只会导致系统噪声(即干扰、调度开销)并产生低于标准的性能。因此，我们提出了一种基于异步抽象机的系统设计。AAM的特点是轻量级调度器，专用于具有共同特征(即共享代码和数据)的特定任务组。它为aam之间的高效交互提供了一个异步的、基于任务的接口。就像应用程序是由aam构建的一样，甚至操作系统本身也由aam组成，应用程序通过异步消息而不是同步系统调用与aam进行接口。一个专用的操作系统组件(它知道系统中的所有aam)负责根据aam当前的工作负载动态地、排他地将内核分配给aam。因此，内核很少在不同aam的异构工作负载之间切换。与此同时，同构任务之间的频繁切换成为AAM的快速本地操作，不涉及OS内核。在本文中，我们描述了现有操作系统的缺点，我们的新系统设计概念，以及我们的原型实现的评估结果。

{"title":"Asynchronous Abstract Machines: Anti-noise System Software for Many-core Processors","authors":"Sebastian Maier, Timo Hönig, Peter Wägemann, Wolfgang Schröder-Preikschat","doi":"10.1145/3322789.3328744","DOIUrl":"https://doi.org/10.1145/3322789.3328744","url":null,"abstract":"Today's systems offer an increasing number of processor cores, however, the chance to operate them efficiently by dedicating cores to specific tasks is often missed. Instead, mixed workloads are processed by each core which leads to system noise (i.e., interferences, scheduling overheads) and yields subpar performance, only. We therefore propose a system design based on Asynchronous Abstract Machines (AAMs). An AAM features a light-weight scheduler and is dedicated to a specific group of tasks with common characteristics (i.e., shared code and data). It offers an asynchronous, task-based interface for efficient interaction between AAMs. Just like applications are built from AAMs, even the OS itself consists of AAMs that are interfaced by applications via asynchronous messages instead of synchronous system calls. A dedicated OS component, which is aware of all AAMs in the system, is responsible for dynamic and exclusive allocation of cores to AAMs depending on their current workload. Thus, cores rarely switch between heterogeneous workloads of different AAMs. And, at the same time, frequent switches between homogeneous tasks become fast, local operations of an AAM, which do not involve the OS kernel. In this paper, we describe shortcomings of existing operating systems, our new system design concept, and present evaluation results of our prototype implementation.","PeriodicalId":365438,"journal":{"name":"Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126057180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

HPC System Software Enhanced by Source Code Analysis 源代码分析增强的HPC系统软件

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

Pub Date : 2019-06-17 DOI: 10.1145/3322789.3328741

Jidong Zhai

Building efficient and scalable system software, especially performance analysis and monitoring, for large-scale systems, is increasingly important both for the developers of parallel applications and the designers of next-generation HPC systems. However, conventional performance tools suffer from significant time/space overhead due to the ever-increasing problem size and system scale. For instance, memory monitoring is of critical use in understanding applications and evaluating systems. Due to the dynamic nature in programs' memory accesses, common practice today leaves large amounts of address examination and data recording at runtime, at the cost of substantial performance overhead. On the other hand, the cost of source code analysis is independent of the problem size and system scale, making it very appealing for large-scale performance analysis. Inspired by this observation, we have designed a series of light-weight system software for HPC systems, such as a memory access monitoring tool, a performance variance detection tool , and a communication trace compression tool. In this talk, I will share our experience on building these tools through combining static analysis and runtime analysis and also point out main challenges in this direction.

构建高效、可扩展的系统软件，特别是大型系统的性能分析和监控，对于并行应用程序的开发人员和下一代高性能计算系统的设计人员来说变得越来越重要。然而，由于不断增加的问题大小和系统规模，传统的性能工具承受着巨大的时间/空间开销。例如，内存监控在理解应用程序和评估系统方面至关重要。由于程序内存访问的动态性，目前的常见做法是在运行时留下大量的地址检查和数据记录，这是以大量的性能开销为代价的。另一方面，源代码分析的成本与问题大小和系统规模无关，这使得它对大规模性能分析非常有吸引力。受此启发，我们为HPC系统设计了一系列轻量级系统软件，如内存访问监控工具、性能差异检测工具和通信跟踪压缩工具。在这次演讲中，我将分享我们通过结合静态分析和运行时分析来构建这些工具的经验，并指出在这个方向上的主要挑战。

{"title":"HPC System Software Enhanced by Source Code Analysis","authors":"Jidong Zhai","doi":"10.1145/3322789.3328741","DOIUrl":"https://doi.org/10.1145/3322789.3328741","url":null,"abstract":"Building efficient and scalable system software, especially performance analysis and monitoring, for large-scale systems, is increasingly important both for the developers of parallel applications and the designers of next-generation HPC systems. However, conventional performance tools suffer from significant time/space overhead due to the ever-increasing problem size and system scale. For instance, memory monitoring is of critical use in understanding applications and evaluating systems. Due to the dynamic nature in programs' memory accesses, common practice today leaves large amounts of address examination and data recording at runtime, at the cost of substantial performance overhead. On the other hand, the cost of source code analysis is independent of the problem size and system scale, making it very appealing for large-scale performance analysis. Inspired by this observation, we have designed a series of light-weight system software for HPC systems, such as a memory access monitoring tool, a performance variance detection tool , and a communication trace compression tool. In this talk, I will share our experience on building these tools through combining static analysis and runtime analysis and also point out main challenges in this direction.","PeriodicalId":365438,"journal":{"name":"Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126107855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Effect of System Utilization on Application Performance Variability 系统利用率对应用程序性能可变性的影响

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

Pub Date : 2019-06-17 DOI: 10.1145/3322789.3328743

Boyang Li, Sudheer Chunduri, K. Harms, Yuping Fan, Z. Lan

Application performance variability caused by network contention is a major issue on dragonfly based systems. This work-in-progress study makes two contributions. First, we analyze real workload logs and conduct application experiments on the production system Theta at Argonne to evaluate application performance variability. We find a strong correlation between system utilization and performance variability where a high system utilization (e.g., above 95%) can cause up to 21% degradation in application performance. Next, driven by this key finding, we investigate a scheduling policy to mitigate workload interference by leveraging the fact that production systems often exhibit diurnal utilization behavior and not all users are in a hurry for job completion. Preliminary results show that this scheduling design is capable of improving system productivity (measured by scheduling makespan) as well as improving user-level scheduling metrics such as user wait time and job slowdown.

在基于蜻蜓的系统中，由网络争用引起的应用程序性能变化是一个主要问题。这项正在进行的研究有两个贡献。首先，我们分析真实的工作负载日志，并在Argonne的生产系统Theta上进行应用程序实验，以评估应用程序性能的可变性。我们发现系统利用率和性能可变性之间存在很强的相关性，其中高系统利用率(例如，高于95%)可能导致应用程序性能下降高达21%。接下来，在这一关键发现的推动下，我们研究了一种调度策略，通过利用生产系统经常表现出每日使用行为以及并非所有用户都急于完成任务的事实来减轻工作负载干扰。初步结果表明，这种调度设计能够提高系统生产率(通过调度makespan来衡量)，并改善用户级调度指标，如用户等待时间和作业速度。

引用次数: 15

Towards a Practical Ecosystem of Specialized OS Kernels 迈向专业操作系统内核的实用生态系统

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

Pub Date : 2019-06-17 DOI: 10.1145/3322789.3328742

Conghao Liu, Kyle C. Hale

Specialized operating systems have enjoyed a recent revival driven both by a pressing need to rethink the system software stack in several domains and by the convenience and flexibility that on-demand infrastructure and virtual execution environments offer. Several barriers exist which curtail the widespread adoption of such highly specialized systems, but perhaps the most consequential of them is that these systems are simply difficult to use. In this paper we discuss the challenges faced by specialized OSes, both for HPC and more broadly, and argue that what is needed to make them practically useful is a reasonable development and deployment model that will form the foundation for a kernel ecosystem that allows intrepid developers to discover, experiment with, contribute to, and write programs for available kernel frameworks while safely ignoring complexities such as provisioning, deployment, cross-compilation, and interface compatibility. We argue that such an ecosystem would allow more developers of highly tuned applications to reap the performance benefits of specialized kernels.

由于迫切需要重新考虑几个领域的系统软件堆栈，以及按需基础设施和虚拟执行环境提供的便利性和灵活性，专用操作系统最近得到了复兴。存在一些阻碍这种高度专门化系统广泛采用的障碍，但其中最重要的可能是这些系统很难使用。在本文中，我们讨论了专门的操作系统所面临的挑战，无论是对于HPC还是更广泛的领域，并认为使它们实际有用所需要的是一个合理的开发和部署模型，它将形成一个内核生态系统的基础，使勇敢的开发人员能够发现、实验、贡献和编写可用内核框架的程序，同时安全地忽略诸如配置、部署、交叉编译等复杂性。以及接口兼容性。我们认为，这样的生态系统将允许更多高度调优应用程序的开发人员获得专用内核的性能优势。

引用次数: 2

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers 第九届超级计算机运行时和操作系统国际研讨会论文集

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

Pub Date : 1900-01-01 DOI: 10.1145/3322789

引用次数: 0

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀