首页 > 最新文献

Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science最新文献

英文 中文
Identifying Anomalous File Transfer Events in LCLS Workflow 识别LCLS工作流中的异常文件传输事件
Mengying Yang, Xinyu Liu, W. Kroeger, A. Sim, Kesheng Wu
This short paper reports our on-going work to study and identify anomalous file transfers for a large scientific facility known as Linac Coherent Light Source (LCLS). We identify the anomalies based on the statistical models extracted from the recent observations of the file transfer events. This data-driven approach could be used in different use cases to identify unusual events. More specifically, we propose two different identification strategies based on the different properties of the observed file transfers. Because these methods capture key aspects of the two different segments of the data transfer pipeline, they are able to make accurate identifications for their respective workflow components. The current anomaly detection algorithms only make use of the file sizes as the primary feature. We anticipate that integrating more information will improve the prediction accuracy. Additional work is planned to validate the identification algorithms on more data and in different use cases.
这篇简短的论文报告了我们正在进行的研究和识别大型科学设施的异常文件传输,称为直线加速器相干光源(LCLS)。我们根据从最近的文件传输事件观察中提取的统计模型来识别异常。这种数据驱动的方法可以在不同的用例中用于识别异常事件。更具体地说,我们根据观察到的文件传输的不同属性提出了两种不同的识别策略。由于这些方法捕获数据传输管道的两个不同部分的关键方面,因此它们能够为各自的工作流组件做出准确的标识。目前的异常检测算法仅将文件大小作为主要特征。我们预计,整合更多的信息将提高预测的准确性。计划在更多数据和不同用例中验证识别算法。
{"title":"Identifying Anomalous File Transfer Events in LCLS Workflow","authors":"Mengying Yang, Xinyu Liu, W. Kroeger, A. Sim, Kesheng Wu","doi":"10.1145/3217197.3217203","DOIUrl":"https://doi.org/10.1145/3217197.3217203","url":null,"abstract":"This short paper reports our on-going work to study and identify anomalous file transfers for a large scientific facility known as Linac Coherent Light Source (LCLS). We identify the anomalies based on the statistical models extracted from the recent observations of the file transfer events. This data-driven approach could be used in different use cases to identify unusual events. More specifically, we propose two different identification strategies based on the different properties of the observed file transfers. Because these methods capture key aspects of the two different segments of the data transfer pipeline, they are able to make accurate identifications for their respective workflow components. The current anomaly detection algorithms only make use of the file sizes as the primary feature. We anticipate that integrating more information will improve the prediction accuracy. Additional work is planned to validate the identification algorithms on more data and in different use cases.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115493590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Model Driven Intelligent Orchestration Approach to Service Automation in Large Distributed Infrastructures 大型分布式基础设施中服务自动化的模型驱动智能编排方法
Xi Yang, T. Lehman, R. Kettimuthu, L. Winkler, Eun-Sung Jung
Today's scientific computing applications and workflows operate on heterogeneous and vastly distributed infrastructures. Traditional human-in-the-loop service engineering approach met its insurmountable challenge in dealing with these very complex and diverse networked systems, including conventional and software defined networks, compute, storage, clouds and instruments. Orchestration is the key to integrate and coordinate the networked multi-services and automate end-to-end workflows. In this work, we present a model driven intelligent orchestration approach to this end-to-end automation, which is built upon a semantic modeling solution that supports the full stack of service integration, orchestration, abstraction, and intent and policy representation. We also present the design of a real-world orchestrator called StackV that is able to accommodate highly complex application scenarios such as Software Defined ScienceDMZ (SD-SDMZ) and Hybrid Cloud Inter-Networking (HCIN) by implementing this approach.
今天的科学计算应用程序和工作流在异构和广泛分布的基础设施上运行。传统的人在环服务工程方法在处理这些非常复杂和多样化的网络系统时遇到了难以克服的挑战,包括传统的和软件定义的网络、计算、存储、云和仪器。编排是集成和协调网络多服务以及自动化端到端工作流的关键。在这项工作中,我们提出了一种模型驱动的智能编排方法来实现这种端到端自动化,它建立在一个语义建模解决方案之上,该解决方案支持服务集成、编排、抽象、意图和策略表示的完整堆栈。我们还介绍了一个名为StackV的现实世界编排器的设计,通过实现这种方法,它能够适应高度复杂的应用场景,如软件定义科学mz (SD-SDMZ)和混合云互连网络(HCIN)。
{"title":"A Model Driven Intelligent Orchestration Approach to Service Automation in Large Distributed Infrastructures","authors":"Xi Yang, T. Lehman, R. Kettimuthu, L. Winkler, Eun-Sung Jung","doi":"10.1145/3217197.3217207","DOIUrl":"https://doi.org/10.1145/3217197.3217207","url":null,"abstract":"Today's scientific computing applications and workflows operate on heterogeneous and vastly distributed infrastructures. Traditional human-in-the-loop service engineering approach met its insurmountable challenge in dealing with these very complex and diverse networked systems, including conventional and software defined networks, compute, storage, clouds and instruments. Orchestration is the key to integrate and coordinate the networked multi-services and automate end-to-end workflows. In this work, we present a model driven intelligent orchestration approach to this end-to-end automation, which is built upon a semantic modeling solution that supports the full stack of service integration, orchestration, abstraction, and intent and policy representation. We also present the design of a real-world orchestrator called StackV that is able to accommodate highly complex application scenarios such as Software Defined ScienceDMZ (SD-SDMZ) and Hybrid Cloud Inter-Networking (HCIN) by implementing this approach.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115952619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a control-theory approach for minimizing unused grid resources 一种最小化未使用网格资源的控制论方法
Emma Stahl, A. Yabo, Olivier Richard, B. Bzeznik, B. Robu, É. Rutten
HPC systems are facing more and more variability in their behavior, related to e.g., performance and power consumption, and the fact that they are less predictable requires more runtime management. This can be done in an Autonomic Management feedback loop, in response to monitored information in the systems, by analysis of this data and utilization of the results in order to activate appropriate system-level or application-level feedback mechanisms (e.g., informing schedulers, down-clocking CPUs). One such problem is found in the context of CiGri, a simple, lightweight, scalable and fault tolerant grid system which exploits the unused resources of a set of computing clusters. Computing power left over by the execution of a main HPC application scheduling is used to execute smaller jobs, which are injected as much as the global system allows. This paper presents first results addressing the problem of automated resource management in an HPC infrastructure, using techniques from Control Theory to design a controller that maximizes cluster utilization while avoiding overload. We put in place a mechanism for feedback (Proportional Integral, PI) control system software, through a maximum number of jobs to be sent to the cluster, in response to system information about the current number of jobs processed.
高性能计算系统正面临着越来越多的行为可变性,例如与性能和功耗有关,而且它们难以预测的事实需要更多的运行时管理。这可以在自治管理反馈循环中完成,通过分析这些数据并利用结果来激活适当的系统级或应用程序级反馈机制(例如,通知调度器、降时钟cpu),以响应系统中监视的信息。CiGri是一个简单、轻量级、可扩展和容错的网格系统,它利用了一组计算集群的未使用资源。执行主HPC应用程序调度所剩余的计算能力用于执行较小的作业,这些作业在全局系统允许的情况下被注入。本文提出了解决高性能计算基础设施中自动化资源管理问题的第一个结果,使用控制论的技术来设计一个最大限度地利用集群同时避免过载的控制器。我们设置了一种反馈机制(比例积分,PI)控制系统软件,通过发送到集群的最大作业数量来响应有关当前处理的作业数量的系统信息。
{"title":"Towards a control-theory approach for minimizing unused grid resources","authors":"Emma Stahl, A. Yabo, Olivier Richard, B. Bzeznik, B. Robu, É. Rutten","doi":"10.1145/3217197.3217201","DOIUrl":"https://doi.org/10.1145/3217197.3217201","url":null,"abstract":"HPC systems are facing more and more variability in their behavior, related to e.g., performance and power consumption, and the fact that they are less predictable requires more runtime management. This can be done in an Autonomic Management feedback loop, in response to monitored information in the systems, by analysis of this data and utilization of the results in order to activate appropriate system-level or application-level feedback mechanisms (e.g., informing schedulers, down-clocking CPUs). One such problem is found in the context of CiGri, a simple, lightweight, scalable and fault tolerant grid system which exploits the unused resources of a set of computing clusters. Computing power left over by the execution of a main HPC application scheduling is used to execute smaller jobs, which are injected as much as the global system allows. This paper presents first results addressing the problem of automated resource management in an HPC infrastructure, using techniques from Control Theory to design a controller that maximizes cluster utilization while avoiding overload. We put in place a mechanism for feedback (Proportional Integral, PI) control system software, through a maximum number of jobs to be sent to the cluster, in response to system information about the current number of jobs processed.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130283181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
High-Throughput Neuroanatomy and Trigger-Action Programming: A Case Study in Research Automation 高通量神经解剖学和触发-动作编程:研究自动化的案例研究
Ryan Chard, Rafael Vescovi, Ming Du, Hanyu Li, K. Chard, S. Tuecke, N. Kasthuri, Ian T Foster
Exponential increases in data volumes and velocities are overwhelming finite human capabilities. Continued progress in science and engineering demands that we automate a broad spectrum of currently manual research data manipulation tasks, from data transfer and sharing to acquisition, publication, and analysis. These needs are particularly evident in large-scale experimental science, in which researchers are typically granted short periods of instrument time and must maximize experiment efficiency as well as output data quality and accuracy. To address the need for automation, which is pervasive across science and engineering, we present our experiences using Trigger-Action-Programming to automate a real-world scientific workflow. We evaluate our methods by applying them to a neuroanatomy application in which a synchrotron is used to image cm-scale mouse brains with sub-micrometer resolution. In this use case, data is acquired in real-time at the synchrotron and are automatically passed through a complex automation flow that involves reconstruction using HPC resources, human-in-the-loop coordination, and finally data publication and visualization. We describe the lessons learned from these experiences and outline the design for a new research automation platform.
数据量和速度的指数级增长压倒了有限的人类能力。科学和工程的持续进步要求我们将目前手工研究数据操作任务的广泛范围自动化,从数据传输和共享到获取,发布和分析。这些需求在大规模实验科学中尤其明显,研究人员通常被授予较短的仪器时间,并且必须最大限度地提高实验效率以及输出数据的质量和准确性。为了解决自动化的需求,这在科学和工程领域是普遍存在的,我们展示了我们使用触发器-操作-编程来自动化现实世界的科学工作流的经验。我们通过将其应用于神经解剖学应用来评估我们的方法,其中同步加速器用于以亚微米分辨率成像厘米尺度的小鼠大脑。在这个用例中,数据在同步加速器上实时获取,并自动通过一个复杂的自动化流程,该流程包括使用HPC资源进行重建、人在环协调,以及最后的数据发布和可视化。我们描述了从这些经验中吸取的教训,并概述了一个新的研究自动化平台的设计。
{"title":"High-Throughput Neuroanatomy and Trigger-Action Programming: A Case Study in Research Automation","authors":"Ryan Chard, Rafael Vescovi, Ming Du, Hanyu Li, K. Chard, S. Tuecke, N. Kasthuri, Ian T Foster","doi":"10.1145/3217197.3217206","DOIUrl":"https://doi.org/10.1145/3217197.3217206","url":null,"abstract":"Exponential increases in data volumes and velocities are overwhelming finite human capabilities. Continued progress in science and engineering demands that we automate a broad spectrum of currently manual research data manipulation tasks, from data transfer and sharing to acquisition, publication, and analysis. These needs are particularly evident in large-scale experimental science, in which researchers are typically granted short periods of instrument time and must maximize experiment efficiency as well as output data quality and accuracy. To address the need for automation, which is pervasive across science and engineering, we present our experiences using Trigger-Action-Programming to automate a real-world scientific workflow. We evaluate our methods by applying them to a neuroanatomy application in which a synchrotron is used to image cm-scale mouse brains with sub-micrometer resolution. In this use case, data is acquired in real-time at the synchrotron and are automatically passed through a complex automation flow that involves reconstruction using HPC resources, human-in-the-loop coordination, and finally data publication and visualization. We describe the lessons learned from these experiences and outline the design for a new research automation platform.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121064697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science 第一届科学自主基础设施国际研讨会论文集
{"title":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","authors":"","doi":"10.1145/3217197","DOIUrl":"https://doi.org/10.1145/3217197","url":null,"abstract":"","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133057011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In-Operando Tracking and Prediction of Transition in Material System using LSTM 基于LSTM的物料系统运行中过渡跟踪与预测
Pranjal Sahu, Dantong Yu, K. Yager, Mallesham Dasari, Hong Qin
The structures of many material systems evolve as they are treated with physical processing. For instance, organic and inorganic crystalline materials frequently coarsen over time as they are thermally treated; with domains (grains) rotating and growing in size. When a material system undergoing the structural transformation is probed using x-ray scattering beams, the peaks in the scattering images will sharpen and intensify, and the scattering rings will become increasingly 'textured'. Accurate identification of the transition frame in advance brings multiple benefits to the NSLS-II in-operando experiments of studying material systems such as minimal beamline damage to samples, reduced energy costs, and the optimal sampling of material properties. In this paper, we formulate the prediction and identification of the structural transition event as a classification problem and apply a novel LSTM model to identify sequences having transition event. The preliminary results of the experiments are encouraging and confirm the viability of the detection and prediction of transition in advance. Our ultimate goal is to deploy such a prediction system in the real-world environment at the selected beamline of NSLS-II for improving the efficiency of the experimental facility.
许多材料系统的结构随着它们的物理处理而演变。例如,有机和无机晶体材料在经过热处理后,往往会随着时间的推移而变粗;随着域(颗粒)的旋转和尺寸的增长。当使用x射线散射光束探测正在经历结构转变的材料系统时,散射图像中的峰将变得锐化和强化,散射环将变得越来越“有纹理”。提前准确识别过渡框架为研究材料系统的NSLS-II在操作中实验带来了诸多好处,如最小的光束线损伤样品,降低能源成本,以及材料性能的最佳采样。本文将结构转移事件的预测和识别作为一个分类问题,并应用一种新的LSTM模型来识别具有转移事件的序列。实验的初步结果令人鼓舞,并证实了提前探测和预测跃迁的可行性。我们的最终目标是在NSLS-II的选定光束线上部署这样一个预测系统,以提高实验设施的效率。
{"title":"In-Operando Tracking and Prediction of Transition in Material System using LSTM","authors":"Pranjal Sahu, Dantong Yu, K. Yager, Mallesham Dasari, Hong Qin","doi":"10.1145/3217197.3217204","DOIUrl":"https://doi.org/10.1145/3217197.3217204","url":null,"abstract":"The structures of many material systems evolve as they are treated with physical processing. For instance, organic and inorganic crystalline materials frequently coarsen over time as they are thermally treated; with domains (grains) rotating and growing in size. When a material system undergoing the structural transformation is probed using x-ray scattering beams, the peaks in the scattering images will sharpen and intensify, and the scattering rings will become increasingly 'textured'. Accurate identification of the transition frame in advance brings multiple benefits to the NSLS-II in-operando experiments of studying material systems such as minimal beamline damage to samples, reduced energy costs, and the optimal sampling of material properties. In this paper, we formulate the prediction and identification of the structural transition event as a classification problem and apply a novel LSTM model to identify sequences having transition event. The preliminary results of the experiments are encouraging and confirm the viability of the detection and prediction of transition in advance. Our ultimate goal is to deploy such a prediction system in the real-world environment at the selected beamline of NSLS-II for improving the efficiency of the experimental facility.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114491992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Virtual Environment for Testing Software-Defined Networking Solutions for Scientific Workflows 用于测试科学工作流的软件定义网络解决方案的虚拟环境
Qiang Liu, N. Rao, S. Sen, B. Settlemyer, Hsing-bung Chen, J. Boley, R. Kettimuthu, D. Katramatos
Recent developments in software-defined infrastructures promise that scientific workflows utilizing supercomputers, instruments, and storage systems will be dynamically composed and orchestrated using software at unprecedented speed and scale in the near future. Testing of the underlying networking software, particularly during initial exploratory stages, remains a challenge due to potential disruptions, and resource allocation and coordination needed over the multi-domain physical infrastructure. To overcome these challenges, we develop the Virtual Science Network Environment (VSNE) that emulates the multi-site host, storage, and network infrastructure using Virtual Machines (VMs), wherein the production and nascent software can be tested. Within each VM, which represents a site, the hosts and local-area networks are emulated using Mininet, and the Software-Defined Network (SDN) controllers and service daemon codes are natively run to support dynamic provisioning of network connections. Additionally, Lustre filesystem support at the sites and an emulation of the long-haul network using Mininet, are provided using separate VMs. As case studies, we describe Lustre file transfers using XDD, Red5 streaming service demonstration, and an emulated experiment with remote monitoring and steering modules, all supported over dynamically configured connections using SDN controllers.
软件定义基础设施的最新发展表明,在不久的将来,利用超级计算机、仪器和存储系统的科学工作流程将以前所未有的速度和规模使用软件进行动态组合和编排。底层网络软件的测试,特别是在最初的探索阶段,由于潜在的中断,以及在多域物理基础设施上需要的资源分配和协调,仍然是一个挑战。为了克服这些挑战,我们开发了虚拟科学网络环境(VSNE),它使用虚拟机(vm)模拟多站点主机,存储和网络基础设施,其中可以测试生产和新生软件。在代表一个站点的每个VM中,使用Mininet模拟主机和局域网,并且本机运行软件定义网络(SDN)控制器和服务守护进程代码,以支持网络连接的动态供应。此外,站点上的Lustre文件系统支持和使用Mininet的长途网络模拟都是使用单独的vm提供的。作为案例研究,我们描述了使用XDD的Lustre文件传输,Red5流服务演示,以及远程监控和转向模块的模拟实验,所有这些都支持使用SDN控制器动态配置的连接。
{"title":"Virtual Environment for Testing Software-Defined Networking Solutions for Scientific Workflows","authors":"Qiang Liu, N. Rao, S. Sen, B. Settlemyer, Hsing-bung Chen, J. Boley, R. Kettimuthu, D. Katramatos","doi":"10.1145/3217197.3217202","DOIUrl":"https://doi.org/10.1145/3217197.3217202","url":null,"abstract":"Recent developments in software-defined infrastructures promise that scientific workflows utilizing supercomputers, instruments, and storage systems will be dynamically composed and orchestrated using software at unprecedented speed and scale in the near future. Testing of the underlying networking software, particularly during initial exploratory stages, remains a challenge due to potential disruptions, and resource allocation and coordination needed over the multi-domain physical infrastructure. To overcome these challenges, we develop the Virtual Science Network Environment (VSNE) that emulates the multi-site host, storage, and network infrastructure using Virtual Machines (VMs), wherein the production and nascent software can be tested. Within each VM, which represents a site, the hosts and local-area networks are emulated using Mininet, and the Software-Defined Network (SDN) controllers and service daemon codes are natively run to support dynamic provisioning of network connections. Additionally, Lustre filesystem support at the sites and an emulation of the long-haul network using Mininet, are provided using separate VMs. As case studies, we describe Lustre file transfers using XDD, Red5 streaming service demonstration, and an emulated experiment with remote monitoring and steering modules, all supported over dynamically configured connections using SDN controllers.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115735697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards Autonomic Science Infrastructure: Architecture, Limitations, and Open Issues 走向自主科学基础设施:架构、限制和开放问题
R. Kettimuthu, Zhengchun Liu, Ian T Foster, P. Beckman, A. Sim, Kesheng Wu, W. Liao, Qiao Kang, Ankit Agrawal, A. Choudhary
Scientific computing systems are becoming increasingly complex and indeed are close to reaching a critical limit in manageability when using current human-in-the-loop techniques. In order to address this problem, autonomic, goal-driven management actions based on machine learning must be applied end to end across the scientific computing landscape. Even though researchers proposed architectures and design choices for autonomic computing systems more than a decade ago, practical realization of such systems has been limited, especially in scientific computing environments. Growing interest and recent developments in machine learning have spurred proposals to apply machine learning for goal-based optimization of computing systems in an autonomous fashion. We review recent work that uses machine learning algorithms to improve computer system performance, identify gaps and open issues. We propose a hierarchical architecture that builds on the earlier proposals for autonomic computing systems to realize an autonomous science infrastructure.
科学计算系统正变得越来越复杂,当使用当前的人在循环技术时,在可管理性方面确实接近临界极限。为了解决这个问题,基于机器学习的自主、目标驱动的管理行为必须在整个科学计算领域端到端应用。尽管研究人员在十多年前就提出了自主计算系统的架构和设计选择,但这种系统的实际实现仍然有限,特别是在科学计算环境中。对机器学习日益增长的兴趣和最近的发展促使人们提出将机器学习应用于以自主方式进行基于目标的计算系统优化的建议。我们回顾了最近使用机器学习算法来提高计算机系统性能,识别差距和开放问题的工作。我们提出了一种分层架构,该架构建立在早期自主计算系统的基础上,以实现自主的科学基础设施。
{"title":"Towards Autonomic Science Infrastructure: Architecture, Limitations, and Open Issues","authors":"R. Kettimuthu, Zhengchun Liu, Ian T Foster, P. Beckman, A. Sim, Kesheng Wu, W. Liao, Qiao Kang, Ankit Agrawal, A. Choudhary","doi":"10.1145/3217197.3217205","DOIUrl":"https://doi.org/10.1145/3217197.3217205","url":null,"abstract":"Scientific computing systems are becoming increasingly complex and indeed are close to reaching a critical limit in manageability when using current human-in-the-loop techniques. In order to address this problem, autonomic, goal-driven management actions based on machine learning must be applied end to end across the scientific computing landscape. Even though researchers proposed architectures and design choices for autonomic computing systems more than a decade ago, practical realization of such systems has been limited, especially in scientific computing environments. Growing interest and recent developments in machine learning have spurred proposals to apply machine learning for goal-based optimization of computing systems in an autonomous fashion. We review recent work that uses machine learning algorithms to improve computer system performance, identify gaps and open issues. We propose a hierarchical architecture that builds on the earlier proposals for autonomic computing systems to realize an autonomous science infrastructure.","PeriodicalId":118966,"journal":{"name":"Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131730064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1