Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)最新文献

英文中文

A Resource-saving Job Monitoring System of High Performance Computing using Parent and Child Process 基于父进程和子进程的高性能计算作业监控系统

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.22323/1.351.0034

Kajornsak Piyoungkorn, Phithak Thaenkaew, C. Vorakulpipat

High performance computing has been more important in the past decade. In the present day, data used for processing becomes enormous. Where a high performance computing resource is needed to help process the data. Some scientific experiments involving big data. Which requires high speed data processing cannot be done by an ordinary computer system. Also, there is a need for support of parallel processing. The solution starts by dividing the job into a number of sections to be processed into parts and the processing unit each processing unit of data at the same time. Then, the system sends the calculated result back to the compiled. This mechanism will speed up the processing time to complete the task and generate more output at the same time. Therefore, a solution in this study is to maximize efficiency when using the resources of the computer which involves the processing power of the processor (CPU Cores).When the HPC system has a large number of concurrent users and requests processing resources that do not match the actual usage. Therefore requires a system to detect job requests that use inefficient computing resources to help users and system administrators to work effectively.

在过去的十年中，高性能计算变得更加重要。在今天，用于处理的数据变得非常庞大。需要高性能计算资源来帮助处理数据。一些涉及大数据的科学实验。这需要高速的数据处理，普通的计算机系统是无法完成的。此外，还需要支持并行处理。该解决方案首先将作业划分为多个部分，然后将其处理成多个部分，每个处理单元同时处理数据。然后，系统将计算结果返回给编译器。这种机制将加快完成任务的处理时间，同时产生更多的输出。因此，本研究的解决方案是在使用计算机资源时最大限度地提高效率，这涉及到处理器(CPU内核)的处理能力。当高性能计算系统有大量并发用户，请求处理的资源与实际使用不匹配时。因此需要系统检测那些使用低效计算资源的作业请求，以帮助用户和系统管理员有效地工作。

{"title":"A Resource-saving Job Monitoring System of High Performance Computing using Parent and Child Process","authors":"Kajornsak Piyoungkorn, Phithak Thaenkaew, C. Vorakulpipat","doi":"10.22323/1.351.0034","DOIUrl":"https://doi.org/10.22323/1.351.0034","url":null,"abstract":"High performance computing has been more important in the past decade. In the present day, data used for processing becomes enormous. Where a high performance computing resource is needed to help process the data. Some scientific experiments involving big data. Which requires high speed data processing cannot be done by an ordinary computer system. Also, there is a need for support of parallel processing. The solution starts by dividing the job into a number of sections to be processed into parts and the processing unit each processing unit of data at the same time. Then, the system sends the calculated result back to the compiled. This mechanism will speed up the processing time to complete the task and generate more output at the same time. Therefore, a solution in this study is to maximize efficiency when using the resources of the computer which involves the processing power of the processor (CPU Cores).When the HPC system has a large number of concurrent users and requests processing resources that do not match the actual usage. Therefore requires a system to detect job requests that use inefficient computing resources to help users and system administrators to work effectively.","PeriodicalId":106243,"journal":{"name":"Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125848819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Toward Single Sign-on Establishment for Inter-Cloud Environment 面向云间环境的单点登录建立

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.22323/1.351.0028

E. Sakane, Takeshi Nishimura, K. Aida, Motonori Nakamura

This paper investigates a mechanism that establishes single sign-on for inter-cloud computing environment built as the optimized result of the needs of users. Arranging requirements and issues for the mechanism, a single sign-on system for an inter-cloud computing environment is presented. As concrete service in the inter-cloud environment, we deal with Amazon Web Service with SAML version 2.0 and implement a prototype system. We also evaluate the prototype implementation and consider applicability to the other services.

本文研究了基于用户需求优化构建的跨云计算环境的单点登录机制。针对该机制的需求和问题，提出了一种跨云计算环境的单点登录系统。作为跨云环境中的具体服务，我们使用SAML 2.0版本来处理Amazon Web service，并实现了一个原型系统。我们还评估原型实现并考虑对其他服务的适用性。

引用次数: 0

Improving efficiency of analysis jobs in CMS 提高CMS中分析工作的效率

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.1051/epjconf/201921403006

T. Ivanov, S. Belforte, M. Wolf, M. Mascheroni, A. P. Yzquierdo, J. Letts, J. Hernández, L. Cristella, D. Ciangottini, J. Balcas, A. Woodard, K. H. Anampa, B. Bockelman, D. Foyo

Hundreds of physicists analyze data collected by the Compact Muon Solenoid (CMS) experiment at the Large Hadron Collider using the CMS Remote Analysis Builder and the CMS global pool to exploit the resources of the Worldwide LHC Computing Grid. Efficient use of such an extensive and expensive resource is crucial. At the same time, the CMS collaboration is committed to minimizing time to insight for every scientist, by pushing for fewer possible access restrictions to the full data sample and supports the free choice of applications to run on the computing resources. Supporting such variety of workflows while preserving efficient resource usage poses special challenges. In this paper we report on three complementary approaches adopted in CMS to improve the scheduling efficiency of user analysis jobs: automatic job splitting, automated run time estimates and automated site selection for jobs.

数百名物理学家在大型强子对撞机上使用CMS远程分析生成器和CMS全球池分析紧凑介子螺线管(CMS)实验收集的数据，以利用全球LHC计算网格的资源。有效利用如此广泛和昂贵的资源是至关重要的。与此同时，CMS合作致力于通过减少对完整数据样本的可能访问限制，并支持在计算资源上运行的应用程序的自由选择，最大限度地减少每位科学家获得洞察力的时间。在保持有效的资源使用的同时支持如此多样化的工作流提出了特殊的挑战。在本文中，我们报告了CMS中采用的三种互补方法来提高用户分析作业的调度效率:自动作业拆分，自动运行时间估计和自动作业选址。

引用次数: 3

Simulation of the cache hit rate for data readout at the Tokyo Tier-2 center 模拟东京Tier-2中心读取数据的缓存命中率

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.22323/1.351.0030

T. Kishimoto, J. Tanaka, T. Mashimo, M. Kaneda, N. Matsui

The Tokyo Tier-2 center, which is located in the International Center for Elementary Particle Physics at the University of Tokyo, provides computing resources to the ATLAS experiment in the Worldwide LHC Computing Grid. In order to improve the I/O performance and scalability of ﬁle servers in the future system, a possibility of introducing a cache system using fast devices such as SSD is under discussion. Therefore, a simulation has been performed to understand the cache behavior using past data access logs at the center. This paper reports a method of the simulation and gives a discussion about its results.

东京二级中心位于东京大学的国际基本粒子物理中心，为全球大型强子对撞机计算网格中的ATLAS实验提供计算资源。为了提高未来系统中文件服务器的I/O性能和可扩展性，正在讨论引入使用SSD等快速设备的缓存系统的可能性。因此，在中心使用过去的数据访问日志执行了一个模拟来理解缓存行为。本文报道了一种仿真方法，并对仿真结果进行了讨论。

引用次数: 0

Building a minimum viable Security Operations Centre for the modern grid environment 为现代网格环境建立最小可行的安全操作中心

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.22323/1.351.0010

D. Crooks, L. Valsan

The modern security landscape affecting grid and cloud sites is constantly evolving, with threats being seen from a range of avenues, including social engineering as well as more direct approaches. It is vital to build up operational security capabilities across the Worldwide LHC Computing Grid (WLCG) in order to improve the defence of the community as a whole. As reported at ISGC 2017 and 2018, the WLCG Security Operations Centres (SOC) Working Group (WG) has been working with sites across the WLCG to develop a model for a Security Operations Centre reference design. We present the current status of a minimum viable SOC design applicable to a range of different WLCG sites, centred around a few key components. The design uses the Zeek Network Intrusion Detection System for monitoring what is happening at the network level in strategic locations: for example at border between the local cluster and external networks, the border between different local network domains or at core infrastructure nodes. The MISP Open Source Threat Intelligence Platform is used to share information regarding relevant security events and the associated Indicators of Compromise (IoCs). By feeding IoCs from MISP into Zeek we have a platform that allows the community to share threat intelligence that is immediately actionable across the entire grid. The logs produced by Zeek are processed using the Elasticsearch, Logstash, Kibana (Elastic) stack for real time indexing and visualisation. This provides sites with a powerful tool for incident response and network forensics. The alerts raised by Zeek are further aggregated, correlated and enriched by an advanced notification processing engine. This ensures that most false positives are automatically whitelisted while at the same time reducing the total number of raised alerts that need to be managed by the computer security team of each site. By enriching these alerts and adding context of what happened around the moment the malicious activity was detected, the time needed to handle these alerts is greatly reduced. We present possible deployment strategies for all these components in a grid context as well as the integration between them. We also report on the current status of work on integrating other sources of data, in particular using netflow / sflow, into this model. Lastly we discuss how making use of these SOC capabilities distributed across the participating sites can lead to increasing the operational security across the entire grid.

影响网格和云站点的现代安全环境正在不断发展，威胁可以从一系列途径看到，包括社会工程和更直接的方法。为了提高整个社区的防御能力，在全球大型强子对撞机计算网格(WLCG)上建立操作安全能力至关重要。据ISGC 2017和2018报道，WLCG安全运营中心(SOC)工作组(WG)一直在与WLCG各站点合作，为安全运营中心参考设计开发模型。我们提出了适用于一系列不同WLCG站点的最小可行SOC设计的当前状态，以几个关键组件为中心。该设计使用Zeek网络入侵检测系统来监控战略位置的网络级别发生的情况:例如在本地集群和外部网络之间的边界，不同本地网络域之间的边界或核心基础设施节点。MISP开源威胁情报平台用于共享相关安全事件和相关的ioc (Indicators of Compromise)信息。通过将来自MISP的ioc提供给Zeek，我们有了一个平台，允许社区在整个电网中共享可立即采取行动的威胁情报。Zeek生成的日志使用Elasticsearch、Logstash、Kibana (Elastic)堆栈进行处理，以实现实时索引和可视化。这为站点提供了一个用于事件响应和网络取证的强大工具。Zeek发出的警报通过高级通知处理引擎进一步聚合、关联和丰富。这确保了大多数误报被自动列入白名单，同时减少了需要由每个站点的计算机安全团队管理的警报总数。通过丰富这些警报并添加检测到恶意活动前后发生的情况的上下文，可以大大减少处理这些警报所需的时间。我们提出了网格环境中所有这些组件的可能部署策略，以及它们之间的集成。我们还报告了将其他数据源(特别是使用netflow / sflow)集成到该模型中的工作的当前状态。最后，我们讨论了如何利用分布在参与站点上的这些SOC功能来提高整个电网的运行安全性。

{"title":"Building a minimum viable Security Operations Centre for the modern grid environment","authors":"D. Crooks, L. Valsan","doi":"10.22323/1.351.0010","DOIUrl":"https://doi.org/10.22323/1.351.0010","url":null,"abstract":"The modern security landscape affecting grid and cloud sites is constantly evolving, with threats being seen from a range of avenues, including social engineering as well as more direct approaches. It is vital to build up operational security capabilities across the Worldwide LHC Computing Grid (WLCG) in order to improve the defence of the community as a whole. As reported at ISGC 2017 and 2018, the WLCG Security Operations Centres (SOC) Working Group (WG) has been working with sites across the WLCG to develop a model for a Security Operations Centre reference design. We present the current status of a minimum viable SOC design applicable to a range of different WLCG sites, centred around a few key components. \u0000 \u0000The design uses the Zeek Network Intrusion Detection System for monitoring what is happening at the network level in strategic locations: for example at border between the local cluster and external networks, the border between different local network domains or at core infrastructure nodes. The MISP Open Source Threat Intelligence Platform is used to share information regarding relevant security events and the associated Indicators of Compromise (IoCs). By feeding IoCs from MISP into Zeek we have a platform that allows the community to share threat intelligence that is immediately actionable across the entire grid. \u0000 \u0000The logs produced by Zeek are processed using the Elasticsearch, Logstash, Kibana (Elastic) stack for real time indexing and visualisation. This provides sites with a powerful tool for incident response and network forensics. The alerts raised by Zeek are further aggregated, correlated and enriched by an advanced notification processing engine. This ensures that most false positives are automatically whitelisted while at the same time reducing the total number of raised alerts that need to be managed by the computer security team of each site. By enriching these alerts and adding context of what happened around the moment the malicious activity was detected, the time needed to handle these alerts is greatly reduced. \u0000 \u0000We present possible deployment strategies for all these components in a grid context as well as the integration between them. We also report on the current status of work on integrating other sources of data, in particular using netflow / sflow, into this model. \u0000 \u0000Lastly we discuss how making use of these SOC capabilities distributed across the participating sites can lead to increasing the operational security across the entire grid.","PeriodicalId":106243,"journal":{"name":"Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131714912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A Blueprint of Log Based Monitoring and Diagnosing Framework in Large Distributed Environments 大型分布式环境下基于日志的监测与诊断框架蓝图

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

Pub Date : 2019-11-21 DOI: 10.22323/1.351.0033

Yining Zhao, Xiaodong Wang, Haili Xiao, Xue-bin Chi

Distributed systems have kept scaling upward since this concept appears, and they soon evolve to environments that contain heterogeneous components playing different roles, making it difficult to understand how the large environment works or if any undesired matters happened from security point of view. Logs, produced by devices, sub-systems and running processes, are a very important source to help system maintainers to get relative security knowledge. But there are too many logs and too many kinds of logs to deal with, which makes manual checking impossible. In this work we will share some of our experiences in log processing and analyzing. We have summarized some common major steps that appear in most of the existing log analysis approaches, including log selection, log classification, information analyses and result feedback. We also represent a general framework that monitors events, analyzes hidden information and diagnoses the healthy state for large distributed computing environments bases on logs. Although the framework we initially designed was for the maintenance for CNGrid, its process is adaptable to other distributed computing environments.

自从这个概念出现以来，分布式系统一直在向上扩展，并且它们很快演变为包含扮演不同角色的异构组件的环境，这使得很难理解大型环境的工作方式，或者从安全的角度来看是否发生了任何不希望发生的事情。设备、子系统和运行进程产生的日志是帮助系统维护人员获得相关安全知识的非常重要的来源。但是日志太多，需要处理的日志种类太多，手工检查是不可能的。在这项工作中，我们将分享我们在日志处理和分析方面的一些经验。总结了目前大多数日志分析方法中常见的主要步骤，包括日志选择、日志分类、信息分析和结果反馈。我们还提供了一个通用框架，用于监视事件、分析隐藏信息并根据日志诊断大型分布式计算环境的健康状态。虽然我们最初设计的框架是为了维护CNGrid，但它的过程可以适应其他分布式计算环境。

{"title":"A Blueprint of Log Based Monitoring and Diagnosing Framework in Large Distributed Environments","authors":"Yining Zhao, Xiaodong Wang, Haili Xiao, Xue-bin Chi","doi":"10.22323/1.351.0033","DOIUrl":"https://doi.org/10.22323/1.351.0033","url":null,"abstract":"Distributed systems have kept scaling upward since this concept appears, and they soon evolve to environments that contain heterogeneous components playing different roles, making it difficult to understand how the large environment works or if any undesired matters happened from security point of view. Logs, produced by devices, sub-systems and running processes, are a very important source to help system maintainers to get relative security knowledge. But there are too many logs and too many kinds of logs to deal with, which makes manual checking impossible. In this work we will share some of our experiences in log processing and analyzing. We have summarized some common major steps that appear in most of the existing log analysis approaches, including log selection, log classification, information analyses and result feedback. We also represent a general framework that monitors events, analyzes hidden information and diagnoses the healthy state for large distributed computing environments bases on logs. Although the framework we initially designed was for the maintenance for CNGrid, its process is adaptable to other distributed computing environments.","PeriodicalId":106243,"journal":{"name":"Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127133870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀