ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences最新文献

英文中文

AMPRO-HPCC: A Machine-Learning Tool for Predicting Resources on Slurm HPC Clusters. AMPRO-HPCC:一个用于预测Slurm HPC集群资源的机器学习工具。

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

Pub Date : 2021-10-01

Mohammed Tanash, Daniel Andresen, William Hsu

Determining resource allocations (memory and time) for submitted jobs in High Performance Computing (HPC) systems is a challenging process even for computer scientists. HPC users are highly encouraged to overestimate resource allocation for their submitted jobs, so their jobs will not be killed due to insufficient resources. Overestimating resource allocations occurs because of the wide variety of HPC applications and environment configuration options, and the lack of knowledge of the complex structure of HPC systems. This causes a waste of HPC resources, a decreased utilization of HPC systems, and increased waiting and turnaround time for submitted jobs. In this paper, we introduce our first ever implemented fully-offline, fully-automated, stand-alone, and open-source Machine Learning (ML) tool to help users predict memory and time requirements for their submitted jobs on the cluster. Our tool involves implementing six ML discriminative models from the scikit-learn and Microsoft LightGBM applied on the historical data (sacct data) from Simple Linux Utility for Resource Management (Slurm). We have tested our tool using historical data (saact data) using HPC resources of Kansas State University (Beocat), which covers the years from January 2019 - March 2021, and contains around 17.6 million jobs. Our results show that our tool achieves high predictive accuracy R ² (0.72 using LightGBM for predicting the memory and 0.74 using Random Forest for predicting the time), helps dramatically reduce computational average waiting-time and turnaround time for the submitted jobs, and increases utilization of the HPC resources. Hence, our tool decreases the power consumption of the HPC resources.

在高性能计算(HPC)系统中，为提交的作业确定资源分配(内存和时间)是一个具有挑战性的过程，即使对计算机科学家也是如此。强烈建议HPC用户高估其提交作业的资源分配，这样他们的作业就不会因为资源不足而被终止。由于HPC应用程序和环境配置选项的多样性，以及缺乏对HPC系统复杂结构的了解，会出现对资源分配的高估。这会导致HPC资源的浪费，HPC系统的利用率降低，以及提交作业的等待和周转时间增加。在本文中，我们介绍了我们有史以来第一个实现的完全离线、全自动、独立和开源的机器学习(ML)工具，以帮助用户预测他们在集群上提交的作业的内存和时间需求。我们的工具包括实现来自scikit-learn和Microsoft LightGBM的6个ML判别模型，这些模型应用于来自Simple Linux Utility for Resource Management (Slurm)的历史数据(sact数据)。我们使用堪萨斯州立大学(Beocat)的HPC资源使用历史数据(saact数据)测试了我们的工具，这些数据涵盖了2019年1月至2021年3月的年份，包含了大约1760万个工作岗位。我们的结果表明，我们的工具达到了很高的预测精度r2(使用LightGBM预测内存为0.72，使用Random Forest预测时间为0.74)，有助于显着减少提交作业的计算平均等待时间和周转时间，并提高HPC资源的利用率。因此，我们的工具降低了HPC资源的功耗。

{"title":"AMPRO-HPCC: A Machine-Learning Tool for Predicting Resources on Slurm HPC Clusters.","authors":"Mohammed Tanash, Daniel Andresen, William Hsu","doi":"","DOIUrl":"","url":null,"abstract":"Determining resource allocations (memory and time) for submitted jobs in High Performance Computing (HPC) systems is a challenging process even for computer scientists. HPC users are highly encouraged to overestimate resource allocation for their submitted jobs, so their jobs will not be killed due to insufficient resources. Overestimating resource allocations occurs because of the wide variety of HPC applications and environment configuration options, and the lack of knowledge of the complex structure of HPC systems. This causes a waste of HPC resources, a decreased utilization of HPC systems, and increased waiting and turnaround time for submitted jobs. In this paper, we introduce our first ever implemented fully-offline, fully-automated, stand-alone, and open-source Machine Learning (ML) tool to help users predict memory and time requirements for their submitted jobs on the cluster. Our tool involves implementing six ML discriminative models from the scikit-learn and Microsoft LightGBM applied on the historical data (sacct data) from Simple Linux Utility for Resource Management (Slurm). We have tested our tool using historical data (saact data) using HPC resources of Kansas State University (Beocat), which covers the years from January 2019 - March 2021, and contains around 17.6 million jobs. Our results show that our tool achieves high predictive accuracy R 2 (0.72 using LightGBM for predicting the memory and 0.74 using Random Forest for predicting the time), helps dramatically reduce computational average waiting-time and turnaround time for the submitted jobs, and increases utilization of the HPC resources. Hence, our tool decreases the power consumption of the HPC resources.","PeriodicalId":72112,"journal":{"name":"ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences","volume":"2021 ","pages":"20-27"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9906793/pdf/nihms-1831252.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10760547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Facilitating large data management in research contexts. 促进研究环境中的大数据管理。

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

Pub Date : 2021-01-01 Epub Date: 2021-10-03

Daniel Andresen, Gerrick Teague

Research data management is becoming increasingly complex as the amount of data, metadata and code increases. Often, researchers must obtain multidisciplinary skills to acquire, transfer, share, and compute large datasets. In this paper we present the results of an investigation into providing a familiar web-based experience for researchers to manage their data and code, leveraging popular, well-funded tools and services. We show how researchers can save time and avoid mistakes, and we provide a detailed discussion of our system architecture and implementation, and summarize the new capabilities, and time savings which can be achieved.

随着数据、元数据和代码数量的增加，研究数据管理变得越来越复杂。通常，研究人员必须获得多学科技能来获取、转移、共享和计算大型数据集。在本文中，我们展示了一项调查的结果，该调查旨在为研究人员提供一种熟悉的基于web的体验，以利用流行的、资金充足的工具和服务来管理他们的数据和代码。我们向研究人员展示了如何节省时间和避免错误，并详细讨论了我们的系统架构和实现，并总结了可以实现的新功能和节省的时间。

引用次数: 0

Message from the Program Chairs and Industry Panel Chairs 来自项目主席和行业小组主席的信息

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

Pub Date : 2020-01-01 DOI: 10.1109/ACOMP50827.2020.00005

M. Marchese, Lam-Son Lê, Bob Dao, M. Toulouse, N. Thoai

引用次数: 0

Low-Complexity Encryption Algorithm Considering Energy Balance on Wireless Sensor Networks 考虑能量平衡的无线传感器网络低复杂度加密算法

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

Pub Date : 2019-11-01 DOI: 10.1109/ACOMP.2019.00025

P. N. Huu, Q. Minh, Hieu Nguyen Trong

This paper proposes an effective key-generation scheme applying to data encryption standard (DES) algorithm for wireless sensor networks (WSNs). In the scheme, data encryption is divided into several tasks for multiple nodes along a path from a source node to the base station. We perform simulations to compare distribution and centralization models. The results show that the distributed model obtains more balances in energy consumption compared to the centralization model. The proposed key management method also improves the security level of data by increasing the number of keys with a simple algorithm in WSNs.

提出了一种适用于无线传感器网络数据加密标准DES算法的有效密钥生成方案。在该方案中，数据加密沿着从源节点到基站的路径分为多个节点的多个任务。我们执行模拟来比较分布和集中化模型。结果表明，与集中式模型相比，分布式模型在能源消耗方面更加平衡。所提出的密钥管理方法还通过一种简单的算法增加wsn中密钥的数量，从而提高了数据的安全级别。

引用次数: 1

Building a Product Origins Tracking System Based on Blockchain and PoA Consensus Protocol 构建基于区块链和PoA共识协议的产品来源跟踪系统

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

Pub Date : 2019-11-01 DOI: 10.1109/ACOMP.2019.00012

A. An, P. Diem, L. Lan, Tran Van Toi, Lam Quoc Huy Le Nguyen Binh

In recent years, the traceability of product origins is strongly concerned, particularly for food products as they directly influence human health. Therefore, there have been some efforts to develop product origins tracking systems. In this paper, we propose an approach to building a supply chain management system based on the blockchain technology for agriculture product origins tracking. The supply chain model is borrowed from Walmart's and it is implemented based on the Ethereum framework using the PoA (Proof of Authority) consensus algorithm. Our experiment shows that the proposed system not only fulfills the requirements of a product origins tracking but also takes the advantages of the blockchain technology such as the immutability and security of data, the low cost in making the transactions, and so on.

近年来，产品来源的可追溯性受到强烈关注，特别是食品，因为它们直接影响人类健康。因此，已经有一些努力开发产品来源跟踪系统。本文提出了一种基于区块链技术构建农产品原产地跟踪供应链管理系统的方法。供应链模型借鉴了沃尔玛的模式，并基于以太坊框架，使用PoA(权威证明)共识算法实现。实验表明，该系统不仅满足了产品原产地跟踪的要求，而且充分利用了区块链技术的数据不变性和安全性、交易成本低等优点。

引用次数: 15

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ADVCOMP ... the ... International Conference on Advanced Engineering Computing and Applications in Sciences

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀