Evaluating parameter sweep workflows in high performance computing

SWEET '12 Pub Date : 2012-05-20 DOI:10.1145/2443416.2443418

F. Chirigati, V. S. Sousa, Eduardo S. Ogasawara, Daniel de Oliveira, Jonas Dias, F. Porto, P. Valduriez, M. Mattoso

{"title":"Evaluating parameter sweep workflows in high performance computing","authors":"F. Chirigati, V. S. Sousa, Eduardo S. Ogasawara, Daniel de Oliveira, Jonas Dias, F. Porto, P. Valduriez, M. Mattoso","doi":"10.1145/2443416.2443418","DOIUrl":null,"url":null,"abstract":"Scientific experiments based on computer simulations can be defined, executed and monitored using Scientific Workflow Management Systems (SWfMS). Several SWfMS are available, each with a different goal and a different engine. Due to the exploratory analysis, scientists need to run parameter sweep (PS) workflows, which are workflows that are invoked repeatedly using different input data. These workflows generate a large amount of tasks that are submitted to High Performance Computing (HPC) environments. Different execution models for a workflow may have significant differences in performance in HPC. However, selecting the best execution model for a given workflow is difficult due to the existence of many characteristics of the workflow that may affect the parallel execution. We developed a study to show performance impacts of using different execution models in running PS workflows in HPC. Our study contributes by presenting a characterization of PS workflow patterns (the basis for many existing scientific workflows) and its behavior under different execution models in HPC. We evaluated four execution models to run workflows in parallel. Our study measures the performance behavior of small, large and complex workflows among the evaluated execution models. The results can be used as a guideline to select the best model for a given scientific workflow execution in HPC. Our evaluation may also serve as a basis for workflow designers to analyze the expected behavior of an HPC workflow engine based on the characteristics of PS workflows.","PeriodicalId":143151,"journal":{"name":"SWEET '12","volume":"42 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SWEET '12","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2443416.2443418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Scientific experiments based on computer simulations can be defined, executed and monitored using Scientific Workflow Management Systems (SWfMS). Several SWfMS are available, each with a different goal and a different engine. Due to the exploratory analysis, scientists need to run parameter sweep (PS) workflows, which are workflows that are invoked repeatedly using different input data. These workflows generate a large amount of tasks that are submitted to High Performance Computing (HPC) environments. Different execution models for a workflow may have significant differences in performance in HPC. However, selecting the best execution model for a given workflow is difficult due to the existence of many characteristics of the workflow that may affect the parallel execution. We developed a study to show performance impacts of using different execution models in running PS workflows in HPC. Our study contributes by presenting a characterization of PS workflow patterns (the basis for many existing scientific workflows) and its behavior under different execution models in HPC. We evaluated four execution models to run workflows in parallel. Our study measures the performance behavior of small, large and complex workflows among the evaluated execution models. The results can be used as a guideline to select the best model for a given scientific workflow execution in HPC. Our evaluation may also serve as a basis for workflow designers to analyze the expected behavior of an HPC workflow engine based on the characteristics of PS workflows.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高性能计算中参数扫描工作流的评估

基于计算机模拟的科学实验可以使用科学工作流管理系统(SWfMS)来定义、执行和监控。有几个SWfMS可用，每个SWfMS都有不同的目标和不同的引擎。由于探索性分析，科学家需要运行参数扫描(PS)工作流，这是使用不同输入数据重复调用的工作流。这些工作流产生大量的任务，这些任务被提交给高性能计算(HPC)环境。在HPC中，工作流的不同执行模型在性能上可能存在显著差异。然而，对于给定的工作流，选择最佳的执行模型是困难的，因为工作流存在许多可能影响并行执行的特征。我们进行了一项研究，以显示在HPC中运行PS工作流时使用不同的执行模型对性能的影响。我们的研究通过描述PS工作流模式(许多现有科学工作流的基础)及其在HPC中不同执行模型下的行为做出了贡献。我们评估了四个并行运行工作流的执行模型。我们的研究在评估的执行模型中测量了小型、大型和复杂工作流的性能行为。计算结果可作为在高性能计算中选择科学工作流执行的最佳模型的指导。我们的评估也可以作为工作流设计者分析基于PS工作流特征的HPC工作流引擎的预期行为的基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

SWEET '12

自引率

0.00%

发文量