A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments.

IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, THEORY & METHODS International Journal of Parallel Programming Pub Date : 2008-04-01 DOI:10.1007/s10766-007-0068-8
George Teodoro, Tulio Tavares, Renato Ferreira, Tahsin Kurc, Wagner Meira, Dorgival Guedes, Tony Pan, Joel Saltz
{"title":"A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments.","authors":"George Teodoro,&nbsp;Tulio Tavares,&nbsp;Renato Ferreira,&nbsp;Tahsin Kurc,&nbsp;Wagner Meira,&nbsp;Dorgival Guedes,&nbsp;Tony Pan,&nbsp;Joel Saltz","doi":"10.1007/s10766-007-0068-8","DOIUrl":null,"url":null,"abstract":"<p><p>Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this work, we present a runtime support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components.</p>","PeriodicalId":14313,"journal":{"name":"International Journal of Parallel Programming","volume":"36 2","pages":"250-266"},"PeriodicalIF":0.9000,"publicationDate":"2008-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10766-007-0068-8","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Parallel Programming","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10766-007-0068-8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 5

Abstract

Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this work, we present a runtime support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分布式环境下科学工作流高效执行的运行时系统。
科学工作流系统的引入是为了响应来自多个科学领域的研究人员的需求,他们需要处理和分析越来越大的数据集。这些系统的设计很大程度上是基于这样的观察:数据分析应用程序可以组成数据计算的管道或网络。在这项工作中,我们提出了一个运行时支持系统,旨在促进分布式计算环境中的这种类型的计算。我们的系统针对数据密集型工作流程进行了优化,其中数据的有效管理和检索,数据处理和数据移动的协调以及中间结果的检查点是关键和具有挑战性的问题。我们的系统的实验评估表明,线性加速可以实现复杂的应用,这是一个由多个数据处理组件组成的网络。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Parallel Programming
International Journal of Parallel Programming 工程技术-计算机:理论方法
CiteScore
4.40
自引率
0.00%
发文量
15
审稿时长
>12 weeks
期刊介绍: International Journal of Parallel Programming is a forum for the publication of peer-reviewed, high-quality original papers in the computer and information sciences, focusing specifically on programming aspects of parallel computing systems. Such systems are characterized by the coexistence over time of multiple coordinated activities. The journal publishes both original research and survey papers. Fields of interest include: linguistic foundations, conceptual frameworks, high-level languages, evaluation methods, implementation techniques, programming support systems, pragmatic considerations, architectural characteristics, software engineering aspects, advances in parallel algorithms, performance studies, and application studies.
期刊最新文献
Accelerating Massively Distributed Deep Learning Through Efficient Pseudo-Synchronous Update Method A Hybrid Machine Learning Model for Code Optimization GPU-Based Algorithms for Processing the k Nearest-Neighbor Query on Spatial Data Using Partitioning and Concurrent Kernel Execution Calculation of Distributed-Order Fractional Derivative on Tensor Cores-Enabled GPU Partitioning-Aware Performance Modeling of Distributed Graph Processing Tasks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1