气候和天气中I/O感知工作流程的潜力

J. Kunkel, L. Pedro
{"title":"气候和天气中I/O感知工作流程的潜力","authors":"J. Kunkel, L. Pedro","doi":"10.14529/jsfi200203","DOIUrl":null,"url":null,"abstract":"The efficient, convenient, and robust execution of data-driven workflows and enhanced data  management are essential for productivity in scientific computing. In HPC, the concerns of storage  and computing are traditionally separated and optimised independently from each other and the  needs of the end-to-end user. However, in complex workflows, this is becoming problematic. These  problems are particularly acute in climate and weather workflows, which as well as becoming  increasingly complex and exploiting deep storage hierarchies, can involve multiple data centres. The key contributions of this paper are: 1) A sketch of a vision for an integrated data-driven  approach, with a discussion of the associated challenges and implications, and 2) An architecture  and roadmap consistent with this vision that would allow a seamless integration into current  climate and weather workflows as it utilises versions of existing tools (ESDM, Cylc, XIOS, and  DDN’s IME). The vision proposed here is built on the belief that workflows composed of data, computing,  and communication-intensive tasks should drive interfaces and hardware configurations to  better support the programming models. When delivered, this work will increase the opportunity  for smarter scheduling of computing by considering storage in heterogeneous storage systems.  We illustrate the performance-impact on an example workload using a model built on measured  performance data using ESDM at DKRZ.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Potential of I/O Aware Workflows in Climate and Weather\",\"authors\":\"J. Kunkel, L. Pedro\",\"doi\":\"10.14529/jsfi200203\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The efficient, convenient, and robust execution of data-driven workflows and enhanced data  management are essential for productivity in scientific computing. In HPC, the concerns of storage  and computing are traditionally separated and optimised independently from each other and the  needs of the end-to-end user. However, in complex workflows, this is becoming problematic. These  problems are particularly acute in climate and weather workflows, which as well as becoming  increasingly complex and exploiting deep storage hierarchies, can involve multiple data centres. The key contributions of this paper are: 1) A sketch of a vision for an integrated data-driven  approach, with a discussion of the associated challenges and implications, and 2) An architecture  and roadmap consistent with this vision that would allow a seamless integration into current  climate and weather workflows as it utilises versions of existing tools (ESDM, Cylc, XIOS, and  DDN’s IME). The vision proposed here is built on the belief that workflows composed of data, computing,  and communication-intensive tasks should drive interfaces and hardware configurations to  better support the programming models. When delivered, this work will increase the opportunity  for smarter scheduling of computing by considering storage in heterogeneous storage systems.  We illustrate the performance-impact on an example workload using a model built on measured  performance data using ESDM at DKRZ.\",\"PeriodicalId\":338883,\"journal\":{\"name\":\"Supercomput. Front. Innov.\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Supercomput. Front. Innov.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14529/jsfi200203\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supercomput. Front. Innov.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14529/jsfi200203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

高效、方便、健壮地执行数据驱动工作流和增强的数据管理对于科学计算的生产力至关重要。在HPC中,存储和计算的关注点传统上是分开的,并且彼此独立优化,并且端到端用户的需求也是如此。然而,在复杂的工作流中,这就变得有问题了。这些问题在气候和天气工作流程中尤其严重,这些工作流程变得越来越复杂,需要利用深层存储层次结构,可能涉及多个数据中心。本文的主要贡献是:1)概述了集成数据驱动方法的愿景,并讨论了相关的挑战和影响;2)与该愿景一致的架构和路线图,该架构和路线图将允许无缝集成到当前的气候和天气工作流程中,因为它利用了现有工具的版本(ESDM, Cylc, XIOS和DDN的IME)。这里提出的愿景是基于这样一种信念,即由数据、计算和通信密集型任务组成的工作流应该驱动接口和硬件配置,以更好地支持编程模型。当交付时,这项工作将通过考虑异构存储系统中的存储来增加智能调度计算的机会。我们使用在DKRZ使用ESDM测量的性能数据上构建的模型来说明对示例工作负载的性能影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Potential of I/O Aware Workflows in Climate and Weather
The efficient, convenient, and robust execution of data-driven workflows and enhanced data  management are essential for productivity in scientific computing. In HPC, the concerns of storage  and computing are traditionally separated and optimised independently from each other and the  needs of the end-to-end user. However, in complex workflows, this is becoming problematic. These  problems are particularly acute in climate and weather workflows, which as well as becoming  increasingly complex and exploiting deep storage hierarchies, can involve multiple data centres. The key contributions of this paper are: 1) A sketch of a vision for an integrated data-driven  approach, with a discussion of the associated challenges and implications, and 2) An architecture  and roadmap consistent with this vision that would allow a seamless integration into current  climate and weather workflows as it utilises versions of existing tools (ESDM, Cylc, XIOS, and  DDN’s IME). The vision proposed here is built on the belief that workflows composed of data, computing,  and communication-intensive tasks should drive interfaces and hardware configurations to  better support the programming models. When delivered, this work will increase the opportunity  for smarter scheduling of computing by considering storage in heterogeneous storage systems.  We illustrate the performance-impact on an example workload using a model built on measured  performance data using ESDM at DKRZ.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Supercomputer-Based Modeling System for Short-Term Prediction of Urban Surface Air Quality River Routing in the INM RAS-MSU Land Surface Model: Numerical Scheme and Parallel Implementation on Hybrid Supercomputers Data Assimilation by Neural Network for Ocean Circulation: Parallel Implementation Multistage Iterative Method to Tackle Inverse Problems of Wave Tomography Machine Learning Approaches to Extreme Weather Events Forecast in Urban Areas: Challenges and Initial Results
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1