Steven W. D. Chien, Kento Sato, Artur Podobas, Niclas Jansson, S. Markidis, Michio Honda
{"title":"Improving Cloud Storage Network Bandwidth Utilization of Scientific Applications","authors":"Steven W. D. Chien, Kento Sato, Artur Podobas, Niclas Jansson, S. Markidis, Michio Honda","doi":"10.1145/3600061.3603122","DOIUrl":null,"url":null,"abstract":"Cloud providers began to provide managed services to attract scientific applications, which have been traditionally executed on supercomputers. One example is AWS FSx for Lustre, a fully managed parallel file system (PFS) released in 2018. However, due to the nature of scientific applications, the frontend storage network bandwidth is left completely idle for the majority of its lifetime. Furthermore, the pricing model does not match the scalability requirement. We propose iFast, a novel host-side caching mechanism for scientific applications that improves storage bandwidth utilization and end-to-end application performance: by overlapping compute and data writeback through inexpensive local storage. iFast supports the Massage Passing Interface (MPI) library that is widely used by scientific applications and is implemented as a preloaded library. It requires no change to applications, the MPI library, or support from cloud operators. We demonstrate how iFast can accelerate the end-to-end time of a representative scientific application Neko, by 13–40%.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th Asia-Pacific Workshop on Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3600061.3603122","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Cloud providers began to provide managed services to attract scientific applications, which have been traditionally executed on supercomputers. One example is AWS FSx for Lustre, a fully managed parallel file system (PFS) released in 2018. However, due to the nature of scientific applications, the frontend storage network bandwidth is left completely idle for the majority of its lifetime. Furthermore, the pricing model does not match the scalability requirement. We propose iFast, a novel host-side caching mechanism for scientific applications that improves storage bandwidth utilization and end-to-end application performance: by overlapping compute and data writeback through inexpensive local storage. iFast supports the Massage Passing Interface (MPI) library that is widely used by scientific applications and is implemented as a preloaded library. It requires no change to applications, the MPI library, or support from cloud operators. We demonstrate how iFast can accelerate the end-to-end time of a representative scientific application Neko, by 13–40%.
云提供商开始提供托管服务,以吸引传统上在超级计算机上执行的科学应用程序。其中一个例子是2018年发布的完全托管并行文件系统(PFS) AWS FSx for Lustre。然而,由于科学应用的性质,前端存储网络带宽在其生命周期的大部分时间内是完全空闲的。此外,定价模型与可伸缩性需求不匹配。我们提出了iFast,一种用于科学应用的新型主机端缓存机制,它通过廉价的本地存储重叠计算和数据回写,提高了存储带宽利用率和端到端应用性能。iFast支持在科学应用中广泛使用的按摩传递接口(MPI)库,并作为预加载库实现。它不需要更改应用程序、MPI库或云运营商的支持。我们演示了iFast如何将具有代表性的科学应用程序Neko的端到端时间缩短13-40%。