Improving Storage Systems Using Machine Learning

IF 2.6 3区计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE ACM Transactions on Storage Pub Date : 2022-11-21 DOI:10.1145/3568429

I. Akgun, A. S. Aydin, Andrew Burford, Michael McNeill, Michael Arkhangelskiy, E. Zadok

{"title":"Improving Storage Systems Using Machine Learning","authors":"I. Akgun, A. S. Aydin, Andrew Burford, Michael McNeill, Michael Arkhangelskiy, E. Zadok","doi":"10.1145/3568429","DOIUrl":null,"url":null,"abstract":"Operating systems include many heuristic algorithms designed to improve overall storage performance and throughput. Because such heuristics cannot work well for all conditions and workloads, system designers resorted to exposing numerous tunable parameters to users—thus burdening users with continually optimizing their own storage systems and applications. Storage systems are usually responsible for most latency in I/O-heavy applications, so even a small latency improvement can be significant. Machine learning (ML) techniques promise to learn patterns, generalize from them, and enable optimal solutions that adapt to changing workloads. We propose that ML solutions become a first-class component in OSs and replace manual heuristics to optimize storage systems dynamically. In this article, we describe our proposed ML architecture, called KML. We developed a prototype KML architecture and applied it to two case studies: optimizing readahead and NFS read-size values. Our experiments show that KML consumes less than 4 KB of dynamic kernel memory, has a CPU overhead smaller than 0.2%, and yet can learn patterns and improve I/O throughput by as much as 2.3× and 15× for two case studies—even for complex, never-seen-before, concurrently running mixed workloads on different storage devices.","PeriodicalId":49113,"journal":{"name":"ACM Transactions on Storage","volume":"19 1","pages":"1 - 30"},"PeriodicalIF":2.6000,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Storage","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3568429","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 2

Abstract

Operating systems include many heuristic algorithms designed to improve overall storage performance and throughput. Because such heuristics cannot work well for all conditions and workloads, system designers resorted to exposing numerous tunable parameters to users—thus burdening users with continually optimizing their own storage systems and applications. Storage systems are usually responsible for most latency in I/O-heavy applications, so even a small latency improvement can be significant. Machine learning (ML) techniques promise to learn patterns, generalize from them, and enable optimal solutions that adapt to changing workloads. We propose that ML solutions become a first-class component in OSs and replace manual heuristics to optimize storage systems dynamically. In this article, we describe our proposed ML architecture, called KML. We developed a prototype KML architecture and applied it to two case studies: optimizing readahead and NFS read-size values. Our experiments show that KML consumes less than 4 KB of dynamic kernel memory, has a CPU overhead smaller than 0.2%, and yet can learn patterns and improve I/O throughput by as much as 2.3× and 15× for two case studies—even for complex, never-seen-before, concurrently running mixed workloads on different storage devices.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用机器学习改进存储系统

操作系统包括许多启发式算法，这些算法旨在提高整体存储性能和吞吐量。由于这种启发式方法不能很好地适用于所有条件和工作负载，系统设计者不得不向用户公开许多可调参数，从而给用户带来不断优化自己的存储系统和应用程序的负担。在I/O密集型应用程序中，存储系统通常是造成大多数延迟的原因，因此即使是一个小的延迟改进也可能意义重大。机器学习（ML）技术承诺学习模式，从中归纳，并实现适应不断变化的工作负载的最佳解决方案。我们建议ML解决方案成为操作系统中的一流组件，并取代手动启发式来动态优化存储系统。在本文中，我们描述了我们提出的ML架构，称为KML。我们开发了一个原型KML体系结构，并将其应用于两个案例研究：优化预读和NFS读取大小值。我们的实验表明，KML消耗的动态内核内存不到4 KB，CPU开销小于0.2%，但在两个案例研究中，它可以学习模式并将I/O吞吐量提高2.3倍和15倍——即使是以前从未见过的复杂的、在不同存储设备上同时运行混合工作负载的情况。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Storage COMPUTER SCIENCE, HARDWARE & ARCHITECTURE-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

4.20

自引率

5.90%

发文量

审稿时长

>12 weeks

期刊介绍： The ACM Transactions on Storage (TOS) is a new journal with an intent to publish original archival papers in the area of storage and closely related disciplines. Articles that appear in TOS will tend either to present new techniques and concepts or to report novel experiences and experiments with practical systems. Storage is a broad and multidisciplinary area that comprises of network protocols, resource management, data backup, replication, recovery, devices, security, and theory of data coding, densities, and low-power. Potential synergies among these fields are expected to open up new research directions.