Proceedings of the 16th ACM International Conference on Systems and Storage最新文献

英文中文

On Latency Awareness with Delayed Hits 关于延迟命中的延迟意识

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594752

Gil Einziger, Nadav Keren, Gabriel Scalosub

We consider a new locality pattern in the form of burstiness to improve cache effectiveness in workflows where items are requested in possibly infrequent yet costly batches. Adding a cache that handles only bursty items to existing State-Of-The-Art algorithms shows a significant improvement in overall average time per query.

我们考虑了一种以突发形式出现的新的局部性模式，以提高工作流中的缓存效率，在工作流中，项目可能以不频繁但代价高昂的批量请求。将只处理突发项的缓存添加到现有的最先进算法中，可以显着改善每次查询的总体平均时间。

引用次数: 0

Predicting GPU Failures With High Precision Under Deep Learning Workloads 深度学习工作负载下GPU故障高精度预测

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594777

Heting Liu, Zhichao Li, Cheng Tan, Rongqiu Yang, Guohong Cao, Zherui Liu, Chuanxiong Guo

Graphics processing units (GPUs) are the de facto standard for processing deep learning (DL) tasks. In large-scale GPU clusters, GPU failures are inevitable and may cause severe consequences. For example, GPU failures disrupt distributed training, crash inference services, and result in service level agreement violations. In this paper, we study the problem of predicting GPU failures using machine learning (ML) models to mitigate their damages. We train prediction models on a four-month production dataset with 350 million entries at ByteDance. We observe that classic prediction models (GBDT, MLP, LSTM, and 1D-CNN) do not perform well---they are inaccurate for predictions and unstable over time. We propose several techniques to improve the precision and stability of predictions, including parallel and cascade model-ensemble mechanisms and a sliding training method. We evaluate the performance of our proposed techniques. The results show that our proposed techniques improve the prediction precision from 46.3% to 85.4% on production workloads.

图形处理单元(gpu)是处理深度学习(DL)任务的实际标准。在大规模GPU集群中，GPU故障是不可避免的，可能会造成严重的后果。例如，GPU故障会破坏分布式训练，导致推理服务崩溃，并导致违反服务水平协议。在本文中，我们研究了使用机器学习(ML)模型预测GPU故障以减轻其损害的问题。我们在字节跳动(ByteDance)有3.5亿个条目的4个月生产数据集上训练预测模型。我们观察到经典的预测模型(GBDT、MLP、LSTM和1D-CNN)表现不佳——它们的预测不准确，而且随着时间的推移不稳定。我们提出了几种提高预测精度和稳定性的技术，包括并行和级联模型集成机制以及滑动训练方法。我们评估我们提出的技术的性能。结果表明，我们提出的技术将生产工作负载的预测精度从46.3%提高到85.4%。

引用次数: 0

Efficient Hashing of Sparse Virtual Disks 稀疏虚拟磁盘的高效哈希

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594748

Nir Soffer, Erez Waisbard

Verifying the integrity of a file is a fundamental operation in file transfer. Common tools compute a short hash value that is sent along with the file, but computing this value requires going over the entire file and if the file is huge, then this process is slow. We introduce blkhash - a novel hash algorithm optimized for disk images, that is up to 4 orders of magnitude faster than commonly used tools. We implemented a new command line tool and library that can be used in the virtualization space for verifying storage management operations. Our approach can significantly contribute to use cases such as: (1) Very fast computing of virtual disk hash value in software defined storage, (2) Verifying an entire disk image content as part of a supply chain integrity verification or in the context of confidential computing.

验证文件的完整性是文件传输中的一项基本操作。常用工具计算一个短散列值，该值随文件一起发送，但是计算该值需要遍历整个文件，如果文件很大，则此过程很慢。我们介绍了blhash——一种针对磁盘映像优化的新型哈希算法，它比常用工具快4个数量级。我们实现了一个新的命令行工具和库，可以在虚拟化空间中用于验证存储管理操作。我们的方法可以显著地为以下用例做出贡献:(1)在软件定义的存储中非常快速地计算虚拟磁盘哈希值，(2)作为供应链完整性验证的一部分或在机密计算的上下文中验证整个磁盘映像内容。

引用次数: 0

Reducing The Virtual Memory Overhead in Nested Virtualization 减少嵌套虚拟化中的虚拟内存开销

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594765

Ori Ben Zur, Shai Bergman, M. Silberstein

Virtualization has become a critical aspect of modern computing, and with the advent of virtualization-based containers, fast nested virtualization has become increasingly important. Nested virtualization is implemented by emulating virtualization capabilities to the guest host which can result in significant overhead. Another source of overheads in virtualization stems from the address translation mechanisms employed to implement virtualization, which usually causes a mix of slower address translation, frequently trapping guests, and loss of granularity in page tables. Our research focuses on using guest-managed physical memory with the use of per-VM memory tags for checking each VMs' access permissions.

虚拟化已经成为现代计算的一个关键方面，随着基于虚拟化的容器的出现，快速嵌套虚拟化变得越来越重要。嵌套虚拟化是通过向客户主机模拟虚拟化功能来实现的，这可能会导致很大的开销。虚拟化开销的另一个来源来自实现虚拟化所采用的地址转换机制，这通常会导致地址转换速度变慢、频繁捕获来宾以及页表粒度丢失等问题。我们的研究重点是使用客户管理的物理内存，并使用每个虚拟机内存标签来检查每个虚拟机的访问权限。

引用次数: 0

Elastic RAID: Implementing RAID over SSDs with Built-in Transparent Compression 弹性RAID:在内置透明压缩的ssd上实现RAID

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594773

Zheng Gu, Jiangpeng Li, Yong Peng, Yang Liu, T. Zhang

This paper studies how RAID (redundant array of independent disks) could take full advantage of modern SSDs (solid-state drives) with built-in transparent compression. In current practice, RAID users are forced to choose a specific RAID level (e.g., RAID 10 or RAID 5) with a fixed storage cost vs. speed performance trade-off. The commercial market is witnessing the emergence of a new family of SSDs that can internally perform hardware-based lossless compression on each 4KB LBA (logical block address) block, transparent to host OS and user applications. Beyond straightforwardly reducing the RAID storage cost, such modern SSDs make it possible to relieve RAID users from being locked into a fixed storage cost vs. speed performance trade-off. In particular, RAID systems could opportunistically leverage higher-than-expected runtime user data compressibility to enable dynamic RAID level conversion to improve the speed performance without compromising the effective storage capacity. This paper presents techniques to enable and optimize the practical implementation of such elastic RAID systems. We implemented a Linux software-based elastic RAID prototype that supports dynamic conversion between RAID 5 and RAID 10. Compared with a baseline software-based RAID 5, under sufficient runtime data compressibility that enables the conversion from RAID 5 to RAID 10 over 60% of user data, the elastic RAID could improve the 4KB random write IOPS (I/O per second) by 42% and 4KB random read IOPS in degraded mode by 46%, while maintaining the same effective storage capacity.

本文研究了独立磁盘冗余阵列(RAID)如何充分利用内置透明压缩的现代固态硬盘(ssd)。在当前的实践中，RAID用户被迫选择特定的RAID级别(例如，RAID 10或RAID 5)，并在存储成本和速度性能之间进行权衡。商业市场正在见证一个新的ssd家族的出现，它可以在内部对每个4KB LBA(逻辑块地址)块执行基于硬件的无损压缩，对主机操作系统和用户应用程序透明。除了直接降低RAID存储成本之外，这种现代ssd还可以将RAID用户从固定存储成本与速度性能之间的权衡中解脱出来。特别是，RAID系统可以利用高于预期的运行时用户数据可压缩性来实现动态RAID级转换，从而在不影响有效存储容量的情况下提高速度性能。本文提出了实现和优化这种弹性RAID系统实际实现的技术。我们实现了一个基于Linux软件的弹性RAID原型，它支持RAID 5和RAID 10之间的动态转换。与基于软件的基准RAID 5相比，在运行时数据可压缩性足够的情况下，在60%以上的用户数据由RAID 5转换为RAID 10，弹性RAID在保持有效存储容量不变的情况下，4KB随机写IOPS(每秒I/O)提高42%，4KB随机读IOPS在降级模式下提高46%。

{"title":"Elastic RAID: Implementing RAID over SSDs with Built-in Transparent Compression","authors":"Zheng Gu, Jiangpeng Li, Yong Peng, Yang Liu, T. Zhang","doi":"10.1145/3579370.3594773","DOIUrl":"https://doi.org/10.1145/3579370.3594773","url":null,"abstract":"This paper studies how RAID (redundant array of independent disks) could take full advantage of modern SSDs (solid-state drives) with built-in transparent compression. In current practice, RAID users are forced to choose a specific RAID level (e.g., RAID 10 or RAID 5) with a fixed storage cost vs. speed performance trade-off. The commercial market is witnessing the emergence of a new family of SSDs that can internally perform hardware-based lossless compression on each 4KB LBA (logical block address) block, transparent to host OS and user applications. Beyond straightforwardly reducing the RAID storage cost, such modern SSDs make it possible to relieve RAID users from being locked into a fixed storage cost vs. speed performance trade-off. In particular, RAID systems could opportunistically leverage higher-than-expected runtime user data compressibility to enable dynamic RAID level conversion to improve the speed performance without compromising the effective storage capacity. This paper presents techniques to enable and optimize the practical implementation of such elastic RAID systems. We implemented a Linux software-based elastic RAID prototype that supports dynamic conversion between RAID 5 and RAID 10. Compared with a baseline software-based RAID 5, under sufficient runtime data compressibility that enables the conversion from RAID 5 to RAID 10 over 60% of user data, the elastic RAID could improve the 4KB random write IOPS (I/O per second) by 42% and 4KB random read IOPS in degraded mode by 46%, while maintaining the same effective storage capacity.","PeriodicalId":180024,"journal":{"name":"Proceedings of the 16th ACM International Conference on Systems and Storage","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133514471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Speeding up reconstruction of declustered RAID with special mapping 加速特殊映射的集群RAID重构

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594761

Svetlana Lazareva, G. Petrunin

It is known that ZFS dRAID [2] provides random data blocks permutation and reconstruction speed up is getting its boost with this initial condition. The question we tried to answer is if there some special permutation that would optimize reconstruction speed at theoretical maximum. We introduce our solution with the usage of cyclic matrices of data layout as currently the best found way to get maximum benefit out from initial declustered RAID configuration.

众所周知，ZFS dRAID[2]提供随机数据块排列和重构，在这个初始条件下，加速得到了提升。我们试图回答的问题是，是否存在某种特殊的排列，可以使重建速度达到理论最大值。我们通过使用数据布局的循环矩阵来介绍我们的解决方案，这是目前从初始集群RAID配置中获得最大收益的最佳方法。

引用次数: 0

Self-Adjusting Cache Advertisement and Selection 自调整缓存发布和选择

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594754

Itamar Cohen

We present a lightweight, self-adjusting algorithm for cache-content advertisement and cache selection. Our algorithm increases the hit ratio and mitigates wasteful, unnecessary cache accesses and cachecontent advertisements.

我们提出了一种轻量级的、自调整的缓存内容发布和缓存选择算法。我们的算法提高了命中率，减少了浪费，不必要的缓存访问和缓存内容广告。

引用次数: 0

ConfZNS : A Novel Emulator for Exploring Design Space of ZNS SSDs 一种探索ZNS固态硬盘设计空间的新型仿真器

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594772

Inho Song, Myounghoon Oh, B. Kim, Seehwan Yoo, Jaedong Lee, Jongmoo Choi

The ZNS (Zoned NameSpace) interface shifts much of the storage maintenance responsibility to the host from the underlying SSDs (Solid-State Drives). In addition, it opens a new opportunity to exploit the internal parallelism of SSDs at both hardware and software levels. By orchestrating the mapping between zones and SSD-internal resources and by controlling zone allocation among threads, ZNS SSDs provide a distinct performance trade-off between parallelism and isolation. To understand and explore the design space of ZNS SSDs, we present ConfZNS (Configurable ZNS), an easy-to-configure and timing-accurate emulator based on QEMU. ConfZNS allows users to investigate a variety of ZNS SSD's internal architecture and how it performs with existing host software. We validate the accuracy of ConfZNS using real ZNS SSDs and explore performance characteristics of different ZNS SSD designs with real-world applications such as RocksDB, F2FS, and Docker environment.

ZNS(分区命名空间)接口将大部分存储维护责任从底层ssd(固态硬盘)转移到主机。此外，它还为在硬件和软件级别上利用ssd的内部并行性提供了新的机会。通过编排区域和ssd内部资源之间的映射以及控制线程之间的区域分配，ZNS ssd在并行性和隔离性之间提供了独特的性能权衡。为了理解和探索ZNS固态硬盘的设计空间，我们提出了一个基于QEMU的易于配置和时序精确的仿真器ConfZNS (Configurable ZNS)。ConfZNS允许用户调查各种ZNS SSD的内部架构以及它如何与现有主机软件一起执行。我们使用真实的ZNS固态硬盘验证了ConfZNS的准确性，并在RocksDB、F2FS和Docker环境等实际应用中探索了不同ZNS固态硬盘设计的性能特征。

引用次数: 1

Iterator Interface Extended LSM-tree-based KVSSD for Range Queries 用于范围查询的基于lsm树的扩展KVSSD迭代器接口

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594775

Seungjin Lee, Chang-Gyu Lee, Donghyun Min, Inhyuk Park, Woosuk Chung, A. Sivasubramaniam, Youngjae Kim

Key-Value SSD (KVSSD) has shown great potential for several important classes of emerging data stores due to its high throughput and low latency. When designing a key-value store with range queries, an LSM-tree is considered a better choice than a hash table due to its key ordering. However, the design space for range queries in LSM-tree-based KVSSDs has yet to be explored, despite range queries being one of the most demanding features. In this paper, we investigate the design constraints in LSM-tree-based KVSSDs from the perspective of range queries and propose three design principles. Based on these principles, we present IterKVSSD, an Iterator interface extended LSM-tree-based KVSSD for range queries. We implement IterKVSSD on OpenSSD Cosmos+, and our evaluation shows that it increases range query throughput by up to 4.13× and 7.22× for random and sequential key distributions, respectively, compared to existing KVSSDs.

Key-Value SSD (KVSSD)由于其高吞吐量和低延迟，在一些重要的新兴数据存储类别中显示出巨大的潜力。在设计包含范围查询的键值存储时，由于其键顺序，lsm树被认为是比散列表更好的选择。然而，在基于lsm树的kvssd中，范围查询的设计空间还有待探索，尽管范围查询是最苛刻的功能之一。本文从范围查询的角度研究了基于lsm树的kvssd的设计约束，提出了三个设计原则。基于这些原则，我们提出了IterKVSSD，这是一个Iterator接口，扩展了基于lsm树的KVSSD，用于范围查询。我们在OpenSSD Cosmos+上实现了IterKVSSD，我们的评估表明，与现有的kvssd相比，对于随机和顺序密钥分布，IterKVSSD分别将范围查询吞吐量提高了4.13倍和7.22倍。

引用次数: 0

A Smart Inhaler for Medication Adherence 药物依从性的智能吸入器

Proceedings of the 16th ACM International Conference on Systems and Storage

Pub Date : 2023-06-05 DOI: 10.1145/3579370.3594744

Itai Dabran, Tom Sofer, N. Bitterman

Asthma is a common inflammatory condition affecting more than 7 million children in US alone, and tens of millions more globally. Despite effective preventive medications, medication adherence in children and adolescents is often below 50% [1]. In this paper we present a novel personalized IoT-based system for improving children's adherence to inhaler use that is integrated into their daily life.

哮喘是一种常见的炎症性疾病，仅在美国就影响了700多万儿童，在全球范围内影响了数千万儿童。尽管有有效的预防药物，儿童和青少年的药物依从性往往低于50%[1]。在本文中，我们提出了一种新的个性化的基于物联网的系统，用于改善儿童对吸入器使用的依从性，并将其融入他们的日常生活。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 16th ACM International Conference on Systems and Storage

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀