AdaptMD: Balancing Space and Performance in NUMA Architectures With Adaptive Memory Deduplication

IF 3.8 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE IEEE Transactions on Computers Pub Date : 2024-03-14 DOI:10.1109/TC.2024.3375592

Lulu Yao;Yongkun Li;Patrick P. C. Lee;Xiaoyang Wang;Yinlong Xu

{"title":"AdaptMD: Balancing Space and Performance in NUMA Architectures With Adaptive Memory Deduplication","authors":"Lulu Yao;Yongkun Li;Patrick P. C. Lee;Xiaoyang Wang;Yinlong Xu","doi":"10.1109/TC.2024.3375592","DOIUrl":null,"url":null,"abstract":"Memory deduplication effectively relieves the memory space bottleneck by removing duplicate pages, especially in virtualized systems in which virtual machines run the same OS and similar applications. However, due to the non-uniform access latencies in NUMA architectures, memory deduplication poses a trade-off between memory savings and access performance: global deduplication across NUMA nodes realizes high memory savings, but leads to frequent cross-node remote access after deduplication and results in performance degradations. In contrast, local deduplication avoids remote access, but limits deduplication effectiveness. We design AdaptMD, an adaptive memory deduplication system that addresses the space-performance trade-off in NUMA architectures. AdaptMD leverages hotness awareness to globally deduplicate only cold pages to reduce remote access. It also migrates similar applications to the same NUMA node to allow local deduplication without remote access. We further make AdaptMD readily configurable to address various deployment scenarios. Experiments show that AdaptMD achieves high memory savings as in global deduplication, while achieving similar access performance as in local deduplication.","PeriodicalId":13087,"journal":{"name":"IEEE Transactions on Computers","volume":"73 6","pages":"1588-1602"},"PeriodicalIF":3.8000,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computers","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10466365/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Memory deduplication effectively relieves the memory space bottleneck by removing duplicate pages, especially in virtualized systems in which virtual machines run the same OS and similar applications. However, due to the non-uniform access latencies in NUMA architectures, memory deduplication poses a trade-off between memory savings and access performance: global deduplication across NUMA nodes realizes high memory savings, but leads to frequent cross-node remote access after deduplication and results in performance degradations. In contrast, local deduplication avoids remote access, but limits deduplication effectiveness. We design AdaptMD, an adaptive memory deduplication system that addresses the space-performance trade-off in NUMA architectures. AdaptMD leverages hotness awareness to globally deduplicate only cold pages to reduce remote access. It also migrates similar applications to the same NUMA node to allow local deduplication without remote access. We further make AdaptMD readily configurable to address various deployment scenarios. Experiments show that AdaptMD achieves high memory savings as in global deduplication, while achieving similar access performance as in local deduplication.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AdaptMD：利用自适应内存重复数据删除技术平衡 NUMA 架构的空间和性能

重复内存删除通过删除重复页面有效缓解了内存空间瓶颈，尤其是在虚拟机运行相同操作系统和类似应用程序的虚拟化系统中。然而，由于 NUMA 架构的访问延迟不均匀，重复数据删除需要在节省内存和访问性能之间做出权衡：跨 NUMA 节点的全局重复数据删除可节省大量内存，但会导致重复数据删除后频繁的跨节点远程访问，从而导致性能下降。相比之下，本地重复数据删除避免了远程访问，但却限制了重复数据删除的效果。我们设计的 AdaptMD 是一种自适应重复数据删除内存系统，可解决 NUMA 架构中空间与性能之间的权衡问题。AdaptMD 利用热度感知功能，只对冷页面进行全局重复数据删除，以减少远程访问。它还能将类似的应用程序迁移到相同的 NUMA 节点上，从而实现无需远程访问的本地重复数据删除。我们还进一步使 AdaptMD 易于配置，以应对各种部署场景。实验表明，AdaptMD 可以像全局重复数据删除一样节省大量内存，同时实现与本地重复数据删除类似的访问性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Computers 工程技术-工程：电子与电气

CiteScore

6.60

自引率

5.40%

发文量

199

审稿时长

6.0 months

期刊介绍： The IEEE Transactions on Computers is a monthly publication with a wide distribution to researchers, developers, technical managers, and educators in the computer field. It publishes papers on research in areas of current interest to the readers. These areas include, but are not limited to, the following: a) computer organizations and architectures; b) operating systems, software systems, and communication protocols; c) real-time systems and embedded systems; d) digital devices, computer components, and interconnection networks; e) specification, design, prototyping, and testing methods and tools; f) performance, fault tolerance, reliability, security, and testability; g) case studies and experimental and theoretical evaluations; and h) new and important applications and trends.

期刊最新文献

2025 Reviewers List Evaluation of Radiation Resilience, Performance, and Vmin of Sub-3 nm FSFET Based SRAM Arrays Dual-Pronged Deep Learning Preprocessing on Heterogeneous Platforms With CPU, Accelerator and CSD Latency Optimization in Hybrid Memory System for GNNs Fused FP8 Many-Terms Dot Product With Scaling and FP32 Accumulation