POEM: Performance Optimization and Endurance Management for Non-volatile Caches

IF 2 4区计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE ACM Transactions on Design Automation of Electronic Systems Pub Date : 2024-03-27 DOI:10.1145/3653452

Aritra Bagchi, Dharamjeet, Ohm Rishabh, Manan Suri, Preeti Ranjan Panda

{"title":"POEM: Performance Optimization and Endurance Management for Non-volatile Caches","authors":"Aritra Bagchi, Dharamjeet, Ohm Rishabh, Manan Suri, Preeti Ranjan Panda","doi":"10.1145/3653452","DOIUrl":null,"url":null,"abstract":"Non-volatile memories (NVMs) with their high storage density and ultra-low leakage power offer promising potential for redesigning the memory hierarchy in next-generation Multi-Processor Systems-on-Chip (MPSoCs). However, the adoption of NVMs in cache designs introduces challenges such as NVM write overheads and limited NVM endurance. The shared NVM cache in an MPSoC experiences requests from different processor cores and responses from the off-chip memory when the requested data is not present in the cache. Besides, upon evictions of dirty data from higher-level caches, the shared NVM cache experiences another source of write operations, known as writebacks. These sources of write operations: writebacks and responses, further exacerbate the contention for the shared bandwidth of the NVM cache, and create significant performance bottlenecks. Uncontrolled write operations can also affect the endurance of the NVM cache, posing a threat to cache lifetime and system reliability. Existing strategies often address either performance or cache endurance individually, leaving a gap for a holistic solution. This study introduces the Performance Optimization and Endurance Management (POEM) methodology, a novel approach that aggressively bypasses cache writebacks and responses to alleviate the NVM cache contention. Contrary to the existing bypass policies which do not pay adequate attention to the shared NVM cache contention, and focus too much on cache data reuse, POEM’s aggressive bypass significantly improves the overall system performance, even at the expense of data reuse. POEM also employs effective wear leveling to enhance the NVM cache endurance by careful redistribution of write operations across different cache lines. Across diverse workloads, POEM yields an average speedup of \\(34\\% \\) over a naïve baseline and \\(28.8\\% \\) over a state-of-the-art NVM cache bypass technique, while enhancing the cache endurance by \\(15\\% \\) over the baseline. POEM also explores diverse design choices by exploiting a key policy parameter that assigns varying priorities to the two system-level objectives.","PeriodicalId":50944,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems","volume":"34 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3653452","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Non-volatile memories (NVMs) with their high storage density and ultra-low leakage power offer promising potential for redesigning the memory hierarchy in next-generation Multi-Processor Systems-on-Chip (MPSoCs). However, the adoption of NVMs in cache designs introduces challenges such as NVM write overheads and limited NVM endurance. The shared NVM cache in an MPSoC experiences requests from different processor cores and responses from the off-chip memory when the requested data is not present in the cache. Besides, upon evictions of dirty data from higher-level caches, the shared NVM cache experiences another source of write operations, known as writebacks. These sources of write operations: writebacks and responses, further exacerbate the contention for the shared bandwidth of the NVM cache, and create significant performance bottlenecks. Uncontrolled write operations can also affect the endurance of the NVM cache, posing a threat to cache lifetime and system reliability. Existing strategies often address either performance or cache endurance individually, leaving a gap for a holistic solution. This study introduces the Performance Optimization and Endurance Management (POEM) methodology, a novel approach that aggressively bypasses cache writebacks and responses to alleviate the NVM cache contention. Contrary to the existing bypass policies which do not pay adequate attention to the shared NVM cache contention, and focus too much on cache data reuse, POEM’s aggressive bypass significantly improves the overall system performance, even at the expense of data reuse. POEM also employs effective wear leveling to enhance the NVM cache endurance by careful redistribution of write operations across different cache lines. Across diverse workloads, POEM yields an average speedup of \(34\% \) over a naïve baseline and \(28.8\% \) over a state-of-the-art NVM cache bypass technique, while enhancing the cache endurance by \(15\% \) over the baseline. POEM also explores diverse design choices by exploiting a key policy parameter that assigns varying priorities to the two system-level objectives.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

POEM：非易失性高速缓存的性能优化和耐久性管理

非易失性存储器（NVM）具有存储密度高、漏电功率超低的特点，为重新设计下一代多处理器片上系统（MPSoC）的存储器层次结构提供了广阔的前景。然而，在高速缓存设计中采用 NVM 会带来一些挑战，如 NVM 写入开销和有限的 NVM 耐用性。当请求的数据不在高速缓存中时，MPSoC 中的共享 NVM 高速缓存会受到来自不同处理器内核的请求和来自片外内存的响应。此外，从上一级高速缓存中驱逐脏数据时，共享 NVM 高速缓存还会经历另一个写操作源，即回写。这些写操作源（回写和响应）进一步加剧了对 NVM 高速缓存共享带宽的争夺，并造成严重的性能瓶颈。不受控制的写操作还会影响 NVM 缓存的耐用性，对缓存寿命和系统可靠性构成威胁。现有的策略通常是单独解决性能或高速缓存耐久性问题，这就为整体解决方案留下了空白。本研究介绍了性能优化和耐久性管理（POEM）方法，这是一种积极绕过高速缓存回写和响应以缓解 NVM 高速缓存争用的新方法。现有的旁路策略没有充分关注共享的 NVM 缓存争用问题，而是过于关注缓存数据的重用，与此相反，POEM 的积极旁路策略即使以牺牲数据重用为代价，也能显著提高系统的整体性能。POEM 还采用了有效的损耗均衡技术，通过在不同缓存行之间谨慎地重新分配写入操作来提高 NVM 缓存的耐用性。在各种不同的工作负载中，POEM的平均速度比原始基线提高了34%，比最先进的NVM缓存旁路技术提高了28.8%，同时缓存耐用性比基线提高了15%。POEM 还利用关键策略参数为两个系统级目标分配了不同的优先级，从而探索了多样化的设计选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Design Automation of Electronic Systems 工程技术-计算机：软件工程

CiteScore

3.20

自引率

7.10%

发文量

105

审稿时长

3 months

期刊介绍： TODAES is a premier ACM journal in design and automation of electronic systems. It publishes innovative work documenting significant research and development advances on the specification, design, analysis, simulation, testing, and evaluation of electronic systems, emphasizing a computer science/engineering orientation. Both theoretical analysis and practical solutions are welcome.