ARC: DVFS-aware asymmetric-retention STT-RAM caches for energy-efficient multicore processors

Proceedings of the International Symposium on Memory Systems Pub Date : 2019-09-30 DOI:10.1145/3357526.3357553

Dhruv Gajaria, Tosiron Adegbija

{"title":"ARC: DVFS-aware asymmetric-retention STT-RAM caches for energy-efficient multicore processors","authors":"Dhruv Gajaria, Tosiron Adegbija","doi":"10.1145/3357526.3357553","DOIUrl":null,"url":null,"abstract":"Relaxed retention (or volatile) spin-transfer torque RAM (STT-RAM) has been widely studied as a way to reduce STT-RAM's write energy and latency overheads. Given a relaxed retention time STT-RAM level one (L1) cache, we analyze the impacts of dynamic voltage and frequency scaling (DVFS)---a common optimization in modern processors---on STT-RAM L1 cache design. Our analysis reveals that, apart from the fact that different applications may require different retention times, the clock frequency, which is typically ignored in most STT-RAM studies, may also significantly impact applications' retention time needs. Based on our findings, we propose an asymmetric-retention core (ARC) design for multicore architectures. ARC features retention time heterogeneity to specialize STT-RAM retention times to applications' needs. We also propose a runtime prediction model to determine the best core on which to run an application, based on the applications' characteristics, their retention time requirements, and available DVFS settings. Results reveal that the proposed approach can reduce the average cache energy by 20.19% and overall processor energy by 7.66%, compared to a homogeneous STT-RAM cache design.","PeriodicalId":355696,"journal":{"name":"Proceedings of the International Symposium on Memory Systems","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Symposium on Memory Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3357526.3357553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Relaxed retention (or volatile) spin-transfer torque RAM (STT-RAM) has been widely studied as a way to reduce STT-RAM's write energy and latency overheads. Given a relaxed retention time STT-RAM level one (L1) cache, we analyze the impacts of dynamic voltage and frequency scaling (DVFS)---a common optimization in modern processors---on STT-RAM L1 cache design. Our analysis reveals that, apart from the fact that different applications may require different retention times, the clock frequency, which is typically ignored in most STT-RAM studies, may also significantly impact applications' retention time needs. Based on our findings, we propose an asymmetric-retention core (ARC) design for multicore architectures. ARC features retention time heterogeneity to specialize STT-RAM retention times to applications' needs. We also propose a runtime prediction model to determine the best core on which to run an application, based on the applications' characteristics, their retention time requirements, and available DVFS settings. Results reveal that the proposed approach can reduce the average cache energy by 20.19% and overall processor energy by 7.66%, compared to a homogeneous STT-RAM cache design.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ARC:用于节能多核处理器的可感知dvfs的非对称保留STT-RAM缓存

松弛保留(或易失性)自旋转移扭矩RAM (STT-RAM)作为一种降低STT-RAM写入能量和延迟开销的方法已被广泛研究。给定一个宽松的STT-RAM一级(L1)缓存保留时间，我们分析了动态电压和频率缩放(DVFS)-现代处理器中常见的优化-对STT-RAM L1缓存设计的影响。我们的分析表明，除了不同的应用程序可能需要不同的保留时间之外，时钟频率(在大多数STT-RAM研究中通常被忽略)也可能显著影响应用程序的保留时间需求。基于我们的发现，我们提出了一种多核架构的不对称保留核(ARC)设计。ARC具有保留时间异质性，可以根据应用程序的需要专门设计STT-RAM保留时间。我们还提出了一个运行时预测模型，根据应用程序的特性、它们的保留时间要求和可用的DVFS设置来确定运行应用程序的最佳核心。结果表明，与均匀的STT-RAM缓存设计相比，该方法可减少平均缓存能量20.19%，总处理器能量7.66%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the International Symposium on Memory Systems

自引率

0.00%

发文量

期刊最新文献

DuoMC: Tight DRAM Latency Bounds with Shared Banks and Near-COTS Performance An LPDDR4 Safety Model for Automotive Applications Zoned FTL: Achieve Resource Isolation via Hardware Virtualization Low-Latency Modulation Scheme for Solid State Drive Toward Computer Vision-based Machine Intelligent Hybrid Memory Management