ARC: DVFS-aware asymmetric-retention STT-RAM caches for energy-efficient multicore processors

Dhruv Gajaria, Tosiron Adegbija
{"title":"ARC: DVFS-aware asymmetric-retention STT-RAM caches for energy-efficient multicore processors","authors":"Dhruv Gajaria, Tosiron Adegbija","doi":"10.1145/3357526.3357553","DOIUrl":null,"url":null,"abstract":"Relaxed retention (or volatile) spin-transfer torque RAM (STT-RAM) has been widely studied as a way to reduce STT-RAM's write energy and latency overheads. Given a relaxed retention time STT-RAM level one (L1) cache, we analyze the impacts of dynamic voltage and frequency scaling (DVFS)---a common optimization in modern processors---on STT-RAM L1 cache design. Our analysis reveals that, apart from the fact that different applications may require different retention times, the clock frequency, which is typically ignored in most STT-RAM studies, may also significantly impact applications' retention time needs. Based on our findings, we propose an asymmetric-retention core (ARC) design for multicore architectures. ARC features retention time heterogeneity to specialize STT-RAM retention times to applications' needs. We also propose a runtime prediction model to determine the best core on which to run an application, based on the applications' characteristics, their retention time requirements, and available DVFS settings. Results reveal that the proposed approach can reduce the average cache energy by 20.19% and overall processor energy by 7.66%, compared to a homogeneous STT-RAM cache design.","PeriodicalId":355696,"journal":{"name":"Proceedings of the International Symposium on Memory Systems","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Symposium on Memory Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3357526.3357553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Relaxed retention (or volatile) spin-transfer torque RAM (STT-RAM) has been widely studied as a way to reduce STT-RAM's write energy and latency overheads. Given a relaxed retention time STT-RAM level one (L1) cache, we analyze the impacts of dynamic voltage and frequency scaling (DVFS)---a common optimization in modern processors---on STT-RAM L1 cache design. Our analysis reveals that, apart from the fact that different applications may require different retention times, the clock frequency, which is typically ignored in most STT-RAM studies, may also significantly impact applications' retention time needs. Based on our findings, we propose an asymmetric-retention core (ARC) design for multicore architectures. ARC features retention time heterogeneity to specialize STT-RAM retention times to applications' needs. We also propose a runtime prediction model to determine the best core on which to run an application, based on the applications' characteristics, their retention time requirements, and available DVFS settings. Results reveal that the proposed approach can reduce the average cache energy by 20.19% and overall processor energy by 7.66%, compared to a homogeneous STT-RAM cache design.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ARC:用于节能多核处理器的可感知dvfs的非对称保留STT-RAM缓存
松弛保留(或易失性)自旋转移扭矩RAM (STT-RAM)作为一种降低STT-RAM写入能量和延迟开销的方法已被广泛研究。给定一个宽松的STT-RAM一级(L1)缓存保留时间,我们分析了动态电压和频率缩放(DVFS)-现代处理器中常见的优化-对STT-RAM L1缓存设计的影响。我们的分析表明,除了不同的应用程序可能需要不同的保留时间之外,时钟频率(在大多数STT-RAM研究中通常被忽略)也可能显著影响应用程序的保留时间需求。基于我们的发现,我们提出了一种多核架构的不对称保留核(ARC)设计。ARC具有保留时间异质性,可以根据应用程序的需要专门设计STT-RAM保留时间。我们还提出了一个运行时预测模型,根据应用程序的特性、它们的保留时间要求和可用的DVFS设置来确定运行应用程序的最佳核心。结果表明,与均匀的STT-RAM缓存设计相比,该方法可减少平均缓存能量20.19%,总处理器能量7.66%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DuoMC: Tight DRAM Latency Bounds with Shared Banks and Near-COTS Performance An LPDDR4 Safety Model for Automotive Applications Zoned FTL: Achieve Resource Isolation via Hardware Virtualization Low-Latency Modulation Scheme for Solid State Drive Toward Computer Vision-based Machine Intelligent Hybrid Memory Management
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1