物联网终端非易失缓存的弹性自适应预取

Mao Ni, Lan Chen, Xiaoran Hao, Chenji Liu, Yiheng Zhang, Ying Li
{"title":"物联网终端非易失缓存的弹性自适应预取","authors":"Mao Ni, Lan Chen, Xiaoran Hao, Chenji Liu, Yiheng Zhang, Ying Li","doi":"10.1587/elex.19.20220225","DOIUrl":null,"url":null,"abstract":"STT-RAM with high storage density, near-zero leakage energy and CMOS compatibility is regarded as a replacement for SRAM to build large-sized cache, which can effectively alleviate the “memory wall” and improve computing power of IoT terminals. The state-of-the-art Near-Side Prefetch Throttling (NST) oriented to SRAM cache can effectively hide the access latency of off-chip memory. However, it also shows some inadaptability to the long write latency and high write energy of STT-RAM cache. The NST algorithm can not timely alleviate the cache congestion caused by STT-RAM long write latency, moreover, if the STT-RAM cache is congested, adjusting the prefetch distance is invalid to improve the prefetch timeliness. In response to the above problems, this paper novelly proposes a periodic and real-time complementary prefetch algorithm called ENCP for STT-RAM cache. Experiments show that, compared to the best- performed STREAM prefetcher, ENCP can reduce the write energy of STTRAM cache by 8.3% on average and 23% the most and improve the CPU IPC performance by 0.46% on average and 3.1% the most. And the ENCP has better performance and lower dynamic energy compared with NST with almost the same hardware overhead.","PeriodicalId":13437,"journal":{"name":"IEICE Electron. Express","volume":"35 1","pages":"20220225"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Elastic adaptive prefetching for non-volatile cache in IoT terminals\",\"authors\":\"Mao Ni, Lan Chen, Xiaoran Hao, Chenji Liu, Yiheng Zhang, Ying Li\",\"doi\":\"10.1587/elex.19.20220225\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"STT-RAM with high storage density, near-zero leakage energy and CMOS compatibility is regarded as a replacement for SRAM to build large-sized cache, which can effectively alleviate the “memory wall” and improve computing power of IoT terminals. The state-of-the-art Near-Side Prefetch Throttling (NST) oriented to SRAM cache can effectively hide the access latency of off-chip memory. However, it also shows some inadaptability to the long write latency and high write energy of STT-RAM cache. The NST algorithm can not timely alleviate the cache congestion caused by STT-RAM long write latency, moreover, if the STT-RAM cache is congested, adjusting the prefetch distance is invalid to improve the prefetch timeliness. In response to the above problems, this paper novelly proposes a periodic and real-time complementary prefetch algorithm called ENCP for STT-RAM cache. Experiments show that, compared to the best- performed STREAM prefetcher, ENCP can reduce the write energy of STTRAM cache by 8.3% on average and 23% the most and improve the CPU IPC performance by 0.46% on average and 3.1% the most. And the ENCP has better performance and lower dynamic energy compared with NST with almost the same hardware overhead.\",\"PeriodicalId\":13437,\"journal\":{\"name\":\"IEICE Electron. Express\",\"volume\":\"35 1\",\"pages\":\"20220225\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEICE Electron. Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1587/elex.19.20220225\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Electron. Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/elex.19.20220225","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

存储密度高、泄漏能量接近于零、兼容CMOS的STT-RAM被认为可以替代SRAM构建大容量缓存,可以有效缓解“内存墙”,提高物联网终端的计算能力。面向SRAM缓存的近侧预取节流(NST)技术可以有效地隐藏片外存储器的访问延迟。但是,它对STT-RAM缓存的长写时延和高写能量也表现出一定的不适应性。NST算法不能及时缓解STT-RAM长写时延造成的缓存拥塞,如果STT-RAM缓存拥塞,通过调整预取距离来提高预取时效性是无效的。针对上述问题,本文提出了一种针对STT-RAM缓存的周期性实时互补预取算法ENCP。实验表明,与性能最好的STREAM预取器相比,ENCP可使STREAM缓存的写能量平均降低8.3%,最大降低23%;CPU IPC性能平均提高0.46%,最大提高3.1%。与NST相比,ENCP在硬件开销几乎相同的情况下具有更好的性能和更低的动态能量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Elastic adaptive prefetching for non-volatile cache in IoT terminals
STT-RAM with high storage density, near-zero leakage energy and CMOS compatibility is regarded as a replacement for SRAM to build large-sized cache, which can effectively alleviate the “memory wall” and improve computing power of IoT terminals. The state-of-the-art Near-Side Prefetch Throttling (NST) oriented to SRAM cache can effectively hide the access latency of off-chip memory. However, it also shows some inadaptability to the long write latency and high write energy of STT-RAM cache. The NST algorithm can not timely alleviate the cache congestion caused by STT-RAM long write latency, moreover, if the STT-RAM cache is congested, adjusting the prefetch distance is invalid to improve the prefetch timeliness. In response to the above problems, this paper novelly proposes a periodic and real-time complementary prefetch algorithm called ENCP for STT-RAM cache. Experiments show that, compared to the best- performed STREAM prefetcher, ENCP can reduce the write energy of STTRAM cache by 8.3% on average and 23% the most and improve the CPU IPC performance by 0.46% on average and 3.1% the most. And the ENCP has better performance and lower dynamic energy compared with NST with almost the same hardware overhead.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Erratum:A 240μW 17bit ENOB ΔΣ modulator using 2nd-order noise-shaped integrating quantizer [IEICE Electronics Express Vol. 19 (2022) No. 5 pp. 20220038] Model-free double fractional-order integral sliding mode control for permanent magnet synchronous motor based electric mopeds drive system Bayesian neural network based inductance calculations of wireless power transfer systems EDF laser displacement sensor based on bending characteristics of polarization-independent double-pass cascaded-chirped long-period fiber grating A five-level inverter based on differential structure
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1