Performance improvement using two level branch predictor on the mobile processor

N. G. Kim, Hyun Hak Cho, C. M. Eun, O. H. Jeong
{"title":"Performance improvement using two level branch predictor on the mobile processor","authors":"N. G. Kim, Hyun Hak Cho, C. M. Eun, O. H. Jeong","doi":"10.1109/ICCE-TW.2015.7217014","DOIUrl":null,"url":null,"abstract":"In the last few years, the microprocessor of mobile device has been developed into multi-core, multi-issue, and deep pipeline for high-performance. But maximizing the parallelism of a pipeline and deepening pipeline cause more penalties so that branch predictor is more and more important. One of the efficient ways to improve its performance is generally to increase the size of the branch predictor. However, increasing the size of branch predictor necessarily involves the increase of memory indexing time and power consumption. Therefore, to improve the performance of branch predictor considering the mobile environment, we have studied the effect on performance improvement by changing the other factors while remaining the size of the memory fixed with 2-level branch predictor. We experimented to investigate the performance improvement by modifying the associativity of Branch Target Buffer (BTB) and the size of Branch History Register (BHR). The simulation was performed using SimpleScalar 3.0 based on ARM Cortex-A15 and benchmarks from SPEC CPU2000. As a result, in the 2-level branch predictor structure, when reducing the length of BHR in 8 bit, the increase of Instruction Per Cycle (IPC) is the highest as 0.599%. Therefore, we concluded that to modify the length of BHR of branch predictor is most effective way to improve the performance of the processor.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Consumer Electronics - Taiwan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-TW.2015.7217014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the last few years, the microprocessor of mobile device has been developed into multi-core, multi-issue, and deep pipeline for high-performance. But maximizing the parallelism of a pipeline and deepening pipeline cause more penalties so that branch predictor is more and more important. One of the efficient ways to improve its performance is generally to increase the size of the branch predictor. However, increasing the size of branch predictor necessarily involves the increase of memory indexing time and power consumption. Therefore, to improve the performance of branch predictor considering the mobile environment, we have studied the effect on performance improvement by changing the other factors while remaining the size of the memory fixed with 2-level branch predictor. We experimented to investigate the performance improvement by modifying the associativity of Branch Target Buffer (BTB) and the size of Branch History Register (BHR). The simulation was performed using SimpleScalar 3.0 based on ARM Cortex-A15 and benchmarks from SPEC CPU2000. As a result, in the 2-level branch predictor structure, when reducing the length of BHR in 8 bit, the increase of Instruction Per Cycle (IPC) is the highest as 0.599%. Therefore, we concluded that to modify the length of BHR of branch predictor is most effective way to improve the performance of the processor.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在移动处理器上使用两级分支预测器的性能改进
近年来,移动设备微处理器向着多核、多议题、高性能深流水线的方向发展。但是最大化管道的并行性和加深管道会带来更多的代价,因此分支预测器变得越来越重要。提高其性能的有效方法之一通常是增加分支预测器的大小。然而,增加分支预测器的大小必然会增加内存索引时间和功耗。因此,为了在考虑移动环境的情况下提高分支预测器的性能,我们研究了在保持2级分支预测器内存大小不变的情况下,改变其他因素对性能提高的影响。我们通过修改分支目标缓冲区(BTB)的关联性和分支历史寄存器(BHR)的大小来研究性能的提高。仿真采用基于ARM Cortex-A15的SimpleScalar 3.0和SPEC CPU2000的基准测试进行。结果表明,在2级分支预测器结构中,当减少BHR长度为8位时,每周期指令(IPC)的增幅最高,为0.599%。因此,我们认为修改分支预测器的BHR长度是提高处理器性能的最有效方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A fuzzy-rough set based ontology for hybrid recommendation Monitoring system of patient position based on wireless body area sensor network Automation control algorithms in gas mixture for preterm infant oxygen therapy Interframe hole filling for DIBR in 3D videos Automatic recognition of audio event using dynamic local binary patterns
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1