在移动处理器上使用两级分支预测器的性能改进

2015 IEEE International Conference on Consumer Electronics - Taiwan Pub Date : 2015-06-06 DOI:10.1109/ICCE-TW.2015.7217014

N. G. Kim, Hyun Hak Cho, C. M. Eun, O. H. Jeong

{"title":"在移动处理器上使用两级分支预测器的性能改进","authors":"N. G. Kim, Hyun Hak Cho, C. M. Eun, O. H. Jeong","doi":"10.1109/ICCE-TW.2015.7217014","DOIUrl":null,"url":null,"abstract":"In the last few years, the microprocessor of mobile device has been developed into multi-core, multi-issue, and deep pipeline for high-performance. But maximizing the parallelism of a pipeline and deepening pipeline cause more penalties so that branch predictor is more and more important. One of the efficient ways to improve its performance is generally to increase the size of the branch predictor. However, increasing the size of branch predictor necessarily involves the increase of memory indexing time and power consumption. Therefore, to improve the performance of branch predictor considering the mobile environment, we have studied the effect on performance improvement by changing the other factors while remaining the size of the memory fixed with 2-level branch predictor. We experimented to investigate the performance improvement by modifying the associativity of Branch Target Buffer (BTB) and the size of Branch History Register (BHR). The simulation was performed using SimpleScalar 3.0 based on ARM Cortex-A15 and benchmarks from SPEC CPU2000. As a result, in the 2-level branch predictor structure, when reducing the length of BHR in 8 bit, the increase of Instruction Per Cycle (IPC) is the highest as 0.599%. Therefore, we concluded that to modify the length of BHR of branch predictor is most effective way to improve the performance of the processor.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance improvement using two level branch predictor on the mobile processor\",\"authors\":\"N. G. Kim, Hyun Hak Cho, C. M. Eun, O. H. Jeong\",\"doi\":\"10.1109/ICCE-TW.2015.7217014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last few years, the microprocessor of mobile device has been developed into multi-core, multi-issue, and deep pipeline for high-performance. But maximizing the parallelism of a pipeline and deepening pipeline cause more penalties so that branch predictor is more and more important. One of the efficient ways to improve its performance is generally to increase the size of the branch predictor. However, increasing the size of branch predictor necessarily involves the increase of memory indexing time and power consumption. Therefore, to improve the performance of branch predictor considering the mobile environment, we have studied the effect on performance improvement by changing the other factors while remaining the size of the memory fixed with 2-level branch predictor. We experimented to investigate the performance improvement by modifying the associativity of Branch Target Buffer (BTB) and the size of Branch History Register (BHR). The simulation was performed using SimpleScalar 3.0 based on ARM Cortex-A15 and benchmarks from SPEC CPU2000. As a result, in the 2-level branch predictor structure, when reducing the length of BHR in 8 bit, the increase of Instruction Per Cycle (IPC) is the highest as 0.599%. Therefore, we concluded that to modify the length of BHR of branch predictor is most effective way to improve the performance of the processor.\",\"PeriodicalId\":340402,\"journal\":{\"name\":\"2015 IEEE International Conference on Consumer Electronics - Taiwan\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Consumer Electronics - Taiwan\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCE-TW.2015.7217014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Consumer Electronics - Taiwan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-TW.2015.7217014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

近年来，移动设备微处理器向着多核、多议题、高性能深流水线的方向发展。但是最大化管道的并行性和加深管道会带来更多的代价，因此分支预测器变得越来越重要。提高其性能的有效方法之一通常是增加分支预测器的大小。然而，增加分支预测器的大小必然会增加内存索引时间和功耗。因此，为了在考虑移动环境的情况下提高分支预测器的性能，我们研究了在保持2级分支预测器内存大小不变的情况下，改变其他因素对性能提高的影响。我们通过修改分支目标缓冲区(BTB)的关联性和分支历史寄存器(BHR)的大小来研究性能的提高。仿真采用基于ARM Cortex-A15的SimpleScalar 3.0和SPEC CPU2000的基准测试进行。结果表明，在2级分支预测器结构中，当减少BHR长度为8位时，每周期指令(IPC)的增幅最高，为0.599%。因此，我们认为修改分支预测器的BHR长度是提高处理器性能的最有效方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Performance improvement using two level branch predictor on the mobile processor

In the last few years, the microprocessor of mobile device has been developed into multi-core, multi-issue, and deep pipeline for high-performance. But maximizing the parallelism of a pipeline and deepening pipeline cause more penalties so that branch predictor is more and more important. One of the efficient ways to improve its performance is generally to increase the size of the branch predictor. However, increasing the size of branch predictor necessarily involves the increase of memory indexing time and power consumption. Therefore, to improve the performance of branch predictor considering the mobile environment, we have studied the effect on performance improvement by changing the other factors while remaining the size of the memory fixed with 2-level branch predictor. We experimented to investigate the performance improvement by modifying the associativity of Branch Target Buffer (BTB) and the size of Branch History Register (BHR). The simulation was performed using SimpleScalar 3.0 based on ARM Cortex-A15 and benchmarks from SPEC CPU2000. As a result, in the 2-level branch predictor structure, when reducing the length of BHR in 8 bit, the increase of Instruction Per Cycle (IPC) is the highest as 0.599%. Therefore, we concluded that to modify the length of BHR of branch predictor is most effective way to improve the performance of the processor.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 IEEE International Conference on Consumer Electronics - Taiwan

自引率

0.00%

发文量

期刊最新文献

A fuzzy-rough set based ontology for hybrid recommendation Monitoring system of patient position based on wireless body area sensor network Automation control algorithms in gas mixture for preterm infant oxygen therapy Interframe hole filling for DIBR in 3D videos Automatic recognition of audio event using dynamic local binary patterns