Power optimizations for transport triggered SIMD processors

Joonas Multanen, T. Viitanen, Henry Linjamaki, Heikki O. Kultala, P. Jääskeläinen, J. Takala, L. Koskinen, Jesse Simonsson, H. Berg, K. Raiskila, Tommi Zetterman
{"title":"Power optimizations for transport triggered SIMD processors","authors":"Joonas Multanen, T. Viitanen, Henry Linjamaki, Heikki O. Kultala, P. Jääskeläinen, J. Takala, L. Koskinen, Jesse Simonsson, H. Berg, K. Raiskila, Tommi Zetterman","doi":"10.1109/SAMOS.2015.7363689","DOIUrl":null,"url":null,"abstract":"Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology. The processor has a programmer exposed datapath based on the transport triggered architecture programming model. The paper's focus is on the RTL and microarchitecture level power optimizations applied to the design. Using semiautomated interconnection network and register file optimization algorithm, up to 27% of power savings were achieved. Using this as a baseline and applying register file datapath gating, register file banking and enabling clock gating of individual pipeline stages in pipelined function units, up to 26% of power and energy savings could be achieved with only a 3% area overhead. On top of this, for the measured radio applications, the exposed datapath architecture helped to achieve approximately 18% power improvement in comparison to a VLIW-like architecture by utilizing optimizations unique to transport triggered architectures.","PeriodicalId":346802,"journal":{"name":"2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMOS.2015.7363689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology. The processor has a programmer exposed datapath based on the transport triggered architecture programming model. The paper's focus is on the RTL and microarchitecture level power optimizations applied to the design. Using semiautomated interconnection network and register file optimization algorithm, up to 27% of power savings were achieved. Using this as a baseline and applying register file datapath gating, register file banking and enabling clock gating of individual pipeline stages in pipelined function units, up to 26% of power and energy savings could be achieved with only a 3% area overhead. On top of this, for the measured radio applications, the exposed datapath architecture helped to achieve approximately 18% power improvement in comparison to a VLIW-like architecture by utilizing optimizations unique to transport triggered architectures.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
传输触发SIMD处理器的电源优化
功耗是现代处理器设计中的一个关键方面。在移动设备的情况下,优化处理器的功率可以直接节省电池的能耗。同时,许多移动应用对计算性能的要求也很高。在大规模计算的情况下,低功耗计算设备有助于热设计和减少电费。本文介绍了一个基于28纳米工艺技术合成的定制低功耗矢量处理器设计的案例研究。处理器有一个程序员公开的基于传输触发架构编程模型的数据路径。本文的重点是RTL和微架构级的功率优化应用于设计。采用半自动互连网络和注册文件优化算法,可节省高达27%的电力。以此为基准,在流水线功能单元中应用寄存器文件数据路径门控、寄存器文件银行和启用单个流水线阶段的时钟门控,可以实现高达26%的电力和能源节约,而面积开销仅为3%。最重要的是,对于测量的无线电应用程序,通过利用传输触发体系结构特有的优化,与类似vliw的体系结构相比,公开的数据路径体系结构帮助实现了大约18%的功率改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Deterministic event-based control of Virtual Platforms for MPSoC software debugging Dynamic re-vectorization of binary code Experiences in speeding up computer vision applications on mobile computing platforms A power estimation technique for cycle-accurate higher-abstraction SystemC-based CPU models Framework for parameter analysis of FPGA-based image processing architectures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1