Software Pipelining with Minimal Loop Overhead on Transport Triggered Architecture

Lei Jiang, Yongxin Zhu, Yipeng Wei
{"title":"Software Pipelining with Minimal Loop Overhead on Transport Triggered Architecture","authors":"Lei Jiang, Yongxin Zhu, Yipeng Wei","doi":"10.1109/ICESS.2008.18","DOIUrl":null,"url":null,"abstract":"On transport triggered architectures (TTAs) featuring huge scheduling freedom, parallelism is exploited at not only operation level, but also data transportation level. Software pipelining, an aggressive compiler optimization scheme for exploiting instruction level parallelism across loop iterations, has been studied extensively. However, only few efforts were focused on software pipelining on TTAs. In these existing works, intuitive yet less efficient methods were used, namely either modulo scheduling algorithm with some heuristics or parallel language to implement software pipelining on TTA. We propose a new software pipelining method on TTAs in order to fully evaluate the scope of scheduling freedom of TTA and take advantage of it. In this paper, we formulate the problem of constructing a resource constrained rate-optimal software pipelining with minimal loop overhead on TTAs as an integer linear programming (ILP) problem. The formulated problem is solved with GNU Linear Programming Kit (GLPK). We apply our approach to major loops in Livermore loop benchmarks. Comparing with the previous schedulers implemented with modulo scheduling algorithm, our ILP approach creates schedules which bring significant performance enhancement to applications on TTA.","PeriodicalId":278372,"journal":{"name":"2008 International Conference on Embedded Software and Systems","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Embedded Software and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICESS.2008.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

On transport triggered architectures (TTAs) featuring huge scheduling freedom, parallelism is exploited at not only operation level, but also data transportation level. Software pipelining, an aggressive compiler optimization scheme for exploiting instruction level parallelism across loop iterations, has been studied extensively. However, only few efforts were focused on software pipelining on TTAs. In these existing works, intuitive yet less efficient methods were used, namely either modulo scheduling algorithm with some heuristics or parallel language to implement software pipelining on TTA. We propose a new software pipelining method on TTAs in order to fully evaluate the scope of scheduling freedom of TTA and take advantage of it. In this paper, we formulate the problem of constructing a resource constrained rate-optimal software pipelining with minimal loop overhead on TTAs as an integer linear programming (ILP) problem. The formulated problem is solved with GNU Linear Programming Kit (GLPK). We apply our approach to major loops in Livermore loop benchmarks. Comparing with the previous schedulers implemented with modulo scheduling algorithm, our ILP approach creates schedules which bring significant performance enhancement to applications on TTA.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
传输触发体系结构中最小循环开销的软件流水线
在调度自由度较大的传输触发架构(TTAs)中,并行性不仅在操作层面得到充分利用,而且在数据传输层面得到充分利用。软件流水线是一种利用指令级并行性的编译器优化方案,已经得到了广泛的研究。然而,只有很少的努力集中在TTAs上的软件流水线上。在这些现有的工作中,使用直观但效率较低的方法,即带有启发式的模调度算法或并行语言来实现TTA上的软件流水线。为了充分评估和利用运输工具的调度自由范围,提出了一种新的运输工具的软件流水线化方法。在本文中,我们将构造一个资源受限的速率最优的软件流水线问题表述为整数线性规划(ILP)问题。用GNU线性规划工具包(GLPK)解决了这个公式化的问题。我们将我们的方法应用于利弗莫尔循环基准中的主要循环。与以前使用模调度算法实现的调度程序相比,我们的ILP方法创建的调度程序为TTA上的应用程序带来了显着的性能提升。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Efficient Multi-protocol RFID Interrogator Baseband Processor based on a Reconfigurable Architecture Schedulability Analysis for Fault-Tolerant Hard Real-Time Tasks with Arbitrary Large Deadlines Energy Efficiency of Scratch-Pad Memory at 65 nm and Below: An Empirical Study Formal Schedulability Analysis and Simulation for AADL An Entropy-based Trust Modeling and Evaluation for Wireless Sensor Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1