Fast dependence analysis in a multimedia vectorizing compiler

P. Bulić, V. Gustin
{"title":"Fast dependence analysis in a multimedia vectorizing compiler","authors":"P. Bulić, V. Gustin","doi":"10.1109/IPDPS.2004.1303273","DOIUrl":null,"url":null,"abstract":"There are a number of data dependence tests that have been proposed in the literature. In each test there is a different trade-off between accuracy and efficiency. The most widely used approximate data dependence tests are the Banerjee inequality and the GCD test; whereas the Omega test is a well-known exact data dependence test. We consider parallelization for microprocessors with a multimedia extension (the short SIMD execution model). For the short SIMD parallelism extraction it is essential that, if dependency exists, then the dependence distance is greater than or equal to the number of data processed in the SIMD register. This implies that some loops that could not be vectorized on traditional vector processors can still be parallelized for the short SIMD execution. In all of these tests the parallelization would be prohibited when actually there is no parallelism restriction relating to the short SIMD execution model. We present a new, fast data dependence test for array references with linear subscripts, which is used in a vectorizing compiler for microprocessors with a multimedia extension. Our test is suitable for use in a dependence analyser that is organized as a series of tests, progressively increasing in accuracy, as a replacement for the GCD or Banerjee tests.","PeriodicalId":105726,"journal":{"name":"12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2004-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPS.2004.1303273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

There are a number of data dependence tests that have been proposed in the literature. In each test there is a different trade-off between accuracy and efficiency. The most widely used approximate data dependence tests are the Banerjee inequality and the GCD test; whereas the Omega test is a well-known exact data dependence test. We consider parallelization for microprocessors with a multimedia extension (the short SIMD execution model). For the short SIMD parallelism extraction it is essential that, if dependency exists, then the dependence distance is greater than or equal to the number of data processed in the SIMD register. This implies that some loops that could not be vectorized on traditional vector processors can still be parallelized for the short SIMD execution. In all of these tests the parallelization would be prohibited when actually there is no parallelism restriction relating to the short SIMD execution model. We present a new, fast data dependence test for array references with linear subscripts, which is used in a vectorizing compiler for microprocessors with a multimedia extension. Our test is suitable for use in a dependence analyser that is organized as a series of tests, progressively increasing in accuracy, as a replacement for the GCD or Banerjee tests.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多媒体矢量化编译器的快速依赖分析
文献中提出了许多数据依赖性测试。在每个测试中,在准确性和效率之间都有不同的权衡。最广泛使用的近似数据依赖检验是Banerjee不等式和GCD检验;而Omega检验是一个众所周知的精确数据依赖检验。我们考虑具有多媒体扩展(短SIMD执行模型)的微处理器的并行化。对于短SIMD并行性提取,如果存在依赖性,则依赖性距离大于或等于SIMD寄存器中处理的数据数是至关重要的。这意味着一些不能在传统矢量处理器上矢量化的循环仍然可以在短时间的SIMD执行中并行化。在所有这些测试中,当实际上没有与短SIMD执行模型相关的并行性限制时,并行化将被禁止。我们提出了一种新的、快速的线性下标数组引用的数据依赖性测试方法,并将其用于具有多媒体扩展的微处理器向量化编译器中。我们的测试适用于作为一系列测试组织的依赖分析器,其准确性逐渐增加,作为GCD或Banerjee测试的替代品。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Fast dependence analysis in a multimedia vectorizing compiler Empirical characterization of the latency of long asynchronous pipelines with data-dependent module delays Parallelization and comparison of 3D iterative reconstruction algorithms Efficient monitoring to detect wireless channel failures for MPI programs Exporting processing power of home embedded devices to global computing applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1