集群并行图形架构的全局分层Z空间算法

Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia Pub Date : 2006-11-29 DOI:10.1145/1174429.1174451

A. Santilli, Ewa Huebner

{"title":"集群并行图形架构的全局分层Z空间算法","authors":"A. Santilli, Ewa Huebner","doi":"10.1145/1174429.1174451","DOIUrl":null,"url":null,"abstract":"In this paper we present a new global hierarchical Z-space sort-last algorithm for cluster parallel graphics architectures that improves upon algorithms used so far for high performance super-graphics. The new algorithm bypasses limitations of sort-last tile based parallelization paradigms, and solves some known Z-space parallelization inefficiencies. The algorithm is implemented as a global hierarchical-Z system which allows GPUs to perform high frequency global intra-frame Z-culling and distributed final frame Z-determination. The new algorithm allows for full one-to-one process-GPU coupling with minimal inter-process and inter-GPU communications. This enables maximal input bandwidth, maximum GPU utilization levels, near optimal load balances and improved efficiency when scaled to larger configurations.","PeriodicalId":360852,"journal":{"name":"Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A global hierarchical Z space algorithm for cluster parallel graphics architectures\",\"authors\":\"A. Santilli, Ewa Huebner\",\"doi\":\"10.1145/1174429.1174451\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present a new global hierarchical Z-space sort-last algorithm for cluster parallel graphics architectures that improves upon algorithms used so far for high performance super-graphics. The new algorithm bypasses limitations of sort-last tile based parallelization paradigms, and solves some known Z-space parallelization inefficiencies. The algorithm is implemented as a global hierarchical-Z system which allows GPUs to perform high frequency global intra-frame Z-culling and distributed final frame Z-determination. The new algorithm allows for full one-to-one process-GPU coupling with minimal inter-process and inter-GPU communications. This enables maximal input bandwidth, maximum GPU utilization levels, near optimal load balances and improved efficiency when scaled to larger configurations.\",\"PeriodicalId\":360852,\"journal\":{\"name\":\"Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1174429.1174451\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1174429.1174451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在本文中，我们提出了一种新的全局分层z空间排序算法，用于集群并行图形架构，改进了迄今为止用于高性能超级图形的算法。新算法绕过了基于最后排序的并行化范式的限制，并解决了一些已知的z空间并行化低效问题。该算法实现为全局分层z系统，允许gpu执行高频全局帧内z剔除和分布式最终帧z确定。新算法允许完全一对一的进程- gpu耦合与最小的进程间和gpu间通信。这可以实现最大的输入带宽，最大的GPU利用率水平，接近最佳负载平衡，并在扩展到更大的配置时提高效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A global hierarchical Z space algorithm for cluster parallel graphics architectures

In this paper we present a new global hierarchical Z-space sort-last algorithm for cluster parallel graphics architectures that improves upon algorithms used so far for high performance super-graphics. The new algorithm bypasses limitations of sort-last tile based parallelization paradigms, and solves some known Z-space parallelization inefficiencies. The algorithm is implemented as a global hierarchical-Z system which allows GPUs to perform high frequency global intra-frame Z-culling and distributed final frame Z-determination. The new algorithm allows for full one-to-one process-GPU coupling with minimal inter-process and inter-GPU communications. This enables maximal input bandwidth, maximum GPU utilization levels, near optimal load balances and improved efficiency when scaled to larger configurations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia

自引率

0.00%

发文量

期刊最新文献

Backwards State-space Reduction for Planning in Dynamic Knowledge Bases Entity-Linking via Graph-Distance Minimization Specifying and Executing Optimizations for Parallel Programs Graph Transformation Planning via Abstraction Strategic Port Graph Rewriting: An Interactive Modelling and Analysis Framework