可伸缩的环境遮挡

EGGH-HPG'12 Pub Date : 2012-06-25 DOI:10.2312/EGGH/HPG12/097-103

M. McGuire, Michael Mara, D. Luebke

{"title":"可伸缩的环境遮挡","authors":"M. McGuire, Michael Mara, D. Luebke","doi":"10.2312/EGGH/HPG12/097-103","DOIUrl":null,"url":null,"abstract":"This paper presents a set of architecture-aware performance and integration improvements for a recent screenspace ambient obscurance algorithm. These improvements collectively produce a 7 x performance increase at 2560 x1600, generalize the algorithm to both forward and deferred renderers, and eliminate the radius- and scene-dependence of the previous algorithm to provide a hard real-time guarantee of fixed execution time. The optimizations build on three strategies: pre-filter the depth buffer to maximize memory hierarchy efficiency; reduce total bandwidth by carefully reconstructing positions and normals at high precision from a depth buffer; and exploit low-level intra- and inter-thread techniques for parallel, floating-point architectures.","PeriodicalId":294868,"journal":{"name":"EGGH-HPG'12","volume":"260 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"41","resultStr":"{\"title\":\"Scalable ambient obscurance\",\"authors\":\"M. McGuire, Michael Mara, D. Luebke\",\"doi\":\"10.2312/EGGH/HPG12/097-103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a set of architecture-aware performance and integration improvements for a recent screenspace ambient obscurance algorithm. These improvements collectively produce a 7 x performance increase at 2560 x1600, generalize the algorithm to both forward and deferred renderers, and eliminate the radius- and scene-dependence of the previous algorithm to provide a hard real-time guarantee of fixed execution time. The optimizations build on three strategies: pre-filter the depth buffer to maximize memory hierarchy efficiency; reduce total bandwidth by carefully reconstructing positions and normals at high precision from a depth buffer; and exploit low-level intra- and inter-thread techniques for parallel, floating-point architectures.\",\"PeriodicalId\":294868,\"journal\":{\"name\":\"EGGH-HPG'12\",\"volume\":\"260 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"41\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EGGH-HPG'12\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2312/EGGH/HPG12/097-103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EGGH-HPG'12","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2312/EGGH/HPG12/097-103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 41

摘要

本文介绍了一种最新的屏幕空间环境模糊算法的一组架构感知性能和集成改进。这些改进共同产生了7倍的性能提高在2560 × 1600，将算法推广到前向和延迟渲染器，并消除了以前的算法的半径和场景依赖，以提供固定执行时间的硬实时保证。优化建立在三个策略上:预过滤深度缓冲区以最大化内存层次效率;通过从深度缓冲区高精度地仔细重建位置和法线来减少总带宽;并为并行浮点架构开发低级的线程内和线程间技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Scalable ambient obscurance

This paper presents a set of architecture-aware performance and integration improvements for a recent screenspace ambient obscurance algorithm. These improvements collectively produce a 7 x performance increase at 2560 x1600, generalize the algorithm to both forward and deferred renderers, and eliminate the radius- and scene-dependence of the previous algorithm to provide a hard real-time guarantee of fixed execution time. The optimizations build on three strategies: pre-filter the depth buffer to maximize memory hierarchy efficiency; reduce total bandwidth by carefully reconstructing positions and normals at high precision from a depth buffer; and exploit low-level intra- and inter-thread techniques for parallel, floating-point architectures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EGGH-HPG'12

自引率

0.00%

发文量

期刊最新文献

Algorithm and VLSI architecture for real-time 1080p60 video retargeting Maximizing parallelism in the construction of BVHs, octrees, and k-d trees kANN on the GPU with shifted sorting Reducing aliasing artifacts through resampling Design and novel uses of higher-dimensional rasterization