Algorithm for Cooperative CPU-GPU Computing

2013 15th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing Pub Date : 2013-09-23 DOI:10.1109/SYNASC.2013.53

Razvan-Mihai Aciu, H. Ciocarlie

引用次数: 3

Abstract

Many applications have modules which could benefit greatly from the massive parallel numeric computing power provided by GPUs. Renderers, signal processing or simulators are only a few such applications. Due to the weaknesses of the GPUs such as stackless execution model or poor capabilities for pointer exchange with the host, sometimes is not feasible to convert an entire algorithm for GPU, even if it is highly parallel and some of its parts can be greatly accelerated on GPU. In such situations a programmer should have a framework which allows him to split the code flow of a thread in parts and each of these parts will run on the most suitable computing resource, CPU or GPU. For GPU execution, multiple data from host threads will be collected, run on GPU and the results returned to the original threads so they will be able to resume execution on host. In this paper we propose such an algorithm, analyze it and evaluate its practical results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

CPU-GPU协同计算算法

许多应用程序的模块可以从gpu提供的大量并行数字计算能力中受益匪浅。渲染器，信号处理或模拟器只是这样的几个应用程序。由于GPU的弱点，如无堆栈执行模型或与主机的指针交换能力差，有时无法将整个算法转换为GPU，即使它是高度并行的，并且它的某些部分可以在GPU上大大加速。在这种情况下，程序员应该有一个框架，允许他将线程的代码流分成几个部分，每个部分将在最合适的计算资源(CPU或GPU)上运行。对于GPU执行，将收集来自主机线程的多个数据，在GPU上运行，并将结果返回给原始线程，以便它们能够在主机上恢复执行。本文提出了这种算法，并对其实际效果进行了分析和评价。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 15th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing

自引率

0.00%

发文量

期刊最新文献

From the Desktop to the Multi-clouds: The Case of ModelioSaaS Bound Propagation for Arithmetic Reasoning in Vampire Dependence of the Oscillatory Movements of an Unmanned Aerial Vehicle on the Forward Velocity Cph CT Toolbox: CT Reconstruction for Education, Research and Industrial Applications Non-interleaving Operational Semantics for Geographically Replicated Databases