用户友好的GPGPU编程界面

Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe
{"title":"用户友好的GPGPU编程界面","authors":"Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe","doi":"10.1109/NCTM.2017.7872835","DOIUrl":null,"url":null,"abstract":"Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.","PeriodicalId":343372,"journal":{"name":"2017 6th National Conference on Technology and Management (NCTM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"User-friendly interface for GPGPU programming\",\"authors\":\"Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe\",\"doi\":\"10.1109/NCTM.2017.7872835\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.\",\"PeriodicalId\":343372,\"journal\":{\"name\":\"2017 6th National Conference on Technology and Management (NCTM)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 6th National Conference on Technology and Management (NCTM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCTM.2017.7872835\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th National Conference on Technology and Management (NCTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCTM.2017.7872835","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

计算统一设备架构(CUDA)是我们对高性能计算不断增长的需求的一个有吸引力的替代方案。然而,为了充分发挥CUDA的潜力,至少应该熟悉编程模型,并且应该对内存和缓存架构有一个合理的理解。然而,大多数来自高性能计算领域的专家都没有受过良好的训练,无法开发出能够提取必要性能的高效CUDA程序。在本文中,我们认为可以通过将CUDA架构作为操作矩阵的API来弥合这一差距。我们观察到,许多高要求的科学计算可以表示为矩阵操作,其中对高性能的需求源于矩阵的大小。我们提出了一个软件即服务(SaaS)解决方案来弥合这一差距,领域专家将数据作为矩阵上传,并通过web GUI将操作指定为涉及上传矩阵的方程。然后后端将使用CUDA处理请求并通过GUI返回结果。处理矩阵操作的CUDA代码是高度优化的,领域专家可以简单地使用它们,而不知道底层复杂的细节。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
User-friendly interface for GPGPU programming
Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A parallel version of the in-close algorithm Linguistic features based personality recognition using social media data Dynamic 3D model construction using architectural house plans The effect of roadside elements on drivers' speed selection and lateral position on road User-friendly interface for GPGPU programming
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1