{"title":"A Memory Centric Kernel Framework for Accelerating Short-Range, Interactive Particle Simulation","authors":"Ian Stewart, Shujia Zhou","doi":"10.1109/CCGRID.2010.108","DOIUrl":null,"url":null,"abstract":"To maximize the performance of emerging multi- and many-core accelerators such as the IBM Cell B.E. and the NVIDIA GPU, a Memory Centric Kernel Framework (MCKF) was developed. MCKF allows a user to decompose the physical space of an application based on the available fast memory in the accelerators. In this way, reducing the communication cost in accessing data can maximize the extraordinary computing power of the accelerators. MCKF is both generic and flexible because it encapsulates hardware-specific characteristics. It has been implemented and tested for short-range inter-active particle simulation on IBM Cell B.E. blades.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGRID.2010.108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To maximize the performance of emerging multi- and many-core accelerators such as the IBM Cell B.E. and the NVIDIA GPU, a Memory Centric Kernel Framework (MCKF) was developed. MCKF allows a user to decompose the physical space of an application based on the available fast memory in the accelerators. In this way, reducing the communication cost in accessing data can maximize the extraordinary computing power of the accelerators. MCKF is both generic and flexible because it encapsulates hardware-specific characteristics. It has been implemented and tested for short-range inter-active particle simulation on IBM Cell B.E. blades.