Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu
{"title":"在Xeon Phi处理器上使用内存绑定蒙特卡罗模拟进行性能分析","authors":"Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu","doi":"10.1109/HPCSim.2015.7237074","DOIUrl":null,"url":null,"abstract":"Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance analysis with a memory-bound Monte Carlo simulation on Xeon Phi\",\"authors\":\"Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu\",\"doi\":\"10.1109/HPCSim.2015.7237074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.\",\"PeriodicalId\":134009,\"journal\":{\"name\":\"2015 International Conference on High Performance Computing & Simulation (HPCS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on High Performance Computing & Simulation (HPCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HPCSim.2015.7237074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance analysis with a memory-bound Monte Carlo simulation on Xeon Phi
Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.