L. Meadows, K. Ishikawa, T. Boku, Masashi Horikoshi
This paper provides results using multiple threads and a high-performance MPI implementation of MPI_THREAD_MULTIPLE applied to a Lattice QCD Code (CCS-QCD) and run on the Oakforest-PACS machine. Performance has improved from the baseline code by as much as 1.8x for smaller lattice sizes.
{"title":"Multiple endpoints for improved MPI performance on a lattice QCD code","authors":"L. Meadows, K. Ishikawa, T. Boku, Masashi Horikoshi","doi":"10.1145/3176364.3176375","DOIUrl":"https://doi.org/10.1145/3176364.3176375","url":null,"abstract":"This paper provides results using multiple threads and a high-performance MPI implementation of MPI_THREAD_MULTIPLE applied to a Lattice QCD Code (CCS-QCD) and run on the Oakforest-PACS machine. Performance has improved from the baseline code by as much as 1.8x for smaller lattice sizes.","PeriodicalId":371083,"journal":{"name":"Proceedings of Workshops of HPC Asia","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126817809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}