{"title":"Performance analysis of a cardiac simulation code using IPM","authors":"P. Strazdins, M. Hegland","doi":"10.1145/2133173.2133186","DOIUrl":null,"url":null,"abstract":"This paper details our experiences in performing a detailed performance analysis on a large-scale parallel cardiac simulation by the Chaste software on an Nehalem and Infiniband-based cluster. Our methodology achieves good accuracy for relatively modest amounts of cluster time. The use of sections in the Chaste internal profiler, coupled with the IPM tool, enabled some detailed insights into the performance and scalability of the application.\n For large core counts, our analysis showed that performance was no longer dominated by the linear systems solver. The computationally-intensive components scaled well up to 2048 cores, and poorly scaling and highly imbalanced components associated with program output and miscellaneous functions were limiting scalability.","PeriodicalId":259517,"journal":{"name":"ACM SIGPLAN Symposium on Scala","volume":"PP 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGPLAN Symposium on Scala","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2133173.2133186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper details our experiences in performing a detailed performance analysis on a large-scale parallel cardiac simulation by the Chaste software on an Nehalem and Infiniband-based cluster. Our methodology achieves good accuracy for relatively modest amounts of cluster time. The use of sections in the Chaste internal profiler, coupled with the IPM tool, enabled some detailed insights into the performance and scalability of the application.
For large core counts, our analysis showed that performance was no longer dominated by the linear systems solver. The computationally-intensive components scaled well up to 2048 cores, and poorly scaling and highly imbalanced components associated with program output and miscellaneous functions were limiting scalability.