{"title":"Thread mapping using system-level throughput prediction model for shared memory multicores","authors":"Reshmi Mitra, B. Joshi, R. Adams","doi":"10.1109/PCCC.2014.7017045","DOIUrl":null,"url":null,"abstract":"The primary purpose of the current paper is to design a fast and accurate performance model framework for exploring various thread-to-core mapping strategies (MS) and estimating steady state cycles per instruction (CPI). It is directed towards efficiently exploring these performance metrics for large parallel applications for shared memory multicores. This work establishes a hybrid Markov Chain Model (MCM) and Model Tree (MT) based system-level performance prediction model framework. The model is validated with an Electromagnetics application for 12 different mapping strategies. The average performance prediction error is 0.168% with standard deviation of 3.866%. The total run time of model is of the order of minutes, whereas the actual application execution time is in terms of several days.","PeriodicalId":105442,"journal":{"name":"2014 IEEE 33rd International Performance Computing and Communications Conference (IPCCC)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 33rd International Performance Computing and Communications Conference (IPCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCCC.2014.7017045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The primary purpose of the current paper is to design a fast and accurate performance model framework for exploring various thread-to-core mapping strategies (MS) and estimating steady state cycles per instruction (CPI). It is directed towards efficiently exploring these performance metrics for large parallel applications for shared memory multicores. This work establishes a hybrid Markov Chain Model (MCM) and Model Tree (MT) based system-level performance prediction model framework. The model is validated with an Electromagnetics application for 12 different mapping strategies. The average performance prediction error is 0.168% with standard deviation of 3.866%. The total run time of model is of the order of minutes, whereas the actual application execution time is in terms of several days.