{"title":"An empirical study of the CRAY Y-MP processor using the PERFECT club benchmarks","authors":"S. Vajapeyam, G. Sohi, W. Hsu","doi":"10.1145/115952.115970","DOIUrl":null,"url":null,"abstract":"Characterization of machines, by studying pro~am usage of their architectural and organizational features, IS art essential ~art of the desi~n recess. ln this aper we re ort Y EL an empimcal study of a smg e processor of t e CRAY Y- P, using as benchmarks long-running scientific applications from the PERFECT Club benchmark set. Since the compiler plays a major mle in determining machine utilization and program execution speed, we compile our benchmarks usin the state-of-the-art Cray Research production FORTRA” compiler. We investigate instruction set usage, operation execution counts, sizes of basic blocks in the prorams, and instruction issue rate. We observe, among other 3“ mgs, that the vectorized fraction of the dynamic rogram % operation count ranges from 4% to %% for our bent marks, Instructions that move values between the scalar registers and corresponding backup registers form a si nificant fraction of the dynamic instruction count. Basic %locks which are more than a hundred instructions in size are significant in number; both small and large basic blocks are important from the point of view of pro ram performance. The E","PeriodicalId":187095,"journal":{"name":"[1991] Proceedings. The 18th Annual International Symposium on Computer Architecture","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1991] Proceedings. The 18th Annual International Symposium on Computer Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/115952.115970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
Abstract
Characterization of machines, by studying pro~am usage of their architectural and organizational features, IS art essential ~art of the desi~n recess. ln this aper we re ort Y EL an empimcal study of a smg e processor of t e CRAY Y- P, using as benchmarks long-running scientific applications from the PERFECT Club benchmark set. Since the compiler plays a major mle in determining machine utilization and program execution speed, we compile our benchmarks usin the state-of-the-art Cray Research production FORTRA” compiler. We investigate instruction set usage, operation execution counts, sizes of basic blocks in the prorams, and instruction issue rate. We observe, among other 3“ mgs, that the vectorized fraction of the dynamic rogram % operation count ranges from 4% to %% for our bent marks, Instructions that move values between the scalar registers and corresponding backup registers form a si nificant fraction of the dynamic instruction count. Basic %locks which are more than a hundred instructions in size are significant in number; both small and large basic blocks are important from the point of view of pro ram performance. The E