Critical path analysis of the TRIPS architecture

2006 IEEE International Symposium on Performance Analysis of Systems and Software Pub Date : 2006-03-19 DOI:10.1109/ISPASS.2006.1620788

R. Nagarajan, Xia Chen, Robert G. McDonald, D. Burger, S. Keckler

{"title":"Critical path analysis of the TRIPS architecture","authors":"R. Nagarajan, Xia Chen, Robert G. McDonald, D. Burger, S. Keckler","doi":"10.1109/ISPASS.2006.1620788","DOIUrl":null,"url":null,"abstract":"Fast, accurate, and effective performance analysis is essential for the design of modern processor architectures and improving application performance. Recent trends toward highly concurrent processors make this goal increasingly difficult. Conventional techniques, based on simulators and performance monitors, are ill-equipped to analyze how a plethora of concurrent events interact and how they affect performance. Prior research has shown the utility of critical path analysis in solving this problem. This analysis abstracts the execution of a program with a dependence graph. With simple manipulations on the graph, designers can gain insights into the bottlenecks of a design. This paper extends critical path analysis to understand the performance of a next-generation, high-ILP architecture. The TRIPS architecture introduces new features not present in conventional superscalar architectures. We show how dependence constraints introduced by these features, specifically the execution model and operand communication links, can be modeled with a dependence graph. We describe a new algorithm that tracks critical path information at a fine-grained level and yet can deliver an order of magnitude (30x) improvement in performance over previously proposed techniques. Finally, we provide a breakdown of the critical path for a select set of benchmarks and show an example where we use this information to improve the performance of a heavily-hand-optimized program by as much as 11%.","PeriodicalId":369192,"journal":{"name":"2006 IEEE International Symposium on Performance Analysis of Systems and Software","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Symposium on Performance Analysis of Systems and Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPASS.2006.1620788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

Fast, accurate, and effective performance analysis is essential for the design of modern processor architectures and improving application performance. Recent trends toward highly concurrent processors make this goal increasingly difficult. Conventional techniques, based on simulators and performance monitors, are ill-equipped to analyze how a plethora of concurrent events interact and how they affect performance. Prior research has shown the utility of critical path analysis in solving this problem. This analysis abstracts the execution of a program with a dependence graph. With simple manipulations on the graph, designers can gain insights into the bottlenecks of a design. This paper extends critical path analysis to understand the performance of a next-generation, high-ILP architecture. The TRIPS architecture introduces new features not present in conventional superscalar architectures. We show how dependence constraints introduced by these features, specifically the execution model and operand communication links, can be modeled with a dependence graph. We describe a new algorithm that tracks critical path information at a fine-grained level and yet can deliver an order of magnitude (30x) improvement in performance over previously proposed techniques. Finally, we provide a breakdown of the critical path for a select set of benchmarks and show an example where we use this information to improve the performance of a heavily-hand-optimized program by as much as 11%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

与贸易有关的知识产权协议》架构的关键路径分析

快速、准确和有效的性能分析对于设计现代处理器架构和提高应用性能至关重要。最近，高并发处理器的发展趋势使得这一目标越来越难以实现。基于模拟器和性能监控器的传统技术无法分析大量并发事件如何相互作用以及它们如何影响性能。先前的研究表明，关键路径分析在解决这一问题方面非常有用。这种分析方法通过依赖关系图对程序的执行进行抽象。通过对图的简单操作，设计人员可以深入了解设计的瓶颈。本文扩展了关键路径分析，以了解下一代高ILP 架构的性能。TRIPS 架构引入了传统超标量架构所不具备的新特性。我们展示了如何通过依赖图来模拟这些特性（特别是执行模型和操作数通信链路）所引入的依赖性约束。我们介绍了一种新算法，该算法可在细粒度级别跟踪关键路径信息，与之前提出的技术相比，性能提高了一个数量级（30 倍）。最后，我们提供了一组选定基准的关键路径明细，并举例说明了我们利用这些信息将一个经过大量手工优化的程序的性能提高了 11%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2006 IEEE International Symposium on Performance Analysis of Systems and Software

自引率

0.00%

发文量

期刊最新文献

Accelerating architectural exploration using canonical instruction segments Simulation sampling with live-points Characterizing the branch misprediction penalty Friendly fire: understanding the effects of multiprocessor prefetches Evaluating the efficacy of statistical simulation for design space exploration