首页 > 最新文献

ACM/IEEE SC 2002 Conference (SC'02)最新文献

英文 中文
Applying Chimera Virtual Data Concepts to Cluster Finding in the Sloan Sky Survey 奇美拉虚拟数据概念在斯隆巡天星团查找中的应用
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10021
J. Annis, Yong Zhao, Jens-S. Vöckler, M. Wilde, S. Kent, Ian T Foster
In many scientific disciplines — especially long running, data- intensive collaborations — it is important to track all aspects of data capture, production, transformation, and analysis. In principle, one can then audit, validate, reproduce, and/or re-run with corrections various data transformations. We have recently proposed and prototyped the Chimera virtual data system, a new database-driven approach to this problem. We present here a major application study in which we apply Chimera to a challenging data analysis problem: the identification of galaxy clusters within the Sloan Digital Sky Survey. We describe the problem, its computational procedures, and the use of Chimera to plan and orchestrate the workflow of thousands of tasks on a data grid comprising hundreds of computers. This experience suggests that a general set of tools can indeed enhance the accuracy and productivity of scientific data reduction and that further development and application of this paradigm will offer great value.
在许多科学学科中——特别是长期运行的、数据密集型的协作——跟踪数据捕获、生产、转换和分析的所有方面是很重要的。原则上,可以对各种数据转换进行审计、验证、复制和/或重新运行。我们最近提出并原型化了Chimera虚拟数据系统,这是一种新的数据库驱动方法来解决这个问题。我们在这里提出了一个主要的应用研究,我们将Chimera应用于一个具有挑战性的数据分析问题:在斯隆数字巡天中识别星系团。我们描述了这个问题,它的计算过程,以及使用Chimera在由数百台计算机组成的数据网格上计划和协调数千个任务的工作流程。这一经验表明,一套通用工具确实可以提高科学数据精简的准确性和生产力,进一步发展和应用这一范式将带来巨大价值。
{"title":"Applying Chimera Virtual Data Concepts to Cluster Finding in the Sloan Sky Survey","authors":"J. Annis, Yong Zhao, Jens-S. Vöckler, M. Wilde, S. Kent, Ian T Foster","doi":"10.1109/SC.2002.10021","DOIUrl":"https://doi.org/10.1109/SC.2002.10021","url":null,"abstract":"In many scientific disciplines — especially long running, data- intensive collaborations — it is important to track all aspects of data capture, production, transformation, and analysis. In principle, one can then audit, validate, reproduce, and/or re-run with corrections various data transformations. We have recently proposed and prototyped the Chimera virtual data system, a new database-driven approach to this problem. We present here a major application study in which we apply Chimera to a challenging data analysis problem: the identification of galaxy clusters within the Sloan Digital Sky Survey. We describe the problem, its computational procedures, and the use of Chimera to plan and orchestrate the workflow of thousands of tasks on a data grid comprising hundreds of computers. This experience suggests that a general set of tools can indeed enhance the accuracy and productivity of scientific data reduction and that further development and application of this paradigm will offer great value.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120852199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 102
STORM: Lightning-Fast Resource Management STORM:闪电般的资源管理
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10057
E. Frachtenberg, F. Petrini, Juan Fernández Peinador, S. Pakin, S. Coll
Although workstation clusters are a common platform for high-performance computing (HPC), they remain more difficult to manage than sequential systems or even symmetric multiprocessors. Furthermore, as cluster sizes increase, the quality of the resource-management subsystem — essentially, all of the code that runs on a cluster other than the applications — increasingly impacts application efficiency. In this paper, we present STORM, a resource-management framework designed for scalability and performance. The key innovation behind STORM is a software architecture that enables resource management to exploit low-level network features. As a result of this HPC-application-like design, STORM is orders of magnitude faster than the best reported results in the literature on two sample resource-management functions: job launching and process scheduling.
尽管工作站集群是高性能计算(HPC)的通用平台,但它们仍然比顺序系统甚至对称多处理器更难管理。此外,随着集群规模的增加,资源管理子系统的质量(本质上是指在集群上运行的所有代码,而不是应用程序)对应用程序效率的影响越来越大。在本文中,我们提出了STORM,一个为可扩展性和性能而设计的资源管理框架。STORM背后的关键创新是一个软件架构,它使资源管理能够利用底层网络特性。由于这种类似hpc应用程序的设计,STORM在两个示例资源管理功能(作业启动和进程调度)上的速度比文献中报道的最佳结果要快几个数量级。
{"title":"STORM: Lightning-Fast Resource Management","authors":"E. Frachtenberg, F. Petrini, Juan Fernández Peinador, S. Pakin, S. Coll","doi":"10.1109/SC.2002.10057","DOIUrl":"https://doi.org/10.1109/SC.2002.10057","url":null,"abstract":"Although workstation clusters are a common platform for high-performance computing (HPC), they remain more difficult to manage than sequential systems or even symmetric multiprocessors. Furthermore, as cluster sizes increase, the quality of the resource-management subsystem — essentially, all of the code that runs on a cluster other than the applications — increasingly impacts application efficiency. In this paper, we present STORM, a resource-management framework designed for scalability and performance. The key innovation behind STORM is a software architecture that enables resource management to exploit low-level network features. As a result of this HPC-application-like design, STORM is orders of magnitude faster than the best reported results in the literature on two sample resource-management functions: job launching and process scheduling.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131940157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
QMView and GAMESS: Integration into the World Wide Computational Grid QMView和game:集成到全球计算网格中
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10014
K. Baldridge, J. Greenberg, S. Elbert, S. Mock, P. Papadopoulos
High performance computing, storage, visualization, and database infrastructures are increasing geometrically in complexity as scientists move towards grid-based computing. While this is natural, it has the effect of pushing computational capabilities beyond the reach of scientists because of the time needed to harness the infrastructure. Hiding the complexity of networked resources becomes essential if scientists are to utilize them effectively. In this work, we describe our efforts to integrate various computational chemistry components into a scientific computing environment. We briefly describe improvements we have made to individual components of the chemistry environment as well as future directions, followed by a more in-depth discussion of our strategy for integration into a grid workflow environment based on web services, which enables access to remote resources while shielding users from the complexities of the grid infrastructures. A preliminary schema for storing data obtained from computational chemistry calculations is also described.
随着科学家转向基于网格的计算,高性能计算、存储、可视化和数据库基础设施的复杂性呈几何级增长。虽然这是很自然的,但由于利用基础设施需要时间,它的影响是将计算能力推到科学家无法企及的范围之外。如果科学家要有效地利用网络资源,隐藏网络资源的复杂性就变得至关重要。在这项工作中,我们描述了我们将各种计算化学组件集成到科学计算环境中的努力。我们简要地描述了我们对化学环境的各个组件所做的改进以及未来的方向,然后更深入地讨论了我们集成到基于web服务的网格工作流环境中的策略,该策略使用户能够访问远程资源,同时屏蔽网格基础设施的复杂性。还描述了一种用于存储从计算化学计算中获得的数据的初步模式。
{"title":"QMView and GAMESS: Integration into the World Wide Computational Grid","authors":"K. Baldridge, J. Greenberg, S. Elbert, S. Mock, P. Papadopoulos","doi":"10.1109/SC.2002.10014","DOIUrl":"https://doi.org/10.1109/SC.2002.10014","url":null,"abstract":"High performance computing, storage, visualization, and database infrastructures are increasing geometrically in complexity as scientists move towards grid-based computing. While this is natural, it has the effect of pushing computational capabilities beyond the reach of scientists because of the time needed to harness the infrastructure. Hiding the complexity of networked resources becomes essential if scientists are to utilize them effectively. In this work, we describe our efforts to integrate various computational chemistry components into a scientific computing environment. We briefly describe improvements we have made to individual components of the chemistry environment as well as future directions, followed by a more in-depth discussion of our strategy for integration into a grid workflow environment based on web services, which enables access to remote resources while shielding users from the complexities of the grid infrastructures. A preliminary schema for storing data obtained from computational chemistry calculations is also described.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128168198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Advanced Visualization Technology for Terascale Particle Accelerator Simulations 兆级粒子加速器模拟的高级可视化技术
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10007
K. Ma, G. Schussman, Brett Wilson, K. Ko, J. Qiang, R. Ryne
This paper presents two new hardware-assisted rendering techniques developed for interactive visualization of the terascale data generated from numerical modeling of next-generation accelerator designs. The first technique, based on a hybrid rendering approach, makes possible interactive exploration of large-scale particle data from particle beam dynamics modeling. The second technique, based on a compact texture-enhanced representation, exploits the advanced features of commodity graphics cards to achieve perceptually effective visualization of the very dense and complex electromagnetic fields produced from the modeling of reflection and transmission properties of open structures in an accelerator design. Because of the collaborative nature of the overall accelerator modeling project, the visualization technology developed is for both desktop and remote visualization settings. We have tested the techniques using both time-varying particle data sets containing up to one billion particles per time step and electromagnetic field data sets with millions of mesh elements.
本文介绍了两种新的硬件辅助渲染技术,用于下一代加速器设计的数值建模产生的万亿级数据的交互式可视化。第一种技术基于混合渲染方法,使粒子束动力学建模的大规模粒子数据的交互式探索成为可能。第二种技术基于紧凑的纹理增强表示,利用商品图形卡的先进特性,实现对加速器设计中开放结构的反射和传输特性建模所产生的非常密集和复杂的电磁场的感知有效可视化。由于整个加速器建模项目的协作性质,所开发的可视化技术既适用于桌面可视化设置,也适用于远程可视化设置。我们已经使用每时间步包含多达10亿个粒子的时变粒子数据集和具有数百万网格元素的电磁场数据集对技术进行了测试。
{"title":"Advanced Visualization Technology for Terascale Particle Accelerator Simulations","authors":"K. Ma, G. Schussman, Brett Wilson, K. Ko, J. Qiang, R. Ryne","doi":"10.1109/SC.2002.10007","DOIUrl":"https://doi.org/10.1109/SC.2002.10007","url":null,"abstract":"This paper presents two new hardware-assisted rendering techniques developed for interactive visualization of the terascale data generated from numerical modeling of next-generation accelerator designs. The first technique, based on a hybrid rendering approach, makes possible interactive exploration of large-scale particle data from particle beam dynamics modeling. The second technique, based on a compact texture-enhanced representation, exploits the advanced features of commodity graphics cards to achieve perceptually effective visualization of the very dense and complex electromagnetic fields produced from the modeling of reflection and transmission properties of open structures in an accelerator design. Because of the collaborative nature of the overall accelerator modeling project, the visualization technology developed is for both desktop and remote visualization settings. We have tested the techniques using both time-varying particle data sets containing up to one billion particles per time step and electromagnetic field data sets with millions of mesh elements.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114563286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Library Support for Hierarchical Multi-Processor Tasks 对分层多处理器任务的库支持
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10064
T. Rauber, G. Rünger
The paper considers the modular programming with hierarchically structured multi-processor tasks on top of SPMD tasks for distributed memory machines. The parallel execution requires a corresponding decomposition of the set of processors into a hierarchical group structure onto which the tasks are mapped. This results in a multi-level group SPMD computation model with varying processor group structures. The advantage of this kind of mixed task and data parallelism is a potential to reduce the communication overhead and to increase scalability. We present a runtime library to support the coordination of hierarchically structured multi-processor tasks. The library exploits an extended parallel group SPMD programming model and manages the entire task execution including the dynamic hierarchy of processor groups. The library is built on top of MPI, has an easy-to-use interface, and leads to only a marginal overhead while allowing static planning and dynamic restructuring.
本文研究了分布式内存机在SPMD任务基础上分层结构多处理器任务的模块化编程。并行执行需要将处理器集相应分解为任务映射到的分层组结构。这将产生具有不同处理器组结构的多级组SPMD计算模型。这种混合任务和数据并行性的优点是有可能减少通信开销并提高可伸缩性。我们提出了一个运行库来支持分层结构的多处理器任务的协调。该库利用扩展的并行组SPMD编程模型来管理整个任务执行,包括处理器组的动态层次结构。该库建立在MPI之上,具有易于使用的界面,并且在允许静态规划和动态重构的同时,只会导致很小的开销。
{"title":"Library Support for Hierarchical Multi-Processor Tasks","authors":"T. Rauber, G. Rünger","doi":"10.1109/SC.2002.10064","DOIUrl":"https://doi.org/10.1109/SC.2002.10064","url":null,"abstract":"The paper considers the modular programming with hierarchically structured multi-processor tasks on top of SPMD tasks for distributed memory machines. The parallel execution requires a corresponding decomposition of the set of processors into a hierarchical group structure onto which the tasks are mapped. This results in a multi-level group SPMD computation model with varying processor group structures. The advantage of this kind of mixed task and data parallelism is a potential to reduce the communication overhead and to increase scalability. We present a runtime library to support the coordination of hierarchically structured multi-processor tasks. The library exploits an extended parallel group SPMD programming model and manages the entire task execution including the dynamic hierarchy of processor groups. The library is built on top of MPI, has an easy-to-use interface, and leads to only a marginal overhead while allowing static planning and dynamic restructuring.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114591122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
High Performance Computing Meets Experimental Mathematics 高性能计算与实验数学
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10060
D. Bailey, D. Broadhurst, Yozo Hida, X. Li, Brandon Thompson
In this paper we describe some novel applications of high performance computing in a discipline now known as "experimental mathematics." The paper reviews some recent published work, and then presents some new results that have not yet appeared in the literature. A key technique inovlved in this research is the PSLQ integer relation algorithm (recently named one of ten "algorithms of the century" by Computing in Science and Engineering). This algorithm permits one to recognize a numeric constant in terms of the formula that it satisfies. We present a variant of PSLQ that is well-suited for parallel computation, and give several examples of new mathematical results that we have found using it. Two of these computations were performed on highly parallel computers, since they are not feasible on conventional systems. We also describe a new software package for performing arbitrary precision arithmetic, which is required in this research.
在本文中,我们描述了高性能计算在现在被称为“实验数学”的学科中的一些新应用。本文回顾了最近发表的一些研究成果,并提出了一些尚未在文献中出现的新结果。本研究涉及的一个关键技术是PSLQ整数关系算法(最近被《科学与工程计算》评为十大“世纪算法”之一)。这个算法允许人们根据它所满足的公式来识别一个数值常数。我们提出了一种非常适合并行计算的PSLQ变体,并给出了几个我们使用它发现的新的数学结果的例子。其中两个计算是在高度并行的计算机上执行的,因为它们在传统系统上是不可行的。我们还描述了一个新的软件包,用于执行本研究所需的任意精度算法。
{"title":"High Performance Computing Meets Experimental Mathematics","authors":"D. Bailey, D. Broadhurst, Yozo Hida, X. Li, Brandon Thompson","doi":"10.1109/SC.2002.10060","DOIUrl":"https://doi.org/10.1109/SC.2002.10060","url":null,"abstract":"In this paper we describe some novel applications of high performance computing in a discipline now known as \"experimental mathematics.\" The paper reviews some recent published work, and then presents some new results that have not yet appeared in the literature. A key technique inovlved in this research is the PSLQ integer relation algorithm (recently named one of ten \"algorithms of the century\" by Computing in Science and Engineering). This algorithm permits one to recognize a numeric constant in terms of the formula that it satisfies. We present a variant of PSLQ that is well-suited for parallel computation, and give several examples of new mathematical results that we have found using it. Two of these computations were performed on highly parallel computers, since they are not feasible on conventional systems. We also describe a new software package for performing arbitrary precision arithmetic, which is required in this research.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128913177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Framework for Performance Modeling and Prediction 性能建模与预测框架
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10004
A. Snavely, L. Carrington, N. Wolter, Jesús Labarta, Rosa M. Badia, A. Purkayastha
Cycle-accurate simulation is far too slow for modeling the expected performance of full parallel applications on large HPC systems. And just running an application on a system and observing wallclock time tells you nothing about why the application performs as it does (and is anyway impossible on yet-to-be-built systems). Here we present a framework for performance modeling and prediction that is faster than cycle-accurate simulation, more informative than simple benchmarking, and is shown useful for performance investigations in several dimensions.
周期精确的模拟对于模拟大型HPC系统上完全并行应用程序的预期性能来说太慢了。仅仅在系统上运行应用程序并观察时钟时间并不能告诉您为什么应用程序会这样执行(无论如何,在尚未构建的系统上是不可能的)。在这里,我们提出了一个性能建模和预测框架,它比周期精确的模拟更快,比简单的基准测试更有信息量,并且对几个维度的性能调查很有用。
{"title":"A Framework for Performance Modeling and Prediction","authors":"A. Snavely, L. Carrington, N. Wolter, Jesús Labarta, Rosa M. Badia, A. Purkayastha","doi":"10.1109/SC.2002.10004","DOIUrl":"https://doi.org/10.1109/SC.2002.10004","url":null,"abstract":"Cycle-accurate simulation is far too slow for modeling the expected performance of full parallel applications on large HPC systems. And just running an application on a system and observing wallclock time tells you nothing about why the application performs as it does (and is anyway impossible on yet-to-be-built systems). Here we present a framework for performance modeling and prediction that is faster than cycle-accurate simulation, more informative than simple benchmarking, and is shown useful for performance investigations in several dimensions.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127877141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 263
Better Tiling and Array Contraction for Compiling Scientific Programs 编译科学程序的更好的平铺和阵列收缩
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10040
Geoff Pike, P. Hilfinger
Scientific programs often include multiple loops over the same data; interleaving parts of different loops may greatly improve performance. We exploit this in a compiler for Titanium, a dialect of Java. Our compiler combines reordering optimizations such as loop fusion and tiling with storage optimizations such as array contraction (eliminating or reducing the size of temporary arrays). The programmers we have in mind are willing to spend some time tuning their code and their compiler parameters. Given that, and the difficulty in statically selecting parameters such as tile sizes, it makes sense to provide automatic parameter searching alongside the compiler. Our strategy is to optimize aggressively but to expose the compiler’s decisions to external control. We double or triple the performance of Gauss-Seidel relaxation and multi-grid (versus an optimizing compiler without tiling and array contraction), and we argue that ours is the best compiler for that kind of program.
科学项目通常包括对相同数据的多个循环;不同循环的交叉部分可以大大提高性能。我们在Java方言Titanium的编译器中利用了这一点。我们的编译器将循环融合和平铺等重新排序优化与数组收缩(消除或减少临时数组的大小)等存储优化结合起来。我们心目中的程序员愿意花一些时间调优他们的代码和编译器参数。考虑到这一点,以及静态选择参数(如贴图大小)的困难,在编译器旁边提供自动参数搜索是有意义的。我们的策略是积极优化,但将编译器的决策暴露给外部控制。我们将高斯-赛德尔松弛和多网格的性能提高了一倍或三倍(与没有平铺和数组收缩的优化编译器相比),我们认为我们的编译器是这类程序的最佳编译器。
{"title":"Better Tiling and Array Contraction for Compiling Scientific Programs","authors":"Geoff Pike, P. Hilfinger","doi":"10.1109/SC.2002.10040","DOIUrl":"https://doi.org/10.1109/SC.2002.10040","url":null,"abstract":"Scientific programs often include multiple loops over the same data; interleaving parts of different loops may greatly improve performance. We exploit this in a compiler for Titanium, a dialect of Java. Our compiler combines reordering optimizations such as loop fusion and tiling with storage optimizations such as array contraction (eliminating or reducing the size of temporary arrays). The programmers we have in mind are willing to spend some time tuning their code and their compiler parameters. Given that, and the difficulty in statically selecting parameters such as tile sizes, it makes sense to provide automatic parameter searching alongside the compiler. Our strategy is to optimize aggressively but to expose the compiler’s decisions to external control. We double or triple the performance of Gauss-Seidel relaxation and multi-grid (versus an optimizing compiler without tiling and array contraction), and we argue that ours is the best compiler for that kind of program.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124557505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes 面向易失性节点的可伸缩容错MPI
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10048
G. Bosilca, Aurélien Bouteiller, F. Cappello, Samir Djilali, G. Fedak, C. Germain, T. Hérault, Pierre Lemarinier, O. Lodygensky, F. Magniette, V. Néri, A. Selikhov
Global Computing platforms, large scale clusters and future TeraGRID systems gather thousands of nodes for computing parallel scientific applications. At this scale, node failures or disconnections are frequent events. This Volatility reduces the MTBF of the whole system in the range of hours or minutes. We present MPICH-V, an automatic Volatility tolerant MPI environment based on uncoordinated checkpoint/roll-back and distributed message logging. MPICH-V architecture relies on Channel Memories, Checkpoint servers and theoretically proven protocols to execute existing or new, SPMD and Master-Worker MPI applications on volatile nodes. To evaluate its capabilities, we run MPICH-V within a framework for which the number of nodes, Channels Memories and Checkpoint Servers can be completely configured as well as the node Volatility. We present a detailed performance evaluation of every component of MPICH-V and its global performance for non-trivial parallel applications. Experimental results demonstrate good scalability and high tolerance to node volatility.
全球计算平台、大规模集群和未来的TeraGRID系统聚集了数千个节点,用于计算并行科学应用。在这种规模下,节点故障或断开连接是经常发生的事件。这种波动性降低了整个系统在小时或分钟范围内的MTBF。我们提出了MPICH-V,一个基于非协调检查点/回滚和分布式消息日志的自动容错MPI环境。MPICH-V架构依赖于通道存储器、检查点服务器和理论上经过验证的协议,在易变节点上执行现有或新的SPMD和Master-Worker MPI应用程序。为了评估其功能,我们在一个框架内运行MPICH-V,该框架可以完全配置节点、通道存储器和检查点服务器的数量以及节点波动性。我们给出了MPICH-V的每个组件的详细性能评估及其在非平凡并行应用中的全局性能。实验结果表明,该算法具有良好的可扩展性和对节点波动的容忍度。
{"title":"MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes","authors":"G. Bosilca, Aurélien Bouteiller, F. Cappello, Samir Djilali, G. Fedak, C. Germain, T. Hérault, Pierre Lemarinier, O. Lodygensky, F. Magniette, V. Néri, A. Selikhov","doi":"10.1109/SC.2002.10048","DOIUrl":"https://doi.org/10.1109/SC.2002.10048","url":null,"abstract":"Global Computing platforms, large scale clusters and future TeraGRID systems gather thousands of nodes for computing parallel scientific applications. At this scale, node failures or disconnections are frequent events. This Volatility reduces the MTBF of the whole system in the range of hours or minutes. We present MPICH-V, an automatic Volatility tolerant MPI environment based on uncoordinated checkpoint/roll-back and distributed message logging. MPICH-V architecture relies on Channel Memories, Checkpoint servers and theoretically proven protocols to execute existing or new, SPMD and Master-Worker MPI applications on volatile nodes. To evaluate its capabilities, we run MPICH-V within a framework for which the number of nodes, Channels Memories and Checkpoint Servers can be completely configured as well as the node Volatility. We present a detailed performance evaluation of every component of MPICH-V and its global performance for non-trivial parallel applications. Experimental results demonstrate good scalability and high tolerance to node volatility.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130487962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 344
Multivariate Resource Performance Forecasting in the Network Weather Service 网络气象服务中的多元资源性能预测
Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10039
M. Swany, R. Wolski
This paper describes a new technique in the Network Weather Service for producing multi-variate forecasts. The new technique uses the NWS’s univariate forecasters and emprically gathered Cumulative Distribution Functions (CDFs) to make predictions from correlated measurement streams. Experimental results are shown in which throughput is predicted for long TCP/IP transfers from short NWS network probes.
本文介绍了网络气象服务中产生多变量预报的一种新技术。这项新技术利用美国国家气象局的单变量预报系统和经验收集的累积分布函数(CDFs),从相关的测量流中进行预测。实验结果表明,对来自短NWS网络探针的长TCP/IP传输的吞吐量进行了预测。
{"title":"Multivariate Resource Performance Forecasting in the Network Weather Service","authors":"M. Swany, R. Wolski","doi":"10.1109/SC.2002.10039","DOIUrl":"https://doi.org/10.1109/SC.2002.10039","url":null,"abstract":"This paper describes a new technique in the Network Weather Service for producing multi-variate forecasts. The new technique uses the NWS’s univariate forecasters and emprically gathered Cumulative Distribution Functions (CDFs) to make predictions from correlated measurement streams. Experimental results are shown in which throughput is predicted for long TCP/IP transfers from short NWS network probes.","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122203659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 104
期刊
ACM/IEEE SC 2002 Conference (SC'02)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1