2011 23rd International Symposium on Computer Architecture and High Performance Computing最新文献

英文中文

A Metadata Cluster Based on OSD+ Devices 基于OSD+ Devices的元数据集群

2011 23rd International Symposium on Computer Architecture and High Performance Computing

Pub Date : 2011-10-26 DOI: 10.1109/SBAC-PAD.2011.12

Ana Aviles-González, J. Piernas, Pilar González-Férez

We present the design and implementation of both an enhanced type of OSD device, the OSD+ device, and a metadata cluster based on it. OSD+s support data objects and directory objects. A directory object stores file names and attributes, and supports metadata--related operations. OSD+s profit the directory implementation and features of the underlying file systems used by the storage nodes, achieving a great flexibility, simplicity and small overhead. By using OSD+ devices, we show how a metadata cluster can effectively be managed by all the servers in a system, improving the performance, scalability and availability of the metadata service. The performance of our new metadata cluster has been evaluated and compared with Lustre's. The results show that our proposal obtains a better throughput than Lustre when both use a single metadata server, easily getting improvements of more than 60--80%, and that the performance scales with the number of OSD+s.

我们提出了一种增强类型的OSD设备OSD+设备的设计和实现，以及基于它的元数据集群。OSD+s支持数据对象和目录对象。目录对象存储文件名和属性，并支持与元数据相关的操作。OSD+s利用了存储节点使用的底层文件系统的目录实现和特性，实现了极大的灵活性、简单性和较小的开销。通过使用OSD+设备，我们展示了如何通过系统中的所有服务器有效地管理元数据集群，从而提高元数据服务的性能、可伸缩性和可用性。我们的新元数据集群的性能已经与Lustre的进行了评估和比较。结果表明，在使用单个元数据服务器的情况下，我们的方案获得了比Lustre更好的吞吐量，轻松提高了60 ~ 80%，并且性能随OSD数量的增加而增加。

引用次数: 5

Component-Based Refactoring of Parallel Numerical Simulation Programs: A Case Study on Component-Based Parallel Programming 并行数值模拟程序的基于组件的重构:以基于组件的并行编程为例

2011 23rd International Symposium on Computer Architecture and High Performance Computing

Pub Date : 2011-10-26 DOI: 10.1109/SBAC-PAD.2011.28

Francisco Heron de Carvalho Junior, Cenez Araújo Rezende

Component-based programming has been applied to address the requirements of large scale applications from sciences and engineering with high performance computing (HPC) requirements. However, parallelism has been poorly supported in usual component infrastructures. This paper evidences the efficacy of an HPC platform of parallel components for development and execution of numerical simulation code, mostly found in these applications.

基于组件的编程已经被应用于解决具有高性能计算(HPC)要求的科学和工程大规模应用的需求。然而，在通常的组件基础结构中，并行性一直得不到很好的支持。本文证明了并行组件的高性能计算平台在开发和执行数值模拟代码方面的有效性，这些代码主要用于这些应用程序。

引用次数: 2

Rapid Development of Error-Free Architectural Simulators Using Dynamic Runtime Testing 利用动态运行时测试快速开发无错误架构模拟器

2011 23rd International Symposium on Computer Architecture and High Performance Computing

Pub Date : 2011-10-26 DOI: 10.1109/SBAC-PAD.2011.23

Sasa Tomic, A. Cristal, O. Unsal, M. Valero

Architectural simulator platforms are particularly complex and error-prone programs that aim to simulate all hardware details of a given target architecture. The development of a stable cycle-accurate architectural simulator can easily take several man-years. Discovering and fixing all visible errors in the simulator often requires significant effort, much higher than for writing the simulator in the first place. In addition, there are no guarantees that all programming errors will be eliminated, no matter how much effort is put into it. This paper presents dynamic runtime testing, a methodology for rapid development and accurate error detection in architectural cycle-accurate simulators. In dynamic runtime testing, the simulator execution is dynamically compared with a simple and functionally equivalent emulator. A possible error is detected if any instruction produces different results in the simulator and the emulator. Dynamic testing can help the developers of architectural simulators to get a reliable and accurate verification of functional correctness. Based on our experience, dynamic testing reduced the simulator modification time from 12-18 person-months to 3-4 person-months, and it only modestly reduced the simulator performance (in our case under 20%).

体系结构模拟器平台是特别复杂和容易出错的程序，旨在模拟给定目标体系结构的所有硬件细节。开发一个稳定的周期精确的建筑模拟器很容易花费几个人年的时间。发现和修复模拟器中所有可见的错误通常需要大量的工作，这比最初编写模拟器要高得多。此外，无论付出多少努力，都不能保证消除所有编程错误。本文介绍了动态运行时测试，这是一种在体系结构周期精确模拟器中快速开发和准确检测错误的方法。在动态运行时测试中，将仿真器的执行情况与一个简单且功能相当的仿真器进行动态比较。如果任何指令在模拟器和仿真器中产生不同的结果，则检测到可能的错误。动态测试可以帮助架构模拟器的开发人员获得可靠、准确的功能正确性验证。根据我们的经验，动态测试将模拟器修改时间从12-18人月减少到3-4人月，并且只略微降低了模拟器的性能(在我们的情况下低于20%)。

{"title":"Rapid Development of Error-Free Architectural Simulators Using Dynamic Runtime Testing","authors":"Sasa Tomic, A. Cristal, O. Unsal, M. Valero","doi":"10.1109/SBAC-PAD.2011.23","DOIUrl":"https://doi.org/10.1109/SBAC-PAD.2011.23","url":null,"abstract":"Architectural simulator platforms are particularly complex and error-prone programs that aim to simulate all hardware details of a given target architecture. The development of a stable cycle-accurate architectural simulator can easily take several man-years. Discovering and fixing all visible errors in the simulator often requires significant effort, much higher than for writing the simulator in the first place. In addition, there are no guarantees that all programming errors will be eliminated, no matter how much effort is put into it. This paper presents dynamic runtime testing, a methodology for rapid development and accurate error detection in architectural cycle-accurate simulators. In dynamic runtime testing, the simulator execution is dynamically compared with a simple and functionally equivalent emulator. A possible error is detected if any instruction produces different results in the simulator and the emulator. Dynamic testing can help the developers of architectural simulators to get a reliable and accurate verification of functional correctness. Based on our experience, dynamic testing reduced the simulator modification time from 12-18 person-months to 3-4 person-months, and it only modestly reduced the simulator performance (in our case under 20%).","PeriodicalId":390734,"journal":{"name":"2011 23rd International Symposium on Computer Architecture and High Performance Computing","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122857893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Computing Properties of Large Scalable and Fault-Tolerant Logical Networks 大型可扩展和容错逻辑网络的计算特性

2011 23rd International Symposium on Computer Architecture and High Performance Computing

Pub Date : 2011-10-26 DOI: 10.1109/SBAC-PAD.2011.22

C. Cérin, Yu Lei, Michel Koskas

As the number of processors embedded in high performance computing platforms becomes higher and higher, it is vital to force the developers to enhance the scalability of their codes in order to exploit all the resources of the platforms. This often requires new algorithms, techniques and methods for code development that add to the application code new properties: the presence of faults is no more an occasional event but a challenge. Scalability and Fault-Tolerance issues are also present in hidden part of any platform: the overlay network that is necessary to build for controlling the application or in the runtime system support for messaging which is also required to be scalable and fault tolerant. In this paper, we focus on the computational challenges to experiment with large scale (many millions of nodes) logical topologies. We compute Fault-Tolerant properties of different variants of Binomial Graphs (BMG) that are generated at random. For instance, we exhibit interesting properties regarding the number of links regarding some desired Fault-Tolerant properties and we compare different metrics with the Binomial Graph structure as the reference structure. A software tool has been developed for this study and we show experimental results with topologies containing 21000 nodes. We also explain the computational challenge when we deal with such large scale topologies and we introduce various probabilistic algorithms to solve the problems of computing the conventional metrics.

随着高性能计算平台中嵌入的处理器数量越来越多，迫使开发人员增强其代码的可伸缩性以充分利用平台的所有资源是至关重要的。这通常需要新的算法、技术和代码开发方法，为应用程序代码添加新的属性:错误的出现不再是偶然事件，而是一种挑战。可伸缩性和容错问题也存在于任何平台的隐藏部分:为控制应用程序而构建的覆盖网络，或者在运行时系统中对消息传递的支持，也需要可伸缩性和容错。在本文中，我们关注大规模(数百万个节点)逻辑拓扑实验的计算挑战。我们计算了随机生成的二项图(BMG)的不同变体的容错特性。例如，我们展示了一些关于链接数量的有趣属性，以及一些期望的容错属性，我们用二项式图结构作为参考结构比较了不同的度量。为此研究开发了一个软件工具，并展示了包含21000个节点的拓扑结构的实验结果。我们还解释了处理如此大规模拓扑时的计算挑战，并引入了各种概率算法来解决计算传统度量的问题。

{"title":"Computing Properties of Large Scalable and Fault-Tolerant Logical Networks","authors":"C. Cérin, Yu Lei, Michel Koskas","doi":"10.1109/SBAC-PAD.2011.22","DOIUrl":"https://doi.org/10.1109/SBAC-PAD.2011.22","url":null,"abstract":"As the number of processors embedded in high performance computing platforms becomes higher and higher, it is vital to force the developers to enhance the scalability of their codes in order to exploit all the resources of the platforms. This often requires new algorithms, techniques and methods for code development that add to the application code new properties: the presence of faults is no more an occasional event but a challenge. Scalability and Fault-Tolerance issues are also present in hidden part of any platform: the overlay network that is necessary to build for controlling the application or in the runtime system support for messaging which is also required to be scalable and fault tolerant. In this paper, we focus on the computational challenges to experiment with large scale (many millions of nodes) logical topologies. We compute Fault-Tolerant properties of different variants of Binomial Graphs (BMG) that are generated at random. For instance, we exhibit interesting properties regarding the number of links regarding some desired Fault-Tolerant properties and we compare different metrics with the Binomial Graph structure as the reference structure. A software tool has been developed for this study and we show experimental results with topologies containing 21000 nodes. We also explain the computational challenge when we deal with such large scale topologies and we introduce various probabilistic algorithms to solve the problems of computing the conventional metrics.","PeriodicalId":390734,"journal":{"name":"2011 23rd International Symposium on Computer Architecture and High Performance Computing","volume":"260 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132904436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

The Experience in Designing and Building the High Performance Cluster Netuno 高性能集群Netuno的设计与构建经验

2011 23rd International Symposium on Computer Architecture and High Performance Computing

Pub Date : 2011-10-26 DOI: 10.1109/SBAC-PAD.2011.11

Gabriel P. Silva, Juliana C. Correa, C. Bentes, Sergio Guedes, Mariela Gabioux

This paper presents a description and the evaluation of the Netuno supercomputer, a high-performance cluster installed at Federal University of Rio de Janeiro in Brazil. The results for the High Performance Linpack (HPL) benchmark and two real applications are reported. Since building a high-performance cluster for running a wide range of applications is a non-trivial task, some lessons learned from assembling and operating this cluster, such as the excelent performance of the OpenMPI library, and the importance of the use an efficient parallel file system over the traditional NFS system, can be useful knowledge to support the design of new systems. Currently, Netuno is being heavily used to run large scale simulations in the areas of ocean modeling, meteorology, engineering, physics, and geophysics.

本文介绍了安装在巴西里约热内卢联邦大学的高性能集群Netuno超级计算机的描述和评估。报告了高性能Linpack (HPL)基准测试和两个实际应用程序的结果。由于构建用于运行各种应用程序的高性能集群是一项非常重要的任务，因此从组装和操作该集群中学到的一些经验教训(例如OpenMPI库的出色性能，以及在传统NFS系统之上使用高效并行文件系统的重要性)可以成为支持新系统设计的有用知识。目前，Netuno被大量用于海洋建模、气象学、工程学、物理学和地球物理学等领域的大规模模拟。

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2011 23rd International Symposium on Computer Architecture and High Performance Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀