广域网上集群并行有限元性能的可行性研究及可预测性

Masae Muraoka, H. Okuda
{"title":"广域网上集群并行有限元性能的可行性研究及可预测性","authors":"Masae Muraoka, H. Okuda","doi":"10.1299/JCST.3.460","DOIUrl":null,"url":null,"abstract":"With the rapid growth of WAN infrastructures and development of Grid middleware, the cluster-of-clusters has become a realistic methodology for executing computationdemanding applications. While distributed computing or loosely connected applications have been successfully ported to the Grid environment, few tightly connected applications such as parallel finite element analysis (FEA) have been attempted. In this paper we focus on an iterative solver which is often used in FEA: the conjugate gradient (CG) method. By using both predictions and numerical experiments we evaluate the performance of the CG method parallelized via domain decomposition. From numerical experiments we measure the inter-cluster execution time and find it to be close to that obtained by the slowest cluster. At the level of performance currently found in most WAN systems, as long as the number of processes is kept adequately small, the increase in communication cost due to WAN is small relative to the computation cost. For a very large test model which realistically requires the utilization of remote resources, the work ratio stays above 86% even for 64 processes. Especially for very large models, using the distributed environment is expected to be a practical methodology even for parallel computations with communication as frequent as the one found in FEA. To evaluate the feasibility of parallel FEA on the C-of-C, we have proposed methods for a priori finding the optimal number of processes.","PeriodicalId":196913,"journal":{"name":"Journal of Computational Science and Technology","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Feasibility Study and Predictability on the Performance of Parallel FEM Using Clusters on WAN\",\"authors\":\"Masae Muraoka, H. Okuda\",\"doi\":\"10.1299/JCST.3.460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid growth of WAN infrastructures and development of Grid middleware, the cluster-of-clusters has become a realistic methodology for executing computationdemanding applications. While distributed computing or loosely connected applications have been successfully ported to the Grid environment, few tightly connected applications such as parallel finite element analysis (FEA) have been attempted. In this paper we focus on an iterative solver which is often used in FEA: the conjugate gradient (CG) method. By using both predictions and numerical experiments we evaluate the performance of the CG method parallelized via domain decomposition. From numerical experiments we measure the inter-cluster execution time and find it to be close to that obtained by the slowest cluster. At the level of performance currently found in most WAN systems, as long as the number of processes is kept adequately small, the increase in communication cost due to WAN is small relative to the computation cost. For a very large test model which realistically requires the utilization of remote resources, the work ratio stays above 86% even for 64 processes. Especially for very large models, using the distributed environment is expected to be a practical methodology even for parallel computations with communication as frequent as the one found in FEA. To evaluate the feasibility of parallel FEA on the C-of-C, we have proposed methods for a priori finding the optimal number of processes.\",\"PeriodicalId\":196913,\"journal\":{\"name\":\"Journal of Computational Science and Technology\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1299/JCST.3.460\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1299/JCST.3.460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着广域网基础设施的快速发展和网格中间件的发展,集群的集群已经成为执行计算要求高的应用程序的一种现实的方法。虽然分布式计算或松散连接的应用程序已经成功地移植到网格环境中,但很少有紧密连接的应用程序(如并行有限元分析(FEA))被尝试。本文重点讨论了有限元分析中常用的一种迭代求解方法:共轭梯度法。通过预测和数值实验,我们评估了通过域分解并行化的CG方法的性能。通过数值实验,我们测量了集群间的执行时间,发现它与最慢集群的执行时间接近。在目前大多数广域网系统的性能水平上,只要进程的数量保持足够小,由于广域网而增加的通信成本相对于计算成本来说是很小的。对于实际需要利用远程资源的非常大的测试模型,即使对于64个进程,工作比率也保持在86%以上。特别是对于非常大的模型,使用分布式环境被认为是一种实用的方法,甚至对于像有限元分析中那样频繁通信的并行计算也是如此。为了评估C-of-C并行有限元分析的可行性,我们提出了先验寻找最优过程数的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Feasibility Study and Predictability on the Performance of Parallel FEM Using Clusters on WAN
With the rapid growth of WAN infrastructures and development of Grid middleware, the cluster-of-clusters has become a realistic methodology for executing computationdemanding applications. While distributed computing or loosely connected applications have been successfully ported to the Grid environment, few tightly connected applications such as parallel finite element analysis (FEA) have been attempted. In this paper we focus on an iterative solver which is often used in FEA: the conjugate gradient (CG) method. By using both predictions and numerical experiments we evaluate the performance of the CG method parallelized via domain decomposition. From numerical experiments we measure the inter-cluster execution time and find it to be close to that obtained by the slowest cluster. At the level of performance currently found in most WAN systems, as long as the number of processes is kept adequately small, the increase in communication cost due to WAN is small relative to the computation cost. For a very large test model which realistically requires the utilization of remote resources, the work ratio stays above 86% even for 64 processes. Especially for very large models, using the distributed environment is expected to be a practical methodology even for parallel computations with communication as frequent as the one found in FEA. To evaluate the feasibility of parallel FEA on the C-of-C, we have proposed methods for a priori finding the optimal number of processes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design and Optimization of a Gas Burner for TPV Application Experimental and Numerical Approaches for Reliability Evaluation of Electronic Packaging Two-Layer Viscous Shallow-Water Equations and Conservation Laws Lattice Boltzmann Simulation of Two-Phase Viscoelastic Fluid Flows An Inexact Balancing Preconditioner for Large-Scale Structural Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1