首页 > 最新文献

Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation最新文献

英文 中文
Adapting to load on workstation clusters 适应工作站集群上的负载
Robert Brunner, L. Kalé, L. Kalé
Desktop workstations represent a largely untapped source of computational power for parallel computing. Two of the main problems in utilizing these workstations are developing strategies for migrating load so that partially loaded workstations can contribute CPU cycles to the computation, and making dynamically migratable application programs easy to write. This paper describes object arrays, a construct which makes dynamically migratable applications easier to write, and a simple strategy for migrating load on a workstation cluster.
桌面工作站为并行计算提供了大量尚未开发的计算能力。利用这些工作站的两个主要问题是开发迁移负载的策略,以便部分加载的工作站可以为计算贡献CPU周期,以及使可动态迁移的应用程序易于编写。本文描述了对象数组,一种使动态可迁移应用程序更容易编写的结构,以及一种在工作站集群上迁移负载的简单策略。
{"title":"Adapting to load on workstation clusters","authors":"Robert Brunner, L. Kalé, L. Kalé","doi":"10.1109/FMPC.1999.750590","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750590","url":null,"abstract":"Desktop workstations represent a largely untapped source of computational power for parallel computing. Two of the main problems in utilizing these workstations are developing strategies for migrating load so that partially loaded workstations can contribute CPU cycles to the computation, and making dynamically migratable application programs easy to write. This paper describes object arrays, a construct which makes dynamically migratable applications easier to write, and a simple strategy for migrating load on a workstation cluster.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123455711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Java Grande: software infrastructure for HPCC Java Grande:用于HPCC的软件基础设施
G. Fox
We describe the definition, motivation and current status of Java Grande activities. We introduce 3 roles of Java in Grande programming at client, middleware or backend tiers of a computing system. We start with Java as a language and describe where it is clearly good and where it could be good! The Java Grande Forum has numerical and distributed computing working groups and projects include the study of changes to Java and its runtime to enhance Grande applications and their programming environment community. There is an important activity to define seamless interfaces allowing universal access to general hosts. Benchmarks for all sorts of Grande applications are critical. We discuss Java for Parallel Computing including message passing (MPI) and data parallelism.
我们描述了Java Grande活动的定义、动机和现状。我们介绍了Java在计算系统的客户端、中间件和后端层的大编程中的3个角色。我们从Java作为一种语言开始,并描述它在哪些方面明显很好,哪些方面可能很好!Java大论坛有数字和分布式计算工作组,项目包括研究Java及其运行时的变化,以增强大应用程序及其编程环境社区。有一个重要的活动是定义无缝接口,允许对通用主机进行通用访问。各种大型应用程序的基准测试至关重要。我们讨论Java的并行计算,包括消息传递(MPI)和数据并行。
{"title":"Java Grande: software infrastructure for HPCC","authors":"G. Fox","doi":"10.1109/FMPC.1999.750606","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750606","url":null,"abstract":"We describe the definition, motivation and current status of Java Grande activities. We introduce 3 roles of Java in Grande programming at client, middleware or backend tiers of a computing system. We start with Java as a language and describe where it is clearly good and where it could be good! The Java Grande Forum has numerical and distributed computing working groups and projects include the study of changes to Java and its runtime to enhance Grande applications and their programming environment community. There is an important activity to define seamless interfaces allowing universal access to general hosts. Benchmarks for all sorts of Grande applications are critical. We discuss Java for Parallel Computing including message passing (MPI) and data parallelism.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127606530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HPF implementation of ARC3D ARC3D的HPF实现
M. Frumkin, J. Yan
We present an HPF implementation of ARC3D code along with the profiling and performance data on SGI Origin 2000. Advantages and limitations of HPF as a parallel programming language for CFD applications are discussed. For achieving good performance results we used the data distributions optimized for implementation of implicit and explicit operators of the solver and boundary conditions. We compare the results with MPI and directive based implementations.
我们给出了ARC3D代码的HPF实现,以及SGI Origin 2000上的性能分析和性能数据。讨论了HPF作为CFD应用的并行编程语言的优点和局限性。为了获得良好的性能结果,我们使用了为求解器和边界条件的隐式和显式操作符的实现而优化的数据分布。我们将结果与MPI和基于指令的实现进行了比较。
{"title":"HPF implementation of ARC3D","authors":"M. Frumkin, J. Yan","doi":"10.1109/FMPC.1999.750587","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750587","url":null,"abstract":"We present an HPF implementation of ARC3D code along with the profiling and performance data on SGI Origin 2000. Advantages and limitations of HPF as a parallel programming language for CFD applications are discussed. For achieving good performance results we used the data distributions optimized for implementation of implicit and explicit operators of the solver and boundary conditions. We compare the results with MPI and directive based implementations.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130967235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Distributed control parallelism for multidisciplinary design of a high speed civil transport 高速民用交通多学科设计的分布式并行控制
D. T. Krasteva, C. Baker, L. T. Watson, B. Grossman, W. Mason, R. Haftka
Large scale multidisciplinary design optimization (MDO) problems often involve massive computation over vast data sets; Regardless of the MDO problem solving methodology, advanced computing technologies and architectures are indispensable. The data parallelism inherent in some engineering problems makes massively parallel architectures a natural choice, but efficiently harnessing the power of massive parallelism requires sophisticated algorithms and techniques. This paper presents an effort to apply massively scalable distributed control and dynamic load balancing techniques to the reasonable design space identification phase of a variable complexity approach to the multidisciplinary design optimization of a high speed civil transport (HSCT). The scalability and performance of two dynamic load balancing techniques, random polling and global round robin with message combining, and two termination detection schemes, token passing and global task count, are studied. The extent to which such techniques are applicable to other MDO paradigms, and to the potential for parallel multidisciplinary design with current large-scale disciplinary codes, is of particular interest.
大规模多学科设计优化(MDO)问题通常涉及对大量数据集的大量计算;无论采用何种MDO问题解决方法,先进的计算技术和体系结构都是不可或缺的。一些工程问题中固有的数据并行性使得大规模并行架构成为自然的选择,但是有效地利用大规模并行性的力量需要复杂的算法和技术。本文将大规模可扩展的分布式控制和动态负载平衡技术应用于高速民用交通多学科设计优化的可变复杂度方法的合理设计空间识别阶段。研究了随机轮询和消息组合的全局轮询两种动态负载均衡技术,以及令牌传递和全局任务计数两种终端检测方案的可扩展性和性能。这些技术在多大程度上适用于其他MDO范例,以及与当前大规模学科规范平行的多学科设计的潜力,特别令人感兴趣。
{"title":"Distributed control parallelism for multidisciplinary design of a high speed civil transport","authors":"D. T. Krasteva, C. Baker, L. T. Watson, B. Grossman, W. Mason, R. Haftka","doi":"10.1109/FMPC.1999.750597","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750597","url":null,"abstract":"Large scale multidisciplinary design optimization (MDO) problems often involve massive computation over vast data sets; Regardless of the MDO problem solving methodology, advanced computing technologies and architectures are indispensable. The data parallelism inherent in some engineering problems makes massively parallel architectures a natural choice, but efficiently harnessing the power of massive parallelism requires sophisticated algorithms and techniques. This paper presents an effort to apply massively scalable distributed control and dynamic load balancing techniques to the reasonable design space identification phase of a variable complexity approach to the multidisciplinary design optimization of a high speed civil transport (HSCT). The scalability and performance of two dynamic load balancing techniques, random polling and global round robin with message combining, and two termination detection schemes, token passing and global task count, are studied. The extent to which such techniques are applicable to other MDO paradigms, and to the potential for parallel multidisciplinary design with current large-scale disciplinary codes, is of particular interest.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133656273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The preliminary evaluation of MBP-light with two protocol policies for a massively parallel processor-JUMP-1 基于两种协议策略的大规模并行处理器jump -1 MBP-light的初步评估
Hiroaki Inoue, K. Anjo, J. Yamamoto, J. Tanabe, Masaki Wakabayashi, M. Sato, H. Amano, K. Hiraki
A massively parallel processor called JUMP-1 has been developed to build an efficient cache coherent-distributed shared memory (DSM) on a large system with more than 1000 processors. Here, the dedicated processor called MBP (Memory Based Processor)-light to manage the DSM of JUMP-1 is introduced, and its preliminary performance with two protocol policies-update/invalidate-is evaluated. From results of its simulation, it appears that simple operations like the tag check and the collection/generation of acknowledgment packets are mostly processed by the hardware mechanisms in MBP-light without the aids of the core processor with both policies. Also, the buffer-register architecture adopted by the core processor in MBP-light is exploited enough to process a protocol transaction for both policies.
为了在超过1000个处理器的大型系统上构建高效的缓存相干分布式共享内存(DSM),开发了一种名为JUMP-1的大规模并行处理器。本文介绍了用于管理JUMP-1的DSM的专用处理器MBP (Memory Based processor)-light,并对其在更新/无效两种协议策略下的初步性能进行了评估。从仿真结果来看,标签检查和确认包的收集/生成等简单操作大多由硬件机制在MBP-light下处理,而无需核心处理器的帮助。此外,核心处理器在MBP-light中采用的缓冲寄存器架构被充分利用,可以处理两个策略的协议事务。
{"title":"The preliminary evaluation of MBP-light with two protocol policies for a massively parallel processor-JUMP-1","authors":"Hiroaki Inoue, K. Anjo, J. Yamamoto, J. Tanabe, Masaki Wakabayashi, M. Sato, H. Amano, K. Hiraki","doi":"10.1109/FMPC.1999.750609","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750609","url":null,"abstract":"A massively parallel processor called JUMP-1 has been developed to build an efficient cache coherent-distributed shared memory (DSM) on a large system with more than 1000 processors. Here, the dedicated processor called MBP (Memory Based Processor)-light to manage the DSM of JUMP-1 is introduced, and its preliminary performance with two protocol policies-update/invalidate-is evaluated. From results of its simulation, it appears that simple operations like the tag check and the collection/generation of acknowledgment packets are mostly processed by the hardware mechanisms in MBP-light without the aids of the core processor with both policies. Also, the buffer-register architecture adopted by the core processor in MBP-light is exploited enough to process a protocol transaction for both policies.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127035311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Latency tolerant algorithms for WAN based workstation clusters 基于WAN的工作站集群的延迟容忍算法
Bernd Helzer, M. Clement, Q. Snell, Brigham Young
One of the biggest differences between traditional supercomputers and workstation clusters is the latency involved in sending a message between processors. Wide Area Network (WAN) based workstation clusters can experience significant latency between machines at different geographical positions. Improvements in network technology can achieve marginal improvements, but the speed of light delays cannot be decreased. This research develops stencil algorithms that are more tolerant of latency. These algorithms can be used to solve finite element problems as well as other problems where neighbor communications are used. Latency tolerant algorithms are essential if a large number of machines on the Internet are to be used in performing a parallel computation.
传统超级计算机和工作站集群之间最大的区别之一是处理器之间发送消息所涉及的延迟。基于广域网(WAN)的工作站集群在不同地理位置的机器之间可能会遇到严重的延迟。网络技术的改进可以实现边际改进,但光延迟的速度不能降低。本研究开发了更能容忍延迟的模板算法。这些算法可以用来解决有限元问题,以及其他问题,其中邻居通信的使用。如果要使用Internet上的大量机器来执行并行计算,那么容忍延迟的算法是必不可少的。
{"title":"Latency tolerant algorithms for WAN based workstation clusters","authors":"Bernd Helzer, M. Clement, Q. Snell, Brigham Young","doi":"10.1109/FMPC.1999.750584","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750584","url":null,"abstract":"One of the biggest differences between traditional supercomputers and workstation clusters is the latency involved in sending a message between processors. Wide Area Network (WAN) based workstation clusters can experience significant latency between machines at different geographical positions. Improvements in network technology can achieve marginal improvements, but the speed of light delays cannot be decreased. This research develops stencil algorithms that are more tolerant of latency. These algorithms can be used to solve finite element problems as well as other problems where neighbor communications are used. Latency tolerant algorithms are essential if a large number of machines on the Internet are to be used in performing a parallel computation.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116001918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New algorithms for efficient mining of association rules 高效关联规则挖掘的新算法
Li Shen, Hong Shen, Ling Cheng
Discovery of association rules is an important data mining task. Several algorithms have been proposed to solve this problem. Most of them require repeated passes over the database, which incurs huge I/O overhead and high synchronization expense in parallel cases. There are a few algorithms trying to reduce these costs. But they contains weaknesses such as often requiring high pre-processing cost to get a vertical database layout, containing much redundant computation in parallel cases, and so on. We propose new association mining algorithms to overcome the above drawbacks: through minimizing the I/O cost and effectively controlling the computation cost. Experiments on well-known synthetic data show that our algorithms consistently outperform a priori, one of the best algorithms for association mining, by factors ranging from 2 to 4 in most cases. Also, our algorithms are very easy to be parallelized, and we present a parallelization for them based on a shared-nothing architecture. We observe that the parallelism in our parallel approach is developed more sufficiently than in two of the best existing parallel algorithms.
关联规则的发现是一项重要的数据挖掘任务。已经提出了几种算法来解决这个问题。它们中的大多数都需要在数据库上重复传递,这在并行情况下会产生巨大的I/O开销和高同步开销。有一些算法试图降低这些成本。但是它们也有缺点,比如通常需要很高的预处理成本来获得垂直数据库布局,在并行情况下包含很多冗余计算,等等。我们提出了新的关联挖掘算法来克服上述缺点:通过最小化I/O成本和有效地控制计算成本。在众所周知的合成数据上的实验表明,在大多数情况下,我们的算法始终优于先验算法,先验算法是关联挖掘的最佳算法之一,因子范围在2到4之间。此外,我们的算法非常容易并行化,我们提出了一种基于无共享架构的并行化算法。我们观察到,我们的并行方法的并行性比现有的两个最好的并行算法得到了更充分的发展。
{"title":"New algorithms for efficient mining of association rules","authors":"Li Shen, Hong Shen, Ling Cheng","doi":"10.1109/FMPC.1999.750605","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750605","url":null,"abstract":"Discovery of association rules is an important data mining task. Several algorithms have been proposed to solve this problem. Most of them require repeated passes over the database, which incurs huge I/O overhead and high synchronization expense in parallel cases. There are a few algorithms trying to reduce these costs. But they contains weaknesses such as often requiring high pre-processing cost to get a vertical database layout, containing much redundant computation in parallel cases, and so on. We propose new association mining algorithms to overcome the above drawbacks: through minimizing the I/O cost and effectively controlling the computation cost. Experiments on well-known synthetic data show that our algorithms consistently outperform a priori, one of the best algorithms for association mining, by factors ranging from 2 to 4 in most cases. Also, our algorithms are very easy to be parallelized, and we present a parallelization for them based on a shared-nothing architecture. We observe that the parallelism in our parallel approach is developed more sufficiently than in two of the best existing parallel algorithms.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116077796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 41
Distributed applet-based certifiable processing in client/server environments 在客户机/服务器环境中基于分布式小程序的可认证处理
Hongxia Jin, G. Sullivan, G. Masson
We describe and demonstrate the concept of Distributed Applet-based Certifiable Processing (DACP) in client/server environments for computational result correctness checking. DACP offers a low-overhead framework for Web-bused client/server environments in which a server can partition a given computational problem into a set of subproblems, distribute these sub-problems across a network to clients, and then efficiently certify the correctness of the sub-problem results returned by the clients before assembling them into a final answer for the original computational problem. The resource and time advantages of the DACP methodology are directly related to the effectiveness and efficiency offered by all innovative distributed implementation of the certification-trail approach to computational result checking. As a proof of the concept, we apply the DACP methodology, to a class of important computationally intensive problems. Our experimental assessment of DACP, performed with the use of Java applets which we have developed emphatically indicates that DACP offers significant advantages in comparison with other known result correctness checking techniques for reliable distributed computing in client/server environments.
我们描述并演示了客户端/服务器环境中用于计算结果正确性检查的分布式基于小程序的可认证处理(DACP)的概念。DACP为基于web的客户机/服务器环境提供了一个低开销的框架,在该框架中,服务器可以将给定的计算问题划分为一组子问题,通过网络将这些子问题分发给客户端,然后在将它们组装成原始计算问题的最终答案之前,有效地验证客户端返回的子问题结果的正确性。DACP方法的资源和时间优势直接关系到所有创新的分布式实现对计算结果检查的认证跟踪方法所提供的有效性和效率。作为概念的证明,我们将DACP方法应用于一类重要的计算密集型问题。我们使用我们开发的Java小程序对DACP进行了实验评估,着重表明,与其他已知的结果正确性检查技术相比,DACP在客户端/服务器环境中提供了可靠的分布式计算的显着优势。
{"title":"Distributed applet-based certifiable processing in client/server environments","authors":"Hongxia Jin, G. Sullivan, G. Masson","doi":"10.1109/FMPC.1999.750583","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750583","url":null,"abstract":"We describe and demonstrate the concept of Distributed Applet-based Certifiable Processing (DACP) in client/server environments for computational result correctness checking. DACP offers a low-overhead framework for Web-bused client/server environments in which a server can partition a given computational problem into a set of subproblems, distribute these sub-problems across a network to clients, and then efficiently certify the correctness of the sub-problem results returned by the clients before assembling them into a final answer for the original computational problem. The resource and time advantages of the DACP methodology are directly related to the effectiveness and efficiency offered by all innovative distributed implementation of the certification-trail approach to computational result checking. As a proof of the concept, we apply the DACP methodology, to a class of important computationally intensive problems. Our experimental assessment of DACP, performed with the use of Java applets which we have developed emphatically indicates that DACP offers significant advantages in comparison with other known result correctness checking techniques for reliable distributed computing in client/server environments.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123882395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Parallel simulation of two-phase flow problems using the finite element method 用有限元法并行模拟两相流问题
S. Aliabadi, Khalil Shujaee, T. Tezduyar
Parallel computation of unsteady, two-phase flow problems are performed using stabilized finite element method. The finite element formulations are written for fix meshes and are based on the Navier-Stokes equations and an advection equation governing the motion of the interface function. The interface function, with two distinct values serve as an marker identifying each fluid This function is advected with fluid velocity through out the computational domain. To increase the accuracy of the method, an interface-sharpening/mass conservation algorithm is designed. The method has been implemented on the CRAY T3E and also IBM SP/6000 using the MPI libraries. We show the effectiveness of the method in simulating complex 3D problems, such as two-fluid interface in a centrifuge tube, operation stability of a partially-filled tanker truck driving over a bump and hydrodynamics stability of ships.
采用稳定有限元法对非定常两相流问题进行了并行计算。有限元公式是为固定网格编写的,并基于Navier-Stokes方程和控制界面函数运动的平流方程。具有两个不同值的界面函数作为识别每种流体的标记,该函数在整个计算域内与流体速度平流。为了提高方法的精度,设计了一种接口锐化/质量守恒算法。该方法已在CRAY T3E和IBM SP/6000上使用MPI库实现。我们证明了该方法在模拟复杂的三维问题上的有效性,例如离心机管中的两流体界面,部分装满的油罐车行驶在颠簸上的运行稳定性以及船舶的流体动力学稳定性。
{"title":"Parallel simulation of two-phase flow problems using the finite element method","authors":"S. Aliabadi, Khalil Shujaee, T. Tezduyar","doi":"10.1109/FMPC.1999.750591","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750591","url":null,"abstract":"Parallel computation of unsteady, two-phase flow problems are performed using stabilized finite element method. The finite element formulations are written for fix meshes and are based on the Navier-Stokes equations and an advection equation governing the motion of the interface function. The interface function, with two distinct values serve as an marker identifying each fluid This function is advected with fluid velocity through out the computational domain. To increase the accuracy of the method, an interface-sharpening/mass conservation algorithm is designed. The method has been implemented on the CRAY T3E and also IBM SP/6000 using the MPI libraries. We show the effectiveness of the method in simulating complex 3D problems, such as two-fluid interface in a centrifuge tube, operation stability of a partially-filled tanker truck driving over a bump and hydrodynamics stability of ships.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123952406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Scalability analysis of multidimensional wavefront algorithms on large-scale SMP clusters 大规模SMP集群上多维波前算法的可扩展性分析
A. Hoisie, O. Lubeck, H. Wasserman
We develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model combines the separate contributions of computation and communication wavefronts. We validate the model on three supercomputer systems, with up to 500 processors, using data from an ASCI deterministic particle transport application, although the model is general to any wavefront algorithm implemented on a 2-D processor domain. We also use the model to make estimates of performance and scalability of wavefront algorithms on 100-TFLOPS computer systems expected to be in existence within the next decade. Our model shows that on a 1-billion-cell problem, single-node computation speed (nor inter-processor communication performance, as is widely believed) is the bottleneck. Finally, we present preliminary considerations that reveal the additional complexity associated with modeling wavefront algorithms on reduced-connectivity network topologies, such as clusters of SMPs.
我们开发了一个算法并行性能模型,该算法由在消息传递环境中实现的并发二维波前组成。该模型结合了计算波前和通信波前的各自贡献。我们在三个超级计算机系统上验证了该模型,该系统拥有多达500个处理器,使用来自ASCI确定性粒子输运应用程序的数据,尽管该模型适用于在二维处理器域上实现的任何波前算法。我们还使用该模型对预计在未来十年内存在的100 tflops计算机系统上的波前算法的性能和可扩展性进行估计。我们的模型显示,在10亿个单元的问题上,单节点计算速度(而不是普遍认为的处理器间通信性能)是瓶颈。最后,我们提出了初步的考虑,揭示了在减少连接的网络拓扑(如smp簇)上建模波前算法的额外复杂性。
{"title":"Scalability analysis of multidimensional wavefront algorithms on large-scale SMP clusters","authors":"A. Hoisie, O. Lubeck, H. Wasserman","doi":"10.1109/FMPC.1999.750452","DOIUrl":"https://doi.org/10.1109/FMPC.1999.750452","url":null,"abstract":"We develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model combines the separate contributions of computation and communication wavefronts. We validate the model on three supercomputer systems, with up to 500 processors, using data from an ASCI deterministic particle transport application, although the model is general to any wavefront algorithm implemented on a 2-D processor domain. We also use the model to make estimates of performance and scalability of wavefront algorithms on 100-TFLOPS computer systems expected to be in existence within the next decade. Our model shows that on a 1-billion-cell problem, single-node computation speed (nor inter-processor communication performance, as is widely believed) is the bottleneck. Finally, we present preliminary considerations that reveal the additional complexity associated with modeling wavefront algorithms on reduced-connectivity network topologies, such as clusters of SMPs.","PeriodicalId":405655,"journal":{"name":"Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132177944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1