首页 > 最新文献

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.最新文献

英文 中文
Hierarchical Domain Decomposition With Unitary Load Balancing For Electromagnetic Particle-In-Cell Codes 单元内电磁粒子码的统一负载均衡层次域分解
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556303
P. M. Campbell, Se, Albuquerque, Edward, A., Carmona, D. Walker
This paper presents a new approach to parallelizing particle-in-cell (PIC) algorithms used in the numeri- cal simulation of three-dimensional plasmas on MIMD multicomputers. Two new concepts are introduced: unitary load balance and hierarchical decomposition. The combined load for particle and field calculations ouer the time step is balanced together to form a single spatial decomposition. The unitary load scheme permits the load to be approzimately balanced while requiring less communication. Decomposition and dynamic bal- ancing as performed in each of the coordinate directions independently (hierarchical), and is particularly efi- cient when load imbalance propagates preferentially in a given direction. The hierarchical decomposition also minimizes the amount of particles that cross bound- ary regions, thereby decreasing communication. A local load balancing method is also introduced which allows rows or columns of processors to perform dynamic load balancing locally and in parallel.
本文提出了一种用于MIMD多计算机三维等离子体数值模拟的并行化粒子池(PIC)算法。引入了两个新概念:统一负载平衡和分层分解。粒子和场计算在时间步长的组合载荷被平衡在一起,形成一个单一的空间分解。单一负载方案允许负载在需要较少通信的情况下大致平衡。分解和动态平衡在每个坐标方向上独立执行(分层),并且当负载不平衡优先在给定方向上传播时特别有效。分层分解还使跨越边界区域的粒子数量最小化,从而减少了通信。还引入了一种本地负载平衡方法,允许处理器的行或列在本地并行执行动态负载平衡。
{"title":"Hierarchical Domain Decomposition With Unitary Load Balancing For Electromagnetic Particle-In-Cell Codes","authors":"P. M. Campbell, Se, Albuquerque, Edward, A., Carmona, D. Walker","doi":"10.1109/DMCC.1990.556303","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556303","url":null,"abstract":"This paper presents a new approach to parallelizing particle-in-cell (PIC) algorithms used in the numeri- cal simulation of three-dimensional plasmas on MIMD multicomputers. Two new concepts are introduced: unitary load balance and hierarchical decomposition. The combined load for particle and field calculations ouer the time step is balanced together to form a single spatial decomposition. The unitary load scheme permits the load to be approzimately balanced while requiring less communication. Decomposition and dynamic bal- ancing as performed in each of the coordinate directions independently (hierarchical), and is particularly efi- cient when load imbalance propagates preferentially in a given direction. The hierarchical decomposition also minimizes the amount of particles that cross bound- ary regions, thereby decreasing communication. A local load balancing method is also introduced which allows rows or columns of processors to perform dynamic load balancing locally and in parallel.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128128179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
An Interactive Environment for Data Partitioning and Distribution 数据分区和分布的交互式环境
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556331
V. Balasundaram, Geoffrey C. Fox, K. Kennedy, U. Kremer
An a.pproach to distributed riieiiiory pa.ralle1 programining that has recently become popular is oue where the programmer explicitly specilies t.he data decoiriposit.ion using language extensions, and a. compiler geuerates all the coiriinunicatioii. While this frees the prograiniuer froin tlie tedium of thinking about message-passing, no assistance is provided in determining the data decouiposition scheme that gives the best performance on tlie target machine. In this paper, we propose an interactive software tool that provides assistance for this very task. The proposed tool also computes performance estimates for any chosen data partitioning scheme, allowing tlie programmer to experiment with several different stra.tegies without ever running the program on the rnacliine.
一种分布式历史数据处理方法。最近流行起来的并行编程是程序员显式指定数据解码的编程。使用语言扩展,编译器生成所有的通信。虽然这将程序员从考虑消息传递的单调乏味中解放出来,但是在确定在目标机器上提供最佳性能的数据解耦方案方面没有提供任何帮助。在本文中,我们提出了一个交互式软件工具,为这项任务提供帮助。所提出的工具还计算任何选择的数据分区方案的性能估计,允许编程人员尝试几种不同的方案。没有在生产线上运行程序的情况下。
{"title":"An Interactive Environment for Data Partitioning and Distribution","authors":"V. Balasundaram, Geoffrey C. Fox, K. Kennedy, U. Kremer","doi":"10.1109/DMCC.1990.556331","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556331","url":null,"abstract":"An a.pproach to distributed riieiiiory pa.ralle1 programining that has recently become popular is oue where the programmer explicitly specilies t.he data decoiriposit.ion using language extensions, and a. compiler geuerates all the coiriinunicatioii. While this frees the prograiniuer froin tlie tedium of thinking about message-passing, no assistance is provided in determining the data decouiposition scheme that gives the best performance on tlie target machine. In this paper, we propose an interactive software tool that provides assistance for this very task. The proposed tool also computes performance estimates for any chosen data partitioning scheme, allowing tlie programmer to experiment with several different stra.tegies without ever running the program on the rnacliine.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"194 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121612961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 98
Hypercube Algorithm for Image Decomposition and Analysis in the Wavelet Representation 小波表示中图像分解与分析的超立方体算法
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555380
T. Huntsberger, B.A. Huntsberger
The tremendous amount of data contained in an image oftentimes precludes the extraction of useful information in real-time environments. A multiresolution representation can be used to obtain structural properties of a single image or sequences of images[8]. These structural properties are useful for such operations as texture analysis, image segmentation, object identification and stereo mat ching[7,12,11].
在实时环境中,图像中包含的大量数据常常阻碍了有用信息的提取。多分辨率表示可用于获取单幅图像或图像序列的结构属性[8]。这些结构属性对于纹理分析、图像分割、目标识别和立体拼接等操作非常有用[7,12,11]。
{"title":"Hypercube Algorithm for Image Decomposition and Analysis in the Wavelet Representation","authors":"T. Huntsberger, B.A. Huntsberger","doi":"10.1109/DMCC.1990.555380","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555380","url":null,"abstract":"The tremendous amount of data contained in an image oftentimes precludes the extraction of useful information in real-time environments. A multiresolution representation can be used to obtain structural properties of a single image or sequences of images[8]. These structural properties are useful for such operations as texture analysis, image segmentation, object identification and stereo mat ching[7,12,11].","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115564436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Embeddings, Communication and Performance of Algorithms in Faulty Hypercubes 故障超立方体中算法的嵌入、通信和性能
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556409
Jin-Kun Wang, F. Ozguner
In many parallel algorithms in hypercubes, linear arrays and 2-D meshes are embedded for the computations that require local communication while the hypercube topology is used for global communication. In this paper, embedding and global data communication schemes are developed for faulty hypercubes and studied in the context of algorithms. The schemes are also applicable to incomplete hypercubes resulting from the allocation of subcubes to different users.
在许多超立方体并行算法中,线性阵列和二维网格被嵌入到需要局部通信的计算中,而超立方体拓扑被用于全局通信。本文提出了故障超立方体的嵌入和全局数据通信方案,并在算法的背景下进行了研究。该方案也适用于由于将子数据集分配给不同用户而导致的不完整超数据集。
{"title":"Embeddings, Communication and Performance of Algorithms in Faulty Hypercubes","authors":"Jin-Kun Wang, F. Ozguner","doi":"10.1109/DMCC.1990.556409","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556409","url":null,"abstract":"In many parallel algorithms in hypercubes, linear arrays and 2-D meshes are embedded for the computations that require local communication while the hypercube topology is used for global communication. In this paper, embedding and global data communication schemes are developed for faulty hypercubes and studied in the context of algorithms. The schemes are also applicable to incomplete hypercubes resulting from the allocation of subcubes to different users.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129001744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Empirical Comparison of Heuristic Load Distribution in Point-to-Point Multicomputer Networks 点对点多计算机网络中启发式负载分配的经验比较
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556309
D. Grunwald, B. Nazief, D. Reed
The study compared several load placement algorithms using instrumented programs and synthetic program models. Salient characteristics of these program traces (total computation time, total number of messages sent, and average message time) span two orders of magnitude. Load distribution algorithms determine the initial placement for processes, a precursor to the more general problem of load redistribution. It is found that desirable workload distribution strategies will place new processes globally, rather than locally, to spread processes rapidly, but that local information should be used to refine global placement.
该研究比较了几种使用仪表程序和综合程序模型的负载分配算法。这些程序跟踪的显著特征(总计算时间、发送的消息总数和平均消息时间)跨越两个数量级。负载分配算法决定进程的初始位置,这是更普遍的负载再分配问题的前兆。研究发现,理想的工作负载分布策略将在全球而不是在本地放置新流程,以快速传播流程,但是应该使用本地信息来优化全局放置。
{"title":"Empirical Comparison of Heuristic Load Distribution in Point-to-Point Multicomputer Networks","authors":"D. Grunwald, B. Nazief, D. Reed","doi":"10.1109/DMCC.1990.556309","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556309","url":null,"abstract":"The study compared several load placement algorithms using instrumented programs and synthetic program models. Salient characteristics of these program traces (total computation time, total number of messages sent, and average message time) span two orders of magnitude. Load distribution algorithms determine the initial placement for processes, a precursor to the more general problem of load redistribution. It is found that desirable workload distribution strategies will place new processes globally, rather than locally, to spread processes rapidly, but that local information should be used to refine global placement.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129076007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Transferring Parallel Processing Technology To Undergraduate Computer Science Students 向计算机专业本科生传授并行处理技术
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556405
W. Thacker, O.E. Katter
To compete successfully in a world that is becoming more international and more competitive, we must increasingly become more effective in using technology, especially in our educational institutions. It is clear that there is a strong parallel between the way current developments in information technology are significantly effecting our business organizations and the way machines helped to transform our society during the Industrial Revolution. Academic organizations must accelerate the integration of new information technology, such as parallel processing methodology into the classroom for our economic growth plans to be achieved. This paper discusses the Winthrop College Computer Science Department's methodology to increase the parallel processing content and experience in our educational offerings.
为了在这个日益国际化、竞争日益激烈的世界中成功竞争,我们必须越来越有效地利用技术,尤其是在我们的教育机构中。很明显,当前信息技术的发展对我们的商业组织产生了重大影响,而机器在工业革命期间帮助改变了我们的社会,这两者之间有着很强的相似性。为了实现我们的经济增长计划,学术组织必须加速将新的信息技术,如并行处理方法整合到课堂中。本文讨论了Winthrop学院计算机科学系在我们的教育产品中增加并行处理内容和经验的方法。
{"title":"Transferring Parallel Processing Technology To Undergraduate Computer Science Students","authors":"W. Thacker, O.E. Katter","doi":"10.1109/DMCC.1990.556405","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556405","url":null,"abstract":"To compete successfully in a world that is becoming more international and more competitive, we must increasingly become more effective in using technology, especially in our educational institutions. It is clear that there is a strong parallel between the way current developments in information technology are significantly effecting our business organizations and the way machines helped to transform our society during the Industrial Revolution. Academic organizations must accelerate the integration of new information technology, such as parallel processing methodology into the classroom for our economic growth plans to be achieved. This paper discusses the Winthrop College Computer Science Department's methodology to increase the parallel processing content and experience in our educational offerings.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128784747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypercube Simulation of Electric Fish Potentials 鱼电势的超立方体模拟
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555422
Roy Williams, B. Rasnow, Christopher Assad
We present a simulation of the electrosensory input of the weakly electric fish Apteronotus leptorhynchus. This fish senses its environment by producing a sinusoidal voltage difference between its body and tail sections, causing an electric field and a current distribution in the surrounding water. If an object is nearby which has different electrical conductivity from the surrounding water, the current distribution is disturbed on the skin of the fish. The fish senses this difference from the usual current distribution, and infers the presence and location of the object. Mathematically, the problem is to solve a potential equation in the domain exterior to the fish with Cauchy boundary conditions, in the presence of an induced dipole arising from the object, and extract the potential difference across the fish skin. We have created an unstructured triangular mesh covering the two-dimensional manifold of the fish skin, using the distributed Irregular Mesh Environment (DIME), then used the Boundary Element Method to solve for the potential derivative at the fish skin. The computational problem is the solution of a full set of simultaneous linear equations, where there is an equation for each node of the boundary mesh, typically about 100 - 200. We have used an NCUBE hypercube to calculate the matrix elements and solve these equations, once for each relative position of the fish and the test object. We present some early results from the simulation.
我们提出了弱电鱼的电感觉输入的模拟leptorhynchus。这种鱼通过在身体和尾巴之间产生正弦电压差来感知环境,从而在周围的水中产生电场和电流分布。如果附近有一个物体的导电性与周围的水不同,那么鱼皮肤上的电流分布就会受到干扰。鱼感觉到这种不同于通常的水流分布,并推断出物体的存在和位置。在数学上,问题是在物体产生的感应偶极子存在的情况下,在具有柯西边界条件的鱼外域求解势方程,并提取鱼皮上的电位差。利用分布式不规则网格环境(DIME)在鱼皮的二维流形上创建了一个非结构化的三角形网格,然后利用边界元法求解鱼皮处的势导数。计算问题是求解一套完整的联立线性方程,其中边界网格的每个节点都有一个方程,通常约为100 - 200。我们使用了一个NCUBE超立方体来计算矩阵元素并求解这些方程,对于鱼和测试对象的每个相对位置都求解一次。我们给出了模拟的一些早期结果。
{"title":"Hypercube Simulation of Electric Fish Potentials","authors":"Roy Williams, B. Rasnow, Christopher Assad","doi":"10.1109/DMCC.1990.555422","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555422","url":null,"abstract":"We present a simulation of the electrosensory input of the weakly electric fish Apteronotus leptorhynchus. This fish senses its environment by producing a sinusoidal voltage difference between its body and tail sections, causing an \u0000electric field and a current distribution in the surrounding \u0000water. If an object is nearby which has different electrical conductivity from the surrounding water, the current distribution is disturbed on the skin of the fish. The fish senses this difference from the usual current distribution, and infers the presence and location of the object. \u0000 \u0000Mathematically, the problem is to solve a potential equation in the domain exterior to the fish with Cauchy boundary conditions, in the presence of an induced dipole arising from the object, and extract the potential difference across the fish skin. \u0000 \u0000We have created an unstructured triangular mesh covering the two-dimensional manifold of the fish skin, using the distributed Irregular Mesh Environment (DIME), then used the Boundary Element Method to solve for the potential derivative at the fish skin. \u0000 \u0000The computational problem is the solution of a full set of simultaneous linear equations, where there is an equation for each node of the boundary mesh, typically about 100 - 200. We have used an NCUBE hypercube to calculate the matrix elements and solve these equations, once for each relative position of the fish and the test object. We present some early results from the simulation.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124661796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Studies of Electron-Molecule Collisions on the Mark IIIfp Hypercube Mark IIIfp超立方体上电子-分子碰撞的研究
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555426
P. Hipes, C. Winstead, M. Lima, V. McKoy
We report on a distributed memory implementation and initial applications of a program for calculating electron-molecule collision cross sections. Runs on the Mark IIIfp hypercube show that large-grain MIMD machines are well suited for these applications. Some results of studies of e^--Si_2H_6 and e^--SiF_4 collisions will be discussed.
我们报告了一个计算电子-分子碰撞截面的程序的分布式存储器实现和初步应用。在Mark IIIfp超立方体上的运行表明,大粒度MIMD机器非常适合这些应用。讨论了e^—Si_2H_6和e^—SiF_4碰撞的一些研究结果。
{"title":"Studies of Electron-Molecule Collisions on the Mark IIIfp Hypercube","authors":"P. Hipes, C. Winstead, M. Lima, V. McKoy","doi":"10.1109/DMCC.1990.555426","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555426","url":null,"abstract":"We report on a distributed memory implementation and initial applications of a program for calculating electron-molecule collision cross sections. Runs on the Mark IIIfp hypercube show that large-grain MIMD machines are well suited for these applications. Some results of studies of e^--Si_2H_6 and e^--SiF_4 collisions will be discussed.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116695540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A General Framework for Complex Time-Driven Simulations on Hypercubes 超立方体上复杂时间驱动模拟的通用框架
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555372
D. Meier, K. Cloud, J. C. Horvath, L.D. Allan, W. Hammond, H. Maxfield
We describe a general framework for building and running complex time-driven simulations with several levels of concurrency. The framework has been implemented on the Caltech/JPL Mark IIIfp hypercube using the Centaur communications protocol. Our framework allows the programmer to break the hypercube up into one or more subcubes of arbitrary size (task parallelism). Each subcube runs a separate application using data parallelism and synchronous communications internal to the subcube. Communications between subcubes are performed with asynchronous messages. Subcubes can each define their own parameters and commands which drive their particular application. These are collected and organized by the Control Processor (CP) in order that the entire simulation can be driven from a single command-driven shell. This system allows several programmers to develop disjoint pieces of a large simulation in parallel and to then integrate them with little effort. Each programmer is, of course, also able to take advantage of the separate data and I/O processors on each hypercube node in order to overlap calculation and communication (on-board parallelism) as well as the pipelined floating point processor on each node (pipelined processor parallelism). We show, as an example of the framework, a large space defense simulation. Functions (sensing, tracking, etc.) each comprise a subcube; functions are collected into defense platforms (satellites); and many platforms comprise the defense architecture. Software in the CP uses simple input to determine the node allocation to each function based on the desired defense architecture and number of platforms simulated in the hypercube. This allows many different architectures to be simulated. The set of simulated platforms, the results, and the messages between them are shown on color graphics displays. The methods used herein can be generalized to other simulations of a similar nature in a straightforward manner.
我们描述了一个通用框架,用于构建和运行具有多个并发级别的复杂时间驱动模拟。该框架已在加州理工学院/喷气推进实验室Mark IIIfp超立方体上使用Centaur通信协议实现。我们的框架允许程序员将超立方体分解为一个或多个任意大小的子立方体(任务并行性)。每个子多维数据集使用子多维数据集内部的数据并行性和同步通信运行一个单独的应用程序。子数据集之间的通信是通过异步消息执行的。每个子数据集都可以定义自己的参数和命令,以驱动其特定的应用程序。这些由控制处理器(CP)收集和组织,以便可以从单个命令驱动的shell驱动整个仿真。该系统允许多个程序员并行开发大型仿真的不相交部分,然后以很少的努力将它们集成在一起。当然,每个程序员也能够利用每个超立方体节点上独立的数据和I/O处理器来重叠计算和通信(板上并行性)以及每个节点上的流水线浮点处理器(流水线处理器并行性)。作为框架的一个例子,我们展示了一个大型空间防御模拟。每个功能(传感、跟踪等)都包含一个子立方体;功能收集到防御平台(卫星);许多平台组成了防御体系结构。CP中的软件使用简单的输入,根据所需的防御体系结构和超立方体中模拟的平台数量,确定每个功能的节点分配。这允许模拟许多不同的体系结构。在彩色图形显示器上显示了模拟平台集、结果和它们之间的消息。本文使用的方法可以以直接的方式推广到具有类似性质的其他模拟。
{"title":"A General Framework for Complex Time-Driven Simulations on Hypercubes","authors":"D. Meier, K. Cloud, J. C. Horvath, L.D. Allan, W. Hammond, H. Maxfield","doi":"10.1109/DMCC.1990.555372","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555372","url":null,"abstract":"We describe a general framework for building and running complex time-driven simulations with several levels of concurrency. The framework has been implemented on the Caltech/JPL Mark IIIfp hypercube using the Centaur communications protocol. Our framework allows the programmer to break the hypercube up into one or more subcubes of arbitrary size (task parallelism). Each subcube runs a separate application using data parallelism and synchronous communications internal to the subcube. Communications between subcubes are performed with asynchronous messages. Subcubes can each define their own parameters and commands which drive their particular application. These are collected and organized by the Control Processor (CP) in order that the entire simulation can be driven from a single command-driven shell. This system allows several programmers to develop disjoint pieces of a large simulation in parallel and to then integrate them with little effort. Each programmer is, of course, also able to take advantage of the separate data and I/O processors on each hypercube node in order to overlap calculation and communication (on-board parallelism) as well as the pipelined floating point processor on each node (pipelined processor parallelism). \u0000 \u0000We show, as an example of the framework, a large space defense simulation. Functions (sensing, tracking, etc.) each comprise a subcube; functions are collected into defense platforms (satellites); and many platforms comprise the defense architecture. Software in the CP uses simple input to determine the node allocation to each function based on the desired defense architecture and number of platforms simulated in the hypercube. This allows many different architectures to be simulated. The set of simulated platforms, the results, and the messages between them are shown on color graphics displays. The methods used herein can be generalized to other simulations of a similar nature in a straightforward manner.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"165 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125171224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Distributed Algorithms for Multi-Channel Broadcast Networks 多通道广播网络的分布式算法
Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556387
Xiaojun Guan, M. Langston
A distributed algorithm is time-space optimal if it achieves optimal speedup and if it uses only a constant amount of extra space when the number of processors is fixed. In this brief paper, we outline a distributed algorithm for merging that, given a multi-channel broadcast network with k processors, merges two sorted lists of total length n in O(n/b+logk) time and O(k) extra space, and are thus time-space optimal for any fixed value of IC that satisfies n 2 klogk.
如果分布式算法实现了最佳加速,并且在处理器数量固定时仅使用恒定数量的额外空间,则该算法是时间-空间最优的。在这篇简短的文章中,我们概述了一种分布式归并算法,给定一个有k个处理器的多通道广播网络,在O(n/b+logk)时间和O(k)额外空间内归并两个总长度为n的排序列表,因此对于满足n2klogk的任何固定IC值都是时空最优的。
{"title":"Distributed Algorithms for Multi-Channel Broadcast Networks","authors":"Xiaojun Guan, M. Langston","doi":"10.1109/DMCC.1990.556387","DOIUrl":"https://doi.org/10.1109/DMCC.1990.556387","url":null,"abstract":"A distributed algorithm is time-space optimal if it achieves optimal speedup and if it uses only a constant amount of extra space when the number of processors is fixed. In this brief paper, we outline a distributed algorithm for merging that, given a multi-channel broadcast network with k processors, merges two sorted lists of total length n in O(n/b+logk) time and O(k) extra space, and are thus time-space optimal for any fixed value of IC that satisfies n 2 klogk.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122277601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the Fifth Distributed Memory Computing Conference, 1990.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1