首页 > 最新文献

[1993] Proceedings Seventh International Parallel Processing Symposium最新文献

英文 中文
CMMD I/O: a parallel Unix I/O cmd I/O: Unix并行I/O
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262828
Michael L. Best, A. Greenberg, C. Stanfill, L. W. Tucker
The authors propose a library providing Unix file system support for highly parallel distributed-memory computers. CMMD I/O supports Unix I/O commands on the CM-5 supercomputer. The overall objective of the library is to provide the node level parallel programmer with routines for opening, reading, writing a file, and so forth. The default behavior mimics standard Unix running on each node; individual nodes can independently perform file system operations. New extensions to the standard Unix file descriptor semantics provide for co-operative parallel I/O. New functions provide access to very large (multi-gigabyte) files.<>
作者提出了一个为高度并行分布式内存计算机提供Unix文件系统支持的库。CMMD I/O在CM-5超级计算机上支持Unix I/O命令。该库的总体目标是为节点级并行程序员提供用于打开、读取、写入文件等的例程。默认行为模仿在每个节点上运行的标准Unix;单个节点可以独立执行文件系统操作。标准Unix文件描述符语义的新扩展提供了协作并行I/O。新功能提供了对非常大(几gb)文件的访问。
{"title":"CMMD I/O: a parallel Unix I/O","authors":"Michael L. Best, A. Greenberg, C. Stanfill, L. W. Tucker","doi":"10.1109/IPPS.1993.262828","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262828","url":null,"abstract":"The authors propose a library providing Unix file system support for highly parallel distributed-memory computers. CMMD I/O supports Unix I/O commands on the CM-5 supercomputer. The overall objective of the library is to provide the node level parallel programmer with routines for opening, reading, writing a file, and so forth. The default behavior mimics standard Unix running on each node; individual nodes can independently perform file system operations. New extensions to the standard Unix file descriptor semantics provide for co-operative parallel I/O. New functions provide access to very large (multi-gigabyte) files.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115161131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Barrier synchronization in distributed-memory multiprocessors using rendezvous primitives 分布式内存多处理器中使用会合原语的屏障同步
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262826
S. Gupta, D. Panda
This paper deals with barrier synchronization in wormhole routed distributed-memory multiprocessors. New rendezvous and multirendezvous synchronization primitives are proposed to implement a barrier between two and multiple processors, respectively. These primitives reduce the number of communication steps required to implement a barrier; thus, significantly reducing the synchronization overhead for networks with high communication start-up cost. Two algorithms for barrier synchronization on k-ary n-cube networks are presented. The rendezvous primitive allows one to synchronize all processors in nlog/sub 2/(k) steps. The multirendezvous primitive allows one to synchronize an arbitrary subset of processors in optimal number of communication steps depending on the ratio of the communication start-up (t/sub s/) to the link-propagation (t/sub p/) cost.<>
研究了虫洞路由分布式存储多处理器中的屏障同步问题。提出了新的会合和多会合同步原语,分别实现了两个处理器和多个处理器之间的屏障。这些原语减少了实现屏障所需的通信步骤;因此,显著降低了通信启动成本高的网络的同步开销。提出了k元n立方网络上的两种屏障同步算法。会合原语允许以nlog/sub 2/(k)步同步所有处理器。多交会原语允许人们根据通信启动(t/sub s/)与链路传播(t/sub p/)成本的比率,以最优通信步数同步任意处理器子集。
{"title":"Barrier synchronization in distributed-memory multiprocessors using rendezvous primitives","authors":"S. Gupta, D. Panda","doi":"10.1109/IPPS.1993.262826","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262826","url":null,"abstract":"This paper deals with barrier synchronization in wormhole routed distributed-memory multiprocessors. New rendezvous and multirendezvous synchronization primitives are proposed to implement a barrier between two and multiple processors, respectively. These primitives reduce the number of communication steps required to implement a barrier; thus, significantly reducing the synchronization overhead for networks with high communication start-up cost. Two algorithms for barrier synchronization on k-ary n-cube networks are presented. The rendezvous primitive allows one to synchronize all processors in nlog/sub 2/(k) steps. The multirendezvous primitive allows one to synchronize an arbitrary subset of processors in optimal number of communication steps depending on the ratio of the communication start-up (t/sub s/) to the link-propagation (t/sub p/) cost.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132120287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Efficient off-line routing of permutations on restricted access expanded delta networks 限制接入扩展增量网络中排列的高效脱机路由
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262894
I. Scherson, R. Subramanian
This paper presents an off-line algorithm for routing permutations on expanded delta networks (EDNs) with restricted access. Restricted access means that the number of elements to be permuted may exceed the number of inputs to the EDN. For every N-element permutation on an M-input EDN, the algorithm computes a routing that takes exactly 3N/M passes (assuming M divides N). On a certain class of EDNs, the number of passes can be reduced to 2N/M. For example, for every 16 K-element permutation on the 1 K-input global network of the MasPar MP-1 and MP-2, the algorithm computes a routing that takes exactly 32 passes. The time complexity of the algorithm is Theta (NlogN) sequentially, and Theta (log/sup 2/N) on an N-processor PRAM.<>
提出了一种用于限制接入的扩展增量网络(edn)路由置换的离线算法。限制访问意味着要排列的元素数量可能超过EDN的输入数量。对于M输入EDN上的每一个N个元素的排列,该算法计算出一条恰好需要3N/M次的路由(假设M除以N)。对于某一类EDN,通过次数可以减少到2N/M。例如,在MasPar MP-1和MP-2的1个k输入的全局网络中,对于每16个k元素的排列,该算法计算的路由恰好需要32次通过。算法的时间复杂度依次为Theta (NlogN),在N处理器的PRAM上为Theta (log/sup 2/N)。
{"title":"Efficient off-line routing of permutations on restricted access expanded delta networks","authors":"I. Scherson, R. Subramanian","doi":"10.1109/IPPS.1993.262894","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262894","url":null,"abstract":"This paper presents an off-line algorithm for routing permutations on expanded delta networks (EDNs) with restricted access. Restricted access means that the number of elements to be permuted may exceed the number of inputs to the EDN. For every N-element permutation on an M-input EDN, the algorithm computes a routing that takes exactly 3N/M passes (assuming M divides N). On a certain class of EDNs, the number of passes can be reduced to 2N/M. For example, for every 16 K-element permutation on the 1 K-input global network of the MasPar MP-1 and MP-2, the algorithm computes a routing that takes exactly 32 passes. The time complexity of the algorithm is Theta (NlogN) sequentially, and Theta (log/sup 2/N) on an N-processor PRAM.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116075581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A cluster-M based mapping methodology 基于集群m的映射方法
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262885
M. Eshaghian-Wilner, M. Shaaban
Cluster-M is a new parallel programming paradigm for designing portable software. The two main components of this paradigm are cluster-M specifications and cluster-M representations. Cluster-M specifications are high level machine independent parallel code which are mapped onto cluster-M representations, system graphs representing the topologies of the underlying architectures. An algorithm for generating cluster-M representations is presented. Also, a set of high-level constructs essential for writing cluster-M specifications are shown. Using these components, an efficient methodology is proposed to map parallel algorithms onto architectures.<>
Cluster-M是一种新的用于设计可移植软件的并行编程范式。该范式的两个主要组成部分是集群m规范和集群m表示。Cluster-M规范是高级机器独立的并行代码,它们被映射到Cluster-M表示,即表示底层架构拓扑的系统图。提出了一种生成簇- m表示的算法。此外,还显示了编写cluster-M规范所必需的一组高级结构。利用这些组件,提出了一种将并行算法映射到体系结构上的有效方法。
{"title":"A cluster-M based mapping methodology","authors":"M. Eshaghian-Wilner, M. Shaaban","doi":"10.1109/IPPS.1993.262885","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262885","url":null,"abstract":"Cluster-M is a new parallel programming paradigm for designing portable software. The two main components of this paradigm are cluster-M specifications and cluster-M representations. Cluster-M specifications are high level machine independent parallel code which are mapped onto cluster-M representations, system graphs representing the topologies of the underlying architectures. An algorithm for generating cluster-M representations is presented. Also, a set of high-level constructs essential for writing cluster-M specifications are shown. Using these components, an efficient methodology is proposed to map parallel algorithms onto architectures.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"20 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113958079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Parallel algorithms for height balancing binary trees 高度平衡二叉树的并行算法
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262903
Srinivasan Venkatraman, Alicia Kime, K. Srinivas
The authors present a simple parallel algorithm to height-balance a binary tree. The algorithm accepts any arbitrary binary tree as its input and yields an optimally shaped binary tree. For any arbitrary binary tree of n nodes the algorithm has a time complexity of O(lgn) and utilizes O(n) processors on a EREW PRAM model. The algorithm uses Euler tours and list ranking, which form the building blocks for many parallel algorithms.<>
提出了一种简单的二叉树高度平衡并行算法。该算法接受任意二叉树作为输入,并生成最优形状的二叉树。对于任意n个节点的二叉树,该算法的时间复杂度为0 (lgn),在EREW PRAM模型上使用了O(n)个处理器。该算法使用欧拉巡回和列表排序,它们构成了许多并行算法的构建块
{"title":"Parallel algorithms for height balancing binary trees","authors":"Srinivasan Venkatraman, Alicia Kime, K. Srinivas","doi":"10.1109/IPPS.1993.262903","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262903","url":null,"abstract":"The authors present a simple parallel algorithm to height-balance a binary tree. The algorithm accepts any arbitrary binary tree as its input and yields an optimally shaped binary tree. For any arbitrary binary tree of n nodes the algorithm has a time complexity of O(lgn) and utilizes O(n) processors on a EREW PRAM model. The algorithm uses Euler tours and list ranking, which form the building blocks for many parallel algorithms.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"188 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121066848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Supporting insertions and deletions in striped parallel filesystems 支持在条纹并行文件系统中插入和删除
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262921
T. Johnson
The dramatic improvements in the processing rates of parallel computers are turning many compute-bound jobs into IO-bound jobs. Parallel file systems have been proposed to better match IO throughput to processing power. Many parallel file systems stripe files across numerous disks; each disk has its own controller. A striped file can be appended (or prepended) to and maintain its structure. However, a block can't be inserted into or deleted from the middle of the file, since this would destroy the round robin striping structure of the file. The author presents a distributed file structure that maintains files in indexed striped extents on a message passing multiprocessor. This approach allows highly parallel random and sequential reads, and also allows insertion and deletion into the middle of the file.<>
并行计算机处理速度的显著提高正在将许多与计算机相关的工作转变为与io相关的工作。为了更好地匹配IO吞吐量和处理能力,人们提出了并行文件系统。许多并行文件系统在许多磁盘上分条文件;每个磁盘都有自己的控制器。条纹文件可以追加(或预先添加)到它的结构中并维持它的结构。但是,不能在文件中间插入或删除块,因为这会破坏文件的轮循分条结构。作者提出了一种分布式文件结构,该结构在消息传递的多处理器上以索引条纹区维护文件。这种方法允许高度并行的随机和顺序读取,也允许在文件中间插入和删除。
{"title":"Supporting insertions and deletions in striped parallel filesystems","authors":"T. Johnson","doi":"10.1109/IPPS.1993.262921","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262921","url":null,"abstract":"The dramatic improvements in the processing rates of parallel computers are turning many compute-bound jobs into IO-bound jobs. Parallel file systems have been proposed to better match IO throughput to processing power. Many parallel file systems stripe files across numerous disks; each disk has its own controller. A striped file can be appended (or prepended) to and maintain its structure. However, a block can't be inserted into or deleted from the middle of the file, since this would destroy the round robin striping structure of the file. The author presents a distributed file structure that maintains files in indexed striped extents on a message passing multiprocessor. This approach allows highly parallel random and sequential reads, and also allows insertion and deletion into the middle of the file.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114065302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fast parallel algorithms for model checking using BDDs 基于bdd的模型检测快速并行算法
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262834
Insup Lee, S. Rajasekaran
Binary decision diagrams (BDDs) have recently been used in model checking to verify systems with a large number of states (of the order of 5*10/sup 20/). Representing both the state space and the state transition graph as BDDs has been demonstrated to alleviate the problem of state space explosion. But there are limitations to this heuristic approach. Even systems of reasonable complexity have many more states. Also, the BDD approach might fail even on some simple systems. The authors propose the use of parallelism to extend the applicability of BDDs in model checking. They present fast algorithms for model checking that employ BDDs. The algorithms presented are much faster than the best known previous algorithms.<>
二进制决策图(bdd)最近被用于模型检查,以验证具有大量状态(5*10/sup / 20/)的系统。将状态空间和状态转移图同时表示为bdd,可以缓解状态空间爆炸的问题。但这种启发式方法也有局限性。即使是合理复杂的系统也有更多的状态。此外,BDD方法甚至可能在一些简单的系统上失败。作者建议使用并行性来扩展bdd在模型检查中的适用性。他们提出了使用bdd进行模型检查的快速算法。所提出的算法比以前最著名的算法要快得多。
{"title":"Fast parallel algorithms for model checking using BDDs","authors":"Insup Lee, S. Rajasekaran","doi":"10.1109/IPPS.1993.262834","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262834","url":null,"abstract":"Binary decision diagrams (BDDs) have recently been used in model checking to verify systems with a large number of states (of the order of 5*10/sup 20/). Representing both the state space and the state transition graph as BDDs has been demonstrated to alleviate the problem of state space explosion. But there are limitations to this heuristic approach. Even systems of reasonable complexity have many more states. Also, the BDD approach might fail even on some simple systems. The authors propose the use of parallelism to extend the applicability of BDDs in model checking. They present fast algorithms for model checking that employ BDDs. The algorithms presented are much faster than the best known previous algorithms.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"260 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122464285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A parallel MSF algorithm for planar graphs on a mesh and applications to image processing 网格上平面图形的并行MSF算法及其在图像处理中的应用
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262882
David Nassimi
The author presents an efficient O(n) parallel algorithm for finding a minimum-cost spanning forest (MSF) of a weighted undirected planar graph with n/sup 2/ edges, on an n*n mesh-connected computer. He also obtains efficient MSF-based O(n) algorithms for several application problems in image processing. In particular, he shows that an MSF can be used to obtain more efficient and elegant O(n) algorithms for the 'k-width connectivity' problem and the 'optical clustering' problem.<>
本文提出了在n*n网格连接计算机上求n/sup 2/条边的加权无向图的最小代价生成森林(MSF)的高效O(n)并行算法。他还为图像处理中的几个应用问题获得了高效的基于msf的O(n)算法。特别是,他展示了MSF可以用来获得更有效和优雅的O(n)算法来解决“k-宽度连接”问题和“光聚类”问题。
{"title":"A parallel MSF algorithm for planar graphs on a mesh and applications to image processing","authors":"David Nassimi","doi":"10.1109/IPPS.1993.262882","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262882","url":null,"abstract":"The author presents an efficient O(n) parallel algorithm for finding a minimum-cost spanning forest (MSF) of a weighted undirected planar graph with n/sup 2/ edges, on an n*n mesh-connected computer. He also obtains efficient MSF-based O(n) algorithms for several application problems in image processing. In particular, he shows that an MSF can be used to obtain more efficient and elegant O(n) algorithms for the 'k-width connectivity' problem and the 'optical clustering' problem.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122834845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Image processing with the MGAP: a cost effective solution 使用MGAP进行图像处理:具有成本效益的解决方案
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262835
R. Bajwa, R. Owens, M. J. Irwin
Image processing applications are suitable candidates for parallelism and have at least in part motivated the design and development of some of the pioneering massively parallel processing systems including the CLIP family, the DAP, the MPP and the GAPP. By exploiting design techniques and architectures suitable for VLSI technology one can now build hardware which provides comparable performance at a fraction of the cost it took for these earlier designs. The authors describe the use of a fine-grained, massively parallel VLSI processor array, the Micro-Grained Array Processor (MGAP) for image processing applications. The array and its support systems, in their current configuration, are designed to be used as a co-processor board in a desk-top workstation. The array can be used for applications other than image processing as well. The versatility of the array and the single broad design provide a cost effective solution for a variety of parallelizable tasks.<>
图像处理应用是并行的合适候选人,并且至少在一定程度上推动了一些开创性的大规模并行处理系统的设计和开发,包括CLIP家族,DAP, MPP和GAPP。通过开发适合VLSI技术的设计技术和架构,现在可以构建硬件,以提供与这些早期设计相当的性能,而成本只是这些早期设计的一小部分。作者描述了用于图像处理应用的细粒度大规模并行VLSI处理器阵列,微粒度阵列处理器(MGAP)的使用。该阵列及其支持系统,在其当前配置中,被设计用作桌面工作站的协处理器板。该阵列也可以用于图像处理以外的应用程序。阵列的多功能性和单宽设计为各种可并行化任务提供了经济有效的解决方案
{"title":"Image processing with the MGAP: a cost effective solution","authors":"R. Bajwa, R. Owens, M. J. Irwin","doi":"10.1109/IPPS.1993.262835","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262835","url":null,"abstract":"Image processing applications are suitable candidates for parallelism and have at least in part motivated the design and development of some of the pioneering massively parallel processing systems including the CLIP family, the DAP, the MPP and the GAPP. By exploiting design techniques and architectures suitable for VLSI technology one can now build hardware which provides comparable performance at a fraction of the cost it took for these earlier designs. The authors describe the use of a fine-grained, massively parallel VLSI processor array, the Micro-Grained Array Processor (MGAP) for image processing applications. The array and its support systems, in their current configuration, are designed to be used as a co-processor board in a desk-top workstation. The array can be used for applications other than image processing as well. The versatility of the array and the single broad design provide a cost effective solution for a variety of parallelizable tasks.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"141 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122914100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A partially asynchronous and iterative algorithm for distributed load balancing 分布式负载平衡的部分异步迭代算法
Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262906
Jianjian Song
Defining tasks as independent entities with identical execution time and workload as the number of tasks, the author proposes a partially asynchronous and iterative algorithm for distributed load balancing, shows its properties, and reports its simulation results. The algorithm converges geometrically according to a theorem proved elsewhere. He proves that the algorithm can achieve the maximum load imbalance of not more than (/sup d///sub 2/) tasks, where d is the diameter of a network. His simulation of a synchronous version of the algorithm not only validated the properties but also showed that the algorithm could produce much smaller load imbalances for hypercubes. The obtained imbalances for hypercubes of order up to ten were no more than two tasks and 56% of the sample runs produced only one task difference, as opposed to the theoretical maximum of six tasks.<>
将任务定义为执行时间和工作负载与任务数量相同的独立实体,提出了一种部分异步迭代的分布式负载均衡算法,展示了该算法的特性,并报告了仿真结果。该算法根据另一个已证明的定理在几何上收敛。他证明了该算法可以实现不超过(/sup d///sub 2/)个任务的最大负载不平衡,其中d为网络的直径。他对该算法的同步版本的模拟不仅验证了这些特性,而且还表明该算法可以为超立方体产生更小的负载不平衡。对于高达10阶的超立方体,获得的不平衡不超过两个任务,56%的样本运行只产生一个任务差异,而不是理论最大值6个任务。
{"title":"A partially asynchronous and iterative algorithm for distributed load balancing","authors":"Jianjian Song","doi":"10.1109/IPPS.1993.262906","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262906","url":null,"abstract":"Defining tasks as independent entities with identical execution time and workload as the number of tasks, the author proposes a partially asynchronous and iterative algorithm for distributed load balancing, shows its properties, and reports its simulation results. The algorithm converges geometrically according to a theorem proved elsewhere. He proves that the algorithm can achieve the maximum load imbalance of not more than (/sup d///sub 2/) tasks, where d is the diameter of a network. His simulation of a synchronous version of the algorithm not only validated the properties but also showed that the algorithm could produce much smaller load imbalances for hypercubes. The obtained imbalances for hypercubes of order up to ten were no more than two tasks and 56% of the sample runs produced only one task difference, as opposed to the theoretical maximum of six tasks.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126449985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 65
期刊
[1993] Proceedings Seventh International Parallel Processing Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1