首页 > 最新文献

Proceedings Sixth International Parallel Processing Symposium最新文献

英文 中文
Assignment of ADT modules to processors 将ADT模块分配给处理器
Pub Date : 1992-03-23 DOI: 10.1109/IPPS.1992.223069
L. Welch
The utilization of reusable software components can help to reduce the complexity of developing and maintaining parallel programs, but can lead to inefficiencies. The potential inefficiencies are addressed by providing a model of parallel execution (asynchronous remote procedure call, or ARPC) that not only speeds up programs, but also encourages the development of layered software by increasing parallelism in correspondence to increases in layering. The paper presents an efficient algorithm for assigning the reusable modules of a program to the processing elements of a parallel computer that supports ARPC. The objectives of the assignment algorithm are to permit maximum inter-module parallelism with the fewest possible PEs, and to prevent deadlock. The algorithm differs from previous solutions to the assignment problem in that the modules to be assigned are generic abstract data type modules, not procedures, tasks, or processes.<>
使用可重用的软件组件可以帮助减少开发和维护并行程序的复杂性,但可能导致效率低下。通过提供并行执行模型(异步远程过程调用或ARPC),可以解决潜在的低效率问题,该模型不仅可以加快程序的速度,而且还可以通过增加并行性来增加分层,从而鼓励分层软件的开发。本文提出了一种将程序可重用模块分配给支持ARPC的并行计算机处理单元的有效算法。赋值算法的目标是以最少的pe实现最大的模块间并行性,并防止死锁。该算法与以前的赋值问题解决方案的不同之处在于,要赋值的模块是通用的抽象数据类型模块,而不是过程、任务或进程
{"title":"Assignment of ADT modules to processors","authors":"L. Welch","doi":"10.1109/IPPS.1992.223069","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223069","url":null,"abstract":"The utilization of reusable software components can help to reduce the complexity of developing and maintaining parallel programs, but can lead to inefficiencies. The potential inefficiencies are addressed by providing a model of parallel execution (asynchronous remote procedure call, or ARPC) that not only speeds up programs, but also encourages the development of layered software by increasing parallelism in correspondence to increases in layering. The paper presents an efficient algorithm for assigning the reusable modules of a program to the processing elements of a parallel computer that supports ARPC. The objectives of the assignment algorithm are to permit maximum inter-module parallelism with the fewest possible PEs, and to prevent deadlock. The algorithm differs from previous solutions to the assignment problem in that the modules to be assigned are generic abstract data type modules, not procedures, tasks, or processes.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134477941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Parallel heap: improved and simplified 并行堆:改进和简化
Pub Date : 1992-03-23 DOI: 10.1109/IPPS.1992.223004
S. Prasad, N. Deo
Describes a new updated version of the data structure parallel heap. Employing p processors, a parallel heap allows detections of Theta (p) highest-priority items and insertion of Theta (p) new items each in O(logn) time on an EREW PRAM where n is the size of the parallel heap. Furthermore, it can efficiently utilize processors in the range 1 through n. This version does not require dedicated maintenance processors, and performs insertion and deletion in place.<>
描述数据结构并行堆的新更新版本。使用p个处理器,一个并行堆允许在一个EREW PRAM上在O(logn)时间内检测Theta (p)个最高优先级项并插入Theta (p)个新项,其中n是并行堆的大小。此外,它可以有效地利用范围为1到n的处理器。该版本不需要专门的维护处理器,并且可以就地执行插入和删除。
{"title":"Parallel heap: improved and simplified","authors":"S. Prasad, N. Deo","doi":"10.1109/IPPS.1992.223004","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223004","url":null,"abstract":"Describes a new updated version of the data structure parallel heap. Employing p processors, a parallel heap allows detections of Theta (p) highest-priority items and insertion of Theta (p) new items each in O(logn) time on an EREW PRAM where n is the size of the parallel heap. Furthermore, it can efficiently utilize processors in the range 1 through n. This version does not require dedicated maintenance processors, and performs insertion and deletion in place.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122544401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A distributed data-balanced dictionary based on the B-link tree 基于B-link树的分布式数据平衡字典
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223026
T. Johnson, A. Colbrook
Many concurrent dictionary data structures have been proposed, but usually in the context of shared memory multiprocessors. The paper presents an algorithm for a concurrent distributed B-tree that can be implemented on message passing parallel computers. This distributed B-tree (the dB-tree) replicates the interior nodes in order to improve parallelism and reduce message passing. It is shown how the dB-tree algorithm can be used to build an efficient, highly parallel, data-balanced distributed dictionary, the dE-tree.<>
已经提出了许多并发字典数据结构,但通常是在共享内存多处理器的背景下。本文提出了一种可在消息传递并行计算机上实现的并发分布式b树算法。这个分布式b树(dB-tree)复制内部节点,以提高并行性并减少消息传递。它展示了如何使用dB-tree算法来构建一个高效、高度并行、数据均衡的分布式字典dE-tree. b>
{"title":"A distributed data-balanced dictionary based on the B-link tree","authors":"T. Johnson, A. Colbrook","doi":"10.1109/IPPS.1992.223026","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223026","url":null,"abstract":"Many concurrent dictionary data structures have been proposed, but usually in the context of shared memory multiprocessors. The paper presents an algorithm for a concurrent distributed B-tree that can be implemented on message passing parallel computers. This distributed B-tree (the dB-tree) replicates the interior nodes in order to improve parallelism and reduce message passing. It is shown how the dB-tree algorithm can be used to build an efficient, highly parallel, data-balanced distributed dictionary, the dE-tree.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"475 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123044267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
A parallel approach to hybrid range image segmentation 一种并行的混合距离图像分割方法
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223024
Nicholas Giolmas, D. Watson, D. Chelberg, H. Siegel
Parallel processing methods are an attractive means to achieve significant speedup of computationally expensive image understanding algorithms, such as those applied to range images. Mixed-mode parallel systems are ideally suited to this area because of the flexibility in using the different modes of parallelism. The trade-offs of using different parallel modes are examined through the implementation of hybrid range segmentation operations, characteristic of a broad class of low level image processing algorithms. Alternative means of distributing data among the processing elements that achieve improved performance are considered. Results comparing different implementations on a single reconfigurable parallel processor. PASM, indicate some generally applicable guidelines for the effective parallelization of vision algorithms.<>
并行处理方法是一种有吸引力的手段,可以实现计算成本高的图像理解算法的显著加速,例如那些应用于距离图像的算法。混合模式并行系统非常适合这一领域,因为它可以灵活地使用不同的并行模式。使用不同的并行模式的权衡是通过混合范围分割操作的实现进行检查,这是一大类低水平图像处理算法的特点。考虑了在处理元素之间分布数据以提高性能的替代方法。在单个可重构并行处理器上不同实现的比较结果。为视觉算法的有效并行化提供了一些普遍适用的准则。
{"title":"A parallel approach to hybrid range image segmentation","authors":"Nicholas Giolmas, D. Watson, D. Chelberg, H. Siegel","doi":"10.1109/IPPS.1992.223024","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223024","url":null,"abstract":"Parallel processing methods are an attractive means to achieve significant speedup of computationally expensive image understanding algorithms, such as those applied to range images. Mixed-mode parallel systems are ideally suited to this area because of the flexibility in using the different modes of parallelism. The trade-offs of using different parallel modes are examined through the implementation of hybrid range segmentation operations, characteristic of a broad class of low level image processing algorithms. Alternative means of distributing data among the processing elements that achieve improved performance are considered. Results comparing different implementations on a single reconfigurable parallel processor. PASM, indicate some generally applicable guidelines for the effective parallelization of vision algorithms.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125899805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Evaluation of connected component labeling algorithms on shared and distributed memory multiprocessors 共享和分布式存储多处理器上连接组件标记算法的评价
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223019
A. Choudhary, R. Thakur
Presents parallel implementations of connected component labeling for grey level images on the iPSC/2 E iPSC/860 hypercubes and on the Encore Multimax shared memory multiprocessor. Several partitioning and mapping strategies including multi-dimensional divide and conquer, block decomposition and scatter decomposition are used. Implementation results, performance evaluation and comparison for all the mapping strategies are reported.<>
提出了在iPSC/2 E iPSC/860超立方体和Encore Multimax共享内存多处理器上并行实现灰度图像的连通组件标记。采用了多维分治、块分解和散点分解等划分和映射策略。报告了所有映射策略的实现结果、性能评价和比较。
{"title":"Evaluation of connected component labeling algorithms on shared and distributed memory multiprocessors","authors":"A. Choudhary, R. Thakur","doi":"10.1109/IPPS.1992.223019","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223019","url":null,"abstract":"Presents parallel implementations of connected component labeling for grey level images on the iPSC/2 E iPSC/860 hypercubes and on the Encore Multimax shared memory multiprocessor. Several partitioning and mapping strategies including multi-dimensional divide and conquer, block decomposition and scatter decomposition are used. Implementation results, performance evaluation and comparison for all the mapping strategies are reported.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123237813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Design and analysis of fault-detecting and fault-locating schedules for computation DAGs 计算dag的故障检测和故障定位计划设计与分析
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223022
S. Yajnik, N. Jha
The paper investigates issues concerning the construction of fault-detecting and fault-locating schedules for multiprocessor systems. It develops conditions for a schedule to be fault-detecting or fault-locating and further uses these conditions to propose schemes for construction of the schedules. Lower-bounds on the length of the schedules are calculated and for the special case of binary computation trees, it is shown that the schedules meet the lower-bounds in most cases. A method for actual fault diagnosis from the results of the fault-locating schedules for binary computation trees is also proposed.<>
本文研究了多处理机系统故障检测与定位调度的构建问题。给出了故障检测或故障定位的条件,并利用这些条件提出了故障检测或故障定位的方案。计算了调度长度的下界,对于二叉计算树的特殊情况,表明在大多数情况下调度满足下界。提出了一种利用二叉计算树的故障定位调度结果进行实际故障诊断的方法。
{"title":"Design and analysis of fault-detecting and fault-locating schedules for computation DAGs","authors":"S. Yajnik, N. Jha","doi":"10.1109/IPPS.1992.223022","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223022","url":null,"abstract":"The paper investigates issues concerning the construction of fault-detecting and fault-locating schedules for multiprocessor systems. It develops conditions for a schedule to be fault-detecting or fault-locating and further uses these conditions to propose schemes for construction of the schedules. Lower-bounds on the length of the schedules are calculated and for the special case of binary computation trees, it is shown that the schedules meet the lower-bounds in most cases. A method for actual fault diagnosis from the results of the fault-locating schedules for binary computation trees is also proposed.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123578484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Two-state self-stabilizing algorithms 二态自稳定算法
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223047
M. Flatebo, A. Datta
A distributed system consists of a set of loosely connected state machines which do not share a global memory. All the possible global states of the system can be split up into legal and illegal states. A self-stabilizing system is a network of processors, which, when started from an arbitrary (and possibly illegal) initial state, always returns to a legal state in a finite number of steps. One issue in designing self-stabilizing algorithms is the number of state required by each machine. The paper presents algorithms which will be self-stabilizing while only requiring each machine in the network to have two states. Probability is used in some of the algorithms in order to make this possible. The algorithms are given along with correctness proofs.<>
分布式系统由一组松散连接的状态机组成,这些状态机不共享全局内存。系统中所有可能的全局状态都可以分为合法状态和非法状态。自稳定系统是一个处理器网络,当从任意(可能是非法的)初始状态启动时,总是在有限的步骤中返回到合法状态。设计自稳定算法的一个问题是每台机器所需的状态数。本文提出了一种自稳定算法,该算法只要求网络中的每台机器具有两种状态。概率在一些算法中被使用,以使这成为可能。给出了算法并给出了正确性证明。
{"title":"Two-state self-stabilizing algorithms","authors":"M. Flatebo, A. Datta","doi":"10.1109/IPPS.1992.223047","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223047","url":null,"abstract":"A distributed system consists of a set of loosely connected state machines which do not share a global memory. All the possible global states of the system can be split up into legal and illegal states. A self-stabilizing system is a network of processors, which, when started from an arbitrary (and possibly illegal) initial state, always returns to a legal state in a finite number of steps. One issue in designing self-stabilizing algorithms is the number of state required by each machine. The paper presents algorithms which will be self-stabilizing while only requiring each machine in the network to have two states. Probability is used in some of the algorithms in order to make this possible. The algorithms are given along with correctness proofs.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126617128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
The effects of communication overhead on the speedup of parallel 3-D finite element applications 通信开销对并行三维有限元加速的影响
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222972
V. Taylor, B. Nour-Omid, D. Messerschmitt
The use of parallel processors for implementing the finite element method has made feasible the analyses of large applications, especially three-dimensional applications. The speedup, however, is limited by the interprocessor communication requirements. The authors analyze the effects of interprocessor communications on the resultant speedup of the parallel execution of regular three-dimensional finite element applications. They derive the speedup expressions for the hypercube and mesh topologies. These expressions can be used to analyze the results of different partitioning and mapping strategies.<>
利用并行处理器实现有限元法,使大型应用,特别是三维应用的分析成为可能。然而,这种加速受到处理器间通信需求的限制。作者分析了处理器间通信对常规三维有限元程序并行执行的最终加速的影响。推导了超立方体和网格拓扑的加速表达式。这些表达式可以用来分析不同分区和映射策略的结果。
{"title":"The effects of communication overhead on the speedup of parallel 3-D finite element applications","authors":"V. Taylor, B. Nour-Omid, D. Messerschmitt","doi":"10.1109/IPPS.1992.222972","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222972","url":null,"abstract":"The use of parallel processors for implementing the finite element method has made feasible the analyses of large applications, especially three-dimensional applications. The speedup, however, is limited by the interprocessor communication requirements. The authors analyze the effects of interprocessor communications on the resultant speedup of the parallel execution of regular three-dimensional finite element applications. They derive the speedup expressions for the hypercube and mesh topologies. These expressions can be used to analyze the results of different partitioning and mapping strategies.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115195825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A systolic algorithm and architecture for Galois field arithmetic 伽罗瓦域算法的一种收缩算法和结构
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223032
M. Kovač, N. Ranganathan, M. Varanasi
Finite or Galois fields are used in numerous applications like error correcting codes, digital signal processing and cryptography. The design of efficient methods for Galois field arithmetic such as multiplication and division is critical for these applications. The paper presents a new algorithm for computing multiplication and division in GF(2/sup m/). A systolic architecture is described for implementing the algorithm which can produce a new result every clock cycle. The architecture can be realized as a VLSI chip that can yield a computational rate of 40 million multiplications/divisions per second.<>
有限域或伽罗瓦域用于许多应用,如纠错码、数字信号处理和密码学。设计有效的伽罗瓦域算法,如乘法和除法,对这些应用至关重要。本文提出了GF(2/sup m/)的一种新的乘除算法。描述了实现该算法的收缩结构,该结构可以在每个时钟周期产生新的结果。该架构可以实现为一个VLSI芯片,可以产生每秒4000万次乘法/除法的计算速率。
{"title":"A systolic algorithm and architecture for Galois field arithmetic","authors":"M. Kovač, N. Ranganathan, M. Varanasi","doi":"10.1109/IPPS.1992.223032","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223032","url":null,"abstract":"Finite or Galois fields are used in numerous applications like error correcting codes, digital signal processing and cryptography. The design of efficient methods for Galois field arithmetic such as multiplication and division is critical for these applications. The paper presents a new algorithm for computing multiplication and division in GF(2/sup m/). A systolic architecture is described for implementing the algorithm which can produce a new result every clock cycle. The architecture can be realized as a VLSI chip that can yield a computational rate of 40 million multiplications/divisions per second.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122167718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
The impact of wiring constraints on hierarchical network performance 布线约束对分层网络性能的影响
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222964
W. Hsu, P. Yew
A unified approach, incorporating architectural and packaging issues, is necessary in the design of high performance computer networks. Clustering enables the authors to exploit the physical hierarchy imposed by packaging. Previously the authors examined the clustering of hypercube networks within the context of wiring constraints (see 1991 Int. Conf. on Parallel Processing, Aug. 1991). The authors extend their earlier work to compare the performance of hypercubes and meshes. They consider two cost constraints, bisection width and package pinout, and examine flat and clustered meshes and hypercubes. They find that the relative performance of networks depends on the chosen wiring constraint, as well as system configuration and message granularity.<>
在高性能计算机网络的设计中,必须采用统一的方法,将体系结构和封装问题结合起来。集群使作者能够利用包装所施加的物理层次结构。以前,作者在布线约束的背景下研究了超立方体网络的聚类(参见1991 Int。关于并行处理,1991年8月)。作者扩展了他们早期的工作,比较了超立方体和网格的性能。他们考虑了两个成本约束,平分宽度和封装引脚,并检查了平面和聚类网格和超立方体。他们发现网络的相对性能取决于所选择的连接约束,以及系统配置和消息粒度。
{"title":"The impact of wiring constraints on hierarchical network performance","authors":"W. Hsu, P. Yew","doi":"10.1109/IPPS.1992.222964","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222964","url":null,"abstract":"A unified approach, incorporating architectural and packaging issues, is necessary in the design of high performance computer networks. Clustering enables the authors to exploit the physical hierarchy imposed by packaging. Previously the authors examined the clustering of hypercube networks within the context of wiring constraints (see 1991 Int. Conf. on Parallel Processing, Aug. 1991). The authors extend their earlier work to compare the performance of hypercubes and meshes. They consider two cost constraints, bisection width and package pinout, and examine flat and clustered meshes and hypercubes. They find that the relative performance of networks depends on the chosen wiring constraint, as well as system configuration and message granularity.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128378270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Proceedings Sixth International Parallel Processing Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1