首页 > 最新文献

Proceedings Sixth International Parallel Processing Symposium最新文献

英文 中文
A new parallel algorithm for breadth-first search on interval graphs 区间图宽度优先搜索的一种新的并行算法
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223055
Sajal K. Das, Calvin Ching-Yuen Chen
The authors design an efficient parallel algorithm for constructing a breadth-first spanning tree of an interval graph. Their novel approach is based on elegantly capturing the structure of a given collection of intervals. This structure reveals important properties of the corresponding interval graph, and is found to be instrumental in solving many other problems including the computation of a breadth-depth spanning tree, which they report for the first time. The algorithm requires O(logn) time employing O(n) processors on the EREW PRAM model.<>
本文设计了一种构造区间图的宽度优先生成树的并行算法。他们的新方法是基于优雅地捕捉给定区间集合的结构。这种结构揭示了相应区间图的重要性质,并被发现有助于解决许多其他问题,包括广度-深度生成树的计算,这是他们第一次报道。在EREW PRAM模型上,该算法使用O(n)个处理器需要O(logn)时间。
{"title":"A new parallel algorithm for breadth-first search on interval graphs","authors":"Sajal K. Das, Calvin Ching-Yuen Chen","doi":"10.1109/IPPS.1992.223055","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223055","url":null,"abstract":"The authors design an efficient parallel algorithm for constructing a breadth-first spanning tree of an interval graph. Their novel approach is based on elegantly capturing the structure of a given collection of intervals. This structure reveals important properties of the corresponding interval graph, and is found to be instrumental in solving many other problems including the computation of a breadth-depth spanning tree, which they report for the first time. The algorithm requires O(logn) time employing O(n) processors on the EREW PRAM model.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"21 44","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132545490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The interplay between granularity, performance and availability in a replicated Linda tuple space 复制Linda元组空间中粒度、性能和可用性之间的相互作用
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222976
S. Kambhatla, J. Walpole
Replication is a common method for increasing the availability of data in a distributed environment. The authors' interest is in the application of replication techniques in the domain of parallel processing. They explore the issues concerning degree of replication and granularity in the context of a distributed and highly available Linda tuple space. In particular, they study the performance effects of varying the number of replicas and the granularities of replication and concurrency control. Traditionally, when using replication in databases, the granularity of replication and that of concurrency control have been the same (at the file level (D.K. Gifford, 1979), for example). This is not an inherent requirement however. The authors show by detailed simulation of a replicated Linda tuple space that it is useful to separate the two granularities and that it is an important design issue especially in parallel processing systems.<>
复制是在分布式环境中提高数据可用性的常用方法。作者的兴趣是在并行处理领域复制技术的应用。他们探讨了在分布式和高可用的Linda元组空间上下文中有关复制程度和粒度的问题。特别是,他们研究了改变副本数量、复制粒度和并发控制的性能影响。传统上,当在数据库中使用复制时,复制的粒度和并发控制的粒度是相同的(例如,在文件级别(D.K. Gifford, 1979))。然而,这并不是一个固有的要求。作者通过对一个复制的Linda元组空间的详细模拟表明,分离两个粒度是有用的,这是一个重要的设计问题,特别是在并行处理系统中
{"title":"The interplay between granularity, performance and availability in a replicated Linda tuple space","authors":"S. Kambhatla, J. Walpole","doi":"10.1109/IPPS.1992.222976","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222976","url":null,"abstract":"Replication is a common method for increasing the availability of data in a distributed environment. The authors' interest is in the application of replication techniques in the domain of parallel processing. They explore the issues concerning degree of replication and granularity in the context of a distributed and highly available Linda tuple space. In particular, they study the performance effects of varying the number of replicas and the granularities of replication and concurrency control. Traditionally, when using replication in databases, the granularity of replication and that of concurrency control have been the same (at the file level (D.K. Gifford, 1979), for example). This is not an inherent requirement however. The authors show by detailed simulation of a replicated Linda tuple space that it is useful to separate the two granularities and that it is an important design issue especially in parallel processing systems.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124624638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A paradigm for distributed deadlock avoidance in multicomputer networks 多计算机网络中分布式死锁避免的范例
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222993
J. P. Samantarai
A paradigm for avoiding buffer deadlock in point-to-point multicomputer networks is presented which is ideal for today's high connectivity, load sharing networks. Unlike the traditional resource ordering principle, this paradigm not only allows unrestricted routing but uses the existence of multiple paths to its direct advantage. Deadlock is avoided entirely using exchange buffers which are not used for message queues, thus eliminating queueing overhead. The paradigm is topology-independent, imposes no routing restrictions, and uses states of neighboring links only, so that it can be built into link level protocol, providing unrestricted deadlock-free routing, while operating transparent to any fault-tolerant topology-specific routing algorithm.<>
提出了一种在点对点多计算机网络中避免缓冲区死锁的范例,该范例非常适合当今的高连接、负载共享网络。与传统的资源排序原则不同,这种范式不仅允许不受限制的路由,而且利用多条路径的存在来发挥其直接优势。使用不用于消息队列的交换缓冲区完全避免了死锁,从而消除了队列开销。该范式是拓扑独立的,不施加路由限制,并且只使用相邻链路的状态,因此它可以被构建到链路级协议中,提供不受限制的无死锁路由,同时对任何容错拓扑特定的路由算法透明。
{"title":"A paradigm for distributed deadlock avoidance in multicomputer networks","authors":"J. P. Samantarai","doi":"10.1109/IPPS.1992.222993","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222993","url":null,"abstract":"A paradigm for avoiding buffer deadlock in point-to-point multicomputer networks is presented which is ideal for today's high connectivity, load sharing networks. Unlike the traditional resource ordering principle, this paradigm not only allows unrestricted routing but uses the existence of multiple paths to its direct advantage. Deadlock is avoided entirely using exchange buffers which are not used for message queues, thus eliminating queueing overhead. The paradigm is topology-independent, imposes no routing restrictions, and uses states of neighboring links only, so that it can be built into link level protocol, providing unrestricted deadlock-free routing, while operating transparent to any fault-tolerant topology-specific routing algorithm.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124783238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conflict-free scheduling of nested loop algorithms on lower dimensional processor arrays 低维处理器阵列上嵌套循环算法的无冲突调度
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223054
Zhenhui Yang, Weijia Shang, J. Fortes
In practice, it is interesting to map n-dimensional algorithms, or algorithms with n nested loops, onto (k-1)-dimensional arrays where k>
在实践中,将n维算法或具有n个嵌套循环的算法映射到k>的(k-1)维数组上是很有趣的。
{"title":"Conflict-free scheduling of nested loop algorithms on lower dimensional processor arrays","authors":"Zhenhui Yang, Weijia Shang, J. Fortes","doi":"10.1109/IPPS.1992.223054","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223054","url":null,"abstract":"In practice, it is interesting to map n-dimensional algorithms, or algorithms with n nested loops, onto (k-1)-dimensional arrays where k<n. The paper considers some open problems in a previous work by Shang and Fortes (1990). A procedure is proposed to test if or not a given mapping has computational conflicts and a lower bound on the total execution time is provided. Based on the testing procedure and the lower bound, the complexity and the optimality of the optimization procedure in the previous work is improved. The integer programming formulation is also discussed and used to find the optimal time mapping for the 5-dimensional bit level matrix multiplication algorithm into a 2-dimensional bit level processor array.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125498940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
IDPS: a massively parallel heuristic search algorithm IDPS:一个大规模并行启发式搜索算法
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223042
A. Mahanti, C. J. Daniels
Presents an efficient SIMD parallel algorithm, called IDPS (iterative deepening parallel search). The performance of four variants of IDPS is studied through experiments conducted on the well known test-bed problem for search algorithms, the 15-puzzle. During the experiments, data were gathered under two different static load-balancing schemes. Under the first scheme, an average efficiency of approximately /sup 3///sub 4/ was obtained for 4 K, 8 K, and 16 K processors. Under the second scheme, average efficiencies of 0.92 and 0.76 were obtained for 8 K and 16 K processors, respectively. It is also shown that for admissible search, linear or superlinear average speedup can be obtained for problems of significant size.<>
提出了一种高效的SIMD并行算法IDPS(迭代深化并行搜索)。通过在著名的搜索算法测试台问题15-puzzle上进行的实验,研究了四种IDPS变体的性能。在实验中,采用两种不同的静态负载均衡方案收集数据。在第一种方案下,4 K、8 K和16 K处理器的平均效率约为/sup 3/// //sub 4/。在第二种方案下,8k和16k处理器的平均效率分别为0.92和0.76。对于可容许搜索,对于显著大小的问题,可以得到线性或超线性的平均加速
{"title":"IDPS: a massively parallel heuristic search algorithm","authors":"A. Mahanti, C. J. Daniels","doi":"10.1109/IPPS.1992.223042","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223042","url":null,"abstract":"Presents an efficient SIMD parallel algorithm, called IDPS (iterative deepening parallel search). The performance of four variants of IDPS is studied through experiments conducted on the well known test-bed problem for search algorithms, the 15-puzzle. During the experiments, data were gathered under two different static load-balancing schemes. Under the first scheme, an average efficiency of approximately /sup 3///sub 4/ was obtained for 4 K, 8 K, and 16 K processors. Under the second scheme, average efficiencies of 0.92 and 0.76 were obtained for 8 K and 16 K processors, respectively. It is also shown that for admissible search, linear or superlinear average speedup can be obtained for problems of significant size.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"21 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120891401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Exploiting concurrency among tasks in partitionable parallel processing systems 利用可分区并行处理系统中任务之间的并发性
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223076
W. Nation, A. A. Maciejewski, H. Siegel
One benefit of partitionable parallel processing systems is their ability to execute multiple, independent tasks simultaneously. Previous work has identified conditions such that, when there are k tasks to be processed, partitioning the system such that all k tasks are processed simultaneously results in a minimum overall execution time. An alternate condition is developed that provides additional insight into the effects of parallelism on execution time. This result, and previous results, however, assume that execution times are data independent. It is shown that data-dependent tasks do not necessarily execute faster when processed simultaneously even if the condition is met. A model is developed that provides for the possible variability of a task's execution time and is used in a new framework to study the problem of finding an optimal mapping for identical, independent data-dependent execution time tasks onto partitionable systems. Extension of this framework to situations where the k tasks are non-identical is discussed.<>
可分区并行处理系统的一个优点是能够同时执行多个独立任务。以前的工作已经确定了这样的条件:当有k个任务需要处理时,对系统进行分区,以便同时处理所有k个任务,从而使总体执行时间最短。开发了另一种条件,可以进一步了解并行性对执行时间的影响。然而,这个结果和前面的结果都假定执行时间与数据无关。结果表明,即使满足条件,数据相关任务在同时处理时也不一定执行得更快。开发了一个模型,该模型提供了任务执行时间的可能可变性,并在一个新的框架中用于研究为相同的、独立的、依赖数据的执行时间任务寻找到可分区系统的最佳映射问题。讨论了将该框架扩展到k个任务不相同的情况。
{"title":"Exploiting concurrency among tasks in partitionable parallel processing systems","authors":"W. Nation, A. A. Maciejewski, H. Siegel","doi":"10.1109/IPPS.1992.223076","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223076","url":null,"abstract":"One benefit of partitionable parallel processing systems is their ability to execute multiple, independent tasks simultaneously. Previous work has identified conditions such that, when there are k tasks to be processed, partitioning the system such that all k tasks are processed simultaneously results in a minimum overall execution time. An alternate condition is developed that provides additional insight into the effects of parallelism on execution time. This result, and previous results, however, assume that execution times are data independent. It is shown that data-dependent tasks do not necessarily execute faster when processed simultaneously even if the condition is met. A model is developed that provides for the possible variability of a task's execution time and is used in a new framework to study the problem of finding an optimal mapping for identical, independent data-dependent execution time tasks onto partitionable systems. Extension of this framework to situations where the k tasks are non-identical is discussed.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127074225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Vector Hartley transform employing multiprocessors 矢量哈特利变换采用多处理器
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223038
R. Mahapatra, Akhilesh Kumar
Many parallel implementations for signal processing transforms have already been reported. The implementation of Hou's FHT algorithm (1987) has been studied on three multiprocessor architectures (MPAs): multiprocessors connected through a shared bus; multiprocessors connected by an indirect binary n-cube multistage interconnection network and mesh connected multiprocessors. The article analyzes the performance of a vector Hartley transform algorithm on these MPAs.<>
许多信号处理变换的并行实现已经被报道。Hou的FHT算法(1987)的实现已经在三种多处理器架构(MPAs)上进行了研究:通过共享总线连接的多处理器;由间接二进制n-立方体多级互连网络连接的多处理器和网状连接的多处理器。本文分析了矢量哈特利变换算法在这些MPAs上的性能
{"title":"Vector Hartley transform employing multiprocessors","authors":"R. Mahapatra, Akhilesh Kumar","doi":"10.1109/IPPS.1992.223038","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223038","url":null,"abstract":"Many parallel implementations for signal processing transforms have already been reported. The implementation of Hou's FHT algorithm (1987) has been studied on three multiprocessor architectures (MPAs): multiprocessors connected through a shared bus; multiprocessors connected by an indirect binary n-cube multistage interconnection network and mesh connected multiprocessors. The article analyzes the performance of a vector Hartley transform algorithm on these MPAs.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127292330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-packet selection on mesh-connected processor arrays 网格连接处理器阵列的多包选择
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222999
D. Krizanc, L. Narayanan
The authors show efficient, deterministic algorithms for selection on the mesh-connected processor array, in the case when there are several elements at every processor. In particular, on a p-processor mesh, with N>or=p elements, stored N/p at every processor, they show that selection can be performed in O(min(plog/sup N///sub p/, max(N/p/sup 2/3/, square root p))) communication steps. The best previously known results were based on sorting and required O(N/ square root p) communication steps, for N>or=p.<>
在每个处理器上都有多个元素的情况下,作者展示了在网格连接的处理器阵列上进行选择的高效、确定性算法。特别是,在p-处理器网格上,具有N>或=p个元素,在每个处理器上存储N/p,它们表明选择可以在O(min(plog/sup N///sub p/, max(N/p/sup 2/3/,平方根p))通信步骤中执行。先前已知的最佳结果是基于排序和需要O(N/平方根p)个通信步骤,对于N>或=p。
{"title":"Multi-packet selection on mesh-connected processor arrays","authors":"D. Krizanc, L. Narayanan","doi":"10.1109/IPPS.1992.222999","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222999","url":null,"abstract":"The authors show efficient, deterministic algorithms for selection on the mesh-connected processor array, in the case when there are several elements at every processor. In particular, on a p-processor mesh, with N>or=p elements, stored N/p at every processor, they show that selection can be performed in O(min(plog/sup N///sub p/, max(N/p/sup 2/3/, square root p))) communication steps. The best previously known results were based on sorting and required O(N/ square root p) communication steps, for N>or=p.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121974860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Memory requirements to balance thus asymptotically full-speedup FFT computation on processor arrays 内存需求平衡,因此渐近加速FFT计算在处理器阵列上
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223045
J. Shieh
The paper proves that for a linearly-connected array of alpha processors or a mesh-connected array of alpha /sup 2/ processors, where each processor has computation bandwidth C, I/O bandwidth I and C/I=logm, Omega (m/sup alpha /) memory size is required in each processor to minimize the I/O requirement in balancing the FFT computation. Then it presents balanced FFT algorithms on these arrays to meet their memory size lower bounds. These algorithms are time optimal exhibiting full speedups.<>
本文证明了对于alpha处理器线性连接阵列或alpha /sup 2/处理器网格连接阵列,其中每个处理器的计算带宽为C, I/O带宽为I,且C/I=logm,在平衡FFT计算时,每个处理器需要Omega (m/sup alpha /)内存大小以最小化I/O需求。然后在这些数组上提出平衡的FFT算法以满足它们的内存大小下界。这些算法是时间最优的,表现出完全的加速。
{"title":"Memory requirements to balance thus asymptotically full-speedup FFT computation on processor arrays","authors":"J. Shieh","doi":"10.1109/IPPS.1992.223045","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223045","url":null,"abstract":"The paper proves that for a linearly-connected array of alpha processors or a mesh-connected array of alpha /sup 2/ processors, where each processor has computation bandwidth C, I/O bandwidth I and C/I=logm, Omega (m/sup alpha /) memory size is required in each processor to minimize the I/O requirement in balancing the FFT computation. Then it presents balanced FFT algorithms on these arrays to meet their memory size lower bounds. These algorithms are time optimal exhibiting full speedups.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"21 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132365036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bussed interconnection networks from trees 从树的总线互连网络
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223015
C. M. Fiduccia
Pin limitations are a fundamental obstacle in the construction of massively parallel computers. The paper introduces a class of d-dimensional bussed hypercubes that can perform simultaneous bidirectional communication across any dimension using d+1, rather than 2d, ports per node. Each network Q/sub d/(T) is based on a tree T, which specifies the 'shape' of the busses, and can perform d(d+1)/2 permutations pi /sub ij/(x)=x(+)c/sub ij/ via a simple global command. This construction is then generalized to any d permutations II=( pi /sub 1/,. . ., pi /sub d/) of any set of nodes X. Given any edge-labeled directed tree T, whose kth arc is associated with the permutation pi /sub k/, a bussed network N(II,T) is constructed that can-in one clock tick-perform any of the O(d/sup 2/) permutations arising from the paths in the tree T.<>
引脚限制是构建大规模并行计算机的一个根本障碍。本文介绍了一类d维总线超立方体,它可以通过每个节点使用d+1而不是2d端口在任何维度上执行同时双向通信。每个网络Q/sub d/(T)都基于一个树T,它指定了总线的“形状”,并且可以通过一个简单的全局命令执行d(d+1)/2个排列pi /sub ij/(x)=x(+)c/sub ij/。然后将这种构造推广到任何节点集合x的任何d个排列II=(pi /sub 1/,…,pi /sub d/)。给定任何边标记有向树T,其第k个弧与排列pi /sub k/相关联,构造一个总线网络N(II,T),它可以在一个时钟内执行从树T的路径中产生的O(d/sup 2/)个排列中的任何一个。
{"title":"Bussed interconnection networks from trees","authors":"C. M. Fiduccia","doi":"10.1109/IPPS.1992.223015","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223015","url":null,"abstract":"Pin limitations are a fundamental obstacle in the construction of massively parallel computers. The paper introduces a class of d-dimensional bussed hypercubes that can perform simultaneous bidirectional communication across any dimension using d+1, rather than 2d, ports per node. Each network Q/sub d/(T) is based on a tree T, which specifies the 'shape' of the busses, and can perform d(d+1)/2 permutations pi /sub ij/(x)=x(+)c/sub ij/ via a simple global command. This construction is then generalized to any d permutations II=( pi /sub 1/,. . ., pi /sub d/) of any set of nodes X. Given any edge-labeled directed tree T, whose kth arc is associated with the permutation pi /sub k/, a bussed network N(II,T) is constructed that can-in one clock tick-perform any of the O(d/sup 2/) permutations arising from the paths in the tree T.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129521002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings Sixth International Parallel Processing Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1