首页 > 最新文献

Proceedings Sixth International Parallel Processing Symposium最新文献

英文 中文
The impact of task-length parameters on the performance of the random load-balancing algorithm 任务长度参数对随机负载均衡算法性能的影响
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223067
Y. Ben-Asher, Aviad Cohen, A. Schuster, J. F. Sibeyn
Considers the problem of dynamic load balancing in an n processors parallel system. The authors focus on the algorithm which randomly assigns newly generated tasks to processors for execution. This process is modeled by randomly throwing weighted balls into n holes. For a given program A, the ball weights (task lengths) are chosen according to an unknown probability distribution D(A) with expectation mu , maximum M and minimum m. For any A, D(A) and a constant 0< in >
研究了n处理器并行系统的动态负载平衡问题。作者重点研究了将新生成的任务随机分配给处理器执行的算法。这个过程是通过将加权球随机扔进n个洞来模拟的。对于给定的程序a,球权(任务长度)根据未知概率分布D(a)选择,期望为mu,最大M和最小M。对于任意a, D(a)和常数0< in >
{"title":"The impact of task-length parameters on the performance of the random load-balancing algorithm","authors":"Y. Ben-Asher, Aviad Cohen, A. Schuster, J. F. Sibeyn","doi":"10.1109/IPPS.1992.223067","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223067","url":null,"abstract":"Considers the problem of dynamic load balancing in an n processors parallel system. The authors focus on the algorithm which randomly assigns newly generated tasks to processors for execution. This process is modeled by randomly throwing weighted balls into n holes. For a given program A, the ball weights (task lengths) are chosen according to an unknown probability distribution D(A) with expectation mu , maximum M and minimum m. For any A, D(A) and a constant 0< in <or=0.5, they derive an upper bound on the number of processes which A needs to generate in order for the algorithm to achieve optimal load balancing with very high probability, so that the run-time is optimal up to a factor of (1+ in )/sup 2/. Using the relation derived, the programmer may control the load-balancing of his program by modifying the global parameters of the generated processes.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121262012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Routing BPC permutations in VLSI VLSI中路由BPC排列
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223061
H. Alnuweiri
A large number of the permutations realized by interconnection networks in parallel processing systems and digital arithmetic circuits, fall in the class of bit-permute-complement (BPC) permutations. The paper presents a methodology for routing this class of permutations in VLSI, under various I/O, area, and time trade-offs. The resulting VLSI designs can route a BPC permutation of size N, using a chip with N/Q I/O pins, O(N/sup 2//Q/sup 2/) area, and O(wQ) time, where w is the word length of the permuted elements and 1>
并行处理系统和数字算术电路中互连网络实现的大量排列都属于比特-置换-补位(BPC)排列。本文提出了在各种I/O、面积和时间权衡下,在VLSI中路由这类排列的方法。由此产生的VLSI设计可以路由大小为N的BPC排列,使用具有N/Q I/O引脚,O(N/sup 2//Q/sup 2/)面积和O(wQ)时间的芯片,其中w是排列元素的字长,1>
{"title":"Routing BPC permutations in VLSI","authors":"H. Alnuweiri","doi":"10.1109/IPPS.1992.223061","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223061","url":null,"abstract":"A large number of the permutations realized by interconnection networks in parallel processing systems and digital arithmetic circuits, fall in the class of bit-permute-complement (BPC) permutations. The paper presents a methodology for routing this class of permutations in VLSI, under various I/O, area, and time trade-offs. The resulting VLSI designs can route a BPC permutation of size N, using a chip with N/Q I/O pins, O(N/sup 2//Q/sup 2/) area, and O(wQ) time, where w is the word length of the permuted elements and 1<or=Q<or= square root N/w.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"145 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122583200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
VLSI implementation of a 256*256 crossbar interconnection network VLSI实现了一个256*256的横杆互连网络
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223031
Kyusun Choi, W. Adams
Despite the fact that a crossbar interconnection network is desirable in parallel processing systems due to its flexibility of configuration and simplicity of control, many of the crossbars developed up to this time are small in size. The paper presents the analysis of VLSI layout size and signal delay of the previous crossbar circuits. Also a circuit with better layout size and signal delay is presented in comparison. Based on the new circuit, the feasibility of the implementation is shown for a 256*256 crossbar on a 1cm/sup 2/ CMOS VLSI chip.<>
尽管由于配置的灵活性和控制的简单性,交叉棒互连网络在并行处理系统中是理想的,但到目前为止开发的许多交叉棒的尺寸都很小。本文分析了VLSI的布局尺寸和以往交叉电路的信号延迟。并给出了一种具有更好的布局尺寸和信号延迟的电路。基于该电路,在1cm/sup 2/ CMOS VLSI芯片上实现256*256横条的可行性得到了验证。
{"title":"VLSI implementation of a 256*256 crossbar interconnection network","authors":"Kyusun Choi, W. Adams","doi":"10.1109/IPPS.1992.223031","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223031","url":null,"abstract":"Despite the fact that a crossbar interconnection network is desirable in parallel processing systems due to its flexibility of configuration and simplicity of control, many of the crossbars developed up to this time are small in size. The paper presents the analysis of VLSI layout size and signal delay of the previous crossbar circuits. Also a circuit with better layout size and signal delay is presented in comparison. Based on the new circuit, the feasibility of the implementation is shown for a 256*256 crossbar on a 1cm/sup 2/ CMOS VLSI chip.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"139 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132607843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Distributed algorithms for shortest-path, deadlock-free routing and broadcasting in a class of interconnection topologies 一类互连拓扑中最短路径、无死锁路由和广播的分布式算法
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222963
Jenshiuh Liu, W. Hsu
A class of novel interconnection topologies called the generalized Fibonacci cubes is presented. The generalized Fibonacci cubes include the hypercubes, the recently proposed Fibonacci cubes (W.-J. Hsu, Proc. Int. Conf. on Parallel Processing, p.1722-3 (1991)), and some other asymmetric interconnection topologies bridging between the two mentioned above. The generalized Fibonacci cubes can serve as a framework for studying degraded hypercubes due to faulty nodes or links. Previously known algorithms for hypercubes do not generalize to this class of interconnection topologies. The authors present distributed routing and broadcasting algorithms that can be applied to all members of this class of interconnection topologies. It is shown that their distributed routing algorithm always finds a shortest and deadlock-free path. The broadcasting algorithms are designed and evaluated based on both the all-port and the 1-port communication models. The all-port broadcasting algorithm is provably optimal in terms of minimizing routing steps. An upper bound for the 1-port broadcasting algorithm is determined, which is shown to be optimal for certain cases.<>
提出了一类新的互连拓扑,称为广义斐波那契立方。广义斐波那契立方体包括超立方体、最近提出的斐波那契立方体(w - j。徐先生,法官。并行处理,p.1722-3(1991)),以及在上述两者之间桥接的其他一些非对称互连拓扑。广义Fibonacci多维数据集可以作为研究由故障节点或链路引起的退化超多维数据集的框架。以前已知的超立方体算法不能推广到这类互连拓扑。作者提出了分布式路由和广播算法,可以应用于这类互连拓扑的所有成员。结果表明,他们的分布式路由算法总能找到一条最短且无死锁的路径。基于全端口和单端口通信模型设计和评估了广播算法。证明了全端口广播算法在最小化路由步骤方面是最优的。确定了1端口广播算法的上界,这在某些情况下是最优的。
{"title":"Distributed algorithms for shortest-path, deadlock-free routing and broadcasting in a class of interconnection topologies","authors":"Jenshiuh Liu, W. Hsu","doi":"10.1109/IPPS.1992.222963","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222963","url":null,"abstract":"A class of novel interconnection topologies called the generalized Fibonacci cubes is presented. The generalized Fibonacci cubes include the hypercubes, the recently proposed Fibonacci cubes (W.-J. Hsu, Proc. Int. Conf. on Parallel Processing, p.1722-3 (1991)), and some other asymmetric interconnection topologies bridging between the two mentioned above. The generalized Fibonacci cubes can serve as a framework for studying degraded hypercubes due to faulty nodes or links. Previously known algorithms for hypercubes do not generalize to this class of interconnection topologies. The authors present distributed routing and broadcasting algorithms that can be applied to all members of this class of interconnection topologies. It is shown that their distributed routing algorithm always finds a shortest and deadlock-free path. The broadcasting algorithms are designed and evaluated based on both the all-port and the 1-port communication models. The all-port broadcasting algorithm is provably optimal in terms of minimizing routing steps. An upper bound for the 1-port broadcasting algorithm is determined, which is shown to be optimal for certain cases.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127343058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Analyzing performance of sequencing mechanisms for simple layered task systems 简单分层任务系统排序机制性能分析
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223051
A. Tayyab, J. G. Kuhl
Considers the problem of sequencing a set of parallel activities in the presence of nonzero overheads. Mechanisms for sequence control may range from explicit inter-task synchronization to more restrictive mechanisms such as blocking barriers. It is highly desirable to be able to base the choice of a sequence control mechanism for a parallel algorithm upon specific characteristics of the algorithm's structure and the underlying architecture. The paper presents approximate models for simple layered task systems that can predict overall performance and provide a useful understanding of key performance parameters and tradeoffs. The analytic results are compared with simulation to demonstrate their accuracy. Some simple applications of the model are presented that demonstrate non-intuitive behavior of layered graphs with inter-task versus barrier-based sequencing.<>
考虑在存在非零开销的情况下对一组并行活动排序的问题。序列控制的机制可以从显式的任务间同步到更严格的机制,如阻塞屏障。能够基于算法结构和底层架构的特定特征来选择并行算法的序列控制机制是非常可取的。本文提出了简单分层任务系统的近似模型,可以预测整体性能,并提供对关键性能参数和权衡的有用理解。将分析结果与仿真结果进行了比较,验证了分析结果的准确性。给出了该模型的一些简单应用,展示了任务间排序与基于屏障排序的分层图的非直观行为
{"title":"Analyzing performance of sequencing mechanisms for simple layered task systems","authors":"A. Tayyab, J. G. Kuhl","doi":"10.1109/IPPS.1992.223051","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223051","url":null,"abstract":"Considers the problem of sequencing a set of parallel activities in the presence of nonzero overheads. Mechanisms for sequence control may range from explicit inter-task synchronization to more restrictive mechanisms such as blocking barriers. It is highly desirable to be able to base the choice of a sequence control mechanism for a parallel algorithm upon specific characteristics of the algorithm's structure and the underlying architecture. The paper presents approximate models for simple layered task systems that can predict overall performance and provide a useful understanding of key performance parameters and tradeoffs. The analytic results are compared with simulation to demonstrate their accuracy. Some simple applications of the model are presented that demonstrate non-intuitive behavior of layered graphs with inter-task versus barrier-based sequencing.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114332577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Efficient process migration in the EMPS multiprocessor system EMPS多处理机系统中有效的进程迁移
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223071
G. Dijk, M. V. Gils
The process migration facility in the Eindhoven multiprocessor system (EMPS) is presented. In the EMPS system, mailboxes are used for interprocess communication. These mailboxes provide transparency of location for communicating processes. The major advantages of mailbox communication in the EMPS system are: (1) interprocess communication can proceed without losing messages; and (2) the communication paths can be updated very efficiently when a process moves to another processor. By redirecting the mailbox connection of the migrating process, the communication paths of all processes connected to the same mailbox are updated.<>
介绍了Eindhoven多处理器系统(EMPS)中的进程迁移功能。在EMPS系统中,邮箱用于进程间通信。这些邮箱为通信过程提供了位置的透明性。EMPS系统中邮箱通信的主要优点是:(1)进程间通信可以在不丢失消息的情况下进行;(2)当一个进程移动到另一个处理器时,通信路径可以非常有效地更新。通过重定向迁移进程的邮箱连接,更新连接到同一邮箱的所有进程的通信路径
{"title":"Efficient process migration in the EMPS multiprocessor system","authors":"G. Dijk, M. V. Gils","doi":"10.1109/IPPS.1992.223071","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223071","url":null,"abstract":"The process migration facility in the Eindhoven multiprocessor system (EMPS) is presented. In the EMPS system, mailboxes are used for interprocess communication. These mailboxes provide transparency of location for communicating processes. The major advantages of mailbox communication in the EMPS system are: (1) interprocess communication can proceed without losing messages; and (2) the communication paths can be updated very efficiently when a process moves to another processor. By redirecting the mailbox connection of the migrating process, the communication paths of all processes connected to the same mailbox are updated.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117025972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Space-optimal linear processor allocation for systolic arrays synthesis 收缩阵列合成空间最优线性处理器配置
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223033
Y. Wong, J. Delosme
The mapping of a systolic algorithm onto a regularly connected array architecture can be considered as a linear transformation problem. However, to derive the 'optimal' transformation is difficult because the necessary optimizations involve discrete decision variables and the cost functions do not usually have closed-form expressions. The paper considers the derivation of a space-optimal (minimum processor count) mapping of a given time performance. Utilizing some recent results from the geometry of numbers, it is shown that the solution space for this discrete optimization problem can be nicely bounded and hence, the optimal solution can be efficiently determined with enumeration for practical cases. Examples are provided to demonstrate the effectiveness of this approach.<>
收缩算法到规则连接阵列结构的映射可以看作是一个线性变换问题。然而,导出“最优”转换是困难的,因为必要的优化涉及离散决策变量和成本函数通常不具有封闭形式的表达式。本文考虑给定时间性能的空间最优(最少处理器数)映射的推导。利用数几何的一些最新结果,证明了该离散优化问题的解空间可以很好地有界,因此,在实际情况下,可以用枚举法有效地确定最优解。举例说明了这种方法的有效性。
{"title":"Space-optimal linear processor allocation for systolic arrays synthesis","authors":"Y. Wong, J. Delosme","doi":"10.1109/IPPS.1992.223033","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223033","url":null,"abstract":"The mapping of a systolic algorithm onto a regularly connected array architecture can be considered as a linear transformation problem. However, to derive the 'optimal' transformation is difficult because the necessary optimizations involve discrete decision variables and the cost functions do not usually have closed-form expressions. The paper considers the derivation of a space-optimal (minimum processor count) mapping of a given time performance. Utilizing some recent results from the geometry of numbers, it is shown that the solution space for this discrete optimization problem can be nicely bounded and hence, the optimal solution can be efficiently determined with enumeration for practical cases. Examples are provided to demonstrate the effectiveness of this approach.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115068453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Hash table in massively parallel systems 大规模并行系统中的哈希表
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222988
I. Yen, F. Bastani
The authors look at the performance and new collision resolution strategies for hash tables in massively parallel systems. The results show that using a hash table with linear probing yields O(logN) time performance for handling M accesses by N processors when the load factor of the table is 50%, where N is the size of the hash table. This is better than the performance of using sorted arrays. Two phase hashing gives an average time complexity O(logN) for M simultaneous accesses to a hash table of size N even when the table has 100% load. Simulation results also show that hypercube hashing significantly outperforms linear probing and double hashing.<>
作者研究了大规模并行系统中哈希表的性能和新的冲突解决策略。结果表明,当表的负载因子为50%时,使用具有线性探测的哈希表处理N个处理器的M次访问产生O(logN)时间性能,其中N是哈希表的大小。这比使用排序数组的性能要好。对于同时访问大小为N的哈希表的M次访问,即使表具有100%的负载,两阶段哈希的平均时间复杂度为O(logN)。仿真结果还表明,超立方体哈希算法明显优于线性探测和双哈希算法。
{"title":"Hash table in massively parallel systems","authors":"I. Yen, F. Bastani","doi":"10.1109/IPPS.1992.222988","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222988","url":null,"abstract":"The authors look at the performance and new collision resolution strategies for hash tables in massively parallel systems. The results show that using a hash table with linear probing yields O(logN) time performance for handling M accesses by N processors when the load factor of the table is 50%, where N is the size of the hash table. This is better than the performance of using sorted arrays. Two phase hashing gives an average time complexity O(logN) for M simultaneous accesses to a hash table of size N even when the table has 100% load. Simulation results also show that hypercube hashing significantly outperforms linear probing and double hashing.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123317800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
The 'Mobius cubes': improved cubelike networks for parallel computation “莫比乌斯立方体”:改进的并行计算立方体网络
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222997
P. Cull, S. Larson
The Mobius cubes are created, by rearranging in a systematic manner, some of the edges of a hypercube. This rearrangement results in smaller distances between processors; where distance is the number of communication links which must be traversed. The authors show that the n-dimensional Mobius cubes have a diameter of about n/2 and expected distance of about n/3. These distances are a considerable savings over the diameter of n and expected distance of n/2 for the n-dimensional hypercube. The authors show that the Mobius cubes have a slightly more complicated algorithm than the hypercube. While the asymmetry of the Mobius cubes may give rise to communications bottlenecks, they report preliminary experiments showing the bottle necks are not significant. They compare their Mobius cubes to other variants and indicate some advantages for the Mobius cubes.<>
莫比乌斯立方体是通过以系统的方式重新排列超立方体的一些边缘而创建的。这种重排导致处理器之间的距离更小;其中,距离是必须穿越的通信链路的数量。作者证明了n维莫比乌斯立方体的直径约为n/2,期望距离约为n/3。对于n维超立方体,这些距离相对于n的直径和n/2的期望距离是相当大的节省。作者表明,莫比乌斯立方体的算法比超立方体稍微复杂一些。虽然莫比乌斯立方体的不对称性可能会导致通信瓶颈,但他们报告说,初步实验表明,瓶颈并不重要。他们将Mobius立方体与其他变体进行了比较,并指出了Mobius立方体的一些优势。
{"title":"The 'Mobius cubes': improved cubelike networks for parallel computation","authors":"P. Cull, S. Larson","doi":"10.1109/IPPS.1992.222997","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222997","url":null,"abstract":"The Mobius cubes are created, by rearranging in a systematic manner, some of the edges of a hypercube. This rearrangement results in smaller distances between processors; where distance is the number of communication links which must be traversed. The authors show that the n-dimensional Mobius cubes have a diameter of about n/2 and expected distance of about n/3. These distances are a considerable savings over the diameter of n and expected distance of n/2 for the n-dimensional hypercube. The authors show that the Mobius cubes have a slightly more complicated algorithm than the hypercube. While the asymmetry of the Mobius cubes may give rise to communications bottlenecks, they report preliminary experiments showing the bottle necks are not significant. They compare their Mobius cubes to other variants and indicate some advantages for the Mobius cubes.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125253978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Using a functional language and graph reduction to program multiprocessor machines or functional control of imperative programs 使用函数式语言和图形化简对多处理机或命令式程序的功能控制进行编程
Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223017
Lal George, G. Lindstrom
Describes an effective means for programming shared memory multiprocessors whereby a set of sequential activities are linked together for execution in parallel. The glue for this linkage is provided by a functional language implemented via graph reduction and demand evaluation. The full power of functional programming is used to obtain succinct, high level specifications of parallel computations. The imperative procedures that constitute the sequential activities facilitate efficient utilization of individual processing elements, while the mechanisms inherent in graph reduction synchronize and schedule these activities.<>
描述一种对共享内存多处理器进行编程的有效方法,通过这种方法,一组顺序活动被链接在一起并行执行。这种联系的粘合剂是由一种函数式语言提供的,这种语言通过图形化简和需求评估来实现。函数式编程的全部功能被用来获得简洁、高层次的并行计算规范。构成顺序活动的命令式过程促进了对单个处理元素的有效利用,而图简化中固有的机制则同步和调度这些活动。
{"title":"Using a functional language and graph reduction to program multiprocessor machines or functional control of imperative programs","authors":"Lal George, G. Lindstrom","doi":"10.1109/IPPS.1992.223017","DOIUrl":"https://doi.org/10.1109/IPPS.1992.223017","url":null,"abstract":"Describes an effective means for programming shared memory multiprocessors whereby a set of sequential activities are linked together for execution in parallel. The glue for this linkage is provided by a functional language implemented via graph reduction and demand evaluation. The full power of functional programming is used to obtain succinct, high level specifications of parallel computations. The imperative procedures that constitute the sequential activities facilitate efficient utilization of individual processing elements, while the mechanisms inherent in graph reduction synchronize and schedule these activities.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130008732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Proceedings Sixth International Parallel Processing Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1