Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing最新文献

英文中文

Self-stabilizing algorithms for tree metrics 树度量的自稳定算法

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472220

A. Datta, T. Gonzalez, V. Thiagarajan

This paper presents self-stabilizing algorithms for finding the diameter, centroid(s) and median(s) of a tree. The algorithms compute these metrics of a tree in a finite number of steps. The distributed tree structured system is maintained by another self-stabilizing spanning tree protocol over a graph. This makes the system resilient to transient failures, from which it is guaranteed to recover after a finite number of moves.<>

本文提出了一种求树的直径、质心和中值的自稳定算法。算法在有限的步骤中计算树的这些度量。分布式树结构系统由另一种自稳定生成树协议在图上维护。这使得系统对瞬态故障具有弹性，保证在有限次数的移动后恢复。

引用次数: 2

Internetworking between TP4 and TCP through protocol complementation 通过协议互补实现TP4与TCP的互联

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472200

S. Das, P. Dhar

Like protocol conversion, protocol complementation is an approach for network interconnection. This paper describes the internetworking between TP4 and TCP at the transport level through protocol complementation. This applies to the need for interoperability between ISO-OSI and Internet. From the given CFSM specifications of protocol P[P/sub s/,P/sub r/] of TP4 and Q[Qs,Qr] of TCP, we have constructed a composite protocol CFSM R/sub PQ/ for the converter which may be inserted as a virtual layer to provide a uniform view to the users. An attempt has been made to implement the converter through Estelle-C (Extended State Transition Language) compiler, a formal description tool for protocol specification and verification.<>

与协议转换一样，协议互补也是网络互连的一种方式。本文描述了通过协议互补在传输层实现TP4与TCP的互连。这适用于ISO-OSI和Internet之间互操作性的需求。根据TP4协议P[P/sub /s /，P/sub / r/]和TCP协议Q[Q,Qr]的CFSM规范，我们为转换器构建了一个复合协议CFSM r/ sub PQ/，该协议可以作为虚拟层插入，为用户提供统一的视图。本文尝试通过Estelle-C(扩展状态转换语言)编译器实现该转换器，该编译器是协议规范和验证的形式化描述工具。

引用次数: 1

A parallel implementation on CM5 of a multi-spectral cooperative segmentation 多光谱协同分割在CM5上的并行实现

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472280

P. Bonnin, C. Maurette, B. Hoelzener-Douarin, E. Pissaloux

The paper focuses on the problem of the multi-spectral image segmentation, which leads-through the data fusion of several mono-spectral images-to reliable and robust vision systems for military or industrial purposes. The proposed approach does not fit the classical taxonomy of image data fusion methods: indeed, data fusion is performed during the segmentation, in parallel, of different images. The presented algorithm has been implemented on the Connection Machine CM5 with the data programming style.<>

本文重点研究了多光谱图像分割问题，该问题通过多幅单光谱图像的数据融合，实现了可靠、鲁棒的军事或工业视觉系统。该方法不符合图像数据融合方法的经典分类，而是在不同图像的并行分割过程中进行数据融合。该算法已在连接机CM5上以数据编程风格。>实现

引用次数: 1

Implementation of an optimal parallel algorithm for arithmetic expression parsing 实现一个最优并行算法的算术表达式解析

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472290

H. Suresh

Development of efficient algorithms for parallel computer architectures is an on-going research area and in the recent past a great volume of theoretical work has been carried out for the search of suitable algorithms in concurrent processing environment. In this paper, the results obtained in the implementation of an Optimal Parallel Algorithm developed by Deng and Iyengar (1992) in the esoteric area of arithmetic expression parsing is reported. The 'C' code developed and tested on an IBM Compatible Personal Computer in this investigative study, is a simple recursive descent parser and may be used for parallel parsing of arithmetic expressions. The algorithm was developed to suit the SIMD parallel architecture to avoid any communication bottlenecks posed by PVM system, however, design and structure of the code readily permits portability to a parallel computer system.<>

开发适用于并行计算机体系结构的高效算法是一个不断发展的研究领域，近年来人们开展了大量的理论工作来寻找适合并行处理环境的算法。本文报道了由Deng和Iyengar(1992)在算术表达式解析的深奥领域开发的最优并行算法的实现结果。在本研究中，在IBM兼容个人计算机上开发和测试的“C”代码是一个简单的递归下降解析器，可用于并行解析算术表达式。该算法是为了适应SIMD并行体系结构而开发的，以避免PVM系统带来的任何通信瓶颈，然而，代码的设计和结构很容易允许可移植性到并行计算机系统。

引用次数: 0

Variable tracking technique: a single-pass method to determine data dependence 变量跟踪技术:单遍确定数据依赖性的方法

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472183

D. Wo, K. Forward

This paper presents a new data dependence checking technique called the variable tracking technique (VTT). It is a single-pass data dependence checking method which locates dependent statements in a serial computer program. VTT produces a schedule which lists the operations in the source code in groups. The list of operations in a particular group can be executed concurrently. The user is not required to provide a profile of the program to the compiler, hence VTT is suitable for applications which automate the process of exploiting parallelism. Here we describe the use of this technique in gacc, a parallelising compiler, which compiles C functions to field programmable gate array (FPGA) circuits. The results presented in this paper show that VTT has been instrumental in gaining improved performance from a parallelising compiler which automates the process of executing the computational intensive portion of the program in hardware.<>

提出了一种新的数据相关性检验技术——变量跟踪技术(VTT)。它是一种定位串行计算机程序中相关语句的单遍数据相关性检查方法。VTT生成一个时间表，它按组列出源代码中的操作。特定组中的操作列表可以并发执行。用户不需要向编译器提供程序的概要文件，因此VTT适用于自动化利用并行性过程的应用程序。本文描述了这种技术在并行编译器gacc中的应用，该编译器将C函数编译到现场可编程门阵列(FPGA)电路中。本文给出的结果表明，VTT有助于从并行编译器获得改进的性能，并行编译器可以在硬件上自动执行程序的计算密集型部分

引用次数: 0

Token distribution on reconfigurable d-dimensional meshes 可重构d维网格上的令牌分布

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472203

G. Turner, H. Schroder

We propose an algorithm to solve the Token Distribution problem, a static variant of the load balancing problem, on d-dimensional, reconfigurable meshes with toroidal connections and side length n. No other algorithms have been proposed under this model of computation. We show that for token size T, the discrepancy /spl Delta/ between the maximum and minimum number of tokens per PE can be reduced to 1 in at most 2n/spl Delta/(T+4d) steps.<>

我们提出了一种算法来解决令牌分发问题，这是负载平衡问题的一个静态变体，在d维，可重构网格上，具有环面连接和边长n。在这种计算模型下没有其他算法被提出。我们表明，对于令牌大小T，每个PE的最大和最小令牌数量之间的差异/spl Delta/最多可以在2n/spl Delta/(T+4d)步中减少到1。

引用次数: 9

Accurate analysis of multistage interconnection networks using finite output-buffered switching elements 使用有限输出缓冲开关元件的多级互连网络的精确分析

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472274

B. Zhou, Mohammed Atiquzzaman

Many of the existing analytical models for output buffered switching elements (SE) assume uniform traffic and infinite buffers at each output port of an SE. Moreover, because of simplifying assumptions, the results are not accurate. It is important to develop an accurate analytical model to tailor the design of the network parameters and optimize the network performance by proper dimensioning of the buffers. The objective of this paper is to develop an accurate model for the performance of MINs using finite output buffered SEs, and operating in the presence of nonuniform traffic patterns. It is shown that the proposed analytical model is much accurate than existing models.<>

许多现有的输出缓冲交换元件(SE)分析模型都假定SE的每个输出端口具有均匀的流量和无限的缓冲区。此外，由于简化了假设，结果并不准确。建立一个准确的分析模型来定制网络参数的设计，并通过适当的缓冲区尺寸来优化网络性能是很重要的。本文的目的是开发一个精确的模型，用于使用有限输出缓冲se的min的性能，并在不均匀的交通模式下运行。结果表明，所提出的分析模型比现有的模型精度更高。

引用次数: 3

An optimal lower bound on the maximum speedup in multiprocessors with clusters 带集群的多处理器最大加速的最优下界

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472251

L. Lundberg, H. Lennerstad

We consider an ideal multiprocessor system with q processors and a centralized scheduler without overhead that selects processes from one common pool, permitting dynamic relocation of processes. A parallel program P consisting of n processes is executed on this system and terminates when all processes are completed. Due to synchronizations, processes may be blocked while waiting for events in other processes. The parallel program is executed using some schedule of processes to processors, resulting in a speedup /spl sigma/. We then consider an ideal multiprocessor with k clusters containing u processors each. In this system processes may not be relocated between clusters. Finding a schedule which results in maximum speedup is NP-hard. Here, we present a formula for the optimal lower bound on the maximum speedup for program P, as a function of q, n, /spl sigma/, k and u. We also present a formula for the optimal lower bound when the number of processes (n) is unknown. Using these results we are able to decide if a certain schedule is close to optimal or if it is worth-while to look for other schedules. This is demonstrated by evaluating the speedup of a specific schedule of a particular program.<>

我们考虑一个理想的多处理器系统，它有q个处理器和一个没有开销的集中式调度程序，它从一个公共池中选择进程，允许进程的动态重新定位。由n个进程组成的并行程序P在该系统上执行，当所有进程完成时终止。由于同步，进程可能在等待其他进程中的事件时被阻塞。并行程序使用进程到处理器的一些调度来执行，导致加速/spl σ /。然后我们考虑一个理想的多处理器，它有k个集群，每个集群包含u个处理器。在此系统中，进程不能在集群之间重新定位。找到一个能够带来最大加速的时间表是np困难的。在这里，我们给出了程序P的最大加速的最优下界的公式，作为q, n， /spl sigma/， k和u的函数。我们还给出了当进程数(n)未知时的最优下界的公式。利用这些结果，我们能够决定某个时间表是否接近最佳，或者是否值得寻找其他时间表。这是通过评估特定程序的特定时间表的加速来证明的。

{"title":"An optimal lower bound on the maximum speedup in multiprocessors with clusters","authors":"L. Lundberg, H. Lennerstad","doi":"10.1109/ICAPP.1995.472251","DOIUrl":"https://doi.org/10.1109/ICAPP.1995.472251","url":null,"abstract":"We consider an ideal multiprocessor system with q processors and a centralized scheduler without overhead that selects processes from one common pool, permitting dynamic relocation of processes. A parallel program P consisting of n processes is executed on this system and terminates when all processes are completed. Due to synchronizations, processes may be blocked while waiting for events in other processes. The parallel program is executed using some schedule of processes to processors, resulting in a speedup /spl sigma/. We then consider an ideal multiprocessor with k clusters containing u processors each. In this system processes may not be relocated between clusters. Finding a schedule which results in maximum speedup is NP-hard. Here, we present a formula for the optimal lower bound on the maximum speedup for program P, as a function of q, n, /spl sigma/, k and u. We also present a formula for the optimal lower bound when the number of processes (n) is unknown. Using these results we are able to decide if a certain schedule is close to optimal or if it is worth-while to look for other schedules. This is demonstrated by evaluating the speedup of a specific schedule of a particular program.<<ETX>>","PeriodicalId":448130,"journal":{"name":"Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124876524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

X-cube: a variation of cube-connected-cycles network with lower average routing steps X-cube:一种具有较低平均路由步长的立方体连接循环网络的变体

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472176

D. Gee, Hong Shen

A fundamental and important research area in parallel computing is the design of high-performance interconnection networks for connecting the processors in parallel computers. This paper presents a new interconnection network, the X-cube, a variant of the Cube-Connected-Cycles (CCC), which has the same degree, and same diameter in the worst case as the CCC of the same size and a decreased number of routing steps in the average case. Associated with this network is a construction algorithm which illustrates the way of building the network, and a routing algorithm that describes the method of passing messages in the network. The proposed network is validated and its performance is evaluated experimentally through implementation of the above algorithms. A number of comparisons are made between this network and three existing networks, mesh, Hypercube, and the CCC.<>

并行计算的一个基础和重要的研究领域是设计高性能互连网络来连接并行计算机中的处理器。本文提出了一种新的互连网络——x立方网络，它是立方连接环(CCC)的一种变体，在最坏情况下，它与相同尺寸的CCC具有相同的度和相同的直径，并且在平均情况下减少了路由步数。与该网络相关联的是一个构造算法，它说明了构建网络的方法，以及一个路由算法，它描述了在网络中传递消息的方法。通过上述算法的实现，对所提出的网络进行了验证，并对其性能进行了实验评估。将该网络与现有的三种网络(mesh、Hypercube和CCC)进行了许多比较。

引用次数: 0

Dynamic load sharing within workstation clusters 工作站集群内的动态负载共享

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1995-04-19 DOI: 10.1109/ICAPP.1995.472295

P. Williams, R. Togneri

Workstation clusters, in which user processes run on one specified machine, cause the potential for a load imbalance. An analysis is conducted to determine how various system resources interact with one another in terms of job throughput and user interaction time. Quantitative and qualitative analyses of theoretical load sharing methods are used in the development of a well engineering system which is configurable, reliable and has a standard interface. Performance evaluation and system testing show very positive results.<>

工作站集群(其中用户进程在一台指定的机器上运行)可能导致负载不平衡。进行分析以确定各种系统资源如何在作业吞吐量和用户交互时间方面相互交互。将理论负荷分担方法的定量和定性分析应用于可配置、可靠和具有标准接口的井工程系统的开发。性能评估和系统测试显示了非常积极的结果。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀