[1993] Proceedings Seventh International Parallel Processing Symposium最新文献

英文中文

Scheduling independent tasks on partitionable hypercube multiprocessors 调度可分区超立方体多处理器上的独立任务

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262866

B. Narahari, Ramesh Krishnamurti

A partitionable hypercube allows simultaneous execution of multiple tasks, where each task can be executed on a choice of subcubes. This paper considers the problem of static nonpreemptive scheduling of w independent tasks on a n processor partitionable hypercube system to minimize the overall finishing time of the w tasks. Each task can be executed on subcubes of different sizes, with smaller execution times on larger subcubes. A schedule determines the size of the subcube to be assigned to each task and schedules these tasks on the processors in the hypercube system. The problem of finding the optimal schedule, with minimum finishing time, is known to be NP-hard. This paper presents a fast polynomial time approximation algorithm for the problem, and derives a tight worst-case performance bound of 2 for the algorithm.<>

可分区的超多维数据集允许同时执行多个任务，其中每个任务可以在一个选择的子多维数据集上执行。研究了n个处理器可分区超立方体系统上w个独立任务的静态非抢占调度问题，以使w个任务的总完成时间最小化。每个任务都可以在不同大小的子数据集上执行，在较大的子数据集上执行时间更短。调度确定要分配给每个任务的子多维数据集的大小，并在超多维数据集系统的处理器上调度这些任务。在最短的完成时间内找到最优计划的问题被称为np困难问题。本文给出了该问题的快速多项式时间逼近算法，并推导出该算法的最坏情况性能界为2。

引用次数: 6

Multiple message broadcasting in the postal model 邮政模型中的多消息广播

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262831

A. Bar-Noy, S. Kipnis

Broadcasting is a widely used operation in many message-passing systems. Most existing broadcasting algorithms, however, do not address several emerging trends in distributed-memory parallel computers and high-speed communication networks. These trends include (i) treating the system as a fully connected collection of processors, (ii) packetizing large data into sequences of messages, and (iii) tolerating communication latencies. This paper explores the broadcasting problem in the postal model that addresses these issues. The authors provide two algorithms for broadcasting m messages in a message-passing system with n processors and communication latency lambda . A lower bound on the time for this problem is (m-1)+f/sub lambda /(n), where f/sub lambda /(n) is the optimal time for broadcasting one message. They present algorithm PARTITION that takes at most 2m+f/sub lambda /(n)+O( lambda ) time, and algorithm D-D-TREES that takes at most m+2f/sub lambda /(n)+O( lambda ) time.<>

广播是许多消息传递系统中广泛使用的操作。然而，大多数现有的广播算法并没有解决分布式内存并行计算机和高速通信网络的几个新兴趋势。这些趋势包括(i)将系统视为一个完全连接的处理器集合，(ii)将大数据打包成消息序列，以及(iii)容忍通信延迟。本文探讨了邮政模式中的广播问题，以解决这些问题。在具有n个处理器和通信延迟lambda的消息传递系统中，作者提供了两种广播m条消息的算法。这个问题的时间的下界是(m-1)+f/下标lambda /(n)，其中f/下标lambda /(n)是广播一条消息的最佳时间。他们提出的算法PARTITION最多需要2m+f/sub lambda /(n)+O(lambda)时间，而算法D-D-TREES最多需要m+2f/sub lambda /(n)+O(lambda)时间。

引用次数: 34

A partially asynchronous and iterative algorithm for distributed load balancing 分布式负载平衡的部分异步迭代算法

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262906

Jianjian Song

Defining tasks as independent entities with identical execution time and workload as the number of tasks, the author proposes a partially asynchronous and iterative algorithm for distributed load balancing, shows its properties, and reports its simulation results. The algorithm converges geometrically according to a theorem proved elsewhere. He proves that the algorithm can achieve the maximum load imbalance of not more than (/sup d///sub 2/) tasks, where d is the diameter of a network. His simulation of a synchronous version of the algorithm not only validated the properties but also showed that the algorithm could produce much smaller load imbalances for hypercubes. The obtained imbalances for hypercubes of order up to ten were no more than two tasks and 56% of the sample runs produced only one task difference, as opposed to the theoretical maximum of six tasks.<>

将任务定义为执行时间和工作负载与任务数量相同的独立实体，提出了一种部分异步迭代的分布式负载均衡算法，展示了该算法的特性，并报告了仿真结果。该算法根据另一个已证明的定理在几何上收敛。他证明了该算法可以实现不超过(/sup d///sub 2/)个任务的最大负载不平衡，其中d为网络的直径。他对该算法的同步版本的模拟不仅验证了这些特性，而且还表明该算法可以为超立方体产生更小的负载不平衡。对于高达10阶的超立方体，获得的不平衡不超过两个任务，56%的样本运行只产生一个任务差异，而不是理论最大值6个任务。

引用次数: 65

Mapping realistic data sets on parallel computers 在并行计算机上绘制真实数据集

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262867

R. Ponnusamy, N. Mansour, A. Choudhary, G. Fox

Mapping data to parallel computers aims at minimizing the execution time of the associated application. However, it can take an unacceptable amount of time in comparison with the execution time of the application if the size of the problem is large. The authors propose reducing the problem size by a mapping-oriented graph contraction technique. They present a graph contraction (GC) heuristic algorithm that yields a smaller representation of the problem, to which mapping is then applied. The experimental results show that the GC algorithm still leads to good quality mapping solutions to the original problem, while producing remarkable reductions in mapping time. The GC algorithm allows large-scale mapping to become efficient, especially when slow but high-quality mappers are used.<>

将数据映射到并行计算机的目的是最小化相关应用程序的执行时间。但是，如果问题的大小很大，那么与应用程序的执行时间相比，它所花费的时间可能是不可接受的。作者提出了一种面向映射的图收缩技术来减小问题的大小。他们提出了一种图收缩(GC)启发式算法，该算法产生问题的较小表示，然后对其应用映射。实验结果表明，GC算法对原问题仍能得到高质量的映射解，同时显著减少了映射时间。GC算法允许大规模映射变得高效，特别是当使用缓慢但高质量的映射器时。

引用次数: 10

Multiprocessors scheduling for imprecise computations in a hard real-time environment 硬实时环境下的多处理器不精确计算调度

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262909

Ashok Khemka, K. Subramanyam, R. Shyamasundar

This paper discusses the problem of scheduling multiprocessors in a hard real-time environment allowing imprecise computations. The computation of tasks can be divided into mandatory and optional parts; In a feasible schedule, the mandatory part of every task has to be completed before the deadline of the task. The optional part refines the result produced by the mandatory part to reduce the error in the result. The optional parts need not ever be completed. The quality of the result of each job is measured in terms of the average error in the results over several consecutive periods. Thus, given n real-time periodic jobs which has a periodicity and a processing requirement per period, the problem of determining whether there exists a preemptive schedule on m identical or uniform machines which completes the mandatory portion of each job in the time interval in a period is examined. A combination of network flow techniques and convex programming formulation is used to construct a minimum error schedule whenever there exists a feasible schedule. The error produced due to the uncomputed portions of tasks is assumed to be some real-valued convex function of the uncomputed portions.<>

本文讨论了在允许不精确计算的硬实时环境下的多处理器调度问题。任务的计算可分为强制部分和可选部分;在一个可行的时间表中，每一项任务的强制性部分都必须在任务的截止日期之前完成。可选部分对强制部分生成的结果进行细化，以减少结果中的错误。可选部分不需要完成。每项工作结果的质量是根据连续几个时期的结果的平均误差来衡量的。因此，给定n个具有周期性和每个周期的处理需求的实时周期性作业，研究了在m台相同或统一的机器上是否存在一个在一个周期的时间间隔内完成每个作业的强制部分的抢占调度的问题。结合网络流技术和凸规划公式，在存在可行调度时构造最小误差调度。假设任务中未计算部分产生的误差为未计算部分的实值凸函数。

{"title":"Multiprocessors scheduling for imprecise computations in a hard real-time environment","authors":"Ashok Khemka, K. Subramanyam, R. Shyamasundar","doi":"10.1109/IPPS.1993.262909","DOIUrl":"https://doi.org/10.1109/IPPS.1993.262909","url":null,"abstract":"This paper discusses the problem of scheduling multiprocessors in a hard real-time environment allowing imprecise computations. The computation of tasks can be divided into mandatory and optional parts; In a feasible schedule, the mandatory part of every task has to be completed before the deadline of the task. The optional part refines the result produced by the mandatory part to reduce the error in the result. The optional parts need not ever be completed. The quality of the result of each job is measured in terms of the average error in the results over several consecutive periods. Thus, given n real-time periodic jobs which has a periodicity and a processing requirement per period, the problem of determining whether there exists a preemptive schedule on m identical or uniform machines which completes the mandatory portion of each job in the time interval in a period is examined. A combination of network flow techniques and convex programming formulation is used to construct a minimum error schedule whenever there exists a feasible schedule. The error produced due to the uncomputed portions of tasks is assumed to be some real-valued convex function of the uncomputed portions.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122855063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Speedup, communication complexity and blocking-a La Recherche du Temps Perdu 加速、通信复杂性和阻塞——这是一项关于Temps Perdu的研究

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262793

D. Marinescu, J. Rice

The paper investigates the time lost in a parallel computation due to sequential and duplicated work, communication and control, and blocking. It introduces the concept of relative speedup and proposes characterizations of parallel algorithms based upon the communication complexity and the blocking model. The paper discusses the impact of the processor's architecture upon the measured speedup. It shows that a large speedup may be due to an inefficient sequential computation, e.g. due to the cache management, rather than to an efficient parallel computation. A model of parallel computations which takes into account sequential and duplicated work, communication and control and blocking is presented. The paper shows that the scalability of a parallel computation is determined by the communication complexity. The model is used to predict the asymptotic behavior, the maximum speedup and the optimal number of processors. An incore 3D FFT algorithm for distributed memory MIMD systems and a Chebyshev iterative algorithm for solving a linear system of equations are used to illustrate the concepts.<>

本文研究了并行计算中由于顺序和重复工作、通信和控制以及阻塞而造成的时间损失。引入了相对加速的概念，提出了基于通信复杂度和阻塞模型的并行算法表征。本文讨论了处理器结构对所测速度的影响。它表明，较大的加速可能是由于低效的顺序计算，例如由于缓存管理，而不是由于高效的并行计算。提出了一种考虑顺序和重复工作、通信和控制以及阻塞的并行计算模型。研究表明，通信复杂度决定了并行计算的可扩展性。该模型用于预测系统的渐近行为、最大加速和最优处理器数。一个用于分布式存储MIMD系统的核心3D FFT算法和一个用于求解线性方程组的Chebyshev迭代算法被用来说明这些概念。

引用次数: 6

Performance of buffered multistage interconnection networks in non uniform traffic environment 非均匀流量环境下缓冲多级互联网络的性能研究

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262785

Mohammed Atiquzzaman, M. S. Akhtar

Multistage interconnection networks (MIN) are used to connect processors to memories in shared memory multiprocessor systems. A generalized Markov chain model for the performance evaluation of a single-buffered Omega network, in the presence of a hot spot, is proposed. The proposed model produces better results than existing models.<>

在共享内存多处理器系统中，多级互连网络(MIN)用于连接处理器和存储器。针对存在热点的单缓冲欧米茄网络，提出了一种广义马尔可夫链模型。所提出的模型比现有的模型具有更好的结果。

引用次数: 29

Process Groups: a mechanism for the coordination of and communication among processes in the Venus collective communication library 进程组:在Venus集体通信库中用于进程之间的协调和通信的机制

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262809

Vasanth Bala, S. Kipnis

In programming massively parallel computers, it is often necessary to have sets of processes cooperate in performing certain computations and communications. Most run-time libraries require that such sets of processes be explicitly specified in the program. In the Venus run-time communication library however, a Process Group abstraction is used to enable implicit coordination of and communication over dynamically determined sets of processes. The Process Groups mechanism in Venus offers an object-oriented approach for handling sets of processes and enhances the debugging and monitoring of programs. The authors describe the Process Groups mechanism in Venus, illustrate its use on the class of N-body problems, and outline some of the data structures and algorithms used to implement this mechanism in Venus.<>

在对大规模并行计算机进行编程时，通常需要一组进程协同执行某些计算和通信。大多数运行时库要求在程序中显式指定这类进程集。但是，在Venus运行时通信库中，使用Process Group抽象来支持动态确定的进程集之间的隐式协调和通信。Venus中的进程组机制为处理进程集提供了一种面向对象的方法，并增强了程序的调试和监控。作者描述了Venus中的Process Groups机制，说明了它在n体问题类中的使用，并概述了在Venus中用于实现该机制的一些数据结构和算法。

引用次数: 25

Concurrent programming with shared objects in networked environments 网络环境中使用共享对象的并发编程

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262830

C. Hartley, V. Sunderam

Concurrent and distributed computing, using portable software systems or environments on general purpose networked computing platforms, has recently gained widespread attention. Many such systems have been developed, and several are in production use. This project proposes the use of object-oriented techniques to enhance application development and ease of use, and to relieve developers of the complexities inherent in message passing environments. The authors support the relatively well understood shared-object concurrent computation model while providing facilities designed to aid the programmer with partitioning, scheduling, and synchronization in a straightforward, efficient, and portable manner. They describe a shared object toolkit for the PVM distributed computing system and present preliminary results and experiences.<>

在通用的网络计算平台上使用可移植软件系统或环境的并发和分布式计算最近得到了广泛的关注。许多这样的系统已经开发出来，其中一些已经投入生产使用。该项目建议使用面向对象技术来增强应用程序开发和易用性，并减轻开发人员在消息传递环境中固有的复杂性。作者支持相对容易理解的共享对象并发计算模型，同时提供旨在帮助程序员以直接、高效和可移植的方式进行分区、调度和同步的工具。他们描述了一个用于PVM分布式计算系统的共享对象工具包，并介绍了初步的结果和经验。

引用次数: 19

Analytical models of bandwidth allocation in pipelined k-ary n-cubes 流水线k元n立方中带宽分配的解析模型

[1993] Proceedings Seventh International Parallel Processing Symposium

Pub Date : 1993-04-13 DOI: 10.1109/IPPS.1993.262915

P. T. Gaughan, S. Yalamanchili

This paper extends existing work in virtual-channel flow control mechanisms by focusing on the bandwidth allocation issues. It presents an analytical model of the network that captures key phenomena of bandwidth allocation mechanisms in k-ary n-cubes. Tradeoffs are examined between full-duplex vs half-duplex links, purely demand-driven bandwidth allocation vs demand-driven allocation with CTS lookahead, and virtual to physical link assignment with link load balancing.<>

本文通过关注带宽分配问题，扩展了虚拟通道流量控制机制的现有工作。提出了一个网络的解析模型，该模型捕捉了k元n立方中带宽分配机制的关键现象。检查了全双工与半双工链路之间的权衡，纯需求驱动的带宽分配与CTS向前看的需求驱动分配，以及链路负载平衡的虚拟到物理链路分配。

引用次数: 9

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

[1993] Proceedings Seventh International Parallel Processing Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀