Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)最新文献

英文中文

An efficient group communication architecture over ATM networks 一种高效的ATM网络组通信体系结构

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666551

Sung-Yong Park, Joohan Lee, S. Hariri

NYNET (ATM wide-area network testbed in New York state) Communication System (NCS) is a multithreaded message-passing tool developed at Syracuse University that provides low-latency and high-throughput communication services over Asynchronous Transfer Mode (ATM)-based high-performance distributed computing (HPDC) environments. NCS provides flexible and scalable group communication services based on dynamic grouping and tree-based multicasting. The NCS architecture, which separates the data and control functions, allows group operations to be implemented efficiently by utilizing the control connections when transferring status information (e.g. topology information, routing information). Furthermore, NCS provides several different algorithms for group communication and allows programmers to select an appropriate algorithm at runtime. The authors overview the general architecture of NCS and present the multicasting services provided by NCS. They analyze and compare the performance of NCS with that of other message-passing tools such as p4, PVM, and MPI in terms of primitive performance and performance.

NYNET(纽约州ATM广域网测试平台)通信系统(NCS)是锡拉丘兹大学开发的多线程消息传递工具，可在基于异步传输模式(ATM)的高性能分布式计算(HPDC)环境中提供低延迟和高吞吐量的通信服务。NCS提供了基于动态分组和基于树的组播的灵活、可扩展的组通信服务。NCS体系结构分离了数据和控制功能，允许在传输状态信息(如拓扑信息、路由信息)时利用控制连接有效地实现组操作。此外，NCS为组通信提供了几种不同的算法，并允许程序员在运行时选择合适的算法。作者概述了NCS的总体结构，并介绍了NCS提供的多播服务。他们分析并比较了NCS与其他消息传递工具(如p4、PVM和MPI)在基本性能和性能方面的性能。

引用次数: 0

The relative performance of various mapping algorithms is independent of sizable variances in run-time predictions 各种映射算法的相对性能与运行时预测中的相当大的方差无关

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666547

R. Armstrong, D. Hensgen, T. Kidd

The author studies the performance of four mapping algorithms. The four algorithms include two naive ones: opportunistic load balancing (OLB), and limited best assignment (LBA), and two intelligent greedy algorithms: an O(nm) greedy algorithm, and an O(n/sup 2/m) greedy algorithm. All of these algorithms, except OLB, use expected run-times to assign jobs to machines. As expected run-times are rarely deterministic in modern networked and server based systems, he first uses experimentation to determine some plausible run-time distributions. Using these distributions, he next executes simulations to determine how the mapping algorithms perform. Performance comparisons show that the greedy algorithms produce schedules that, when executed, perform better than naive algorithms, even though the exact run-times are not available to the schedulers. He concludes that the use of intelligent mapping algorithms is beneficial, even when the expected time for completion of a job is not deterministic.

作者研究了四种映射算法的性能。这四种算法包括两个朴素算法:机会负载平衡(OLB)和有限最佳分配(LBA)，以及两个智能贪婪算法:O(nm)贪婪算法和O(n/sup 2/m)贪婪算法。除OLB外，所有这些算法都使用预期运行时间将作业分配给机器。正如预期的那样，在现代网络化和基于服务器的系统中，运行时很少是确定的，因此他首先使用实验来确定一些合理的运行时分布。使用这些分布，他接下来执行模拟以确定映射算法的执行情况。性能比较表明，贪婪算法产生的调度在执行时比朴素算法执行得更好，即使调度程序无法获得确切的运行时间。他的结论是，即使在完成工作的预期时间不确定的情况下，使用智能映射算法也是有益的。

引用次数: 294

Dynamic, competitive scheduling of multiple DAGs in a distributed heterogeneous environment 分布式异构环境中多个dag的动态竞争调度

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666546

Michael A. Iverson, F. Özgüner

With the advent of large scale heterogeneous environments, there is a need for matching and scheduling algorithms which can allow multiple DAG-structured applications to share the computational resources of the network. The paper presents a matching and scheduling framework where multiple applications compete for the computational resources on the network. In this environment, each application makes its own scheduling decisions. Thus, no centralized scheduling resource is required. Applications do not need direct knowledge of the other applications. The only knowledge of other applications arrives indirectly through load estimates (like queue lengths). The paper also presents algorithms for each portion of this scheduling framework. One of these algorithms is modification of a static scheduling algorithm, the DLS algorithm, first presented by Sih and Lee (1993). Other algorithms attempt to predict the future task arrivals by modeling the task arrivals as Poisson random processes. A series of simulations are presented to examine the performance of these algorithms in this environment. These simulations also compare the performance of this environment to a more conventional, single user environment.

随着大规模异构环境的出现，需要匹配和调度算法来允许多个dag结构的应用程序共享网络的计算资源。本文提出了一个多应用程序竞争网络计算资源的匹配和调度框架。在这种环境中，每个应用程序都做出自己的调度决策。因此，不需要集中调度资源。应用程序不需要直接了解其他应用程序。对其他应用程序的唯一了解是通过负载估计间接获得的(比如队列长度)。本文还给出了该调度框架各部分的算法。其中一种算法是对静态调度算法的改进，DLS算法，由Sih和Lee(1993)首次提出。其他算法试图通过将任务到达建模为泊松随机过程来预测未来的任务到达。通过一系列的仿真来检验这些算法在这种环境下的性能。这些模拟还将此环境的性能与更传统的单用户环境进行比较。

{"title":"Dynamic, competitive scheduling of multiple DAGs in a distributed heterogeneous environment","authors":"Michael A. Iverson, F. Özgüner","doi":"10.1109/HCW.1998.666546","DOIUrl":"https://doi.org/10.1109/HCW.1998.666546","url":null,"abstract":"With the advent of large scale heterogeneous environments, there is a need for matching and scheduling algorithms which can allow multiple DAG-structured applications to share the computational resources of the network. The paper presents a matching and scheduling framework where multiple applications compete for the computational resources on the network. In this environment, each application makes its own scheduling decisions. Thus, no centralized scheduling resource is required. Applications do not need direct knowledge of the other applications. The only knowledge of other applications arrives indirectly through load estimates (like queue lengths). The paper also presents algorithms for each portion of this scheduling framework. One of these algorithms is modification of a static scheduling algorithm, the DLS algorithm, first presented by Sih and Lee (1993). Other algorithms attempt to predict the future task arrivals by modeling the task arrivals as Poisson random processes. A series of simulations are presented to examine the performance of these algorithms in this environment. These simulations also compare the performance of this environment to a more conventional, single user environment.","PeriodicalId":273718,"journal":{"name":"Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124003115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 101

Modeling the slowdown of data-parallel applications in homogeneous and heterogeneous clusters of workstations 对同构和异构工作站集群中数据并行应用程序的速度进行建模

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666548

S. Figueira, F. Berman

Data-parallel applications executing in multi-user clustered environments share resources with other applications. Since this sharing of resources dramatically affects the performance of individual applications, it is critical to estimate its effect, i.e., the application slowdown, in order to predict application behavior. The authors develop a new approach for predicting the slowdown imposed on data-parallel applications executing on homogeneous and heterogeneous clusters of workstations. The model synthesizes the slowdown on each machine used by an application into a contention measure-the aggregate slowdown factor-used to adjust the execution time of the application to account for the aggregate load. The model is parameterized by the work (or data) partitioning policy employed by the targeted application, the local slowdown (due to contention from other users) present in each node of the cluster and the relative weight (capacity) associated with each node in the cluster. This model provides a basis for predicting realistic execution times for distributed data-parallel applications in production clustered environments.

在多用户集群环境中执行的数据并行应用程序与其他应用程序共享资源。由于这种资源共享极大地影响了单个应用程序的性能，因此为了预测应用程序的行为，评估其影响(即应用程序的减速)是至关重要的。作者开发了一种新的方法来预测在同构和异构工作站集群上执行的数据并行应用程序的减速。该模型将应用程序使用的每台机器上的减速综合到争用度量中——聚合减速因子——用于调整应用程序的执行时间，以考虑聚合负载。该模型由目标应用程序采用的工作(或数据)分区策略、集群中每个节点中存在的本地减速(由于其他用户的争用)以及与集群中每个节点相关联的相对权重(容量)来参数化。该模型为预测生产集群环境中分布式数据并行应用程序的实际执行时间提供了基础。

{"title":"Modeling the slowdown of data-parallel applications in homogeneous and heterogeneous clusters of workstations","authors":"S. Figueira, F. Berman","doi":"10.1109/HCW.1998.666548","DOIUrl":"https://doi.org/10.1109/HCW.1998.666548","url":null,"abstract":"Data-parallel applications executing in multi-user clustered environments share resources with other applications. Since this sharing of resources dramatically affects the performance of individual applications, it is critical to estimate its effect, i.e., the application slowdown, in order to predict application behavior. The authors develop a new approach for predicting the slowdown imposed on data-parallel applications executing on homogeneous and heterogeneous clusters of workstations. The model synthesizes the slowdown on each machine used by an application into a contention measure-the aggregate slowdown factor-used to adjust the execution time of the application to account for the aggregate load. The model is parameterized by the work (or data) partitioning policy employed by the targeted application, the local slowdown (due to contention from other users) present in each node of the cluster and the relative weight (capacity) associated with each node in the cluster. This model provides a basis for predicting realistic execution times for distributed data-parallel applications in production clustered environments.","PeriodicalId":273718,"journal":{"name":"Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115325334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A dynamic matching and scheduling algorithm for heterogeneous computing systems 异构计算系统的动态匹配与调度算法

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666545

Muthucumaru Maheswaran, H. Siegel

A heterogeneous computing system provides a variety of different machines, orchestrated to perform an application whose subtasks have diverse execution requirements. The subtasks must be assigned to machines (matching) and ordered for execution (scheduling) such that the overall application execution time is minimized. A new dynamic mapping (matching and scheduling) heuristic called the hybrid remapper is presented here. The hybrid remapper is based on a centralized policy and improves a statically, obtained initial matching and scheduling by remapping to reduce the overall execution time. The remapping is non-preemptive and the execution of the hybrid remapper can be overlapped with the execution of the subtasks. During application execution, the hybrid remapper uses run-time values for the subtask completion times and machine availability times whenever possible. Therefore, the hybrid remapper bases its decisions on a mixture of run-time and expected values. The potential of the hybrid remapper to improve the performance of initial static mappings is demonstrated using simulation studies.

异构计算系统提供各种不同的机器，这些机器经过编排以执行具有不同执行需求的子任务的应用程序。必须将子任务分配给机器(匹配)并安排执行(调度)，以便最小化整个应用程序的执行时间。本文提出了一种新的动态映射(匹配和调度)启发式算法——混合映射器。混合重映射器基于集中式策略，通过重映射改进静态获得的初始匹配和调度，以减少总体执行时间。重映射是非抢占式的，混合重映射器的执行可以与子任务的执行重叠。在应用程序执行期间，混合remapper尽可能使用子任务完成时间和机器可用性时间的运行时值。因此，混合式重新定义器将其决策建立在运行时值和期望值的混合基础上。通过仿真研究证明了混合映射器提高初始静态映射性能的潜力。

引用次数: 210

Specification and control of cooperative work in a heterogeneous computing environment 异构计算环境中协同工作的规范和控制

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666549

G. Hoyos-Rivera, Esther Martínez González, H. Rios-Figueroa, V. G. Sánchez-Arias, H. Acosta-Mesa, N. Lopez-Benitez

The implementation of an interface to support cooperative work in a heterogeneous completing environment is based on previously proposed definitions referred to as the cooperative work model (CWM) and cooperative work language (CWL). The interface for cooperative work (ICW) and the graphical interface for cooperative work (GICW) are the main two components of a tool useful in the set up and control of a cooperative working environment in a general purpose heterogeneous computing platform. This tool is described as well as some desired characteristics to improve its effectiveness. The specification and control of a virtual parallel machine are illustrated with an algorithm for 3D-reconstruction from two stereoscopic images. Test results on this application are also reported.

在异构完成环境中支持协作工作的接口的实现是基于先前提出的称为协作工作模型(CWM)和协作工作语言(CWL)的定义。协同工作界面(ICW)和协同工作图形界面(GICW)是通用异构计算平台中协同工作环境的建立和控制工具的两个主要组成部分。描述了该工具以及提高其有效性所需的一些特性。以一种基于两幅立体图像的三维重建算法为例，说明了虚拟并联机的规格和控制。本文还报告了该应用程序的测试结果。

引用次数: 1

Implementing distributed synthetic forces simulations in metacomputing environments 在元计算环境中实现分布式综合力模拟

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666543

S. Brunett, D. Davis, T. D. Gottschalk, P. Messina, C. Kesselman

A distributed, parallel implementation of the widely used Modular Semi-Automated Forces (ModSAF) Distributed Interactive Simulation (DIS) is presented, with scalable parallel processors (SPPs) used to simulate more than 50,000 individual vehicles. The single-SPP code is portable and has been used on a variety of different SPP architectures for simulations with up to 15,000 vehicles. A general metacomputing framework for DIS on multiple SPPs is discussed and results are presented for an initial system using explicit Gateway processes to manage communications among the SPPs. These 50K-vehicle simulations utilized 1,904 processors at six sites across seven time zones, including platforms from three manufacturers. Ongoing activities to both simplify and enhance the metacomputing system using Globus are described.

提出了一种广泛应用的模块化半自动部队(ModSAF)分布式交互仿真(DIS)的分布式并行实现，使用可扩展的并行处理器(spp)来模拟超过50,000辆单独的车辆。单SPP代码是可移植的，并已用于各种不同的SPP架构，模拟多达15,000辆车辆。讨论了一个用于多spp上的DIS的通用元计算框架，并给出了使用显式网关进程来管理spp之间通信的初始系统的结果。这些50k车辆模拟在七个时区的六个站点使用了1,904个处理器，包括来自三家制造商的平台。描述了正在进行的使用Globus简化和增强元计算系统的活动。

引用次数: 50

On the interaction between mobile processes and objects 关于移动进程和对象之间的交互

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

Pub Date : 1998-03-30 DOI: 10.1109/HCW.1998.666555

S. Jagannathan, R. Kelsey

Java's remote method invocation mechanism provides a number of features that extended the functionality of traditional client/server-based distributed systems. However, there are a number of characteristics of the language that influence its utility as a vehicle in which to express lightweight mobile processes. Among these are its highly imperative sequential core, the close coupling of control and state as a consequence of its object model, and the fact that remote method calls are not properly tail-recursive. These features impact the likelihood that Java can easily support process and object mobility for programs which exhibit complex communication and distribution patterns.

Java的远程方法调用机制提供了许多扩展传统的基于客户机/服务器的分布式系统功能的特性。但是，该语言的许多特征会影响其作为表达轻量级移动流程的工具的实用性。其中包括其高度命令式的顺序核心，作为其对象模型的结果的控制和状态的紧密耦合，以及远程方法调用不是正确的尾部递归的事实。这些特性影响了Java对表现出复杂通信和分布模式的程序轻松支持过程和对象移动性的可能性。

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Seventh Heterogeneous Computing Workshop (HCW'98)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀