Proceedings 11th International Parallel Processing Symposium最新文献

英文中文

Predicting queue times on space-sharing parallel computers 预测空间共享并行计算机的排队时间

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580894

A. Downey

We present statistical techniques for predicting the queue times experienced by jobs submitted to a space-sharing parallel machine with first-come-first-served (FCFS) scheduling. We apply these techniques to trace data from the Intel Paragon at the San Diego Supercomputer Center and the IBM SP2 at the Cornell Theory Center. We show that it is possible to predict queue times with accuracy that is acceptable for several intended applications. The coefficient of correlation between our predicted queue times and the actual queue times from simulated schedules is between 0.65 and 0.72.

我们提出了使用先到先服务(FCFS)调度来预测提交给空间共享并行机的作业所经历的排队时间的统计技术。我们应用这些技术来跟踪来自圣地亚哥超级计算机中心的英特尔Paragon和康奈尔理论中心的IBM SP2的数据。我们表明，对于几个预期的应用程序，预测队列时间的准确性是可以接受的。我们预测的队列时间与模拟调度的实际队列时间之间的相关系数在0.65到0.72之间。

引用次数: 176

Reducing waiting costs in user-level communication 减少用户级通信的等待成本

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580930

Stefanos N. Damianakis, Yuqun Chen, E. Felten

Describes a mechanism for reducing the cost of waiting for messages in architectures that allow user-level communication libraries. We reduce waiting costs in two ways: by reducing the cost of servicing interrupts, and by carefully controlling when the system uses interrupts and when it uses polling. We have implemented our mechanism on the SHRIMP multicomputer and integrated it with our user-level sockets library. Experiments show that a hybrid spin-then-block strategy offers good performance in a wide variety of situations, and that speeding up the interrupt path significantly improves performance.

描述一种机制，用于减少在允许用户级通信库的体系结构中等待消息的成本。我们通过两种方式减少等待成本:通过减少服务中断的成本，以及通过仔细控制系统何时使用中断和何时使用轮询。我们已经在SHRIMP多计算机上实现了我们的机制，并将其与我们的用户级套接字库集成。实验表明，混合自旋-阻塞策略在多种情况下都具有良好的性能，加速中断路径显著提高了性能。

引用次数: 12

Design and evaluation of data storage and retrieval strategies in a distributed memory continuous media server 分布式存储连续介质服务器中数据存储与检索策略的设计与评价

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580927

C. Srinilta, D. Jadav, A. Choudhary

High-performance servers and high-speed networks will form the backbone of the infrastructure required for distributed multimedia information systems. Given that the goal of such a server is to support hundreds of interactive data streams simultaneously, various tradeoffs are possible with respect to the storage of data on secondary memory and its retrieval In this paper, we identify and evaluate these tradeoffs. We evaluate the effect of varying the stripe factor and also the performance of batched retrieval of disk-resident data. We develop a methodology to predict the stream capacity of such a server. The evaluation is done for both uniform and skewed access patterns. Experimental results on the Intel Paragon computer are presented.

高性能服务器和高速网络将构成分布式多媒体信息系统所需基础设施的骨干。考虑到这样一个服务器的目标是同时支持数百个交互式数据流，在次要内存上存储数据及其检索方面可能存在各种权衡。在本文中，我们识别并评估了这些权衡。我们评估了改变条带因子的影响以及批量检索磁盘驻留数据的性能。我们开发了一种方法来预测这种服务器的流容量。对均匀访问模式和倾斜访问模式都进行了评估。给出了在Intel Paragon计算机上的实验结果。

引用次数: 7

A reliable hardware barrier synchronization scheme 一个可靠的硬件屏障同步方案

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580908

Rajeev Sivaram, C. Stunkel, D. Panda

Barrier synchronization is a crucial operation for parallel systems. Many schemes have been proposed in the literature to achieve fast barrier synchronization through software, hardware, or a combination of these mechanisms. However few of these schemes emphasize fault-tolerant barrier operations. In this paper, we describe inexpensive support that can be added to network switches for achieving reliable hardware-based barrier synchronization while recovering from lost or corrupted messages. Necessary modifications to the switch architecture and the associated fault-tolerant message-passing protocols are presented. The protocols are optimized for the no-fault case while providing means to detect the failure of any step of the operation and to recover from it. The proposed scheme shows significant potential for use in parallel systems, especially the emerging systems based on networks of workstations.

屏障同步是并行系统的关键操作。文献中提出了许多方案，通过软件、硬件或这些机制的组合来实现快速屏障同步。然而，这些方案中很少强调容错屏障操作。在本文中，我们描述了可以添加到网络交换机的廉价支持，以实现可靠的基于硬件的屏障同步，同时从丢失或损坏的消息中恢复。对交换机体系结构和相关的容错消息传递协议进行了必要的修改。该协议针对无故障情况进行了优化，同时提供了检测操作任何步骤失败并从中恢复的方法。所提出的方案在并行系统，特别是基于工作站网络的新兴系统中显示出巨大的应用潜力。

引用次数: 28

Work-time optimal k-merge algorithms on the PRAM PRAM上工作时间最优k-归并算法

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580913

Tatsuya Hayashi, K. Nakano, S. Olariu

The k-merge problem, given a collection of k, (2/spl les/k/spl les/n), sorted sequences of total length a asks to merge them into a new sorted sequence. The main contribution of the work is to propose simple and intuitive work-time optimal algorithms for the k-merge problem on two PRAM models. Specifically their k-merge algorithms perform O(nlogk) work and run in O(log n) time on the EREW-PRAM and in O (log log n+log k) time on the CREW-PRAM, respectively.

k-归并问题，给定一个k的集合，(2/spl les/k/spl les/n)，总长度为a的有序序列，要求将它们归并成一个新的有序序列。该工作的主要贡献是针对两个PRAM模型的k-归并问题提出了简单直观的工作时间最优算法。具体来说，它们的k-归并算法分别在eew - pram和CREW-PRAM上执行O(nlogk)的工作，运行时间分别为O(log n)和O(log n+log k)。

引用次数: 12

Interactive visual exploration of distributed computations 分布式计算的交互式可视化探索

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580864

Delbert Hart, Eileen T. Kraemer

Program understanding is central to the development of distributed computations, from the initial coding phase, through testing and debugging, to maintenance and support. Our goal is to support programmers unfamiliar with a particular distributed computation in developing a reasonable understanding of the workings of a program, without requiring that they examine the details of the code itself. Toward, this goal, we propose query based visualization, a novel exploratory approach to understanding distributed computations. The key features of the approach are the use of queries as a device for searching the state space, visual presentation techniques adapted from program animation, and the ability to navigate through the state space using visual interactions. All views correspond to globally consistent snapshots of the computation. A working prototype demonstrates the technical feasibility of the approach.

程序理解是分布式计算开发的核心，从最初的编码阶段，到测试和调试，再到维护和支持。我们的目标是支持不熟悉特定分布式计算的程序员合理地理解程序的工作原理，而不需要他们检查代码本身的细节。为了实现这一目标，我们提出了基于查询的可视化，这是一种新的理解分布式计算的探索性方法。该方法的主要特点是使用查询作为搜索状态空间的工具、采用来自程序动画的可视化表示技术，以及使用可视化交互在状态空间中导航的能力。所有视图都对应于计算的全局一致快照。一个工作样机证明了该方法的技术可行性。

引用次数: 26

Real-time parallel MPEG-2 decoding in software 实时并行MPEG-2解码软件

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580889

A. Bilas, Jason E. Fritts, J. Singh

The growing demand for high quality compressed video has led to an increasing need for real-time MPEG decoding at greater resolutions and picture sizes. With the widespread availability of small-scale multiprocessors, a parallel software implementation may provide an effective solution to the decoding problem. We present a parallel decoder for the MPEG standard, implemented on a shared memory multiprocessor. Goal of this work is to provide an all-software solution for real-time, high-quality video decoding and to investigate the important properties of this application as they pertain to multiprocessor systems. Both coarse and fine grained implementations are considered for parallelizing the decoder. The coarse-grained approach exploits parallelism at the group of pictures level, while the fine-grained approach parallelizes within pictures, at the slice level. A comparative evaluation of these methods is made, with results presented in terms of speedup, memory requirements, load balance, synchronization time, and temporal and spatial locality. Both methods demonstrate very good speedups and locality properties.

对高质量压缩视频的需求不断增长，导致对更高分辨率和更大图像尺寸的实时MPEG解码的需求不断增加。随着小型多处理器的广泛使用，并行软件实现可以为解码问题提供有效的解决方案。我们提出了一个MPEG标准的并行解码器，实现在一个共享内存多处理器上。这项工作的目标是为实时，高质量的视频解码提供全软件解决方案，并研究该应用程序的重要特性，因为它们与多处理器系统有关。对于并行化解码器，考虑了粗粒度和细粒度实现。粗粒度方法利用图片组级别的并行性，而细粒度方法利用图片内的并行性，即片级别。对这些方法进行了比较评估，并在加速、内存需求、负载平衡、同步时间以及时间和空间局部性方面给出了结果。这两种方法都展示了非常好的加速和局部性。

引用次数: 73

Causality filters: a tool for the online visualization and steering of parallel and distributed programs 因果关系过滤器:一个工具，用于在线可视化和并行和分布式程序的指导

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580862

Eileen T. Kraemer

Interactive program steering is a promising technique for improving the performance of parallel and distributed applications. Steering decisions are typically based on visual presentations of some subset of the computation's current state, a historical view of the computation's behavior or views of metrics based on the program's performance. As in any endeavor good decisions require accurate information. However the distributed nature of the collection process may result in distortions in the portrayal of the program's execution. These distortions stem from the merging of streams of information from distributed collection points into a single stream without enforcing the ordering relationships that held among the program components that produced the information. An ordering filter placed at the point at which the streams are merged can ensure a valid ordering, leading to more accurate visualizations and better informed steering decisions. In this paper we describe the implementation of such filters in the Falcon interactive steering toolkit, and present a methodology for their specification for automated generation.

交互式程序控制是一种很有前途的技术，可以提高并行和分布式应用程序的性能。指导决策通常基于计算当前状态的某些子集的可视化表示、计算行为的历史视图或基于程序性能的度量视图。在任何努力中，正确的决定都需要准确的信息。然而，收集过程的分布式特性可能导致对程序执行的描述失真。这些扭曲源于将来自分布式收集点的信息流合并为单个流，而没有强制执行产生信息的程序组件之间的顺序关系。将排序过滤器放置在流合并的位置，可以确保有效的排序，从而实现更准确的可视化和更明智的指导决策。在本文中，我们描述了这种过滤器在Falcon交互式转向工具包中的实现，并提出了一种用于自动生成它们的规范的方法。

{"title":"Causality filters: a tool for the online visualization and steering of parallel and distributed programs","authors":"Eileen T. Kraemer","doi":"10.1109/IPPS.1997.580862","DOIUrl":"https://doi.org/10.1109/IPPS.1997.580862","url":null,"abstract":"Interactive program steering is a promising technique for improving the performance of parallel and distributed applications. Steering decisions are typically based on visual presentations of some subset of the computation's current state, a historical view of the computation's behavior or views of metrics based on the program's performance. As in any endeavor good decisions require accurate information. However the distributed nature of the collection process may result in distortions in the portrayal of the program's execution. These distortions stem from the merging of streams of information from distributed collection points into a single stream without enforcing the ordering relationships that held among the program components that produced the information. An ordering filter placed at the point at which the streams are merged can ensure a valid ordering, leading to more accurate visualizations and better informed steering decisions. In this paper we describe the implementation of such filters in the Falcon interactive steering toolkit, and present a methodology for their specification for automated generation.","PeriodicalId":145892,"journal":{"name":"Proceedings 11th International Parallel Processing Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128832929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

SPAX: a new parallel processing system for commercial applications SPAX:用于商业应用的新型并行处理系统

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580991

W. Hahn, K. Rim, Soo-Won Kim

A new parallel processing system for commercial applications, called SPAX, is described. SPAX cost-effectively overcomes the SMP limitation by providing scalability of the parallel processing system and application portability of the SMP. We also describe a new system network, called Xcent-Net, which interconnects hundreds of multiprocessor PC boards in SPAX. It is a hierarchical network that provides incremental scalability with minimum re-wiring when the user's requirement is changed. This is based on the low latency crossbar routers on each hierarchy, which consist a router-cloud and provide up to 2.67 Gbytes/sec/router-cloud of bandwidth. We briefly describe the preliminary evaluation result that shows Xcent-Net will not be the bottleneck in the system running a typical commercial application.

介绍了一种新的商业应用并行处理系统SPAX。SPAX通过提供并行处理系统的可伸缩性和SMP的应用程序可移植性，经济有效地克服了SMP的限制。我们还描述了一个新的系统网络，称为Xcent-Net，它在SPAX中互连了数百个多处理器PC板。它是一个分层网络，当用户的需求发生变化时，它以最少的重新布线提供增量可伸缩性。这是基于每个层次结构上的低延迟交叉栏路由器，它由一个路由器云组成，并提供高达2.67 gb /秒/路由器云的带宽。我们简要地描述了初步评估结果，该结果表明Xcent-Net不会成为运行典型商业应用程序的系统的瓶颈。

引用次数: 3

Deadlock-free fault-tolerant routing in the multi-dimensional crossbar network and its implementation for the Hitachi SR2201 多维交叉棒网络中的无死锁容错路由及其在日立SR2201上的实现

Proceedings 11th International Parallel Processing Symposium

Pub Date : 1997-04-01 DOI: 10.1109/IPPS.1997.580925

Y. Yasuda, Hiroaki Fujii, Hideya Akashi, Y. Inagami, Teruo Tanaka, Junji Nakagoshi, Hideo Wada, Tsutomu Sumimoto

We have developed a hardware detour path selection facility for the Hitachi SR2201 parallel computer, which uses a multi-dimensional crossbar as an inter-processor network to ensure operating efficiency and high reliability when a part of the network is faulty. When this hardware facility is used, packets are transmitted to their destination along alternative paths to avoid the fault. However, changing the routing may cause deadlock. This paper describes a deadlock-free fault-tolerant routing scheme that can be used by the detour path selection facility to avoid deadlock, and its implementation for the SR2201.

我们为日立SR2201并行计算机开发了一种硬件绕行路径选择工具，该工具使用多维交叉棒作为处理器间网络，在部分网络故障时确保运行效率和高可靠性。当使用该硬件设施时，数据包沿着备选路径传输到目的地以避免故障。但是，更改路由可能会导致死锁。本文描述了一种无死锁容错路由方案，该方案可用于绕行路径选择设施以避免死锁，并在SR2201上实现。

引用次数: 9

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings 11th International Parallel Processing Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀