Proceedings. Advances in Parallel and Distributed Computing最新文献

英文中文

Architectural implications of the NAS MG and FT parallel benchmarks NAS、MG和FT并行基准的架构含义

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574038

Yuzhong Sun, Jianyong Wang, Zhiwei Xu

This paper characterizes the structure and resource requirements of the NAS Parallel Benchmarks (NPB), a popular benchmark suite used to evaluate various parallel computers. The phase parallel model is used to obtain parameter values for memory, I/O, and communication latency and bandwidth requirements. These quantitative parameters are useful in the design and evaluation of various parallel computers. The results of this study is being used in designing Dawning 2000, which is NCIC's second generation MPP.

本文描述了NAS并行基准测试(NPB)的结构和资源需求，NPB是一种用于评估各种并行计算机的流行基准测试套件。相位并行模型用于获取内存、I/O、通信延迟和带宽要求的参数值。这些定量参数可用于各种并行计算机的设计和评价。该研究结果被用于设计曙光2000，这是NCIC的第二代MPP。

引用次数: 5

Reduced communication protocol for clusters 减少了集群的通信协议

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574049

Shuo Di, Weimin Zheng

With the development of CPUs and communication networks, workstation clusters using message-passing mechanism become a crucial role in the field of network computing. Today's clusters are mainly connected by networks running traditional communication protocols (such as TCP/IP). The high overheads of these protocols make many parallel applications running on clusters inefficient using the potential computation power provided by the workstations and the networks. A method to solve this problem is to construct reduced communication protocol. This paper gives a detailed analysis of overheads produced by traditional protocols and provides some global strategies to design a reduced communication protocol. Our implementation method of such a protocol is described here together with some core algorithms and the testing results.

随着cpu和通信网络的发展，采用消息传递机制的工作站集群在网络计算领域发挥着至关重要的作用。今天的集群主要通过运行传统通信协议(如TCP/IP)的网络连接。这些协议的高开销使得在集群上运行的许多并行应用程序在使用工作站和网络提供的潜在计算能力时效率低下。解决这一问题的一种方法是构造简化的通信协议。本文详细分析了传统协议产生的开销，并提出了一些设计精简通信协议的全局策略。本文介绍了该协议的实现方法，并给出了部分核心算法和测试结果。

引用次数: 1

Solving sparse least squares problems on massively distributed memory computers 在大规模分布式存储计算机上求解稀疏最小二乘问题

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574029

L. Yang

In this paper we study the parallel aspects of PCGLS, a basic iterative method whose main idea is to organize the computation of conjugate gradient method with preconditioner applied to normal equations, and incomplete modified Gram-Schmidt (IMGS) preconditioner for solving sparse least squares problems on massively parallel distributed memory computers. The performance of these methods on this kind of architecture is always limited because of the global communication required for the inner products. We describe the parallelization of PCGLS and IMGS preconditioner by two ways of improvement. One is to assemble the results of a number of inner products collectively and the other is to create situations when communication can be overlapped with computation. A theoretical model of computation and communication phases is presented which allows us to decide the number of processors that minimizes the runtime. Several numerical experiments on Parsytec GC/PowerPlus are presented.

本文研究了PCGLS的并行性。PCGLS是一种基本的迭代方法，其主要思想是组织共轭梯度法的计算，共轭梯度法的预条件应用于正态方程，不完全修正Gram-Schmidt (IMGS)预条件用于求解大规模并行分布式存储计算机上的稀疏最小二乘问题。由于内部产品需要全局通信，这些方法在这种体系结构上的性能总是受到限制。通过两种改进方法描述了PCGLS和IMGS预调节器的并行化。一种是将许多内部产品的结果集合在一起，另一种是创造通信可以与计算重叠的情况。提出了一个计算和通信阶段的理论模型，该模型允许我们决定最小化运行时间的处理器数量。介绍了在Parsytec GC/PowerPlus上进行的几个数值实验。

引用次数: 24

On distributed snapshot algorithms 分布式快照算法研究

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574046

L. He, Yongqiang Sun

Snapshot algorithms are fundamental for many distributed applications and must often be executed repeatedly. We present three snapshot algorithms. The first one is based on the assumption of global time, it computes channel states using several schemes. Taking consistent cut for global time instant, we show that the algorithm is applicable for existing snapshot algorithms. The second one is a real token passing based algorithm for non-FIFO asynchronous distributed systems. Its message complexity of control messages is O(n). The last algorithm is the repeated version of the second one. Using this algorithm, processes can get consistent global states at their convenience concurrently.

快照算法是许多分布式应用程序的基础，必须经常重复执行。我们提出了三种快照算法。第一种方法是基于全局时间假设，使用几种方案计算信道状态。以全局时间瞬间为例，证明了该算法适用于现有的快照算法。第二部分是一种基于令牌传递的非fifo异步分布式系统算法。其控制消息的消息复杂度为0 (n)。最后一个算法是第二个算法的重复版本。使用该算法，进程可以方便地并发地获得一致的全局状态。

引用次数: 7

A lifetime-sensitive scheduling method 一种生命周期敏感的调度方法

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574054

Xinda Lu, Y. Hu, Jing Chen

This paper presents a lifetime-sensitive scheduling method. By shortening lifetimes of variables in scheduling phase, it can lighten register pressure in register allocation phase, lessen spill codes and result in more efficient object codes. The preliminary experimental results show that this method is an effective scheduling method.

提出了一种寿命敏感调度方法。通过在调度阶段缩短变量的生存期，可以减轻寄存器分配阶段的寄存器压力，减少溢出代码，提高目标代码的效率。初步实验结果表明，该方法是一种有效的调度方法。

引用次数: 0

Automatic generation of parallel compiler-partial evaluation of parallel lambda language 并行编译器的自动生成——并行lambda语言的部分求值

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574060

Yongqiang Sun, Kai Lin, Yijia Chen

We describe in this paper a partial evaluator for a parallel programming language. The parallel language we present is a combination of lambda calculus and message passing communication mechanism. By improving some techniques originally used for partial evaluation of sequential language and introducing some new methods, we successfully solve the problems caused by some internal semantic differences between lambda calculus and message passing in our partial evaluator for the parallel language.

本文描述了一种并行编程语言的部分求值器。我们提出的并行语言是lambda演算和消息传递通信机制的结合。通过改进原用于顺序语言部分求值的一些技术和引入一些新方法，我们成功地解决了并行语言部分求值器中lambda演算和消息传递之间的一些内部语义差异所导致的问题。

引用次数: 0

Language support for synchronous parallel critical sections 对同步并行临界区的语言支持

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574018

C. Kessler, H. Seidl

We introduce a new parallel programming paradigm, namely synchronous parallel critical sections. Such parallel critical sections must be seen in the context of switching between synchronous and asynchronous modes of computation. Thread farming allows to generate bunches of threads to solve independent subproblems asynchronously and in parallel. Opposed to that, synchronous parallel critical sections allow to organize bunches of asynchronous parallel threads to execute certain task jointly and synchronously. We show how the PRAM language Fork95 can be extended by a construct join supporting parallel critical sections. We explain its semantics and implementation, and discuss possible applications.

我们引入了一种新的并行编程范式，即同步并行临界区。这种并行临界区必须在同步和异步计算模式之间切换的背景下看到。线程农场允许生成线程群，以异步和并行的方式解决独立的子问题。与此相反，同步并行临界区允许组织一组异步并行线程联合并同步地执行某些任务。我们展示了如何通过支持并行临界区的构造连接扩展PRAM语言Fork95。我们解释它的语义和实现，并讨论可能的应用。

引用次数: 12

A new architecture for branch-intensive loops 分支密集循环的新架构

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574039

Zhizhong Tang, Chihong Zhang, Sifei Lvand, Tao Yu

A new VLIW architecture, called GPMB (Global Pipelining of Multi-Branch), is discussed in this paper. The GPMB architecture can handle branch-intensive programs efficiently. With the concept of next address function, GPMB regards branching as correctly calculating the next address. The next address function is implemented by hardware and software in GPMB. A brief description of GPMB and a detailed example are included. A comparison with other architectures is also presented in this paper.

本文讨论了一种新的VLIW体系结构GPMB (Global Pipelining of Multi-Branch)。GPMB架构可以有效地处理分支密集型程序。GPMB采用下一地址函数的概念，将分支视为正确计算下一地址。下一地址功能在GPMB中通过硬件和软件实现。本文对GPMB进行了简要描述，并给出了一个详细的示例。并与其他体系结构进行了比较。

引用次数: 1

Coherent parallel programming in C/spl par/ C/spl / par/中的相干并行编程

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574021

Zhiwei Xu, K. Hwang

This paper presents the coherent parallel programming concept using a new parallel language called C/spl par/ (pronounced C Parallel). The C/spl par/ language is based on the standard C language with a small set of extended constructs for parallelism and process interaction. At the core of C/spl par/ is a structured construct called coherent region, which facilitates the development of coherent programs, i.e., parallel programs that are structured, determinate, terminative, and compositional. We present the basic features of C/spl par/ and show that coherent region is a versatile construct.

本文用一种新的并行语言C/spl / par/(发音为C parallel)提出了相干并行编程的概念。C/spl par/语言基于标准的C语言，带有一组用于并行性和进程交互的扩展结构。C/spl par/的核心是一个被称为相干域的结构化结构，它促进了相干程序的开发，即结构化的、确定的、终止的和组合的并行程序。本文介绍了C/spl / par/的基本特征，并指出相干区是一个通用的结构。

引用次数: 3

An efficient parallel texture classification for image retrieval 一种用于图像检索的高效并行纹理分类方法

Proceedings. Advances in Parallel and Distributed Computing

Pub Date : 1997-03-19 DOI: 10.1109/APDC.1997.574009

J. You, H. Shen, H. Cohen

This paper proposes an efficient parallel approach to texture classification for image retrieval. The idea behind this method is to pre-extract texture features in terms of texture energy measurement associated with a 'tuned' mask and store them in a multi-scale and multi-orientation texture class database via a two-dimensional linked list for query. Thus each texture class sample in the database can be traced by its texture energy in a two-dimensional row sorted matrix. The parallel searching strategies are introduced for fast identifying the entities closest to the input texture throughout the given texture energy matrix. In contrast to the traditional search methods, our approach incorporates different computation patterns for different cases of available processor numbers and concerns with robust and work-optimal parallel algorithms for row-search and minimum-find based an the accelerated cascading technique and the dynamic processor allocation scheme. Applications of the proposed parallel search and multisearch algorithms to both single image classification and multiple image classification are discussed. The time complexity analysis shows that our proposal will speed up the classification tasks in a simple but dynamic manner. Examples are presented of the texture classification task applied to image retrieval of Brodatz textures, comprising various orientations and scales.

提出了一种用于图像检索的纹理分类并行算法。该方法的思想是根据“调谐”掩模相关的纹理能量测量预提取纹理特征，并通过二维链表将其存储在多尺度、多方向纹理类数据库中供查询。这样，数据库中的每个纹理类样本都可以通过其纹理能量在二维行排序矩阵中进行跟踪。引入并行搜索策略，在给定纹理能量矩阵中快速识别最接近输入纹理的实体。与传统的搜索方法相比，我们的方法结合了不同情况下可用处理器数量的不同计算模式，并关注基于加速级联技术和动态处理器分配方案的行搜索和最小查找的鲁棒和工作优化并行算法。讨论了并行搜索和多搜索算法在单幅图像分类和多幅图像分类中的应用。时间复杂度分析表明，我们的方案能够以简单而动态的方式加快分类任务的速度。给出了纹理分类任务应用于图像检索的实例，包括不同方向和尺度的Brodatz纹理。

{"title":"An efficient parallel texture classification for image retrieval","authors":"J. You, H. Shen, H. Cohen","doi":"10.1109/APDC.1997.574009","DOIUrl":"https://doi.org/10.1109/APDC.1997.574009","url":null,"abstract":"This paper proposes an efficient parallel approach to texture classification for image retrieval. The idea behind this method is to pre-extract texture features in terms of texture energy measurement associated with a 'tuned' mask and store them in a multi-scale and multi-orientation texture class database via a two-dimensional linked list for query. Thus each texture class sample in the database can be traced by its texture energy in a two-dimensional row sorted matrix. The parallel searching strategies are introduced for fast identifying the entities closest to the input texture throughout the given texture energy matrix. In contrast to the traditional search methods, our approach incorporates different computation patterns for different cases of available processor numbers and concerns with robust and work-optimal parallel algorithms for row-search and minimum-find based an the accelerated cascading technique and the dynamic processor allocation scheme. Applications of the proposed parallel search and multisearch algorithms to both single image classification and multiple image classification are discussed. The time complexity analysis shows that our proposal will speed up the classification tasks in a simple but dynamic manner. Examples are presented of the texture classification task applied to image retrieval of Brodatz textures, comprising various orientations and scales.","PeriodicalId":413925,"journal":{"name":"Proceedings. Advances in Parallel and Distributed Computing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127048985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings. Advances in Parallel and Distributed Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀