Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing最新文献

英文中文

Multi-node broadcasting in hypercubes and star graphs 超立方体和星图中的多节点广播

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651479

Y. Tseng

In a hypercube or a star graph, given an unknown number of nodes located at unknown positions each intending to broadcast a message, we propose an efficient routing algorithm to solve this problem using asymptotically optimal or near-optimal transmission time.

在超立方体或星图中，给定位于未知位置的未知数量的节点，每个节点都打算广播消息，我们提出了一种有效的路由算法，使用渐近最优或接近最优的传输时间来解决这个问题。

引用次数: 10

Multiple dependent queries execution using critical path scheduling in parallel databases 在并行数据库中使用关键路径调度执行多个依赖查询

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651534

K.H. Liu, C. Leung, Y. Jiang

Multiple processors are employed to improve the performance of database systems and the parallelism can be exploited at three levels in query processing: intra-operation, inter-operation, and inter-query parallelism. Intra-operation and inter-operation parallelism are also called intra-query parallelism which has been studied extensively. In contrast, inter-query parallelism has received little attention particularly for multiple dependent queries. We develop a decompression algorithm, CPS, for coping with multiple dependent queries which are represented by a directed graph, and the algorithm makes use of the activity analysis of critical path analysis, and the resource scheduling and levelling of project management. A simulation study has been conducted and the results show that the proposed algorithm outperforms other existing methods and is able to provide a global optimal solution when the number of processors available is sufficient.

使用多处理器来提高数据库系统的性能，并且在查询处理中可以在三个层次上利用并行性:操作内并行性、操作间并行性和查询间并行性。操作内并行和操作间并行又称为查询内并行，已被广泛研究。相比之下，查询间并行性很少受到关注，特别是对于多个依赖查询。我们开发了一种解压算法CPS，用于处理由有向图表示的多个依赖查询，该算法利用了关键路径分析的活动分析和项目管理的资源调度和水平。仿真研究结果表明，该算法在处理器数量足够的情况下，能够提供全局最优解，优于现有的算法。

引用次数: 0

Efficient run-time scheduling for parallelizing partially parallel loops 用于并行化部分并行循环的高效运行时调度

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651508

Tsung-Chuan Huang, Po-Hsueh Hsu, Tze-Nan Sheng

We propose an efficient run-time technique to find an optimal parallel execution schedule for partially parallel loops in which synchronizations between iterations are needed to ensure correct program semantics. For efficiency, we combine conventional mark phase and scheduler phase into a single parallel scheduler. The scheduler divides the loop iterations into several chunks then executes the iterations in one chunk in parallel. Our scheme not only runs fast but also achieves an optimal schedule. In addition, an atomic bit-vector operation is introduced to avoid global synchronization overhead and ensure the larger wavefront number is kept when the wavefront number of an iteration will be concurrently updated during scheduling.

我们提出了一种有效的运行时技术来为部分并行循环找到最佳并行执行计划，其中迭代之间需要同步以确保正确的程序语义。为了提高效率，我们将传统的标记阶段和调度阶段合并为一个并行调度程序。调度器将循环迭代划分为几个块，然后并行地在一个块中执行迭代。该方案不仅运行速度快，而且实现了最优调度。此外，还引入了原子位向量操作，避免了全局同步开销，并确保在调度过程中迭代的波前数同时更新时保持较大的波前数。

引用次数: 5

Mapping of finite-element grids onto parallel computers using neural networks 利用神经网络将有限元网格映射到并行计算机上

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651533

R. Tan, V. Lakshmi Narasimhan

In this paper, LSOM (Load-balancing Self-Organizing Map), a neural network based on Kohonen's self-organizing map is proposed for the problem of mapping finite-element method (FEM) grids to distributed-memory parallel computers with mesh interconnection networks. The rough global ordering produced by LSOM is then combined with the local refinement Kernighan-Lin algorithm (called LSOM-KL) to obtain the solution. LSOM-KL obtained a load imbalance of less than 0.1% and a low number of hops, comparable to results obtained with commonly used recursive bisection methods.

本文提出了一种基于Kohonen自组织映射的神经网络LSOM (Load-balancing Self-Organizing Map)，用于将有限元网格映射到具有网格互连网络的分布式存储并行计算机上。然后将LSOM生成的粗糙全局排序与局部细化Kernighan-Lin算法(称为LSOM- kl)结合得到解。LSOM-KL获得了小于0.1%的负载不平衡和较低的跳数，与常用的递归对分方法得到的结果相当。

引用次数: 0

Optimal task clustering using Hopfield net 基于Hopfield网络的最优任务聚类

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651513

Weiping Zhu, Tyng-Yeu Liang, C. Shieh

To achieve high performance in a distributed system, the tasks of a program have to be carefully clustered and assigned to processors. In this paper we present a static method to cluster tasks and allocate them to processors. The proposed method relies on the Hopfield neural network to achieve optimum or near-optimum task clustering in terms of load balancing and communication cost. Experimental studies show that this method indeed can find optimal or near-optimal mapping for those programs used in our tests.

为了在分布式系统中实现高性能，程序的任务必须小心地集群并分配给处理器。在本文中，我们提出了一种静态方法来集群任务并将它们分配给处理器。该方法利用Hopfield神经网络在负载均衡和通信开销方面实现最优或接近最优的任务聚类。实验研究表明，该方法确实可以为我们测试中使用的程序找到最优或接近最优的映射。

引用次数: 0

Real-time obstacle detection on a massively parallel linear architecture 基于大规模并行线性结构的实时障碍物检测

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651520

M. Bertozzi, A. Broggi, A. Fascioli

This paper presents a real-time solution to the problem of obstacle detection in automotive applications using image processing techniques. To speed-up the processing a massively parallel engine has been used and the algorithms tuned to match the specific features of the computing architecture. The system acquires pairs of stereo images, checks for correspondences, and remaps the resulting image in a new domain to ease the following processing steps. The whole processing is performed on PAPRICA-3, a massively parallel system whose processing elements are disposed on a linear array; the proposed system can reach video rate performance.

本文提出了一种利用图像处理技术实时解决汽车应用中障碍物检测问题的方法。为了加快处理速度，使用了大规模并行引擎，并调整了算法以匹配计算体系结构的特定特征。该系统获取一对对立体图像，检查对应关系，并将结果图像重新映射到一个新的域中，以简化以下处理步骤。整个处理过程在PAPRICA-3上进行，PAPRICA-3是一个大规模并行系统，其处理单元配置在线性阵列上;所提出的系统可以达到视频速率的性能。

引用次数: 4

The Pilgrim: a new consistency protocol for distributed shared memory 朝圣者:分布式共享内存的新一致性协议

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651495

H. Guyennet, J. Lapayre, M. Tréhel

We propose a new consistency protocol named Pilgrim for distributed shared memory (DSM) where different shared objects are replicated at each site. This protocol provides both reliable consistency and guaranteed performance. This protocol is discussed and proved with a finite state automaton, and we demonstrate its qualities.

本文提出了一种新的分布式共享内存(DSM)一致性协议Pilgrim，该协议在每个站点复制不同的共享对象。该协议提供了可靠的一致性和有保证的性能。用有限状态自动机对该协议进行了讨论和证明，并证明了它的性质。

引用次数: 6

Adaptive learning schemes for the modified probabilistic neural network 改进概率神经网络的自适应学习方案

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651526

A. Zaknich, C.J.S. de Silva

The modified probabilistic neural network was initially derived from Specht's (1990) probabilistic neural network classifier and developed for nonlinear time series analysis. It can be described as a vector quantised reduced form of Specht's general regression neural network. It is typically trained with a known set of representative data pairs. This is quite satisfactory for stationary data statistics, but for the nonstationary case it is necessary to be able to adapt the network during operation. This paper describes adaptive learning schemes for the modified probabilistic neural network for both stationary and nonstationary data statistics. A nonlinear control problem is used to illustrate and compare the network's learning ability with that of the general regression and radial basis function neural networks.

修正概率神经网络最初来源于Specht(1990)的概率神经网络分类器，是为非线性时间序列分析而发展起来的。它可以被描述为Specht广义回归神经网络的矢量量化简化形式。它通常使用一组已知的代表性数据对进行训练。对于平稳的数据统计，这是相当令人满意的，但对于非平稳的情况，必须能够适应网络的运行。本文描述了修正概率神经网络在平稳和非平稳数据统计中的自适应学习方案。用一个非线性控制问题来说明和比较该网络与一般回归和径向基函数神经网络的学习能力。

引用次数: 4

An object-oriented data cache architecture for programmable parallel digital signal processors 面向对象的可编程并行数字信号处理器数据缓存体系结构

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651483

J. Kneip

The paper describes concept and implementation of a data cache architecture with concurrent conflict free access to shared data for DSPs with parallel, synchronized processing units. It utilizes techniques known from object-oriented software design to achieve efficient and programmer friendly on-chip storage of data. The cache internally uses virtual 1D or 2D address spaces directly assigned to data structures instead of a conventional, linear address space. Data within the cache are distributed to a number of memory banks. Virtual local addresses are used for data location and hit/miss detection to minimize cost and memory latency. The object-oriented cache is fully transparent to programmer and compiler, reduces the amount of address calculations to be performed, exploits the 2D spatial locality typical for image processing algorithms and can be integrated into a standard RISC processor pipeline.

本文描述了一种数据缓存架构的概念和实现，该架构具有并行、同步处理单元的dsp对共享数据的并发无冲突访问。它利用面向对象软件设计的技术来实现高效和程序员友好的芯片上数据存储。缓存内部使用直接分配给数据结构的虚拟1D或2D地址空间，而不是传统的线性地址空间。缓存中的数据分布到多个内存库中。虚拟本地地址用于数据定位和命中/未命中检测，以最小化成本和内存延迟。面向对象的缓存对程序员和编译器完全透明，减少了要执行的地址计算量，利用了图像处理算法典型的二维空间局域性，并且可以集成到标准的RISC处理器管道中。

引用次数: 0

A new method for transparent fault tolerance of distributed programs on a network of workstations using alternative schedules 采用可选调度的工作站网络上分布式程序的透明容错新方法

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

Pub Date : 1997-12-10 DOI: 10.1109/ICAPP.1997.651515

D. Das, P. Dasgupta, P. Das

In this paper, we devise a new method for transparent fault tolerance of distributed programs running on a cluster of networked workstations. We use the concept of alternative schedules for this purpose. Such schedules are generated from static task graphs at compile-time. At run-time a distributed program can use these alternatives to switch from one schedule to another if some machine/s become faulty. We have devised fast but efficient mechanisms for switching among schedules at run-time. This enables fault recovery from any number of simultaneous machine faults any number of times. The correctness of the resultant algorithm is ensured through prevention of direct data sharing among local tasks on a machine. Such a transparent fault tolerant strategy is easily implementable on a network of workstations running PVM-like softwares.

在本文中，我们设计了一种新的方法对运行在网络工作站集群上的分布式程序进行透明容错。为此，我们使用了可选时间表的概念。这样的调度是在编译时从静态任务图生成的。在运行时，如果某些机器出现故障，分布式程序可以使用这些替代方案从一个调度切换到另一个调度。我们已经设计了快速而有效的机制来在运行时在计划之间切换。这使得从任意数量的同时发生的机器故障中恢复故障成为可能。通过防止在机器上的本地任务之间直接共享数据来确保生成算法的正确性。这种透明的容错策略很容易在运行类似pvm的软件的工作站网络上实现。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 3rd International Conference on Algorithms and Architectures for Parallel Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀