Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing最新文献

英文中文

A congestion control mechanism for wormhole networks 虫洞网络拥塞控制机制

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-01 DOI: 10.1109/EMPDP.2001.904965

Elvira Baydal, P. López, J. Duato

Deadlock avoidance and recovery techniques suffer from severe performance degradation when the network is close to or beyond saturation. Many parallel applications produce bursty traffic that may saturate the network during some intervals, and increase execution time. Therefore, the use of techniques that prevent network saturation are of crucial importance in both deadlock avoidance and recovery strategies. Several mechanisms have been proposed in the literature to reach this goal. However some of them do not work well under all network load conditions. Others introduce some penalty when the network is not fully saturated, or complicate network and/or node implementation. In this paper we propose a new mechanism to avoid network saturation that overcomes these drawbacks. In this mechanism, each node estimates network traffic locally by using the percentage of free virtual output channels that can be used for forwarding a message towards its destination. When this number surpasses a threshold value, network congestion is assumed to exist and message injection is forbidden.

当网络接近或超过饱和状态时，死锁避免和恢复技术的性能会严重下降。许多并行应用会产生突发流量，可能会在某些时间间隔内使网络饱和，并增加执行时间。因此，在避免死锁和恢复策略中，使用防止网络饱和的技术至关重要。为实现这一目标，文献中提出了多种机制。然而，其中一些机制并不能在所有网络负载条件下都很好地发挥作用。还有一些机制会在网络未完全饱和时引入一些惩罚，或使网络和/或节点的实现变得复杂。在本文中，我们提出了一种避免网络饱和的新机制，克服了这些缺点。在这一机制中，每个节点通过使用可用于向目的地转发信息的空闲虚拟输出通道的百分比来估算本地网络流量。当这一数字超过阈值时，网络拥塞就会被假定存在，信息注入就会被禁止。

引用次数: 21

ULSI architectures for artificial neural networks 用于人工神经网络的ULSI架构

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 1900-01-01 DOI: 10.1109/EMPDP.2001.905072

U. Ruckert

Three different hardware implementations of artificial neural networks are presented. The chips are model-specific integrated circuits for neural associative memories, self-organizing feature maps and local cluster neural networks. Some of the key implementational issues are considered and especially the question of resource-efficiency is discussed.

介绍了人工神经网络的三种不同的硬件实现。这些芯片是用于神经联想记忆、自组织特征图和局部聚类神经网络的特定模型集成电路。审议了一些关键的执行问题，特别是讨论了资源效率问题。

引用次数: 0

Hardware prefetching in bus-based multiprocessors: pattern characterization and cost-effective hardware 基于总线的多处理器中的硬件预取:模式表征和低成本硬件

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 1900-01-01 DOI: 10.1109/EMPDP.2001.905061

M. Garzarán, J. L. Briz, P. Ibáñez, V. Viñals

Data prefetching has been widely studied as a technique to hide memory access latency in multiprocessors. Most recent research on hardware prefetching focuses either on uniprocessors, or on distributed shared memory (DSM) and other non bus-based organizations. However, in the context of bus-based SMPs, prefetching poses a number of problems related to the lack of scalability and limited bus bandwidth of these modest-sized machines. This paper considers how the number of processors and the memory access patterns in the program influence the relative performance of sequential and non-sequential prefetching mechanisms in a bus-based SMP. We compare the performance of four inexpensive hardware prefetching techniques, varying the number of processors. After a breakdown of the results based on a performance model, we propose a cost-effective hardware prefetching solution for implementing on such modest-sized multiprocessors.

数据预取作为一种隐藏多处理器存储器访问延迟的技术已经得到了广泛的研究。最近对硬件预取的研究主要集中在单处理器、分布式共享内存(DSM)和其他非基于总线的组织。然而，在基于总线的smp上下文中，预取带来了许多与这些中等大小的机器缺乏可伸缩性和有限的总线带宽相关的问题。本文研究了在基于总线的SMP中，程序中的处理器数量和存储器访问模式如何影响顺序和非顺序预取机制的相对性能。我们比较了四种便宜的硬件预取技术的性能，改变了处理器的数量。在对基于性能模型的结果进行细分之后，我们提出了一种经济有效的硬件预取解决方案，用于在这种中等大小的多处理器上实现。

引用次数: 8

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀