首页 > 最新文献

2008 International Symposium on Parallel and Distributed Computing最新文献

英文 中文
An Architecture for an Adaptive Run-time Prediction System 自适应运行时预测系统的体系结构
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.34
C. Glasner, J. Volkert
This article describes a system for run-time prediction of applications in heterogeneous environments. To exploit the power of computational grids, scheduling systems need profound information about the job to be executed. The run-time of a job is - beside others - not only dependent of its kind and complexity but also of the adequacy and load of the remote host where it will be executed. Accounting and billing are additional aspects that have to be considered when creating a schedule. Currently predictions are achieved by using descriptive models of the applications or by applying statistical methods to former jobs mostly neglecting the behaviour of users. Motivated by this, we propose a method that is not only based on the characteristics of a job but also takes the behaviour of single users and groups of similar users respectively into account. The basic idea of our approach is to cluster users, hosts and jobs and apply multiple methods in order to detect similarities and create forecasts. This is achieved by tagging jobs with attributes and by deriving predictions for similar attributed jobs whereas the recent behaviour of a user determines which predictions are finally taken.
本文描述了一个用于异构环境中应用程序运行时预测的系统。为了利用计算网格的力量,调度系统需要关于要执行的任务的深刻信息。作业的运行时间不仅取决于作业的类型和复杂性,还取决于执行作业的远程主机的适当性和负载。会计和账单是创建计划时必须考虑的额外方面。目前的预测是通过使用应用程序的描述性模型或通过将统计方法应用于以前的工作来实现的,这些工作大多忽略了用户的行为。受此启发,我们提出了一种方法,该方法不仅基于工作的特征,而且还分别考虑了单个用户和相似用户群体的行为。我们的方法的基本思想是将用户、主机和作业聚类,并应用多种方法来检测相似性并创建预测。这是通过标记带有属性的作业,并通过对类似的属性作业进行预测来实现的,而用户最近的行为决定了最终采用哪些预测。
{"title":"An Architecture for an Adaptive Run-time Prediction System","authors":"C. Glasner, J. Volkert","doi":"10.1109/ISPDC.2008.34","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.34","url":null,"abstract":"This article describes a system for run-time prediction of applications in heterogeneous environments. To exploit the power of computational grids, scheduling systems need profound information about the job to be executed. The run-time of a job is - beside others - not only dependent of its kind and complexity but also of the adequacy and load of the remote host where it will be executed. Accounting and billing are additional aspects that have to be considered when creating a schedule. Currently predictions are achieved by using descriptive models of the applications or by applying statistical methods to former jobs mostly neglecting the behaviour of users. Motivated by this, we propose a method that is not only based on the characteristics of a job but also takes the behaviour of single users and groups of similar users respectively into account. The basic idea of our approach is to cluster users, hosts and jobs and apply multiple methods in order to detect similarities and create forecasts. This is achieved by tagging jobs with attributes and by deriving predictions for similar attributed jobs whereas the recent behaviour of a user determines which predictions are finally taken.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122688206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Large Scale Parallel Hybrid GMRES Method for the Linear System on Grid System 网格系统上线性系统的大规模并行混合GMRES方法
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.21
Ye Zhang, Guy Bergére, S. Petiton
The method GMRES is used widely to solve the large sparse linear systems. In this paper, we will present an effective parallel hybrid asynchronous method, which combines the typical parallel method GMRES with the Least Square method that needs some eigenvalues obtained from a parallel Arnoldi process. And we will apply it on a Grid Computing platform Grid5000. Grid computing in general is a special type of parallel computing. It intends to deliver high-performance computing over distributed platforms for computation and data-intensive applications by making use of a very large amount of resources. From the numeric results, we will present this hybrid method has some advantage for some matrices compared to the general method GMRES.
GMRES方法被广泛应用于求解大型稀疏线性系统。本文提出了一种有效的并行混合异步方法,该方法将典型的并行方法GMRES与需要从并行Arnoldi过程中获得特征值的最小二乘法相结合。并将其应用于网格计算平台Grid5000。网格计算一般是一种特殊类型的并行计算。它打算通过利用大量资源,在分布式平台上为计算和数据密集型应用程序提供高性能计算。从数值结果来看,与通用方法GMRES相比,这种混合方法对某些矩阵具有一定的优势。
{"title":"Large Scale Parallel Hybrid GMRES Method for the Linear System on Grid System","authors":"Ye Zhang, Guy Bergére, S. Petiton","doi":"10.1109/ISPDC.2008.21","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.21","url":null,"abstract":"The method GMRES is used widely to solve the large sparse linear systems. In this paper, we will present an effective parallel hybrid asynchronous method, which combines the typical parallel method GMRES with the Least Square method that needs some eigenvalues obtained from a parallel Arnoldi process. And we will apply it on a Grid Computing platform Grid5000. Grid computing in general is a special type of parallel computing. It intends to deliver high-performance computing over distributed platforms for computation and data-intensive applications by making use of a very large amount of resources. From the numeric results, we will present this hybrid method has some advantage for some matrices compared to the general method GMRES.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115432382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Realistic Simulation of Large Scale Distributed Systems using Monitoring 基于监控的大规模分布式系统仿真
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.37
C. Dobre, Corina Stratan, V. Cristea
In this paper we present an extension to MONARC, a generic simulator for large scale distributed systems, which allows realistic evaluation of various actual distributed system technologies based on real-world monitored data supplied by MonALISA. The field of modelling and simulation was long-time seen as a viable solution to develop new algorithms and technologies and to enable the development of large-scale distributed systems, where analytical validations are prohibited by the nature of the encountered problems. This paper presents a novel approach to combining two distributed systems domains, monitoring and simulation, highlighting a realistic solution to the problem of accurately evaluating various distributed systems technologies using simulation. We also present a simulation study which demonstrates the interoperability between the simulation framework and the monitoring instrument, demonstrating important properties of the US LHCNet research network in the context of the LHC experiments in CERN.
在本文中,我们提出了对MONARC的扩展,MONARC是一个用于大规模分布式系统的通用模拟器,它允许基于MonALISA提供的真实监测数据对各种实际分布式系统技术进行现实评估。长期以来,建模和模拟领域被视为开发新算法和技术的可行解决方案,并使大规模分布式系统的发展成为可能,在这些系统中,由于遇到的问题的性质,分析验证是禁止的。本文提出了一种结合监控和仿真两个分布式系统领域的新方法,强调了使用仿真来准确评估各种分布式系统技术的现实解决方案。此外,我们还进行了模拟研究,演示了模拟框架与监测仪器之间的互操作性,展示了美国LHCNet研究网络在CERN大型强子对撞机实验背景下的重要特性。
{"title":"Realistic Simulation of Large Scale Distributed Systems using Monitoring","authors":"C. Dobre, Corina Stratan, V. Cristea","doi":"10.1109/ISPDC.2008.37","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.37","url":null,"abstract":"In this paper we present an extension to MONARC, a generic simulator for large scale distributed systems, which allows realistic evaluation of various actual distributed system technologies based on real-world monitored data supplied by MonALISA. The field of modelling and simulation was long-time seen as a viable solution to develop new algorithms and technologies and to enable the development of large-scale distributed systems, where analytical validations are prohibited by the nature of the encountered problems. This paper presents a novel approach to combining two distributed systems domains, monitoring and simulation, highlighting a realistic solution to the problem of accurately evaluating various distributed systems technologies using simulation. We also present a simulation study which demonstrates the interoperability between the simulation framework and the monitoring instrument, demonstrating important properties of the US LHCNet research network in the context of the LHC experiments in CERN.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126786843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A MILP Approach to Schedule Parallel Independent Tasks 并行独立任务调度的MILP方法
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.59
A. Goldman, Yanik Ngoko
We propose a new mixed integer linear programming approach to solve the classical problem of scheduling independent parallel tasks without preemption. We propose a formulation where the goal is to minimize the makespan.Then we show the flexibility of this approach by extending the result to the contiguous case. We validate this approach with some experiments on the execution times and comparing the optimal results with the solutions provided by list algorithms.
针对独立并行任务的无抢占调度问题,提出了一种新的混合整数线性规划方法。我们提出了一个以最小化完工时间为目标的公式。然后我们通过将结果扩展到连续情况来展示这种方法的灵活性。我们通过一些关于执行时间的实验验证了这种方法,并将最优结果与列表算法提供的解决方案进行了比较。
{"title":"A MILP Approach to Schedule Parallel Independent Tasks","authors":"A. Goldman, Yanik Ngoko","doi":"10.1109/ISPDC.2008.59","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.59","url":null,"abstract":"We propose a new mixed integer linear programming approach to solve the classical problem of scheduling independent parallel tasks without preemption. We propose a formulation where the goal is to minimize the makespan.Then we show the flexibility of this approach by extending the result to the contiguous case. We validate this approach with some experiments on the execution times and comparing the optimal results with the solutions provided by list algorithms.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124202586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Towards a Grid Oriented Architecture for Symbolic Computing 面向网格的符号计算体系结构
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.46
Georgiana Macariu, A. Cârstea, M. Frîncu, D. Petcu
One of the benefits of the current service-oriented architectures is the easy static composition of geographically scattered services into complex applications. Dynamic composition is more difficult to achieve with the current technologies. We propose a practical solution for dynamic composition of the facilities provided by computer algebra systems, based on Grid services and the WS-BPEL standard Web service orchestration language. Moreover, we introduce a methodology for migrating from Web services to Grid services using databases for persistence.
当前面向服务的体系结构的好处之一是可以轻松地将地理上分散的服务静态组合到复杂的应用程序中。动态合成是目前技术难以实现的。我们基于网格服务和WS-BPEL标准Web服务编排语言,为计算机代数系统提供的功能的动态组合提出了一个实用的解决方案。此外,我们还介绍了一种使用数据库实现持久性从Web服务迁移到网格服务的方法。
{"title":"Towards a Grid Oriented Architecture for Symbolic Computing","authors":"Georgiana Macariu, A. Cârstea, M. Frîncu, D. Petcu","doi":"10.1109/ISPDC.2008.46","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.46","url":null,"abstract":"One of the benefits of the current service-oriented architectures is the easy static composition of geographically scattered services into complex applications. Dynamic composition is more difficult to achieve with the current technologies. We propose a practical solution for dynamic composition of the facilities provided by computer algebra systems, based on Grid services and the WS-BPEL standard Web service orchestration language. Moreover, we introduce a methodology for migrating from Web services to Grid services using databases for persistence.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124953999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Computing Equilibria in Bimatrix Games by Parallel Support Enumeration 用并行支持枚举法计算双矩阵对策的均衡
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.38
J. Widger, Daniel Grosu
We consider the problem of computing all Nash equilibria in bimatrix games (i.e., nonzero-sum two-player noncooperative games). Computing all Nash equilibria for large bimatrix games using single-processor computers is not feasible due to the exponential time required by the existing algorithms. We consider the use of parallel computing which allows us to solve larger games. We design and implement a parallel algorithm for computing all Nash Equilibria in bimatrix games. The algorithm computes all Nash equilibria by searching all possible supports of mixed strategies. We perform experiments on a cluster computing system to evaluate the performance of the parallel algorithm.
我们考虑双矩阵对策(即非零和二人非合作对策)中所有纳什均衡的计算问题。由于现有算法需要指数级的时间,使用单处理器计算机计算大型双矩阵博弈的所有纳什均衡是不可行的。我们考虑使用并行计算来解决更大的游戏。我们设计并实现了一种计算双矩阵博弈中所有纳什均衡的并行算法。该算法通过搜索混合策略的所有可能支持来计算所有纳什均衡。我们在一个集群计算系统上进行了实验,以评估并行算法的性能。
{"title":"Computing Equilibria in Bimatrix Games by Parallel Support Enumeration","authors":"J. Widger, Daniel Grosu","doi":"10.1109/ISPDC.2008.38","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.38","url":null,"abstract":"We consider the problem of computing all Nash equilibria in bimatrix games (i.e., nonzero-sum two-player noncooperative games). Computing all Nash equilibria for large bimatrix games using single-processor computers is not feasible due to the exponential time required by the existing algorithms. We consider the use of parallel computing which allows us to solve larger games. We design and implement a parallel algorithm for computing all Nash Equilibria in bimatrix games. The algorithm computes all Nash equilibria by searching all possible supports of mixed strategies. We perform experiments on a cluster computing system to evaluate the performance of the parallel algorithm.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125010094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Monitoring Architecture for High-Speed Networks in Large Scale Distributed Collaborations 面向大规模分布式协作的高速网络监控体系结构
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.33
Alexandru Costan, C. Dobre, V. Cristea, R. Voicu
In this paper we present the architecture of a distributed framework that allows real-time accurate monitoring of large scale high-speed networks. An important component of a large-scale distributed collaboration is the complex network infrastructure on which it relies. For monitoring and controlling the networking resources an adequate instrument should offer the possibility to collect and store the relevant monitoring information, presenting significant perspectives and synthetic views of how the large distributed system performs. We therefore developed within the MonALISA monitoring framework a system able to collect, store, process and interpret the large volume of status information related to the US LHCNet research network. The system uses flexible mechanisms for data representation, providing access optimization and decision support, being able to present real-time and long-time history information through global or specific views and to take further automated control actions based on them.
在本文中,我们提出了一个分布式框架的架构,可以实现对大规模高速网络的实时精确监控。大规模分布式协作的一个重要组成部分是它所依赖的复杂网络基础设施。为了监测和控制网络资源,适当的工具应该提供收集和存储相关监测信息的可能性,提供有关大型分布式系统如何执行的重要视角和综合视图。因此,我们在MonALISA监测框架内开发了一个能够收集、存储、处理和解释与美国LHCNet研究网络相关的大量状态信息的系统。该系统采用灵活的数据表示机制,提供访问优化和决策支持,能够通过全局或特定视图呈现实时和长期的历史信息,并基于这些信息采取进一步的自动化控制行动。
{"title":"A Monitoring Architecture for High-Speed Networks in Large Scale Distributed Collaborations","authors":"Alexandru Costan, C. Dobre, V. Cristea, R. Voicu","doi":"10.1109/ISPDC.2008.33","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.33","url":null,"abstract":"In this paper we present the architecture of a distributed framework that allows real-time accurate monitoring of large scale high-speed networks. An important component of a large-scale distributed collaboration is the complex network infrastructure on which it relies. For monitoring and controlling the networking resources an adequate instrument should offer the possibility to collect and store the relevant monitoring information, presenting significant perspectives and synthetic views of how the large distributed system performs. We therefore developed within the MonALISA monitoring framework a system able to collect, store, process and interpret the large volume of status information related to the US LHCNet research network. The system uses flexible mechanisms for data representation, providing access optimization and decision support, being able to present real-time and long-time history information through global or specific views and to take further automated control actions based on them.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117118422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
FPGA Generators of Combinatorial Configurations in a Linear Array Model 线性阵列模型中组合配置的FPGA生成器
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.48
Zbigniew Kokosinski, Pawel Halesiak
In this paper we describe hardware implementations of generators of combinatorial objects. For implementation several systolic algorithms were selected that generate combinatorial configurations in a linear array model. The algorithms generate such objects as combinations, combinations with repetitions, t-ary trees, partitions, and variations with repetitions. The generators were implemented in VHLD with Xilinx Foundation ISE software and tested on Digilent development boards with Xilinx FPGAs. Implementation data obtained for various input parameters and FPGA devices are given.
本文描述了组合对象生成器的硬件实现。为了实现,选择了几种收缩算法,在线性阵列模型中生成组合配置。算法生成诸如组合、重复组合、t- tree、分区和重复变化等对象。发生器在VHLD中使用Xilinx Foundation ISE软件实现,并在带有Xilinx fpga的Digilent开发板上进行了测试。给出了各种输入参数和FPGA器件的实现数据。
{"title":"FPGA Generators of Combinatorial Configurations in a Linear Array Model","authors":"Zbigniew Kokosinski, Pawel Halesiak","doi":"10.1109/ISPDC.2008.48","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.48","url":null,"abstract":"In this paper we describe hardware implementations of generators of combinatorial objects. For implementation several systolic algorithms were selected that generate combinatorial configurations in a linear array model. The algorithms generate such objects as combinations, combinations with repetitions, t-ary trees, partitions, and variations with repetitions. The generators were implemented in VHLD with Xilinx Foundation ISE software and tested on Digilent development boards with Xilinx FPGAs. Implementation data obtained for various input parameters and FPGA devices are given.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125673302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Data Partitioning and Placement Schemes for Matrix Multiplications on a PIM Architecture PIM体系结构中矩阵乘法的数据划分和放置方案
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.7
J. Cha, S. Gupta
Data intensive applications require massive data transfers between storage and processing units. VLSI scaling has increased the sizes of dynamic memories as well as speeds and capabilities of processing units to a point where, for many such applications, storage and computational processing capabilities are no longer the main limiting factors. Despite this fact, most current architectures fail to meet the performance requirements for such data intensive applications. In this paper, we describe a PIM architecture that harnesses the benefits of VLSI scaling to accelerate matrix operations that constitute the core of many data-intensive applications. We then present data partitioning and placement schemes that are efficient in terms of the computational complexities and internode communication cost. Such approaches are evaluated and analyzed under various computing environments. We also discuss on how to apply such partitioning and placement schemes to each matrix when chains of matrix operations are given as a task.
数据密集型应用需要在存储和处理单元之间传输大量数据。VLSI的扩展已经增加了动态存储器的尺寸以及处理单元的速度和能力,对于许多此类应用来说,存储和计算处理能力不再是主要的限制因素。尽管如此,大多数当前架构都无法满足此类数据密集型应用程序的性能需求。在本文中,我们描述了一种PIM架构,该架构利用VLSI扩展的优势来加速构成许多数据密集型应用核心的矩阵运算。然后,我们提出了在计算复杂性和节点间通信成本方面有效的数据分区和放置方案。这些方法在各种计算环境下进行了评估和分析。我们还讨论了当矩阵运算链作为一个任务给定时,如何将这种划分和放置方案应用于每个矩阵。
{"title":"Data Partitioning and Placement Schemes for Matrix Multiplications on a PIM Architecture","authors":"J. Cha, S. Gupta","doi":"10.1109/ISPDC.2008.7","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.7","url":null,"abstract":"Data intensive applications require massive data transfers between storage and processing units. VLSI scaling has increased the sizes of dynamic memories as well as speeds and capabilities of processing units to a point where, for many such applications, storage and computational processing capabilities are no longer the main limiting factors. Despite this fact, most current architectures fail to meet the performance requirements for such data intensive applications. In this paper, we describe a PIM architecture that harnesses the benefits of VLSI scaling to accelerate matrix operations that constitute the core of many data-intensive applications. We then present data partitioning and placement schemes that are efficient in terms of the computational complexities and internode communication cost. Such approaches are evaluated and analyzed under various computing environments. We also discuss on how to apply such partitioning and placement schemes to each matrix when chains of matrix operations are given as a task.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131724282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Improved Parallel Algorithm for Computing Approximate Inverses by Reducing Synchronizations 一种减少同步的近似逆计算改进并行算法
Pub Date : 2008-07-01 DOI: 10.1109/ISPDC.2008.18
G. Gravvanis, K. M. Giannoutakis
A new parallel algorithm, based on the concept of anti diagonal wave pattern, for computing approximate inverses, is introduced for symmetric multiprocessor systems. The parallel normalized approximate inverses are used in conjunction with parallel normalized preconditioned conjugate gradient-type schemes, for the efficient solution of sparse finite element linear systems. The parallel implementation issues of the new algorithm are discussed and the parallel performance is presented, using OpenMP.
针对对称多处理器系统,提出了一种基于反对角波形概念的近似求逆并行算法。将并行归一化近似逆与并行归一化预条件共轭梯度型格式相结合,用于稀疏有限元线性系统的有效求解。讨论了新算法的并行实现问题,并给出了在OpenMP环境下的并行性能。
{"title":"An Improved Parallel Algorithm for Computing Approximate Inverses by Reducing Synchronizations","authors":"G. Gravvanis, K. M. Giannoutakis","doi":"10.1109/ISPDC.2008.18","DOIUrl":"https://doi.org/10.1109/ISPDC.2008.18","url":null,"abstract":"A new parallel algorithm, based on the concept of anti diagonal wave pattern, for computing approximate inverses, is introduced for symmetric multiprocessor systems. The parallel normalized approximate inverses are used in conjunction with parallel normalized preconditioned conjugate gradient-type schemes, for the efficient solution of sparse finite element linear systems. The parallel implementation issues of the new algorithm are discussed and the parallel performance is presented, using OpenMP.","PeriodicalId":125975,"journal":{"name":"2008 International Symposium on Parallel and Distributed Computing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120904617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2008 International Symposium on Parallel and Distributed Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1