首页 > 最新文献

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)最新文献

英文 中文
AuctionNet: Market oriented task scheduling in heterogeneous distributed environments AuctionNet:异构分布式环境中面向市场的任务调度
Han Zhao, Xiaolin Li
We propose a suite of market-oriented task scheduling algorithms to build an AuctionNet for heterogeneous distributed environments. In heterogeneous distributed environments, computing nodes are autonomous and owned by different organizations, for example peer-to-peer systems, desktop grids/clouds. To address such diverse heterogeneity and dynamism in systems, applications, and local policies, efficient and fair task scheduling becomes a challenging issue. To cope with such complexity in a distributed and noncooperative environment, we propose to use market-oriented incentive mechanisms to regulate task scheduling in a distributed manner. Further, to accommodate multiple objectives and criteria, we adopt a combined approach leveraging the advantage of both hypergraph theory and incentive mechanisms. We first formulate a general framework of market-oriented task scheduling in distributed systems. We then present two algorithms for task-bundle scheduling. Preliminary results demonstrate the satisfactory performance of our proposed algorithms. The remaining work to complete the PhD dissertation is then presented. The proposed research carries significant intellectual merits and potential broader impacts in the following aspects. (1) We propose the notion of task-bundle for the first time in the literature. Product-bundle has been a common marketing strategy in our daily life for a long time. In the emerging commercial clouds and desktop clouds, task-bundle could be a useful concept for computing and storage markets. (2) We propose efficient distributed mechanisms that are very suitable for such distributed systems. A novel algorithm combining hypergraph and incentive mechanisms achieves multi-objective optimization. (3) We conduct rigorous analytical study and prove that our algorithms ensure efficiency and fairness and in the meantime maximize social welfare. (4) Overall, this proposal lays a solid foundation and sheds light on future research and realworld applications in the broad area of task scheduling in distributed systems.
我们提出了一套面向市场的任务调度算法来构建异构分布式环境下的拍卖网。在异构分布式环境中,计算节点是自治的,由不同的组织拥有,例如点对点系统、桌面网格/云。为了解决系统、应用程序和本地策略中的这种多样化的异质性和动态性,高效和公平的任务调度成为一个具有挑战性的问题。为了应对分布式和非合作环境下的这种复杂性,我们建议使用市场导向的激励机制来调节任务调度的分布式方式。此外,为了适应多种目标和标准,我们采用了一种综合方法,利用超图理论和激励机制的优势。本文首先提出了分布式系统中面向市场的任务调度的一般框架。然后,我们提出了两种任务束调度算法。初步结果表明,所提算法的性能令人满意。剩下的工作,以完成博士论文,然后提出。拟议的研究在以下方面具有重要的智力价值和潜在的更广泛的影响。(1)在文献中首次提出了任务包的概念。长期以来,产品捆绑营销一直是我们日常生活中常见的一种营销策略。在新兴的商业云和桌面云中,任务包可能是计算和存储市场的一个有用概念。(2)我们提出了非常适合这种分布式系统的高效分布式机制。一种结合超图和激励机制的新算法实现了多目标优化。(3)我们进行了严格的分析研究,证明了我们的算法在保证效率和公平的同时,最大限度地提高了社会福利。(4)总体而言,本研究为分布式系统任务调度的未来研究和实际应用奠定了坚实的基础。
{"title":"AuctionNet: Market oriented task scheduling in heterogeneous distributed environments","authors":"Han Zhao, Xiaolin Li","doi":"10.1109/IPDPSW.2010.5470798","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470798","url":null,"abstract":"We propose a suite of market-oriented task scheduling algorithms to build an AuctionNet for heterogeneous distributed environments. In heterogeneous distributed environments, computing nodes are autonomous and owned by different organizations, for example peer-to-peer systems, desktop grids/clouds. To address such diverse heterogeneity and dynamism in systems, applications, and local policies, efficient and fair task scheduling becomes a challenging issue. To cope with such complexity in a distributed and noncooperative environment, we propose to use market-oriented incentive mechanisms to regulate task scheduling in a distributed manner. Further, to accommodate multiple objectives and criteria, we adopt a combined approach leveraging the advantage of both hypergraph theory and incentive mechanisms. We first formulate a general framework of market-oriented task scheduling in distributed systems. We then present two algorithms for task-bundle scheduling. Preliminary results demonstrate the satisfactory performance of our proposed algorithms. The remaining work to complete the PhD dissertation is then presented. The proposed research carries significant intellectual merits and potential broader impacts in the following aspects. (1) We propose the notion of task-bundle for the first time in the literature. Product-bundle has been a common marketing strategy in our daily life for a long time. In the emerging commercial clouds and desktop clouds, task-bundle could be a useful concept for computing and storage markets. (2) We propose efficient distributed mechanisms that are very suitable for such distributed systems. A novel algorithm combining hypergraph and incentive mechanisms achieves multi-objective optimization. (3) We conduct rigorous analytical study and prove that our algorithms ensure efficiency and fairness and in the meantime maximize social welfare. (4) Overall, this proposal lays a solid foundation and sheds light on future research and realworld applications in the broad area of task scheduling in distributed systems.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116441705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
TiNy threads on BlueGene/P: Exploring many-core parallelisms beyond The traditional OS BlueGene/P上的微线程:探索传统操作系统之外的多核并行
Handong Ye, R. Pavel, A. Landwehr, G. Gao
Operating Systems (OSs) have been considered as a cornerstone of the modern computer system, and the conventional operating system model targets computers designed around the sequential execution model. However, with the rapid progress of the multi-core/manycore technologies, we argue that OSes must be adapted to the underlying hardware platform to fully exploit parallelism. To illustrate this, our paper reports a study on how to perform such an adaptation for the IBM BlueGene/P multi-core system.
操作系统(os)一直被认为是现代计算机系统的基石,传统的操作系统模型针对的是围绕顺序执行模型设计的计算机。然而,随着多核/多核技术的快速发展,我们认为操作系统必须适应底层硬件平台,以充分利用并行性。为了说明这一点,我们的论文报告了如何在IBM BlueGene/P多核系统上执行这种改编的研究。
{"title":"TiNy threads on BlueGene/P: Exploring many-core parallelisms beyond The traditional OS","authors":"Handong Ye, R. Pavel, A. Landwehr, G. Gao","doi":"10.1109/IPDPSW.2010.5470690","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470690","url":null,"abstract":"Operating Systems (OSs) have been considered as a cornerstone of the modern computer system, and the conventional operating system model targets computers designed around the sequential execution model. However, with the rapid progress of the multi-core/manycore technologies, we argue that OSes must be adapted to the underlying hardware platform to fully exploit parallelism. To illustrate this, our paper reports a study on how to perform such an adaptation for the IBM BlueGene/P multi-core system.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123525404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Coping with uncertainty in scheduling problems 处理调度问题中的不确定性
Louis-Claude Canon
Large-scale distributed systems such as Grids constitute computational environments that are essential to academic and industry needs. However, they present uncertain behaviors due to their scales that increase continually. We propose to revisit traditional scheduling problematics in these environments by considering uncertainty in the models.
像网格这样的大规模分布式系统构成了对学术和工业需求至关重要的计算环境。然而,由于它们的规模不断增加,它们表现出不确定的行为。我们建议通过考虑模型中的不确定性来重新审视这些环境中的传统调度问题。
{"title":"Coping with uncertainty in scheduling problems","authors":"Louis-Claude Canon","doi":"10.1109/IPDPSW.2010.5470809","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470809","url":null,"abstract":"Large-scale distributed systems such as Grids constitute computational environments that are essential to academic and industry needs. However, they present uncertain behaviors due to their scales that increase continually. We propose to revisit traditional scheduling problematics in these environments by considering uncertainty in the models.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121970039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of random time-based switching for file sharing in peer-to-peer networks 点对点网络中基于随机时间的文件共享交换分析
Keqin Li
The expected file download time of the randomized time-based switching algorithm for peer selection and file downloading in a peer-to-peer (P2P) network is still unknown. The main contribution of this paper is to analyze the expected file download time of the time-based switching algorithm for file sharing in P2P networks when the service capacity of a source peer is totally correlated over time, namely, the service capacities of a source peer in different time slots are a fixed value. A recurrence relation is developed to characterize the expected file download time of the time-based switching algorithm. Is is proved that for two or more heterogeneous source peers and sufficiently large file size, the expected file download time of the time-based switching algorithm is less than and can be arbitrarily less than the expected download time of the chunk-based switching algorithm and the expected download time of the permanent connection algorithm. It is shown that the expected file download time of the time-based switching algorithm is in the range of the file size divided by the harmonic mean of service capacities and the file size divided by the arithmetic mean of service capacities. Numerical examples and data are presented to demonstrate our analytical results.
在P2P网络中,基于随机时间的交换算法用于对等点选择和文件下载的期望文件下载时间是未知的。本文的主要贡献在于分析了当源对等体的服务能力与时间完全相关,即源对等体在不同时隙的服务能力为固定值时,P2P网络中用于文件共享的基于时间的交换算法的期望文件下载时间。建立了一个递归关系来表征基于时间的切换算法的期望文件下载时间。证明了对于两个或两个以上异构源对等体,且文件大小足够大时,基于时间的交换算法的期望文件下载时间小于且可以任意小于基于块的交换算法的期望下载时间和永久连接算法的期望下载时间。结果表明,基于时间的交换算法的期望文件下载时间在文件大小除以服务容量的谐波平均值和文件大小除以服务容量的算术平均值的范围内。给出了数值算例和数据来验证我们的分析结果。
{"title":"Analysis of random time-based switching for file sharing in peer-to-peer networks","authors":"Keqin Li","doi":"10.1109/IPDPSW.2010.5470934","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470934","url":null,"abstract":"The expected file download time of the randomized time-based switching algorithm for peer selection and file downloading in a peer-to-peer (P2P) network is still unknown. The main contribution of this paper is to analyze the expected file download time of the time-based switching algorithm for file sharing in P2P networks when the service capacity of a source peer is totally correlated over time, namely, the service capacities of a source peer in different time slots are a fixed value. A recurrence relation is developed to characterize the expected file download time of the time-based switching algorithm. Is is proved that for two or more heterogeneous source peers and sufficiently large file size, the expected file download time of the time-based switching algorithm is less than and can be arbitrarily less than the expected download time of the chunk-based switching algorithm and the expected download time of the permanent connection algorithm. It is shown that the expected file download time of the time-based switching algorithm is in the range of the file size divided by the harmonic mean of service capacities and the file size divided by the arithmetic mean of service capacities. Numerical examples and data are presented to demonstrate our analytical results.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123937502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Adapting to NAT timeout values in P2P overlay networks 适应P2P覆盖网络中的NAT超时值
Richard Price, P. Tiňo
Nodes within existing P2P networks typically exchange periodic keep-alive messages in order to maintain network connections between neighbours. Keep-alive messages serve a dual purpose, they're used to detect node failures and to prevent idle connections from being expired by NAT devices. However despite being widely used, the interval between messages are typically fixed below the timeout value of most NAT devices based upon crude rules of thumb. Furthermore, although many studies have been conducted to traverse NAT devices and other studies seek to improve failure detection in P2P overlay networks; the limitations of NAT devices have received little research attention. This paper explores algorithms which allow nodes to adapt to the timeout values of individual NAT devices and investigates the resulting trade-offs.
现有P2P网络中的节点通常定期交换keep-alive消息,以维持邻居之间的网络连接。Keep-alive消息有双重用途,它们用于检测节点故障,并防止空闲连接被NAT设备过期。然而,尽管被广泛使用,消息之间的间隔通常固定在大多数NAT设备的超时值以下,这是基于粗糙的经验法则。此外,尽管已经进行了许多研究来遍历NAT设备和其他研究试图改进P2P覆盖网络中的故障检测;NAT设备的局限性很少受到研究的关注。本文探讨了允许节点适应单个NAT设备超时值的算法,并研究了由此产生的权衡。
{"title":"Adapting to NAT timeout values in P2P overlay networks","authors":"Richard Price, P. Tiňo","doi":"10.1109/IPDPSW.2010.5470785","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470785","url":null,"abstract":"Nodes within existing P2P networks typically exchange periodic keep-alive messages in order to maintain network connections between neighbours. Keep-alive messages serve a dual purpose, they're used to detect node failures and to prevent idle connections from being expired by NAT devices. However despite being widely used, the interval between messages are typically fixed below the timeout value of most NAT devices based upon crude rules of thumb. Furthermore, although many studies have been conducted to traverse NAT devices and other studies seek to improve failure detection in P2P overlay networks; the limitations of NAT devices have received little research attention. This paper explores algorithms which allow nodes to adapt to the timeout values of individual NAT devices and investigates the resulting trade-offs.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"2019 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124046905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
High performance Peer-to-Peer distributed computing with application to obstacle problem 高性能点对点分布式计算及其在障碍问题中的应用
Nguyen The Tung, D. E. Baz, P. Spitéri, Guillaume Jourjon, M. Chau
This paper deals with high performance Peer-to-Peer computing applications. We concentrate on the solution of large scale numerical simulation problems via distributed iterative methods. We present the current version of an environment that allows direct communication between peers. This environment is based on a self-adaptive communication protocol. The protocol configures itself automatically and dynamically in function of application requirements like scheme of computation and elements of context like topology by choosing the most appropriate communication mode between peers. A first series of computational experiments is presented and analyzed for the obstacle problem.
本文研究高性能点对点计算应用。我们专注于用分布式迭代方法解决大规模数值模拟问题。我们提供了一个允许对等体之间直接通信的环境的当前版本。该环境基于自适应通信协议。协议通过选择最合适的对等体之间的通信模式,根据应用需求(如计算方案)和上下文元素(如拓扑)自动动态地配置自身。提出并分析了障碍物问题的一系列计算实验。
{"title":"High performance Peer-to-Peer distributed computing with application to obstacle problem","authors":"Nguyen The Tung, D. E. Baz, P. Spitéri, Guillaume Jourjon, M. Chau","doi":"10.1109/IPDPSW.2010.5470930","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470930","url":null,"abstract":"This paper deals with high performance Peer-to-Peer computing applications. We concentrate on the solution of large scale numerical simulation problems via distributed iterative methods. We present the current version of an environment that allows direct communication between peers. This environment is based on a self-adaptive communication protocol. The protocol configures itself automatically and dynamically in function of application requirements like scheme of computation and elements of context like topology by choosing the most appropriate communication mode between peers. A first series of computational experiments is presented and analyzed for the obstacle problem.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125783793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Towards dynamic reconfigurable load-balancing for hybrid desktop platforms 实现混合桌面平台的动态可重构负载平衡
A. Binotto, C. Pereira, D. Fellner
High-performance platforms are required by applications that use massive calculations. Actually, desktop accelerators (like the GPUs) form a powerful heterogeneous platform in conjunction with multi-core CPUs. To improve application performance on these hybrid platforms, load-balancing plays an important role to distribute workload. However, such scheduling problem faces challenges since the cost of a task at a Processing Unit (PU) is non-deterministic and depends on parameters that cannot be known a priori, like input data, online creation of tasks, scenario changing, etc. Therefore, self-adaptive computing is a potential paradigm as it can provide flexibility to explore computational resources and improve performance on different execution scenarios. This paper presents an ongoing PhD research focused on a dynamic and reconfigurable scheduling strategy based on timing profiling for desktop accelerators. Preliminary results analyze the performance of solvers for SLEs (Systems of Linear Equations) over a hybrid CPU and multi-GPU platform applied to a CFD (Computational Fluid Dynamics) application. The decision of choosing the best solver as well as its scheduling must be performed dynamically considering online parameters in order to achieve a better application performance.
使用大量计算的应用程序需要高性能平台。实际上,桌面加速器(如gpu)与多核cpu一起构成了一个强大的异构平台。为了提高这些混合平台上的应用程序性能,负载平衡在分配工作负载方面起着重要作用。然而,这种调度问题面临着挑战,因为任务在处理单元(PU)上的成本是不确定的,并且取决于不能先验地知道的参数,如输入数据、任务的在线创建、场景变化等。因此,自适应计算是一种潜在的范例,因为它可以提供探索计算资源的灵活性,并在不同的执行场景中提高性能。本文介绍了一项正在进行的博士研究,重点是基于时间分析的桌面加速器动态可重构调度策略。初步结果分析了在CPU和多gpu混合平台上求解SLEs(线性方程组)的性能,并应用于CFD(计算流体动力学)应用。为了获得较好的应用性能,必须动态地考虑在线参数来选择最优解及其调度。
{"title":"Towards dynamic reconfigurable load-balancing for hybrid desktop platforms","authors":"A. Binotto, C. Pereira, D. Fellner","doi":"10.1109/IPDPSW.2010.5470804","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470804","url":null,"abstract":"High-performance platforms are required by applications that use massive calculations. Actually, desktop accelerators (like the GPUs) form a powerful heterogeneous platform in conjunction with multi-core CPUs. To improve application performance on these hybrid platforms, load-balancing plays an important role to distribute workload. However, such scheduling problem faces challenges since the cost of a task at a Processing Unit (PU) is non-deterministic and depends on parameters that cannot be known a priori, like input data, online creation of tasks, scenario changing, etc. Therefore, self-adaptive computing is a potential paradigm as it can provide flexibility to explore computational resources and improve performance on different execution scenarios. This paper presents an ongoing PhD research focused on a dynamic and reconfigurable scheduling strategy based on timing profiling for desktop accelerators. Preliminary results analyze the performance of solvers for SLEs (Systems of Linear Equations) over a hybrid CPU and multi-GPU platform applied to a CFD (Computational Fluid Dynamics) application. The decision of choosing the best solver as well as its scheduling must be performed dynamically considering online parameters in order to achieve a better application performance.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125832448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Analyzing the trade-off between multiple memory controllers and memory channels on multi-core processor performance 分析多内存控制器和内存通道对多核处理器性能的影响
J. Sancho, M. Lang, D. Kerbyson
The increasing core-count on current and future processors is posing critical challenges to the memory subsystem to efficiently handle concurrent memory requests. The current trend is to increase the number of memory channels available to the processor's memory controller. In this paper we investigate the effectiveness of this approach on the performance of parallel scientific applications. Specifically, we explore the trade-off between employing multiple memory channels per memory controller and the use of multiple memory controllers. Experiments conducted on two current state-of-the-art multicore processors, a 6-core AMD Istanbul and a 4-core Intel Nehalem-EP, for a wide range of production applications shows that there is a diminishing return when increasing the number of memory channels per memory controller. In addition, we show that this performance degradation can be efficiently addressed by increasing the ratio of memory controllers to channels while keeping the number of memory channels constant. Significant performance improvements can be achieved in this scheme, up to 28%, in the case of using two memory controllers each with one channel compared with one controller with two memory channels.
当前和未来处理器上不断增加的核数对内存子系统有效处理并发内存请求提出了严峻的挑战。当前的趋势是增加处理器内存控制器可用的内存通道的数量。在本文中,我们研究了这种方法对并行科学应用性能的有效性。具体来说,我们探讨了每个内存控制器使用多个内存通道和使用多个内存控制器之间的权衡。在两种当前最先进的多核处理器上进行的实验,6核AMD Istanbul和4核Intel Nehalem-EP,用于广泛的生产应用,表明当增加每个内存控制器的内存通道数量时,回报会递减。此外,我们还表明,在保持内存通道数量不变的同时,增加内存控制器与通道的比率可以有效地解决这种性能下降问题。在此方案中,使用两个内存控制器各具有一个通道的情况下,与使用一个控制器具有两个存储通道的情况相比,可以实现显著的性能改进,最高可达28%。
{"title":"Analyzing the trade-off between multiple memory controllers and memory channels on multi-core processor performance","authors":"J. Sancho, M. Lang, D. Kerbyson","doi":"10.1109/IPDPSW.2010.5470812","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470812","url":null,"abstract":"The increasing core-count on current and future processors is posing critical challenges to the memory subsystem to efficiently handle concurrent memory requests. The current trend is to increase the number of memory channels available to the processor's memory controller. In this paper we investigate the effectiveness of this approach on the performance of parallel scientific applications. Specifically, we explore the trade-off between employing multiple memory channels per memory controller and the use of multiple memory controllers. Experiments conducted on two current state-of-the-art multicore processors, a 6-core AMD Istanbul and a 4-core Intel Nehalem-EP, for a wide range of production applications shows that there is a diminishing return when increasing the number of memory channels per memory controller. In addition, we show that this performance degradation can be efficiently addressed by increasing the ratio of memory controllers to channels while keeping the number of memory channels constant. Significant performance improvements can be achieved in this scheme, up to 28%, in the case of using two memory controllers each with one channel compared with one controller with two memory channels.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125929069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Stack protection unit as a step towards securing MPSoCs 堆栈保护单元是确保mpsoc安全的一个步骤
S. Lukovic, P. Pezzino, Leandro Fiorin
Reconfigurable technologies are getting popular as an instrument not only for verification and prototyping but also for commercial implementation of Multi-Processor System-on-Chip (MPSoC) architectures. These systems, in particular Network-on-Chip (NoC) based ones, have emerged as a design strategy to cope with increased requirements and complexity of modern applications. However, the increasing heterogeneity, coupled with possibility of reconfiguration, makes security become one of major concerns in MPSoC design. In this work, we show a solution for FPGA based designs against one of the most widespread types of attacks - code injection. Our response to tackle this challenge is given in form of Stack Protection Unit (SPU) embedded into processing cores. MicroBlaze soft-core processor serves as a case study for verification of the proposed solution in FPGA technology.
可重构技术不仅是验证和原型设计的工具,也是多处理器片上系统(MPSoC)架构的商业实现的工具。这些系统,特别是基于片上网络(NoC)的系统,已经作为一种设计策略出现,以应对日益增长的需求和现代应用的复杂性。然而,日益增加的异构性,再加上可重构的可能性,使得安全性成为MPSoC设计的主要关注点之一。在这项工作中,我们展示了一种基于FPGA的设计解决方案,以对抗最广泛的攻击类型之一-代码注入。我们应对这一挑战的方式是将堆栈保护单元(SPU)嵌入到处理核心中。MicroBlaze软核处理器作为验证FPGA技术中提出的解决方案的案例研究。
{"title":"Stack protection unit as a step towards securing MPSoCs","authors":"S. Lukovic, P. Pezzino, Leandro Fiorin","doi":"10.1109/IPDPSW.2010.5470728","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470728","url":null,"abstract":"Reconfigurable technologies are getting popular as an instrument not only for verification and prototyping but also for commercial implementation of Multi-Processor System-on-Chip (MPSoC) architectures. These systems, in particular Network-on-Chip (NoC) based ones, have emerged as a design strategy to cope with increased requirements and complexity of modern applications. However, the increasing heterogeneity, coupled with possibility of reconfiguration, makes security become one of major concerns in MPSoC design. In this work, we show a solution for FPGA based designs against one of the most widespread types of attacks - code injection. Our response to tackle this challenge is given in form of Stack Protection Unit (SPU) embedded into processing cores. MicroBlaze soft-core processor serves as a case study for verification of the proposed solution in FPGA technology.","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124681035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An interoperable & optimal data grid solution for heterogeneous and SOA based Grid- GARUDA 一种可互操作的最佳数据网格解决方案,用于异构和基于SOA的网格- GARUDA
Payal Saluja, Prahlada Rao B.B., V. Shashidhar, A. Paventhan, Neetu Sharma
Storage plays an important role in sufficing the requirements of data intensive applications in a Grid computing environment. Current Scientific applications perform complex computational analysis, and consume/produce hundreds of terabytes of data. The authors in this paper have surveyed available data grid solutions, viz., Storage Resource Broker (SRB), Grid File System (GFS), Storage Resource Manager (SRM), iRODS and WS-DAI and presented their operational experiences in Service Oriented Architecture (SOA) based GARUDA grid. SOA introduces more challenges to achieve: availability, security, scalability and performance to the storage system. Based on the survey, the authors proposed GARUDA-Storage Resource Manager (GSRM) that adheres to SRM specifications. GSRM is a disk based SRM implementation based on DPM (Disk Pool manager) architecture. It addresses the various aspects like virtualization, security, latency, performance, and data availability. We discussed how GSRM architecture can leverage CDAC's Parallel File System (C-PFS).
存储在满足网格计算环境中数据密集型应用程序的需求方面起着重要作用。当前的科学应用程序执行复杂的计算分析,并消耗/产生数百tb的数据。本文作者调查了现有的数据网格解决方案,即存储资源代理(SRB)、网格文件系统(GFS)、存储资源管理器(SRM)、iRODS和WS-DAI,并介绍了他们在基于GARUDA网格的面向服务体系结构(SOA)中的运行经验。SOA引入了更多需要实现的挑战:存储系统的可用性、安全性、可伸缩性和性能。基于调查,作者提出了遵循SRM规范的GARUDA-Storage Resource Manager (GSRM)。GSRM是一种基于DPM (disk Pool manager)架构的基于磁盘的SRM实现。它解决了虚拟化、安全性、延迟、性能和数据可用性等各个方面的问题。我们讨论了GSRM架构如何利用CDAC的并行文件系统(C-PFS)。
{"title":"An interoperable & optimal data grid solution for heterogeneous and SOA based Grid- GARUDA","authors":"Payal Saluja, Prahlada Rao B.B., V. Shashidhar, A. Paventhan, Neetu Sharma","doi":"10.1109/IPDPSW.2010.5470914","DOIUrl":"https://doi.org/10.1109/IPDPSW.2010.5470914","url":null,"abstract":"Storage plays an important role in sufficing the requirements of data intensive applications in a Grid computing environment. Current Scientific applications perform complex computational analysis, and consume/produce hundreds of terabytes of data. The authors in this paper have surveyed available data grid solutions, viz., Storage Resource Broker (SRB), Grid File System (GFS), Storage Resource Manager (SRM), iRODS and WS-DAI and presented their operational experiences in Service Oriented Architecture (SOA) based GARUDA grid. SOA introduces more challenges to achieve: availability, security, scalability and performance to the storage system. Based on the survey, the authors proposed GARUDA-Storage Resource Manager (GSRM) that adheres to SRM specifications. GSRM is a disk based SRM implementation based on DPM (Disk Pool manager) architecture. It addresses the various aspects like virtualization, security, latency, performance, and data availability. We discussed how GSRM architecture can leverage CDAC's Parallel File System (C-PFS).","PeriodicalId":329280,"journal":{"name":"2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129847220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1