首页 > 最新文献

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)最新文献

英文 中文
A distributed spectrum sharing algorithm in cognitive radio networks 认知无线电网络中的分布式频谱共享算法
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097848
Wei Sun, Jiadi Yu, Tong Liu
In this paper we study a social welfare maximization problem for spectrum sharing in cognitive radio networks. To fully use the spectrum resource, the spectrum owned by the licensed primary user (PU) can be leased to secondary users (SUs) for transmitting data. We first formulate the social welfare of a cognitive radio network, considering the cost for the primary user sharing spectrum and the utility gained for secondary users transmitting data. The social welfare maximization is a convex optimization, which can be solved by standard methods in a centralized manner. However, the utility function of each secondary user always contains the private information, which leads to the centralized methods disabled. To overcome this challenge, we propose an iterative distributed algorithm based on a pricing-based decomposition framework. It is theoretically proved that our proposed algorithm converges to the optimal solution. Numerical simulation results are presented to show that our proposed algorithm achieves optimal social welfare and fast convergence speed.
本文研究了认知无线电网络中频谱共享的社会福利最大化问题。为了充分利用频谱资源,可以将已获得license的primary user (primary user)拥有的频谱租给secondary user (secondary user),用于传输数据。考虑到主用户共享频谱的成本和次用户传输数据的效用,我们首先制定了认知无线电网络的社会福利。社会福利最大化是一个凸优化问题,可以用标准方法集中求解。然而,由于每个二级用户的效用函数总是包含私有信息,导致集中方法无法使用。为了克服这一挑战,我们提出了一种基于定价分解框架的迭代分布式算法。从理论上证明了该算法收敛于最优解。数值仿真结果表明,该算法具有较好的社会福利和较快的收敛速度。
{"title":"A distributed spectrum sharing algorithm in cognitive radio networks","authors":"Wei Sun, Jiadi Yu, Tong Liu","doi":"10.1109/PADSW.2014.7097848","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097848","url":null,"abstract":"In this paper we study a social welfare maximization problem for spectrum sharing in cognitive radio networks. To fully use the spectrum resource, the spectrum owned by the licensed primary user (PU) can be leased to secondary users (SUs) for transmitting data. We first formulate the social welfare of a cognitive radio network, considering the cost for the primary user sharing spectrum and the utility gained for secondary users transmitting data. The social welfare maximization is a convex optimization, which can be solved by standard methods in a centralized manner. However, the utility function of each secondary user always contains the private information, which leads to the centralized methods disabled. To overcome this challenge, we propose an iterative distributed algorithm based on a pricing-based decomposition framework. It is theoretically proved that our proposed algorithm converges to the optimal solution. Numerical simulation results are presented to show that our proposed algorithm achieves optimal social welfare and fast convergence speed.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125877541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
TSUBAME-KFC: A modern liquid submersion cooling prototype towards exascale becoming the greenest supercomputer in the world TSUBAME-KFC:一个现代化的液体浸没冷却原型,使百亿亿次成为世界上最环保的超级计算机
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097829
Toshio Endo, Akira Nukada, S. Matsuoka
Modern supercomputer performance is principally limited by power. TSUBAME-KFC is a state-of-the-art prototype for our next-generation TSUBAME3.0 supercomputer and towards future exascale. In collaboration with Green Revolution Cooling and others, TSUBAME-KFC submerges compute nodes configured with extremely high processor/component density, into non-toxic, low viscosity oil with high 260 Celsius flash point, and cooled using ambient / evaporative cooling tower. This minimizes cooling power while all semiconductor components kept at low temperature to lower leakage current. Numerous off-line in addition to on-line power and temperature sensors are facilitated throughout and constantly monitored to immediately observe the effect of voltage/frequency control. As a result, TSUBAME-KFC achieved world No.1 on the Green500 in Nov. 2013 and Jun. 2014, by over 20% c.f. the nearest competitors.
现代超级计算机的性能主要受到功率的限制。TSUBAME-KFC是我们下一代TSUBAME3.0超级计算机和未来百亿亿次超级计算机的最先进的原型。TSUBAME-KFC与Green Revolution Cooling等公司合作,将具有极高处理器/组件密度的计算节点浸入具有260摄氏度高闪点的无毒低粘度油中,并使用环境/蒸发冷却塔进行冷却。这最大限度地减少了冷却功率,同时所有半导体元件保持在低温,以降低泄漏电流。除在线功率和温度传感器外,还提供了许多离线传感器,并不断进行监测,以立即观察电压/频率控制的效果。因此,2013年11月和2014年6月,TSUBAME-KFC在Green500中排名世界第一,比最接近的竞争对手高出20%以上。
{"title":"TSUBAME-KFC: A modern liquid submersion cooling prototype towards exascale becoming the greenest supercomputer in the world","authors":"Toshio Endo, Akira Nukada, S. Matsuoka","doi":"10.1109/PADSW.2014.7097829","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097829","url":null,"abstract":"Modern supercomputer performance is principally limited by power. TSUBAME-KFC is a state-of-the-art prototype for our next-generation TSUBAME3.0 supercomputer and towards future exascale. In collaboration with Green Revolution Cooling and others, TSUBAME-KFC submerges compute nodes configured with extremely high processor/component density, into non-toxic, low viscosity oil with high 260 Celsius flash point, and cooled using ambient / evaporative cooling tower. This minimizes cooling power while all semiconductor components kept at low temperature to lower leakage current. Numerous off-line in addition to on-line power and temperature sensors are facilitated throughout and constantly monitored to immediately observe the effect of voltage/frequency control. As a result, TSUBAME-KFC achieved world No.1 on the Green500 in Nov. 2013 and Jun. 2014, by over 20% c.f. the nearest competitors.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"60 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129581158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
LEB-MAC: Load and energy balancing MAC protocol for energy harvesting powered wireless sensor networks LEB-MAC:用于能量收集供电无线传感器网络的负载和能量平衡MAC协议
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097857
Huey-Ing Liu, Wen He, Winston K.G. Seah
Ambient energy from solar, vibration, heat and wind provide alternative energy sources to power sensors and extend the lifetime of wireless sensor networks which have traditionally been powered by batteries. This paper aims to enhance the performance of energy harvesting powered wireless sensor networks in three aspects: relaying, scheduling, and medium access control. To better adapt to the characteristics of energy harvesting, an asynchronous receiver-initiated duty-cycling approach is preferred in energy harvesting powered wireless sensor networks. This reduces the duty cycle of senders, and regulates the active and sleep intervals according to the energy levels of sensors. When nodes run out of power and need time to recharge, network holes or voids develop, forcing data packets to be routed via other paths, like detours. The proposed relaying strategy aims to prevent holes by balancing the load across the network according to nodes' energy harvesting characteristics. This is a natural consequence of the asynchronous duty cycling by scheduling transmission based on the receiver's availability. The simulation results show that our scheme outperforms in terms of sender duty cycle, end-to-end delay and delivery ratio, especially in challenged conditions where other protocols fail.
来自太阳能、振动、热和风能的环境能量为传感器提供了替代能源,并延长了传统上由电池供电的无线传感器网络的使用寿命。本文旨在从中继、调度和介质访问控制三个方面提高能量收集供电无线传感器网络的性能。为了更好地适应能量收集的特点,在能量收集供电的无线传感器网络中,首选异步接收器发起的占空循环方法。这减少了发送器的占空比,并根据传感器的能量水平调节活动和睡眠间隔。当节点电量耗尽并需要时间充电时,网络就会出现漏洞或空洞,迫使数据包通过其他路径(如绕路)传输。提出的中继策略旨在根据节点的能量收集特性,通过平衡网络负载来防止漏洞。这是基于接收方可用性调度传输的异步占空循环的自然结果。仿真结果表明,该方案在发送方占空比、端到端延迟和传输比等方面都优于其他协议,特别是在其他协议失效的困难条件下。
{"title":"LEB-MAC: Load and energy balancing MAC protocol for energy harvesting powered wireless sensor networks","authors":"Huey-Ing Liu, Wen He, Winston K.G. Seah","doi":"10.1109/PADSW.2014.7097857","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097857","url":null,"abstract":"Ambient energy from solar, vibration, heat and wind provide alternative energy sources to power sensors and extend the lifetime of wireless sensor networks which have traditionally been powered by batteries. This paper aims to enhance the performance of energy harvesting powered wireless sensor networks in three aspects: relaying, scheduling, and medium access control. To better adapt to the characteristics of energy harvesting, an asynchronous receiver-initiated duty-cycling approach is preferred in energy harvesting powered wireless sensor networks. This reduces the duty cycle of senders, and regulates the active and sleep intervals according to the energy levels of sensors. When nodes run out of power and need time to recharge, network holes or voids develop, forcing data packets to be routed via other paths, like detours. The proposed relaying strategy aims to prevent holes by balancing the load across the network according to nodes' energy harvesting characteristics. This is a natural consequence of the asynchronous duty cycling by scheduling transmission based on the receiver's availability. The simulation results show that our scheme outperforms in terms of sender duty cycle, end-to-end delay and delivery ratio, especially in challenged conditions where other protocols fail.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125615791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Characteristics and perspectives of wearable smart devices and industrial ecosystem 可穿戴智能设备与产业生态系统的特点与展望
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097907
Sang Cheol Han, Youn-Hee Han, Huijun Kim
Wearable smart devices industry is booming up and being highlighted by the market recently as an alternative of post smart phone industry. Wearable smart devices have an intrinsic nature that their visual style should be very important like fashion as well as the sophisticated function is. They also have unique characteristics of industrial ecosystem different from the ecosystem of the smart phone industry. In this paper, we insist that there are a huge number of vertical markets for wearable smart device product and services. We also suggests that the small and medium companies should be more aggressive to advance such vertical markets with the application products and services with components as well as solutions.
可穿戴智能设备行业作为后智能手机行业的替代品,近年来蓬勃发展并受到市场的关注。可穿戴智能设备具有一种内在的特性,它的视觉风格和时尚一样重要,它的复杂功能也很重要。它们也具有不同于智能手机产业生态系统的独特的产业生态系统特征。在本文中,我们坚持认为可穿戴智能设备产品和服务存在大量的垂直市场。我们还建议,中小企业应该更积极地推进这些垂直市场的应用产品和服务与组件以及解决方案。
{"title":"Characteristics and perspectives of wearable smart devices and industrial ecosystem","authors":"Sang Cheol Han, Youn-Hee Han, Huijun Kim","doi":"10.1109/PADSW.2014.7097907","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097907","url":null,"abstract":"Wearable smart devices industry is booming up and being highlighted by the market recently as an alternative of post smart phone industry. Wearable smart devices have an intrinsic nature that their visual style should be very important like fashion as well as the sophisticated function is. They also have unique characteristics of industrial ecosystem different from the ecosystem of the smart phone industry. In this paper, we insist that there are a huge number of vertical markets for wearable smart device product and services. We also suggests that the small and medium companies should be more aggressive to advance such vertical markets with the application products and services with components as well as solutions.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117286260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Decreasing k-means algorithm for the Disk Covering Tour Problem in wireless sensor networks 无线传感器网络磁盘覆盖漫游问题的递减k均值算法
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097906
Jia-Jiun Yang, Jehn-Ruey Jiang, Yung-Liang Lai
This paper studies a Disk Covering Tour Problem (DCTP) for reducing the energy consumption of a mobile robot's movement to provide services for sensor nodes in a wireless sensor network (WSN). Given a set of locations of sensor nodes and a starting location of mobile robot, the DCTP is to find a minimum cost tour of a sequence of tour stops for the mobile robot to serve sensor nodes by keeping every sensor node within a specified distance of a tour stop. We propose an algorithm, called Decreasing k-means (Dk-means), to find an approximate solution to the DCTP. The idea is to select a minimum number of disks or circles of a fixed radius to cover all sensor nodes, and then to find a minimum cost tour passing all disk centers. The simulation results show the proposed algorithm outperforms the related CSP (Covering Salesman Problem) algorithm and the QiF algorithm.
为了降低移动机器人为无线传感器网络(WSN)中的传感器节点提供服务的能量消耗,研究了一种磁盘覆盖漫游问题(DCTP)。给定一组传感器节点的位置和一个移动机器人的起始位置,DCTP是通过使每个传感器节点保持在一个巡回站点的指定距离内,找到移动机器人服务于传感器节点的一系列巡回站点的最小成本巡回。我们提出了一种称为递减k-means (Dk-means)的算法来找到DCTP的近似解。其思想是选择最小数量的固定半径的磁盘或圆来覆盖所有传感器节点,然后找到通过所有磁盘中心的最小成本巡回。仿真结果表明,该算法优于相关的覆盖推销员问题(CSP)算法和QiF算法。
{"title":"A Decreasing k-means algorithm for the Disk Covering Tour Problem in wireless sensor networks","authors":"Jia-Jiun Yang, Jehn-Ruey Jiang, Yung-Liang Lai","doi":"10.1109/PADSW.2014.7097906","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097906","url":null,"abstract":"This paper studies a Disk Covering Tour Problem (DCTP) for reducing the energy consumption of a mobile robot's movement to provide services for sensor nodes in a wireless sensor network (WSN). Given a set of locations of sensor nodes and a starting location of mobile robot, the DCTP is to find a minimum cost tour of a sequence of tour stops for the mobile robot to serve sensor nodes by keeping every sensor node within a specified distance of a tour stop. We propose an algorithm, called Decreasing k-means (Dk-means), to find an approximate solution to the DCTP. The idea is to select a minimum number of disks or circles of a fixed radius to cover all sensor nodes, and then to find a minimum cost tour passing all disk centers. The simulation results show the proposed algorithm outperforms the related CSP (Covering Salesman Problem) algorithm and the QiF algorithm.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134488898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
MapReduce based parallel suffix tree construction for human genome 基于MapReduce的人类基因组平行后缀树构建
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097867
Umesh Chandra Satish, Praveenkumar Kondikoppa, Seung-Jong Park, Manish Patil, R. Shah
Genome indexing is the basis for many bioinformatics applications. Read mapping(sequence alignment) is one such application where the goal is to align millions of short reads against reference genome. Several tools are available for read mapping which rely on different indexing techniques to expedite the alignment process. However, many of these contemporary alignment programs are sequential, memory intensive and cannot be easily scaled for larger genomes. Suffix tree is one of the most widely used data structures for indexing strings (genomes). Building a scalable suffix-tree based tool is particularly challenging due to the difficulties involved in parallel construction of the suffix tree. Several suffix tree construction techniques have been proposed till date with focus on space-time tradeoff. Most of these existing works address the construction issue for uniprocessor and cannot be easily extended to utilize modern multi-processor systems. In this paper we investigate and propose a MapReduce based parallel construction of suffix tree. We demonstrate the performance of the algorithm over commodity cluster using up to 32 nodes each having 8GB of primary memory.
基因组索引是许多生物信息学应用的基础。读取映射(序列比对)就是这样一个应用程序,其目标是将数百万个短读取与参考基因组比对。有几种工具可用于读取映射,它们依赖于不同的索引技术来加快对齐过程。然而,许多这些当代的比对程序是顺序的,内存密集型的,不能很容易地扩展到更大的基因组。后缀树是索引字符串(基因组)最广泛使用的数据结构之一。构建一个可扩展的基于后缀树的工具尤其具有挑战性,因为涉及到并行构建后缀树的困难。迄今为止已经提出了几种后缀树构建技术,其重点是时空权衡。这些现有的工作大多解决了单处理器的构造问题,不能轻易地扩展到利用现代多处理器系统。本文研究并提出了一种基于MapReduce的后缀树并行构造方法。我们使用多达32个节点(每个节点具有8GB主内存)在商品集群上演示了该算法的性能。
{"title":"MapReduce based parallel suffix tree construction for human genome","authors":"Umesh Chandra Satish, Praveenkumar Kondikoppa, Seung-Jong Park, Manish Patil, R. Shah","doi":"10.1109/PADSW.2014.7097867","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097867","url":null,"abstract":"Genome indexing is the basis for many bioinformatics applications. Read mapping(sequence alignment) is one such application where the goal is to align millions of short reads against reference genome. Several tools are available for read mapping which rely on different indexing techniques to expedite the alignment process. However, many of these contemporary alignment programs are sequential, memory intensive and cannot be easily scaled for larger genomes. Suffix tree is one of the most widely used data structures for indexing strings (genomes). Building a scalable suffix-tree based tool is particularly challenging due to the difficulties involved in parallel construction of the suffix tree. Several suffix tree construction techniques have been proposed till date with focus on space-time tradeoff. Most of these existing works address the construction issue for uniprocessor and cannot be easily extended to utilize modern multi-processor systems. In this paper we investigate and propose a MapReduce based parallel construction of suffix tree. We demonstrate the performance of the algorithm over commodity cluster using up to 32 nodes each having 8GB of primary memory.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"79 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132724583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Lightweight online power monitoring and control for mobile applications 用于移动应用的轻量级在线电源监测和控制
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097845
Bo Wang, Xinghui Zhao, David Chiu
Limited battery power has long been a challenge for mobile applications. As a result, the work in power monitoring and management has attracted great interests. In this paper, we propose a model to estimate power consumption of mobile applications at run-time, based on application-specific per-action power profiling. In addition, we have developed on-line optimization techniques which help maximize users' experience while conserving power. Our power model is lightweight and flexible, in that it can be used by any mobile applications as a plugin, and it can support user-defined optimization mechanisms. This approach has been evaluated using a case study, a mobile application for field studies, and the experimental results show that our model accurately captures power consumption of the application, and the model can be used to optimize the power consumption based on users' needs.
长期以来,有限的电池电量一直是移动应用程序面临的挑战。因此,电力监测与管理工作引起了人们极大的兴趣。在本文中,我们提出了一个模型来估计移动应用程序在运行时的功耗,基于特定于应用程序的每动作功率分析。此外,我们还开发了在线优化技术,帮助最大限度地提高用户体验,同时节省电力。我们的power模型轻量级且灵活,因为它可以被任何移动应用程序作为插件使用,并且它可以支持用户定义的优化机制。通过对某移动应用程序的现场研究,对该方法进行了评估,实验结果表明,该模型能够准确地捕获应用程序的功耗,并可根据用户需求对功耗进行优化。
{"title":"Lightweight online power monitoring and control for mobile applications","authors":"Bo Wang, Xinghui Zhao, David Chiu","doi":"10.1109/PADSW.2014.7097845","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097845","url":null,"abstract":"Limited battery power has long been a challenge for mobile applications. As a result, the work in power monitoring and management has attracted great interests. In this paper, we propose a model to estimate power consumption of mobile applications at run-time, based on application-specific per-action power profiling. In addition, we have developed on-line optimization techniques which help maximize users' experience while conserving power. Our power model is lightweight and flexible, in that it can be used by any mobile applications as a plugin, and it can support user-defined optimization mechanisms. This approach has been evaluated using a case study, a mobile application for field studies, and the experimental results show that our model accurately captures power consumption of the application, and the model can be used to optimize the power consumption based on users' needs.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130342523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
SLA-based energy aware scheduling of precedence-constrained applications on DVFS-enabled clusters 支持dvfs的集群上基于sla的优先级约束应用程序的能量感知调度
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097826
Xuedi Chen, Kenli Li, Chubo Liu, Kuan-Ching Li
The energy aware scheduling problem has been a critical issue in high-performance clusters owing to their high operation cost, environmental impact, and low reliability. An existing technique to reduce energy consumption of applications is dynamic voltage/frequency scaling (DVFS). In this paper, we develop an energy aware scheduling algorithm called EASLA for precedence-constrained applications in the context of Service Level Agreement (SLA) on DVFS-enabled cluster systems. Due to the dependencies among tasks and makespan extension, there may be some slacks under used. The main idea of the EASLA algorithm is to distribute each slack to a set of tasks and scale frequencies down to try to minimize energy consumption. Specifically, it first finds the maximum set of independent tasks for each task, and then iteratively allocates each slack to the maximum independent set whose total energy reduction is the maximal. Randomly generated graphs and two real-world applications are tested in our experiments. The experimental results show that our scheduling algorithm can save up to 22.68% and 12.01% energy consumption compared with GreedyDVS and EvenlyDVS algorithms, respectively.
由于高性能集群运行成本高、环境影响大、可靠性低,能源感知调度问题一直是高性能集群中的一个关键问题。动态电压/频率缩放(DVFS)是一种现有的降低应用能耗的技术。在本文中,我们开发了一种称为EASLA的能源感知调度算法,用于支持dvfs的集群系统中服务水平协议(SLA)上下文中的优先级约束应用。由于任务之间的依赖关系和makespan扩展,可能会有一些闲置。EASLA算法的主要思想是将每个空闲分配给一组任务,并按比例降低频率,以尽量减少能耗。具体来说,它首先为每个任务找到最大的独立任务集,然后迭代地将每个空闲分配给总能量减少最大的最大独立任务集。在我们的实验中测试了随机生成的图形和两个实际应用程序。实验结果表明,与GreedyDVS和EvenlyDVS算法相比,我们的调度算法可分别节省22.68%和12.01%的能耗。
{"title":"SLA-based energy aware scheduling of precedence-constrained applications on DVFS-enabled clusters","authors":"Xuedi Chen, Kenli Li, Chubo Liu, Kuan-Ching Li","doi":"10.1109/PADSW.2014.7097826","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097826","url":null,"abstract":"The energy aware scheduling problem has been a critical issue in high-performance clusters owing to their high operation cost, environmental impact, and low reliability. An existing technique to reduce energy consumption of applications is dynamic voltage/frequency scaling (DVFS). In this paper, we develop an energy aware scheduling algorithm called EASLA for precedence-constrained applications in the context of Service Level Agreement (SLA) on DVFS-enabled cluster systems. Due to the dependencies among tasks and makespan extension, there may be some slacks under used. The main idea of the EASLA algorithm is to distribute each slack to a set of tasks and scale frequencies down to try to minimize energy consumption. Specifically, it first finds the maximum set of independent tasks for each task, and then iteratively allocates each slack to the maximum independent set whose total energy reduction is the maximal. Randomly generated graphs and two real-world applications are tested in our experiments. The experimental results show that our scheduling algorithm can save up to 22.68% and 12.01% energy consumption compared with GreedyDVS and EvenlyDVS algorithms, respectively.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125126723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Effective multi-GPU communication using multiple CUDA streams and threads 有效的多gpu通信使用多个CUDA流和线程
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097919
Mohammed Sourouri, T. Gillberg, S. Baden, Xing Cai
In the context of multiple GPUs that share the same PCIe bus, we propose a new communication scheme that leads to a more effective overlap of communication and computation. Multiple CUDA streams and OpenMP threads are adopted so that data can simultaneously be sent and received. A representative 3D stencil example is used to demonstrate the effectiveness of our scheme. We compare the performance of our new scheme with an MPI-based state-of-the-art scheme. Results show that our approach outperforms the state-of-the-art scheme, being up to 1.85× faster. However, our performance results also indicate that the current underlying PCIe bus architecture needs improvements to handle the future scenario of many GPUs per node.
在多个gpu共享同一PCIe总线的情况下,我们提出了一种新的通信方案,可以更有效地实现通信和计算的重叠。采用多个CUDA流和OpenMP线程,可以同时发送和接收数据。通过一个典型的三维模板实例验证了该方法的有效性。我们将新方案的性能与基于mpi的最先进方案进行了比较。结果表明,我们的方法优于最先进的方案,速度提高了1.85倍。然而,我们的性能结果也表明,当前的底层PCIe总线架构需要改进,以处理每个节点多个gpu的未来场景。
{"title":"Effective multi-GPU communication using multiple CUDA streams and threads","authors":"Mohammed Sourouri, T. Gillberg, S. Baden, Xing Cai","doi":"10.1109/PADSW.2014.7097919","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097919","url":null,"abstract":"In the context of multiple GPUs that share the same PCIe bus, we propose a new communication scheme that leads to a more effective overlap of communication and computation. Multiple CUDA streams and OpenMP threads are adopted so that data can simultaneously be sent and received. A representative 3D stencil example is used to demonstrate the effectiveness of our scheme. We compare the performance of our new scheme with an MPI-based state-of-the-art scheme. Results show that our approach outperforms the state-of-the-art scheme, being up to 1.85× faster. However, our performance results also indicate that the current underlying PCIe bus architecture needs improvements to handle the future scenario of many GPUs per node.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126884123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
POPS: A popularity-aware live streaming service pop:关注流行的流媒体直播服务
Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097873
Karine Pires, Sébastien Monnet, Pierre Sens
Live streaming has become very popular. Many systems, such as justin.tv, have emerged. They aim to collect user live-streams and serve them to the viewers using broadcasting servers. However, the huge variation in the total number of viewers and the great heterogeneity among streams popularity generally implies over-provisioning, leading to an important resource waste. In this paper, we show that there is a trade-off between the number of servers involved to broadcast the streams and the bandwidth usage among the servers. We also stress the importance to predict streams popularity in order to efficiently place them on the servers. We propose POPS: a live streaming service using popularity predictions to map live-streams on the servers.
直播已经变得非常流行。很多系统,比如justin。电视,已经出现。他们的目标是收集用户的直播流,并通过广播服务器将其提供给观众。然而,观众总数的巨大差异和流受欢迎程度的巨大异质性通常意味着过度供应,导致重要的资源浪费。在本文中,我们展示了在广播流所涉及的服务器数量和服务器之间的带宽使用之间存在权衡。我们还强调了预测流流行度的重要性,以便有效地将它们放置在服务器上。我们提出POPS:一种使用流行度预测来映射服务器上的直播流的直播流服务。
{"title":"POPS: A popularity-aware live streaming service","authors":"Karine Pires, Sébastien Monnet, Pierre Sens","doi":"10.1109/PADSW.2014.7097873","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097873","url":null,"abstract":"Live streaming has become very popular. Many systems, such as justin.tv, have emerged. They aim to collect user live-streams and serve them to the viewers using broadcasting servers. However, the huge variation in the total number of viewers and the great heterogeneity among streams popularity generally implies over-provisioning, leading to an important resource waste. In this paper, we show that there is a trade-off between the number of servers involved to broadcast the streams and the bandwidth usage among the servers. We also stress the importance to predict streams popularity in order to efficiently place them on the servers. We propose POPS: a live streaming service using popularity predictions to map live-streams on the servers.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116726964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1