首页 > 最新文献

2015 International Conference on High Performance Computing & Simulation (HPCS)最新文献

英文 中文
Improved scheduling algorithm in VCL Cloud computing environment on CloudSim 改进了CloudSim上VCL云计算环境下的调度算法
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237047
Omar Khedher, M. Jarraya
Cloud Computing has empowered the academic communities by reducing the IT infrastructure administration and cost. As one of several cloud computing solutions, Virtual Cloud Laboratory (VCL) has improved many universities experience by providing hand-on labs for each course per class. VCL has multiple deployment requirements. Although it represents a simplified architecture design, treating a large size VCL deployment might be challenging. In addition, putting the performance of resource allocation policies and different scheduling algorithms under scope could help to quantify the workload running in a real and expanded VCL environment. We aim through this study at identifying the limitations of the default scheduler in VCL and propose an enhanced scheduler. By simulating a VCL cloud computing environment, it might be possible to prove different scheduling algorithms and their impact from performance perspective.
云计算通过减少IT基础设施管理和成本,增强了学术界的能力。作为几种云计算解决方案之一,虚拟云实验室(VCL)通过为每班的每门课程提供动手实验,改善了许多大学的体验。VCL有多个部署需求。尽管它代表了一种简化的体系结构设计,但是处理大型VCL部署可能具有挑战性。此外,将资源分配策略和不同调度算法的性能置于范围内可以帮助量化在真实和扩展的VCL环境中运行的工作负载。通过这项研究,我们旨在确定VCL中默认调度器的局限性,并提出一个增强的调度器。通过模拟VCL云计算环境,可以从性能的角度证明不同的调度算法及其影响。
{"title":"Improved scheduling algorithm in VCL Cloud computing environment on CloudSim","authors":"Omar Khedher, M. Jarraya","doi":"10.1109/HPCSim.2015.7237047","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237047","url":null,"abstract":"Cloud Computing has empowered the academic communities by reducing the IT infrastructure administration and cost. As one of several cloud computing solutions, Virtual Cloud Laboratory (VCL) has improved many universities experience by providing hand-on labs for each course per class. VCL has multiple deployment requirements. Although it represents a simplified architecture design, treating a large size VCL deployment might be challenging. In addition, putting the performance of resource allocation policies and different scheduling algorithms under scope could help to quantify the workload running in a real and expanded VCL environment. We aim through this study at identifying the limitations of the default scheduler in VCL and propose an enhanced scheduler. By simulating a VCL cloud computing environment, it might be possible to prove different scheduling algorithms and their impact from performance perspective.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116639670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Immortalizing many-core systems early experiences of the horizon 2020 action IMMORTAL 不朽的许多核心系统的早期经验的地平线2020行动不朽
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237092
J. Raik
A dedicated fault monitoring network supplemented with fault management capabilities is required for controlling the dynamic reconfiguration and resource isolation within many-core systems as well as for keeping track of the system health status and reporting it to the software layer. It is imperative to develop such cross-layer fault management capabilities to be deployed in safety-critical domains but also in applications where access for maintenance would be time-consuming and costly, e.g. base stations in the telecommunications area. European universities (Tallinn University of Technology, Graz University of Technology and the University of Twente) have teamed up with IBM, the German Space Centre DLR and two SMEs (Recore Systems and Testonica Lab) in a new EU funded research project IMMORTAL (Integrated Modelling, Fault Management, Verification and Reliable Design Environment for Cyber-Physical Systems) in a quest to improve reliability and extend the lifetime of emerging many-core based cyber-physical systems.
为了控制多核心系统中的动态重新配置和资源隔离,以及跟踪系统健康状态并将其报告给软件层,需要配备故障管理功能的专用故障监控网络。必须开发这种跨层故障管理能力,以部署在安全关键领域,但也适用于访问维护将是耗时和昂贵的应用,例如电信领域的基站。欧洲大学(塔林科技大学、格拉茨科技大学和特芬特大学)与IBM、德国航天中心DLR和两家中小企业(Recore系统和Testonica实验室)合作,在一个新的欧盟资助的研究项目IMMORTAL(网络物理系统的集成建模、故障管理、验证和可靠设计环境)中寻求提高可靠性和延长新兴的多核网络物理系统的使用寿命。
{"title":"Immortalizing many-core systems early experiences of the horizon 2020 action IMMORTAL","authors":"J. Raik","doi":"10.1109/HPCSim.2015.7237092","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237092","url":null,"abstract":"A dedicated fault monitoring network supplemented with fault management capabilities is required for controlling the dynamic reconfiguration and resource isolation within many-core systems as well as for keeping track of the system health status and reporting it to the software layer. It is imperative to develop such cross-layer fault management capabilities to be deployed in safety-critical domains but also in applications where access for maintenance would be time-consuming and costly, e.g. base stations in the telecommunications area. European universities (Tallinn University of Technology, Graz University of Technology and the University of Twente) have teamed up with IBM, the German Space Centre DLR and two SMEs (Recore Systems and Testonica Lab) in a new EU funded research project IMMORTAL (Integrated Modelling, Fault Management, Verification and Reliable Design Environment for Cyber-Physical Systems) in a quest to improve reliability and extend the lifetime of emerging many-core based cyber-physical systems.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125884137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Requirement checklist for infrastructure monitoring of swift swift基础设施监控需求清单
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237099
Pragya Jain, A. Goel, S. Gupta
For the purposes of operating and managing complex cloud storage infrastructure, there is a need of monitoring it effectively and efficiently. Data is required to be collected from major components and analyzed for improving system performance and resource utilization. Swift is an open source object storage cloud system by OpenStack, also popularly used by many key commercial cloud providers. In this paper, we present a detailed list of attributes required for monitoring Swift storage infrastructure. We see that some of the processes associated with object storage need to be monitored and also resources the storage software utilizes for its working. We note that infrastructure associated with storage and its usage is required to be monitored constantly for providing better services. Accordingly, we categorize infrastructure monitoring into components and list parameters associated with them. Our monitoring list is beneficial for system administrator when selecting monitoring software for their system. During development of any new monitoring software, proposed monitoring list aids developer in selecting functionality in contrast to specifying fresh requirements.
为了操作和管理复杂的云存储基础设施,需要对其进行有效和高效的监控。需要从主要组件收集数据并进行分析,以提高系统性能和资源利用率。Swift是OpenStack提供的开源对象存储云系统,也被许多重点商业云提供商广泛使用。在本文中,我们提供了监控Swift存储基础设施所需的详细属性列表。我们看到,需要监控与对象存储相关的一些进程,以及存储软件为其工作所利用的资源。我们注意到,为了提供更好的服务,需要不断监控与存储及其使用相关的基础设施。因此,我们将基础设施监控分类为组件,并列出与它们相关的参数。我们的监控列表对于系统管理员在选择监控软件时非常有用。在任何新的监视软件的开发过程中,建议的监视列表帮助开发人员选择功能,而不是指定新的需求。
{"title":"Requirement checklist for infrastructure monitoring of swift","authors":"Pragya Jain, A. Goel, S. Gupta","doi":"10.1109/HPCSim.2015.7237099","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237099","url":null,"abstract":"For the purposes of operating and managing complex cloud storage infrastructure, there is a need of monitoring it effectively and efficiently. Data is required to be collected from major components and analyzed for improving system performance and resource utilization. Swift is an open source object storage cloud system by OpenStack, also popularly used by many key commercial cloud providers. In this paper, we present a detailed list of attributes required for monitoring Swift storage infrastructure. We see that some of the processes associated with object storage need to be monitored and also resources the storage software utilizes for its working. We note that infrastructure associated with storage and its usage is required to be monitored constantly for providing better services. Accordingly, we categorize infrastructure monitoring into components and list parameters associated with them. Our monitoring list is beneficial for system administrator when selecting monitoring software for their system. During development of any new monitoring software, proposed monitoring list aids developer in selecting functionality in contrast to specifying fresh requirements.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126062454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GPU accelerated ray launching for high-fidelity virtual test drives of VANET applications GPU加速射线发射的高保真虚拟测试驱动VANET应用程序
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237048
M. Schiller, A. Knoll, M. Mocker, T. Eibert
Due to the complexity of Vehicular Ad Hoc Networks, future driving assistance systems need to be validated through virtual test drives in a simulated environment. An accurate modeling of the vehicle-to-vehicle communication channel is crucial to enable a precise evaluation of such network-aware applications. Since existing ray-based methods cause long computation times, a new parallel GPU-based ray-launching simulation method is presented. The algorithmic improvements allow a high utilization of the GPU computing power, which results in significantly faster simulations while achieving high accuracy. The validation of the simulation results against real-world measurements showed a high level of agreement.
由于车载自组织网络的复杂性,未来的驾驶辅助系统需要通过模拟环境中的虚拟试驾来验证。车对车通信通道的精确建模对于精确评估此类网络感知应用至关重要。针对现有的射线发射仿真方法计算时间长,提出了一种基于并行gpu的射线发射仿真方法。算法的改进允许GPU计算能力的高利用率,从而在实现高精度的同时显着加快模拟速度。仿真结果与实际测量结果的验证表明了高度的一致性。
{"title":"GPU accelerated ray launching for high-fidelity virtual test drives of VANET applications","authors":"M. Schiller, A. Knoll, M. Mocker, T. Eibert","doi":"10.1109/HPCSim.2015.7237048","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237048","url":null,"abstract":"Due to the complexity of Vehicular Ad Hoc Networks, future driving assistance systems need to be validated through virtual test drives in a simulated environment. An accurate modeling of the vehicle-to-vehicle communication channel is crucial to enable a precise evaluation of such network-aware applications. Since existing ray-based methods cause long computation times, a new parallel GPU-based ray-launching simulation method is presented. The algorithmic improvements allow a high utilization of the GPU computing power, which results in significantly faster simulations while achieving high accuracy. The validation of the simulation results against real-world measurements showed a high level of agreement.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115577121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Parallel Branch-and-Bound using private IVM-based work stealing on Xeon Phi MIC coprocessor 在Xeon Phi MIC协处理器上使用基于私有ivm的并行分支绑定工作窃取
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237067
N. Melab, Rudi Leroy, M. Mezmaz, D. Tuyttens
Many combinatorial optimization problems are modeled in practice as permutation-based ones. We have recently proposed a new data structure called IVM dedicated to those problems. IVM is memory efficient in terms of size and management time for solving large permutation problems using Branch-and-Bound (B&B) algorithm. We believe that those memory properties make IVM well-suited for Many Integrated Cores (MIC) architecture. This paper deals with the parallel design and implementation of the B&B algorithm on MIC architectures using private IVM-based work stealing. The proposed approach has been extensively experimented on an Intel Xeon Phi 5110P using several instances of the Flow-Shop scheduling permutation problem. The reported results show that the IVM-based work stealing approach is about 10 times faster than the linked-list traditionally used for parallel B&B.
在实践中,许多组合优化问题都是基于排列的组合优化问题。我们最近提出了一种名为IVM的新数据结构,专门用于解决这些问题。对于使用Branch-and-Bound (B&B)算法解决大型排列问题,IVM在大小和管理时间方面具有内存效率。我们相信这些内存属性使IVM非常适合多集成核心(MIC)架构。本文研究了基于私有ivm的工作窃取在MIC架构上并行设计和实现B&B算法。所提出的方法已经在Intel Xeon Phi 5110P上进行了广泛的实验,使用了几个Flow-Shop调度排列问题的实例。报告的结果表明,基于ivm的工作窃取方法比传统上用于并行民宿的链表方法快10倍左右。
{"title":"Parallel Branch-and-Bound using private IVM-based work stealing on Xeon Phi MIC coprocessor","authors":"N. Melab, Rudi Leroy, M. Mezmaz, D. Tuyttens","doi":"10.1109/HPCSim.2015.7237067","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237067","url":null,"abstract":"Many combinatorial optimization problems are modeled in practice as permutation-based ones. We have recently proposed a new data structure called IVM dedicated to those problems. IVM is memory efficient in terms of size and management time for solving large permutation problems using Branch-and-Bound (B&B) algorithm. We believe that those memory properties make IVM well-suited for Many Integrated Cores (MIC) architecture. This paper deals with the parallel design and implementation of the B&B algorithm on MIC architectures using private IVM-based work stealing. The proposed approach has been extensively experimented on an Intel Xeon Phi 5110P using several instances of the Flow-Shop scheduling permutation problem. The reported results show that the IVM-based work stealing approach is about 10 times faster than the linked-list traditionally used for parallel B&B.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122749683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Power variation aware Configuration Adviser for scalable HPC schedulers 可扩展HPC调度器的功率变化感知配置顾问
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237023
H. Shoukourian, T. Wilde, A. Auweter, A. Bode
Efficient scheduling is crucial for time and cost-effective utilization of compute resources especially for high end systems. A variety of factors need to be considered during the scheduling decisions. Power variation across the compute resources of homogeneous large-scale systems has not been considered so far. This paper discusses the impact of the power variation for parallel application scheduling. It addresses the problem of finding the optimal resource configuration for a given application that will minimize the amount of consumed energy, under pre-defined constraints on application execution time and instantaneous average power consumption. This paper presents an efficient algorithm to do so, which also considers the existing power diversity among the compute nodes (modified also at different operating CPU frequencies) of a given homogeneous High Performance Computing system. Based on this algorithm, the paper presents a plug-in, referred to as Configuration Adviser, which operates on top of a given resource management and scheduling system to advise on energy-wise optimal resource configuration for a given application, execution using which, will adhere to the specified execution time and power consumption constraints. The main goal of this plug-in is to enhance the current resource management and scheduling tools for the support of power capping for future Exascale systems, where a data center might not be able to provide cooling or electrical power for system peak consumption but only for the expected power bands.
高效的调度对于计算资源的时间和成本效益利用至关重要,特别是对于高端系统。在调度决策过程中需要考虑各种因素。到目前为止,还没有考虑到同构大型系统计算资源之间的功率变化。本文讨论了功率变化对并行应用调度的影响。它解决的问题是,在预定义的应用程序执行时间和瞬时平均功耗约束下,为给定的应用程序找到最优的资源配置,从而使消耗的能量最小化。本文提出了一种有效的算法,该算法考虑了给定同构高性能计算系统中计算节点之间的现有功率分集(在不同CPU工作频率下也进行了修改)。基于该算法,本文提出了一个称为Configuration advisor的插件,它在给定的资源管理和调度系统之上运行,为给定的应用程序提供能源方面的最佳资源配置建议,使用该系统执行,将遵守指定的执行时间和功耗约束。这个插件的主要目标是增强当前的资源管理和调度工具,以支持未来的Exascale系统的功率上限,在这些系统中,数据中心可能无法为系统峰值消耗提供冷却或电力,而只能为预期的功率带提供电力。
{"title":"Power variation aware Configuration Adviser for scalable HPC schedulers","authors":"H. Shoukourian, T. Wilde, A. Auweter, A. Bode","doi":"10.1109/HPCSim.2015.7237023","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237023","url":null,"abstract":"Efficient scheduling is crucial for time and cost-effective utilization of compute resources especially for high end systems. A variety of factors need to be considered during the scheduling decisions. Power variation across the compute resources of homogeneous large-scale systems has not been considered so far. This paper discusses the impact of the power variation for parallel application scheduling. It addresses the problem of finding the optimal resource configuration for a given application that will minimize the amount of consumed energy, under pre-defined constraints on application execution time and instantaneous average power consumption. This paper presents an efficient algorithm to do so, which also considers the existing power diversity among the compute nodes (modified also at different operating CPU frequencies) of a given homogeneous High Performance Computing system. Based on this algorithm, the paper presents a plug-in, referred to as Configuration Adviser, which operates on top of a given resource management and scheduling system to advise on energy-wise optimal resource configuration for a given application, execution using which, will adhere to the specified execution time and power consumption constraints. The main goal of this plug-in is to enhance the current resource management and scheduling tools for the support of power capping for future Exascale systems, where a data center might not be able to provide cooling or electrical power for system peak consumption but only for the expected power bands.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116612474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Efficient Asian option pricing with CUDA 基于CUDA的高效亚洲期权定价
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237103
Artur Yuzhanin, I. Gankevich, E. Stepanov, V. Korkhov
In this paper the Monte Carlo methods of the Asian option pricing are considered. Among them are pricing method with path integral and partial differential equation. Simulation algorithms running on the CPU sequentially and algorithms running on the GPU in parallel using the CUDA technology were analyzed and compared.
本文研究了亚洲期权定价的蒙特卡罗方法。其中有路径积分定价法和偏微分方程定价法。采用CUDA技术对CPU上顺序运行的仿真算法和GPU上并行运行的仿真算法进行了分析和比较。
{"title":"Efficient Asian option pricing with CUDA","authors":"Artur Yuzhanin, I. Gankevich, E. Stepanov, V. Korkhov","doi":"10.1109/HPCSim.2015.7237103","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237103","url":null,"abstract":"In this paper the Monte Carlo methods of the Asian option pricing are considered. Among them are pricing method with path integral and partial differential equation. Simulation algorithms running on the CPU sequentially and algorithms running on the GPU in parallel using the CUDA technology were analyzed and compared.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129298653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Car2x with software defined networks, network functions virtualization and supercomputers technical and scientific preparations for the Amsterdam Arena telecoms fieldlab Car2x与软件定义网络,网络功能虚拟化和超级计算机技术和科学准备阿姆斯特丹竞技场电信现场实验室
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237111
R. Meijer, R. Cushing, C. D. Laat, P. Jackson, S. Klous, R. Koning, M. Makkes, A. Meerwijk
In the invited talk “Car2x with SDN, NFV and supercomputers” we report about how our past work with SDN [1, 2] allows the design of a smart mobility fieldlab in the huge parking lot the Amsterdam Arena. We explain how we can engineer and test software that handle the complex conditions of the Car2X case. The talk starts by describing the engineering challenges that developers of smart car telecommunications and computing infrastructures face. We concentrate on the development of software defined networks (SDN) that support smart cars optimally and securely over a heterogeneous, dynamic and developing ICT infrastructure. The goal here is to enable smart cars to profit maximally from any bit of information available from fixed and moving objects as well as persons. For example, in a low tech situation, the lack of other options makes that one has to fall back to GPRS to download only traffic jam locations. As a contrast, in an advanced telecom environment video streams from multiple cars are transmitted via 5G pico cells to computers a few millisecond nearby. These computers fuse the video information to generate a local traffic model. In an ultimate situation, cars use all the communications infrastructures that are available including the numerous WiFi hotspots, all generations of mobile telecommunications, the developing car-to-car communications technologies and even the smart phone of a passing person. SDN technologies deal with the complexities of such communication environment.
在“Car2x与SDN, NFV和超级计算机”的特邀演讲中,我们报告了我们过去与SDN的合作[1,2]如何在阿姆斯特丹竞技场的大型停车场设计智能移动现场实验室。我们解释了如何设计和测试处理Car2X案例复杂条件的软件。演讲首先描述了智能汽车电信和计算基础设施开发人员面临的工程挑战。我们专注于软件定义网络(SDN)的开发,通过异构、动态和不断发展的ICT基础设施,以最佳方式和安全地支持智能汽车。这里的目标是使智能汽车能够最大限度地从固定和移动物体以及人的任何可用信息中获利。例如,在技术含量较低的情况下,由于缺乏其他选择,用户不得不退回到GPRS,只下载交通拥堵的位置。相比之下,在先进的电信环境中,来自多辆汽车的视频流通过5G微蜂窝传输到附近几毫秒的计算机上。这些计算机融合视频信息生成本地交通模型。在最终的情况下,汽车使用了所有可用的通信基础设施,包括众多的WiFi热点,各代移动通信,发展中的车对车通信技术,甚至是过路人的智能手机。SDN技术解决了这种通信环境的复杂性。
{"title":"Car2x with software defined networks, network functions virtualization and supercomputers technical and scientific preparations for the Amsterdam Arena telecoms fieldlab","authors":"R. Meijer, R. Cushing, C. D. Laat, P. Jackson, S. Klous, R. Koning, M. Makkes, A. Meerwijk","doi":"10.1109/HPCSim.2015.7237111","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237111","url":null,"abstract":"In the invited talk “Car2x with SDN, NFV and supercomputers” we report about how our past work with SDN [1, 2] allows the design of a smart mobility fieldlab in the huge parking lot the Amsterdam Arena. We explain how we can engineer and test software that handle the complex conditions of the Car2X case. The talk starts by describing the engineering challenges that developers of smart car telecommunications and computing infrastructures face. We concentrate on the development of software defined networks (SDN) that support smart cars optimally and securely over a heterogeneous, dynamic and developing ICT infrastructure. The goal here is to enable smart cars to profit maximally from any bit of information available from fixed and moving objects as well as persons. For example, in a low tech situation, the lack of other options makes that one has to fall back to GPRS to download only traffic jam locations. As a contrast, in an advanced telecom environment video streams from multiple cars are transmitted via 5G pico cells to computers a few millisecond nearby. These computers fuse the video information to generate a local traffic model. In an ultimate situation, cars use all the communications infrastructures that are available including the numerous WiFi hotspots, all generations of mobile telecommunications, the developing car-to-car communications technologies and even the smart phone of a passing person. SDN technologies deal with the complexities of such communication environment.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121185235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Investigation of DVFS based dynamic reliability management for chip multiprocessors 基于DVFS的芯片多处理器动态可靠性管理研究
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237093
M. Moghaddam, A. Yamamoto, Cristinel Ababei
We investigate dynamic voltage and frequency scaling (DVFS) as a mechanism for dynamic reliability management (DRM) of chip multiprocessors (CMPs). The proposed DRM scheme operates as a control technique whose objective is to drive the operation of the CMP such that reliability changes towards a desired target. While the chip multiprocessor is continuously monitored and reliability is estimated in real time, the voltage and frequency of different cores in the CMP are dynamically adjusted such that reliability converges towards the target. When the temperature of cores increases and thus reliability degrades, the proposed DRM scheme throttles selectively the frequency of the cores with the highest temperature. This is turn, leads to a lower power dissipation in those cores whose temperature decreases, thereby improving reliability. We leverage existing simulation and estimation tools to develop the proposed DRM scheme. Simulations results show that the proposed DRM scheme provides an effective way to tradeoff reliability and performance.
我们研究了动态电压和频率缩放(DVFS)作为芯片多处理器(cmp)动态可靠性管理(DRM)的机制。提出的DRM方案作为一种控制技术,其目的是驱动CMP的运行,使可靠性朝着期望的目标变化。在对芯片多处理器进行连续监控和实时可靠性估计的同时,动态调整CMP中不同内核的电压和频率,使可靠性向目标收敛。当核的温度升高从而导致可靠性下降时,所提出的DRM方案选择性地限制温度最高的核的频率。这样可以降低内核的功耗,从而提高可靠性。我们利用现有的模拟和估计工具来开发拟议的DRM方案。仿真结果表明,所提出的DRM方案提供了一种有效的可靠性和性能平衡方法。
{"title":"Investigation of DVFS based dynamic reliability management for chip multiprocessors","authors":"M. Moghaddam, A. Yamamoto, Cristinel Ababei","doi":"10.1109/HPCSim.2015.7237093","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237093","url":null,"abstract":"We investigate dynamic voltage and frequency scaling (DVFS) as a mechanism for dynamic reliability management (DRM) of chip multiprocessors (CMPs). The proposed DRM scheme operates as a control technique whose objective is to drive the operation of the CMP such that reliability changes towards a desired target. While the chip multiprocessor is continuously monitored and reliability is estimated in real time, the voltage and frequency of different cores in the CMP are dynamically adjusted such that reliability converges towards the target. When the temperature of cores increases and thus reliability degrades, the proposed DRM scheme throttles selectively the frequency of the cores with the highest temperature. This is turn, leads to a lower power dissipation in those cores whose temperature decreases, thereby improving reliability. We leverage existing simulation and estimation tools to develop the proposed DRM scheme. Simulations results show that the proposed DRM scheme provides an effective way to tradeoff reliability and performance.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114629715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
234 scheduling of 3-2 and 2-1 eliminations for parallel image compositing using non-power-of-two number of processes 234调度3-2和2-1消除,用于使用非2次幂进程数的并行图像合成
Pub Date : 2015-07-20 DOI: 10.1109/HPCSim.2015.7237071
J. Nonaka, K. Ono, M. Fujita
Binary-Swap is a parallel image compositing algorithm based on recursive vector halving and distance doubling, and works efficiently when the number of processes is exactly a power-of-two (2n). Several power-of-two converting approaches for Binary-Swap have been proposed. Among them, the Telescope method, based on the Binary Blocks algorithm, has been shown as the most promising approach. The Telescope method decomposes an entire set of processes into blocks of power-of-two size and merges the smaller blocks into larger blocks in stepwise fashion. This block merging process corresponds to the communication and computational overhead of the conversion, and since it can only merge one block per stage, it becomes inefficient as the number of binary blocks increases. In this paper, we focus on a single-stage conversion method using the 3-2 and 2-1 elimination approaches. The original scheduling method, proposed by Rabenseifner et al., is limited to an odd number of processes since it always schedules a single 3-2 elimination per conversion. Taking into consideration that the 3-2 elimination can be optimized on modern HPC systems, which can overlap the communication and computation, we propose 234 Scheduling for scheduling multiple 3-2 eliminations per conversion. The multiple 3-2 elimination scheduling enlarges the application range by enabling its use on an even number of processes. We evaluated 234 Scheduling applied to Binary-Swap on the K computer, which is a modern parallel HPC system, and confirmed its effectiveness.
Binary-Swap是一种基于递归向量减半和距离加倍的并行图像合成算法,当处理数恰好为2的2次方(2n)时,该算法能够高效地工作。提出了几种用于二进制交换的二次方转换方法。其中,基于二进制块算法的望远镜方法已被证明是最有前途的方法。望远镜方法将整个过程分解为2次幂大小的块,并逐步将较小的块合并为较大的块。这个块合并过程对应于转换的通信和计算开销,并且由于每个阶段只能合并一个块,因此随着二进制块数量的增加,它变得低效。在本文中,我们重点讨论了使用3-2和2-1消去方法的单阶段转换方法。由Rabenseifner等人提出的原始调度方法,由于每次转换总是调度单个3-2消除,因此仅限于奇数个进程。考虑到3-2消去在现代高性能计算系统上可以优化,通信和计算可以重叠,我们提出234调度来调度每次转换的多个3-2消去。多重3-2消除调度通过使其能够在偶数个进程上使用而扩大了应用范围。在现代并行HPC系统K上,对234调度算法应用于Binary-Swap进行了评估,验证了其有效性。
{"title":"234 scheduling of 3-2 and 2-1 eliminations for parallel image compositing using non-power-of-two number of processes","authors":"J. Nonaka, K. Ono, M. Fujita","doi":"10.1109/HPCSim.2015.7237071","DOIUrl":"https://doi.org/10.1109/HPCSim.2015.7237071","url":null,"abstract":"Binary-Swap is a parallel image compositing algorithm based on recursive vector halving and distance doubling, and works efficiently when the number of processes is exactly a power-of-two (2n). Several power-of-two converting approaches for Binary-Swap have been proposed. Among them, the Telescope method, based on the Binary Blocks algorithm, has been shown as the most promising approach. The Telescope method decomposes an entire set of processes into blocks of power-of-two size and merges the smaller blocks into larger blocks in stepwise fashion. This block merging process corresponds to the communication and computational overhead of the conversion, and since it can only merge one block per stage, it becomes inefficient as the number of binary blocks increases. In this paper, we focus on a single-stage conversion method using the 3-2 and 2-1 elimination approaches. The original scheduling method, proposed by Rabenseifner et al., is limited to an odd number of processes since it always schedules a single 3-2 elimination per conversion. Taking into consideration that the 3-2 elimination can be optimized on modern HPC systems, which can overlap the communication and computation, we propose 234 Scheduling for scheduling multiple 3-2 eliminations per conversion. The multiple 3-2 elimination scheduling enlarges the application range by enabling its use on an even number of processes. We evaluated 234 Scheduling applied to Binary-Swap on the K computer, which is a modern parallel HPC system, and confirmed its effectiveness.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127676274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2015 International Conference on High Performance Computing & Simulation (HPCS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1