首页 > 最新文献

2014 International Conference on High Performance Computing & Simulation (HPCS)最新文献

英文 中文
Adsorption and Electronic Excitation of Water on TiO2 (110): Calculation of High-Dimensional Potential Energy Surfaces 水在TiO2(110)上的吸附和电子激发:高维势能表面的计算
Pub Date : 2015-01-01 DOI: 10.1007/978-3-319-10810-0_14
Jan Mitschker, T. Klüner
{"title":"Adsorption and Electronic Excitation of Water on TiO2 (110): Calculation of High-Dimensional Potential Energy Surfaces","authors":"Jan Mitschker, T. Klüner","doi":"10.1007/978-3-319-10810-0_14","DOIUrl":"https://doi.org/10.1007/978-3-319-10810-0_14","url":null,"abstract":"","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"5 1","pages":"191-203"},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91398087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Real time robotic arm control using hand gestures 使用手势的实时机械臂控制
Pub Date : 2014-12-01 DOI: 10.1109/ICHPCA.2014.7045349
B. GaneshChoudhary, B. ChethanRam
In this paper we propose a way to accomplish Human Computer Interface absolutely in electronic way (without mechanical sensors). The idea is to extirpate old techniques of controlling Robotic arm using joysticks, buttons and supersede with more intuitive technique ie., to control robotic arm by hand motion or gesture. Here we propound an approach to achieve the aforementioned idea employing Image processing technique using web camera. We detect the vital features of hand: fingers by computational geometry calculation enabling real time interaction between hand gestures and Robot. Our system can meticulously locate fingers even when fore-arm is involved. And the system can sustain a certain rotation of palm and fore-arm, which augments the freedom of use in palm center estimation.
本文提出了一种完全以电子方式(无机械传感器)实现人机界面的方法。这个想法是为了消除使用操纵杆、按钮控制机械臂的旧技术,代之以更直观的技术。,通过手部动作或手势控制机械臂。本文提出了一种利用网络摄像机的图像处理技术来实现上述思想的方法。我们通过计算几何计算来检测手的重要特征:手指,从而实现手势和机器人之间的实时交互。我们的系统可以精确地定位手指,即使是在前臂受伤的时候。该系统可以维持手掌和前臂的一定旋转,增加了手掌中心估计的使用自由度。
{"title":"Real time robotic arm control using hand gestures","authors":"B. GaneshChoudhary, B. ChethanRam","doi":"10.1109/ICHPCA.2014.7045349","DOIUrl":"https://doi.org/10.1109/ICHPCA.2014.7045349","url":null,"abstract":"In this paper we propose a way to accomplish Human Computer Interface absolutely in electronic way (without mechanical sensors). The idea is to extirpate old techniques of controlling Robotic arm using joysticks, buttons and supersede with more intuitive technique ie., to control robotic arm by hand motion or gesture. Here we propound an approach to achieve the aforementioned idea employing Image processing technique using web camera. We detect the vital features of hand: fingers by computational geometry calculation enabling real time interaction between hand gestures and Robot. Our system can meticulously locate fingers even when fore-arm is involved. And the system can sustain a certain rotation of palm and fore-arm, which augments the freedom of use in palm center estimation.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"3 1","pages":"1-3"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91374560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Steering simulations on high performance computing resources 在高性能计算资源上的转向仿真
Pub Date : 2014-09-22 DOI: 10.1109/HPCSim.2014.6903801
Junyi Han
Computational Steering (CS) of numerical simulations has been developed over the last three decades. While it has succeeded in some of its chief aims, the uptake and impact of CS has not been as great as anticipated. This paper aims to investigate the reasons for this, and from this analysis to provide an enhanced CS framework, taking into account both modern developments in end-user devices and changes in the architecture of very large High Performance Computing (HPC) systems (supercomputers). We also consider the impact on CS of the recent interest in Dynamic Data Driven Application Systems (DDDAS) and Cyber-Physical System (CPS). As the beginning phase of our research, we present a general-purpose framework that provides CS and HPC as Web services to widen their uptake and use. Key advantages of this approach, include the re-usability, modularity, real-time 3D web visualization, and capacitating users to access CS and HPC service on portable devices.
数值模拟的计算转向(CS)在过去三十年中得到了发展。虽然它在一些主要目标上取得了成功,但CS的吸收和影响并没有预期的那么大。本文旨在调查其原因,并从这一分析中提供一个增强的CS框架,同时考虑到终端用户设备的现代发展和超大型高性能计算(HPC)系统(超级计算机)体系结构的变化。我们还考虑了最近对动态数据驱动应用系统(DDDAS)和网络物理系统(CPS)的兴趣对CS的影响。作为我们研究的开始阶段,我们提出了一个通用框架,将CS和HPC作为Web服务提供,以扩大它们的吸收和使用。这种方法的主要优点包括可重用性、模块化、实时3D web可视化,以及使用户能够在便携式设备上访问CS和HPC服务。
{"title":"Steering simulations on high performance computing resources","authors":"Junyi Han","doi":"10.1109/HPCSim.2014.6903801","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903801","url":null,"abstract":"Computational Steering (CS) of numerical simulations has been developed over the last three decades. While it has succeeded in some of its chief aims, the uptake and impact of CS has not been as great as anticipated. This paper aims to investigate the reasons for this, and from this analysis to provide an enhanced CS framework, taking into account both modern developments in end-user devices and changes in the architecture of very large High Performance Computing (HPC) systems (supercomputers). We also consider the impact on CS of the recent interest in Dynamic Data Driven Application Systems (DDDAS) and Cyber-Physical System (CPS). As the beginning phase of our research, we present a general-purpose framework that provides CS and HPC as Web services to widen their uptake and use. Key advantages of this approach, include the re-usability, modularity, real-time 3D web visualization, and capacitating users to access CS and HPC service on portable devices.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"203 1","pages":"1008-1010"},"PeriodicalIF":0.0,"publicationDate":"2014-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77018991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Determining map partitioning to accelerate wind field calculation 确定地图分区,加速风场计算
Pub Date : 2014-09-22 DOI: 10.1109/HPCSim.2014.6903674
Gemma Sanjuan, C. Brun, T. Margalef, A. Cortés
Wind speed and direction are parameters that affect forest fire propagation dramatically. So, an accurate estimation of such parameters is crucial to predict the fire propagation precisely. WindNInja is a wind field simulator that can easily be coupled to a forest fire propagation simulator such as FARSITE. However, wind field simulators present to main drawbacks: They take too much time to compute the wind field and they require a lot of memory. So, a map partitioning strategy has been developed to compute partial wind field maps that can be aggregated afterwards. Each map part can be computed in parallel and the amount of memory required is available in a single node. In this work a methodology to determine the most adequate map partitioning is presented. The map part shape, map part size, amount of overlapping and number of parts have been studied considering execution time and effects on wind field estimation. The results are based on a wide experimentation and are validated with real case scenarios.
风速和风向是影响森林火灾蔓延的重要参数。因此,准确估计这些参数对于准确预测火灾的传播至关重要。WindNInja是一个风场模拟器,可以很容易地与森林火灾传播模拟器(如FARSITE)耦合。然而,风场模拟器存在着主要的缺点:它们需要花费太多的时间来计算风场,并且需要大量的内存。因此,本文提出了一种地图分区策略,用于计算局部风场地图,然后进行聚合。每个映射部分都可以并行计算,并且所需的内存量可以在单个节点中使用。在这项工作中,提出了一种确定最适当的映射分区的方法。考虑执行时间和对风场估计的影响,研究了地图部分形状、地图部分尺寸、重叠量和部分数量。结果基于广泛的实验,并通过实际案例进行了验证。
{"title":"Determining map partitioning to accelerate wind field calculation","authors":"Gemma Sanjuan, C. Brun, T. Margalef, A. Cortés","doi":"10.1109/HPCSim.2014.6903674","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903674","url":null,"abstract":"Wind speed and direction are parameters that affect forest fire propagation dramatically. So, an accurate estimation of such parameters is crucial to predict the fire propagation precisely. WindNInja is a wind field simulator that can easily be coupled to a forest fire propagation simulator such as FARSITE. However, wind field simulators present to main drawbacks: They take too much time to compute the wind field and they require a lot of memory. So, a map partitioning strategy has been developed to compute partial wind field maps that can be aggregated afterwards. Each map part can be computed in parallel and the amount of memory required is available in a single node. In this work a methodology to determine the most adequate map partitioning is presented. The map part shape, map part size, amount of overlapping and number of parts have been studied considering execution time and effects on wind field estimation. The results are based on a wide experimentation and are validated with real case scenarios.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"15 1","pages":"96-103"},"PeriodicalIF":0.0,"publicationDate":"2014-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88674507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Parallel 3D deterministic particle transport on Intel MIC architecture 基于Intel MIC架构的并行三维确定性粒子输运
Pub Date : 2014-09-22 DOI: 10.1109/HPCSim.2014.6903685
Qinglin Wang, Zuocheng Xing, Jie Liu, X. Qiang, Chunye Gong, Jiang Jiang
Single-node computation speed is essential in large-scale parallel solutions of particle transport problems. The Intel Many Integrated Core (MIC) architecture supports more than 200 hardware threads as well as 512-bit double precision float-point vector operations. In this paper, we use the native model of MIC in the parallelization of the simulation of one energy group time-independent deterministic discrete ordinates particle transport in 3D Cartesian geometry (Sweep3D). The implementation adopts both hardware threads and vector units in MIC to efficiently exploit multi-level parallelism in the discrete ordinates method when keeping good data locality. Our optimized implementation is verified on target MIC and can provide up to 1.99 times speedup based on the original MPI code on Intel Xeon E5-2660 CPU when flux fixup is off. Compared with the prior on NVIDIA Tesla M2050 GPU, the speedup of up to 1.23 times is obtained. In addition, the difference between the implementations on MIC and GPU is discussed as well.
在粒子输运问题的大规模并行解中,单节点计算速度至关重要。Intel多集成核心(MIC)架构支持200多个硬件线程以及512位双精度浮点向量操作。在本文中,我们使用MIC的原生模型来并行化模拟三维笛卡尔几何中一能量群时间无关的确定性离散坐标粒子输运(Sweep3D)。该实现在保证数据局部性的前提下,同时采用了硬件线程和MIC中的矢量单元,有效地利用了离散坐标法中的多级并行性。我们的优化实现在目标MIC上进行了验证,当通量修复关闭时,基于Intel Xeon E5-2660 CPU上的原始MPI代码,可以提供高达1.99倍的加速。与之前的NVIDIA Tesla M2050 GPU相比,获得了高达1.23倍的加速。此外,还讨论了在MIC和GPU上实现的不同之处。
{"title":"Parallel 3D deterministic particle transport on Intel MIC architecture","authors":"Qinglin Wang, Zuocheng Xing, Jie Liu, X. Qiang, Chunye Gong, Jiang Jiang","doi":"10.1109/HPCSim.2014.6903685","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903685","url":null,"abstract":"Single-node computation speed is essential in large-scale parallel solutions of particle transport problems. The Intel Many Integrated Core (MIC) architecture supports more than 200 hardware threads as well as 512-bit double precision float-point vector operations. In this paper, we use the native model of MIC in the parallelization of the simulation of one energy group time-independent deterministic discrete ordinates particle transport in 3D Cartesian geometry (Sweep3D). The implementation adopts both hardware threads and vector units in MIC to efficiently exploit multi-level parallelism in the discrete ordinates method when keeping good data locality. Our optimized implementation is verified on target MIC and can provide up to 1.99 times speedup based on the original MPI code on Intel Xeon E5-2660 CPU when flux fixup is off. Compared with the prior on NVIDIA Tesla M2050 GPU, the speedup of up to 1.23 times is obtained. In addition, the difference between the implementations on MIC and GPU is discussed as well.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"46 1","pages":"186-192"},"PeriodicalIF":0.0,"publicationDate":"2014-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82698866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Task-optimized cable-actuated planar parallel manipulator architecture and its concurrent implementation 任务优化缆索驱动平面并联机械臂结构及其并行实现
Pub Date : 2014-07-21 DOI: 10.1109/HPCSim.2014.6903684
J. Pickard, J. A. Carretero, V. Bhavsar
This work presents an initial framework for an efficient new technique for obtaining task-optimized parallel manipulators with the aid of parallel computing through OpenMP directives. A cable-driven parallel manipulator is an architecture whose actuated limbs are cables. All of the cables must remain in constant positive tension to constrain the motion of the moving end-effector. A Differential Evolution algorithm is applied in order to optimize the topology and actuator specifications of a cable-driven parallel manipulator. The algorithm's intrinsic parallelism is exploited using OpenMP directives to evaluate the manipulator's associated reachable and wrench workspaces. The results show that this algorithm is effective at obtaining a task-optimized architecture for the cable-driven parallel manipulator. Parallel implementation is shown to improve the algorithm's performance with a speedup of 7.4 times using ten cores on the Atlantic Computational Excellence Network (ACEnet) Fundy compute resource which utilizes Parallel Sun x4600 and x2200 AMD Opteron (dual-core) clusters.
这项工作提出了一个初始框架,为通过OpenMP指令的并行计算获得任务优化的并行操纵器提供了一个有效的新技术。缆索驱动并联机械臂是一种以缆索为驱动肢的结构。所有的电缆必须保持恒定的正张力,以约束移动的末端执行器的运动。采用差分进化算法对缆索驱动并联机械臂的拓扑结构和作动器规格进行优化。该算法的内在并行性利用OpenMP指令来评估机械手的相关可达和扳手工作空间。结果表明,该算法能够有效地求解出缆索驱动并联机器人的任务优化结构。在大西洋计算卓越网络(ACEnet) Fundy计算资源上使用并行Sun x4600和x2200 AMD Opteron(双核)集群,并行实现可以提高算法的性能,速度提高7.4倍。
{"title":"Task-optimized cable-actuated planar parallel manipulator architecture and its concurrent implementation","authors":"J. Pickard, J. A. Carretero, V. Bhavsar","doi":"10.1109/HPCSim.2014.6903684","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903684","url":null,"abstract":"This work presents an initial framework for an efficient new technique for obtaining task-optimized parallel manipulators with the aid of parallel computing through OpenMP directives. A cable-driven parallel manipulator is an architecture whose actuated limbs are cables. All of the cables must remain in constant positive tension to constrain the motion of the moving end-effector. A Differential Evolution algorithm is applied in order to optimize the topology and actuator specifications of a cable-driven parallel manipulator. The algorithm's intrinsic parallelism is exploited using OpenMP directives to evaluate the manipulator's associated reachable and wrench workspaces. The results show that this algorithm is effective at obtaining a task-optimized architecture for the cable-driven parallel manipulator. Parallel implementation is shown to improve the algorithm's performance with a speedup of 7.4 times using ten cores on the Atlantic Computational Excellence Network (ACEnet) Fundy compute resource which utilizes Parallel Sun x4600 and x2200 AMD Opteron (dual-core) clusters.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"157 1","pages":"178-185"},"PeriodicalIF":0.0,"publicationDate":"2014-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73479048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A performance evaluation of TopHat RNA sequences alignment tool on openstack-based cloud environments TopHat RNA序列比对工具在开放式云环境下的性能评价
Pub Date : 2014-07-21 DOI: 10.1109/HPCSim.2014.6903708
L. Foschini, Alessandro Pernafini, Antonio Corradi, M. Rosati, Alessandro Federico, G. Fiameni
Despite its great promises, current Cloud offering has not been fully exploited for the management of Next-Generation Sequencing technologies. In fact, while dynamic resource allocation is typically required to ensure efficient and effective usage of the Cloud resources, Cloud providers have to deal with complex services, usually treated as black-boxes; hence, the estimation of the maximum number of resources that could improve service execution is a big challenge. This paper proposes and explores the benefits of Cloud deployment when operating a processor-hungry RNA alignment tool. The goal is to show the advantages of the virtualized and Cloud-aware approach compared to a typical bare-metal deployment. Extensive results demonstrate that our approach is as a viable first step toward easing the deployment and improving run-time service scaling.
尽管有很大的希望,目前的云产品还没有完全用于下一代测序技术的管理。事实上,虽然通常需要动态资源分配来确保高效和有效地使用云资源,但云提供商必须处理复杂的服务,这些服务通常被视为黑盒;因此,估计可以改善服务执行的最大资源数量是一个很大的挑战。本文提出并探讨了云部署在操作处理器饥渴的RNA比对工具时的好处。我们的目标是展示与典型裸机部署相比,虚拟化和云感知方法的优势。广泛的结果表明,我们的方法是简化部署和改进运行时服务扩展的可行的第一步。
{"title":"A performance evaluation of TopHat RNA sequences alignment tool on openstack-based cloud environments","authors":"L. Foschini, Alessandro Pernafini, Antonio Corradi, M. Rosati, Alessandro Federico, G. Fiameni","doi":"10.1109/HPCSim.2014.6903708","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903708","url":null,"abstract":"Despite its great promises, current Cloud offering has not been fully exploited for the management of Next-Generation Sequencing technologies. In fact, while dynamic resource allocation is typically required to ensure efficient and effective usage of the Cloud resources, Cloud providers have to deal with complex services, usually treated as black-boxes; hence, the estimation of the maximum number of resources that could improve service execution is a big challenge. This paper proposes and explores the benefits of Cloud deployment when operating a processor-hungry RNA alignment tool. The goal is to show the advantages of the virtualized and Cloud-aware approach compared to a typical bare-metal deployment. Extensive results demonstrate that our approach is as a viable first step toward easing the deployment and improving run-time service scaling.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"11 1","pages":"358-365"},"PeriodicalIF":0.0,"publicationDate":"2014-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75218084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Enabling hydrodynamics solver for efficient parallel simulations 使流体动力学求解器有效的并行模拟
Pub Date : 2014-07-21 DOI: 10.1109/HPCSim.2014.6903770
R. Broglia, S. Zaghi, R. Muscari, F. Salvadore
In this paper we present the parallel solver χnavis, a general purpose solver for Computational Fluid Dynamics (CFD). The solver is based on the finite volume discretization of the unsteady incompressible Navier-Stokes equations; main features include a level set approach to handle free surface flows and a dynamical overlapping grids approach, which allows to deal with bodies in relative motion. The baseline code features a hybrid MPI/OpenMP parallelization, proven to scale when running on order of hundreds of cores (i.e. Tier-1 platforms). This paper deals with latest developments aimed to extend the capabilities of the χnavis software to exploit modern parallel architectures. Scalability properties will be demonstrated for different cases. As example of application, the computation of the flow fields around a submarine in prescribed oscillatory motion and a surface flow around a catamaran in steady drift advancement are presented.
本文提出了计算流体动力学(CFD)的通用求解器——并行求解器χnavis。求解方法基于非定常不可压缩Navier-Stokes方程的有限体积离散;主要功能包括处理自由表面流动的水平集方法和动态重叠网格方法,该方法允许处理相对运动中的物体。基线代码的特点是混合MPI/OpenMP并行化,当运行在数百个核心(即第1层平台)上时,被证明是可扩展的。本文讨论了旨在扩展软件的能力以利用现代并行体系结构的最新发展。可伸缩性属性将针对不同的情况进行演示。作为应用实例,给出了规定振荡运动时潜艇周围流场和稳定漂移推进时双体船周围表面流场的计算。
{"title":"Enabling hydrodynamics solver for efficient parallel simulations","authors":"R. Broglia, S. Zaghi, R. Muscari, F. Salvadore","doi":"10.1109/HPCSim.2014.6903770","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903770","url":null,"abstract":"In this paper we present the parallel solver χnavis, a general purpose solver for Computational Fluid Dynamics (CFD). The solver is based on the finite volume discretization of the unsteady incompressible Navier-Stokes equations; main features include a level set approach to handle free surface flows and a dynamical overlapping grids approach, which allows to deal with bodies in relative motion. The baseline code features a hybrid MPI/OpenMP parallelization, proven to scale when running on order of hundreds of cores (i.e. Tier-1 platforms). This paper deals with latest developments aimed to extend the capabilities of the χnavis software to exploit modern parallel architectures. Scalability properties will be demonstrated for different cases. As example of application, the computation of the flow fields around a submarine in prescribed oscillatory motion and a surface flow around a catamaran in steady drift advancement are presented.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"64 1","pages":"803-810"},"PeriodicalIF":0.0,"publicationDate":"2014-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75814030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Analyzing the impact of programming models for efficient communication overlap in high-speed networks 分析高速网络中规划模型对高效通信重叠的影响
Pub Date : 2014-07-21 DOI: 10.1109/HPCSim.2014.6903689
G. Utrera, Marisa Gil, X. Martorell
Exascale applications for civil engineering, simulations and other fields related with current research make intensive use of large sparse matrices. A characteristic of these matrices is the difficulty of balancing communication and computation, so that even when these two phases are overlapped the application does not achieve a good overall scalability, but instead suffers from a loss of performance. Some proposals have been presented in order to diminish this drawback, based on the hybrid use of programming models, using MPI as the communication basis and threads for computation -mainly OpenMP, but also Cilk, CUDA or OpenCL, to adapt to new heterogeneous platforms. In this work, we evaluate the impact of providing task-based parallelism instead of fork-join parallelism. As regards communication, the appearance of faster networks with specific optimizations and internal protocol characteristics makes it appealing to analyze and evaluate the influence of these networks on performance execution. We evaluate our results on two different communication networks: 10Gigabit Ethernet and Infiniband. For our evaluations we run the miniFE miniapplication of the Mantevo suite benchmark, in a homogeneous supercomputer platform based on Intel SandyBridge processors. Experimental results show how the network behavior can affect performance and how it can be managed via task-based models: from a hybrid MPI/OpenMP version that overlaps communication and computation, our task-based proposal MPI/OmpSs obtains up to 60% improvement.
土木工程、仿真和其他与当前研究相关领域的百亿亿次应用大量使用了大型稀疏矩阵。这些矩阵的一个特点是难以平衡通信和计算,因此,即使这两个阶段重叠,应用程序也无法获得良好的整体可伸缩性,反而会遭受性能损失。为了减少这个缺点,已经提出了一些建议,基于混合使用编程模型,使用MPI作为通信基础和计算线程-主要是OpenMP,但也有Cilk, CUDA或OpenCL,以适应新的异构平台。在这项工作中,我们评估了提供基于任务的并行性而不是fork-join并行性的影响。在通信方面,具有特定优化和内部协议特征的更快网络的出现,使得分析和评估这些网络对性能执行的影响变得很有吸引力。我们在两种不同的通信网络上评估了我们的结果:10gb以太网和Infiniband。为了进行评估,我们在基于英特尔SandyBridge处理器的同构超级计算机平台上运行了Mantevo套件基准测试的miniFE迷你应用程序。实验结果显示了网络行为如何影响性能以及如何通过基于任务的模型来管理网络行为:从一个重叠通信和计算的MPI/OpenMP混合版本中,我们基于任务的MPI/ omps提案获得了高达60%的改进。
{"title":"Analyzing the impact of programming models for efficient communication overlap in high-speed networks","authors":"G. Utrera, Marisa Gil, X. Martorell","doi":"10.1109/HPCSim.2014.6903689","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903689","url":null,"abstract":"Exascale applications for civil engineering, simulations and other fields related with current research make intensive use of large sparse matrices. A characteristic of these matrices is the difficulty of balancing communication and computation, so that even when these two phases are overlapped the application does not achieve a good overall scalability, but instead suffers from a loss of performance. Some proposals have been presented in order to diminish this drawback, based on the hybrid use of programming models, using MPI as the communication basis and threads for computation -mainly OpenMP, but also Cilk, CUDA or OpenCL, to adapt to new heterogeneous platforms. In this work, we evaluate the impact of providing task-based parallelism instead of fork-join parallelism. As regards communication, the appearance of faster networks with specific optimizations and internal protocol characteristics makes it appealing to analyze and evaluate the influence of these networks on performance execution. We evaluate our results on two different communication networks: 10Gigabit Ethernet and Infiniband. For our evaluations we run the miniFE miniapplication of the Mantevo suite benchmark, in a homogeneous supercomputer platform based on Intel SandyBridge processors. Experimental results show how the network behavior can affect performance and how it can be managed via task-based models: from a hybrid MPI/OpenMP version that overlaps communication and computation, our task-based proposal MPI/OmpSs obtains up to 60% improvement.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"6 1","pages":"218-225"},"PeriodicalIF":0.0,"publicationDate":"2014-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78764673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
BitTorrent vulnerability to free riders: Root causes analysis 免费乘车者的bt漏洞:根本原因分析
Pub Date : 2014-07-21 DOI: 10.1109/HPCSim.2014.6903794
Farag Azzedin, Mohammed Onimisi Yahaya
BitTorrent has gained popularity as a file sharing environment. The concept of sharing assumes that every peer in the environment contributes. However, the action of not sharing negates the fundamental concepts of BitTorrent. In this paper, we investigate the free riding phenomenon in BitTorrent. Through simulation experiments, we study in detail BitTorrent's vulnerability to free riders.
BitTorrent作为一个文件共享环境已经获得了普及。共享的概念假定环境中的每个同伴都有贡献。然而,不分享的行为否定了BitTorrent的基本概念。本文主要研究了bt中存在的免费搭便车现象。通过仿真实验,我们详细研究了BitTorrent对搭便车者的脆弱性。
{"title":"BitTorrent vulnerability to free riders: Root causes analysis","authors":"Farag Azzedin, Mohammed Onimisi Yahaya","doi":"10.1109/HPCSim.2014.6903794","DOIUrl":"https://doi.org/10.1109/HPCSim.2014.6903794","url":null,"abstract":"BitTorrent has gained popularity as a file sharing environment. The concept of sharing assumes that every peer in the environment contributes. However, the action of not sharing negates the fundamental concepts of BitTorrent. In this paper, we investigate the free riding phenomenon in BitTorrent. Through simulation experiments, we study in detail BitTorrent's vulnerability to free riders.","PeriodicalId":6469,"journal":{"name":"2014 International Conference on High Performance Computing & Simulation (HPCS)","volume":"15 1","pages":"978-984"},"PeriodicalIF":0.0,"publicationDate":"2014-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81426447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2014 International Conference on High Performance Computing & Simulation (HPCS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1