首页 > 最新文献

2012 SC Companion: High Performance Computing, Networking Storage and Analysis最新文献

英文 中文
TDPSS: A Scalable Time Domain Power System Simulator for Dynamic Security Assessment TDPSS:用于动态安全评估的可扩展时域电力系统模拟器
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.51
S. Khaitan, J. McCalley
Simulation plays a very crucial role to model, study and experiment with any design innovation proposed in the power systems. Since mathematical modeling of power systems leads to tens of thousands of stiff DAEs (differential and algebraic equations), the design of power system simulators involve exercising a trade-off between the simulation speed and modeling accuracy. Lack of efficient and detailed simulators forces the designers to experiment their techniques with small test systems and hence, the results obtained from such experiments may not be representative of the results obtained using real-life power systems. In this paper, we present TDPSS, a high speed time domain power system simulator for dynamic security assessment. TDPSS has been designed using object-oriented programming framework, and thus, it is modular and extensible. By offering a variety of models of power system components and fast numerical algorithms, it provides the user with the flexibility to experiment with different design options in an efficient manner. We discuss the design of TDPSS to give insights into the simulation infrastructure and also discuss the areas where TDPSS can be extended for parallel contingency analysis. We also validate it against the commercial power system simulators, namely PSSE and DSA Tools. Further, we compare the simulation speed of TPDSS for different numerical algorithms. The results have shown that TDPSS is accurate and also outperforms the commonly used commercial simulator PSSE in terms of its computational efficiency.
仿真对于电力系统中任何设计创新的建模、研究和实验都起着至关重要的作用。由于电力系统的数学建模导致成千上万的刚性DAEs(微分方程和代数方程),因此电力系统模拟器的设计涉及在仿真速度和建模精度之间进行权衡。由于缺乏高效和详细的模拟器,设计人员不得不在小型测试系统上试验他们的技术,因此,从这些实验中获得的结果可能不能代表使用实际电力系统获得的结果。本文提出了一种用于电力系统动态安全评估的高速时域仿真器TDPSS。TDPSS采用面向对象的编程框架进行设计,具有模块化和可扩展性。通过提供各种电力系统组件模型和快速数值算法,它为用户提供了灵活的实验,以有效的方式不同的设计方案。我们讨论了TDPSS的设计,以深入了解仿真基础设施,并讨论了可以扩展TDPSS进行并行偶然性分析的领域。我们还对商业电力系统模拟器,即PSSE和DSA工具进行了验证。此外,我们比较了不同数值算法下TPDSS的仿真速度。结果表明,TDPSS是精确的,并且在计算效率方面优于常用的商用模拟器PSSE。
{"title":"TDPSS: A Scalable Time Domain Power System Simulator for Dynamic Security Assessment","authors":"S. Khaitan, J. McCalley","doi":"10.1109/SC.Companion.2012.51","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.51","url":null,"abstract":"Simulation plays a very crucial role to model, study and experiment with any design innovation proposed in the power systems. Since mathematical modeling of power systems leads to tens of thousands of stiff DAEs (differential and algebraic equations), the design of power system simulators involve exercising a trade-off between the simulation speed and modeling accuracy. Lack of efficient and detailed simulators forces the designers to experiment their techniques with small test systems and hence, the results obtained from such experiments may not be representative of the results obtained using real-life power systems. In this paper, we present TDPSS, a high speed time domain power system simulator for dynamic security assessment. TDPSS has been designed using object-oriented programming framework, and thus, it is modular and extensible. By offering a variety of models of power system components and fast numerical algorithms, it provides the user with the flexibility to experiment with different design options in an efficient manner. We discuss the design of TDPSS to give insights into the simulation infrastructure and also discuss the areas where TDPSS can be extended for parallel contingency analysis. We also validate it against the commercial power system simulators, namely PSSE and DSA Tools. Further, we compare the simulation speed of TPDSS for different numerical algorithms. The results have shown that TDPSS is accurate and also outperforms the commonly used commercial simulator PSSE in terms of its computational efficiency.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"14 1","pages":"323-332"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80103585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
In-Situ Feature Tracking and Visualization of a Temporal Mixing Layer 时间混合层的原位特征跟踪与可视化
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.335
E. Duque, Daniel E. Hiepler, S. Legensky, C. Stone
The flow field for a temporal mixing layer was analyzed by solving the Navier-Stokes equations via a Large Eddy Simulation method, LESLIE3D, and then visualizing and post-processing the resulting flow features by utilizing the prototype visualization and CFD data analysis software system Intelligent In-Situ Feature Detection, Tracking and Visualization for Turbulent Flow Simulations (IFDT). The system utilizes volume rendering with an Intelligent Adaptive Transfer Function that allows the user to train the visualization system to highlight flow features such as turbulent vortices. A feature extractor based upon a Prediction-Correction method then tracks and extracts the flow features and determines the statistics of features over time. The method executes In-Situ with the flow solver via a Python Interface Framework to avoid the overhead of saving data to file. The movie submitted for this visualization showcase highlights the visualization of the flow such as the formation of vortex features, vortex breakdown, the onset of turbulence and then fully mixed conditions.
通过大涡模拟方法LESLIE3D求解Navier-Stokes方程,分析时间混合层的流场,然后利用原型可视化和CFD数据分析软件系统湍流模拟智能原位特征检测、跟踪和可视化(IFDT)对得到的流场特征进行可视化和后处理。该系统利用具有智能自适应传递函数的体积渲染,允许用户训练可视化系统来突出显示湍流漩涡等流动特征。然后,基于预测-校正方法的特征提取器跟踪和提取流特征并确定特征随时间的统计量。该方法通过Python接口框架与流求解器一起原位执行,以避免将数据保存到文件的开销。为这个可视化展示提交的电影突出了流动的可视化,如漩涡特征的形成,漩涡破裂,湍流的开始,然后是完全混合的条件。
{"title":"In-Situ Feature Tracking and Visualization of a Temporal Mixing Layer","authors":"E. Duque, Daniel E. Hiepler, S. Legensky, C. Stone","doi":"10.1109/SC.Companion.2012.335","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.335","url":null,"abstract":"The flow field for a temporal mixing layer was analyzed by solving the Navier-Stokes equations via a Large Eddy Simulation method, LESLIE3D, and then visualizing and post-processing the resulting flow features by utilizing the prototype visualization and CFD data analysis software system Intelligent In-Situ Feature Detection, Tracking and Visualization for Turbulent Flow Simulations (IFDT). The system utilizes volume rendering with an Intelligent Adaptive Transfer Function that allows the user to train the visualization system to highlight flow features such as turbulent vortices. A feature extractor based upon a Prediction-Correction method then tracks and extracts the flow features and determines the statistics of features over time. The method executes In-Situ with the flow solver via a Python Interface Framework to avoid the overhead of saving data to file. The movie submitted for this visualization showcase highlights the visualization of the flow such as the formation of vortex features, vortex breakdown, the onset of turbulence and then fully mixed conditions.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"1 1","pages":"1593-1593"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88864747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Achieving design targets by stochastic car crash simulations 通过随机碰撞模拟实现设计目标
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.350
T. Yasuki
{"title":"Achieving design targets by stochastic car crash simulations","authors":"T. Yasuki","doi":"10.1109/SC.Companion.2012.350","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.350","url":null,"abstract":"","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"89 1","pages":"1923-1941"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78223865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oh, $#*@! Exascale! The Effect of Emerging Architectures on Scientific Discovery 哦,$ # * @ !Exascale !新兴架构对科学发现的影响
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.38
K. Moreland
The predictions for exascale computing are dire. Although we have benefited from a consistent supercomputer architecture design, even across manufacturers, for well over a decade, recent trends indicate that future high-performance computers will have different hardware structure and programming models to which software must adapt. This paper provides an informal discussion on the ways in which changes in high-performance computing architecture will profoundly affect the scalability of our current generation of scientific visualization and analysis codes and how we must adapt our applications, workflows, and attitudes to continue our success at exascale computing.
对百亿亿次计算的预测是可怕的。尽管十多年来我们一直受益于一致的超级计算机架构设计,即使是在不同的制造商之间,但最近的趋势表明,未来的高性能计算机将具有不同的硬件结构和编程模型,软件必须适应这些结构和模型。本文非正式地讨论了高性能计算架构的变化将如何深刻地影响我们当前一代科学可视化和分析代码的可伸缩性,以及我们必须如何调整我们的应用程序、工作流和态度,以继续我们在百亿亿级计算上的成功。
{"title":"Oh, $#*@! Exascale! The Effect of Emerging Architectures on Scientific Discovery","authors":"K. Moreland","doi":"10.1109/SC.Companion.2012.38","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.38","url":null,"abstract":"The predictions for exascale computing are dire. Although we have benefited from a consistent supercomputer architecture design, even across manufacturers, for well over a decade, recent trends indicate that future high-performance computers will have different hardware structure and programming models to which software must adapt. This paper provides an informal discussion on the ways in which changes in high-performance computing architecture will profoundly affect the scalability of our current generation of scientific visualization and analysis codes and how we must adapt our applications, workflows, and attitudes to continue our success at exascale computing.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"6 1","pages":"224-231"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72823698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A Hybrid Scheduling Approach for Scalable Heterogeneous Hadoop Systems 可扩展异构Hadoop系统的混合调度方法
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.155
Aysan Rasooli Oskooei, D. Down
The scalability of Cloud infrastructures has significantly increased their applicability. Hadoop, which works based on a MapReduce model, provides for efficient processing of Big Data. This solution is being used widely by most Cloud providers. Hadoop schedulers are critical elements for providing desired performance levels. A scheduler assigns MapReduce tasks to Hadoop resources. There is a considerable challenge to schedule the growing number of tasks and resources in a scalable manner. Moreover, the potential heterogeneous nature of deployed Hadoop systems tends to increase this challenge. This paper analyzes the performance of widely used Hadoop schedulers including FIFO and Fair sharing and compares them with the COSHH (Classification and Optimization based Scheduler for Heterogeneous Hadoop) scheduler, which has been developed by the authors. Based on our insights, a hybrid solution is introduced, which selects appropriate scheduling algorithms for scalable and heterogeneous Hadoop systems with respect to the number of incoming jobs and available resources.
云基础设施的可伸缩性大大提高了它们的适用性。Hadoop基于MapReduce模型,提供了对大数据的高效处理。这种解决方案被大多数云提供商广泛使用。Hadoop调度器是提供所需性能级别的关键元素。调度程序将MapReduce任务分配给Hadoop资源。以可伸缩的方式安排越来越多的任务和资源是一个相当大的挑战。此外,已部署Hadoop系统的潜在异构特性往往会增加这一挑战。分析了先进先出(FIFO)和公平共享(Fair sharing)等Hadoop调度程序的性能,并与作者开发的COSHH (Classification and Optimization based Scheduler for Heterogeneous Hadoop)调度程序进行了比较。基于我们的见解,介绍了一种混合解决方案,它根据传入作业和可用资源的数量为可扩展和异构Hadoop系统选择适当的调度算法。
{"title":"A Hybrid Scheduling Approach for Scalable Heterogeneous Hadoop Systems","authors":"Aysan Rasooli Oskooei, D. Down","doi":"10.1109/SC.Companion.2012.155","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.155","url":null,"abstract":"The scalability of Cloud infrastructures has significantly increased their applicability. Hadoop, which works based on a MapReduce model, provides for efficient processing of Big Data. This solution is being used widely by most Cloud providers. Hadoop schedulers are critical elements for providing desired performance levels. A scheduler assigns MapReduce tasks to Hadoop resources. There is a considerable challenge to schedule the growing number of tasks and resources in a scalable manner. Moreover, the potential heterogeneous nature of deployed Hadoop systems tends to increase this challenge. This paper analyzes the performance of widely used Hadoop schedulers including FIFO and Fair sharing and compares them with the COSHH (Classification and Optimization based Scheduler for Heterogeneous Hadoop) scheduler, which has been developed by the authors. Based on our insights, a hybrid solution is introduced, which selects appropriate scheduling algorithms for scalable and heterogeneous Hadoop systems with respect to the number of incoming jobs and available resources.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"39 1","pages":"1284-1291"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72843764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
Executing Optimized Irregular Applications Using Task Graphs within Existing Parallel Models 在现有并行模型中使用任务图执行优化的不规则应用程序
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.43
Christopher D. Krieger, M. Strout, J. Roelofs, A. Bajwa
Many sparse or irregular scientific computations are memory bound and benefit from locality improving optimizations such as blocking or tiling. These optimizations result in asynchronous parallelism that can be represented by arbitrary task graphs. Unfortunately, most popular parallel programming models with the exception of Threading Building Blocks (TBB) do not directly execute arbitrary task graphs. In this paper, we compare the programming and execution of arbitrary task graphs qualitatively and quantitatively in TBB, the OpenMP doall model, the OpenMP 3.0 task model, and Cilk Plus. We present performance and scalability results for 8 and 40 core shared memory systems on a sparse matrix iterative solver and a molecular dynamics benchmark.
许多稀疏或不规则的科学计算都是内存受限的,并受益于局部性改进的优化,如阻塞或平铺。这些优化产生异步并行性,可以用任意任务图表示。不幸的是,除了线程构建块(TBB)之外,大多数流行的并行编程模型都不能直接执行任意任务图。本文对TBB、OpenMP doall模型、OpenMP 3.0任务模型和Cilk Plus中任意任务图的编程和执行进行了定性和定量的比较。我们在稀疏矩阵迭代求解器和分子动力学基准上给出了8核和40核共享内存系统的性能和可扩展性结果。
{"title":"Executing Optimized Irregular Applications Using Task Graphs within Existing Parallel Models","authors":"Christopher D. Krieger, M. Strout, J. Roelofs, A. Bajwa","doi":"10.1109/SC.Companion.2012.43","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.43","url":null,"abstract":"Many sparse or irregular scientific computations are memory bound and benefit from locality improving optimizations such as blocking or tiling. These optimizations result in asynchronous parallelism that can be represented by arbitrary task graphs. Unfortunately, most popular parallel programming models with the exception of Threading Building Blocks (TBB) do not directly execute arbitrary task graphs. In this paper, we compare the programming and execution of arbitrary task graphs qualitatively and quantitatively in TBB, the OpenMP doall model, the OpenMP 3.0 task model, and Cilk Plus. We present performance and scalability results for 8 and 40 core shared memory systems on a sparse matrix iterative solver and a molecular dynamics benchmark.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"40 1","pages":"261-268"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86921617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Neural Circuit Simulation of Hodgkin-Huxley Type Neurons Toward Peta Scale Computers 面向Peta级计算机的霍奇金-赫胥黎型神经元的神经回路模拟
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.314
Daisuke Miyamoto, T. Kazawa, R. Kanzaki
We ported and optimized simulation environment "NEURON" on K computer to simulate a insect brain as multi-compartment Hodgkin-Huxley type model. To use SIMD units of SPARC64VIIIfx (CPU of K computer), we exchanged the order of the compartment loop and the ion channel loop and apply sector caches. These tuning improved single core performance 340 MFLOPS/core to 1560 MFLOPS/core (about 10% efficiency).Spike exchange method of gNEURONh (MPI_Allgather) demands large amount of time in case of 10,000 cores or more and simple asynchronous point-to-point method (MPI_Isend) is not effective either, because of a large number of function calls and long distance of interconnect pathway. To tackle these problems, we adopted MPI/OpenMP hybrid parallelization to reduce interconnect communications and we developed a program to optimize location of neurons on calculation nodes in the 3D torus network. As a these results, we obtained 187 TFLOPS with 196,608 CPU cores.
将仿真环境“NEURON”移植并优化到K计算机上,以多室霍奇金-赫胥黎模型模拟昆虫脑。为了使用SPARC64VIIIfx (K计算机的CPU)的SIMD单元,我们交换了隔室环路和离子通道环路的顺序,并应用扇区缓存。这些调优将单核性能从340 MFLOPS/核提高到1560 MFLOPS/核(效率约为10%)。gNEURONh的尖峰交换方法(MPI_Allgather)在10000核以上的情况下需要大量的时间,简单的异步点对点方法(MPI_Isend)也不有效,因为函数调用量大,互连路径距离长。为了解决这些问题,我们采用MPI/OpenMP混合并行化来减少互连通信,并开发了一个程序来优化三维环面网络中计算节点上神经元的位置。根据这些结果,我们获得了187 TFLOPS和196,608个CPU内核。
{"title":"Neural Circuit Simulation of Hodgkin-Huxley Type Neurons Toward Peta Scale Computers","authors":"Daisuke Miyamoto, T. Kazawa, R. Kanzaki","doi":"10.1109/SC.Companion.2012.314","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.314","url":null,"abstract":"We ported and optimized simulation environment \"NEURON\" on K computer to simulate a insect brain as multi-compartment Hodgkin-Huxley type model. To use SIMD units of SPARC64VIIIfx (CPU of K computer), we exchanged the order of the compartment loop and the ion channel loop and apply sector caches. These tuning improved single core performance 340 MFLOPS/core to 1560 MFLOPS/core (about 10% efficiency).Spike exchange method of gNEURONh (MPI_Allgather) demands large amount of time in case of 10,000 cores or more and simple asynchronous point-to-point method (MPI_Isend) is not effective either, because of a large number of function calls and long distance of interconnect pathway. To tackle these problems, we adopted MPI/OpenMP hybrid parallelization to reduce interconnect communications and we developed a program to optimize location of neurons on calculation nodes in the 3D torus network. As a these results, we obtained 187 TFLOPS with 196,608 CPU cores.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"160 1","pages":"1541-1541"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87053432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Improved Real-Time Computation Engine for a Dispatcher Training Center of the European Transmission Network 欧洲输电网调度员培训中心改进的实时计算引擎
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.52
B. Haut, Francois Bouchez, F. Villella
Dispatcher Training Simulators (DTS) are fundamental tools used by Transmission System Operators (TSO's) and Distribution System Operators (DSO's) around the world to train their dispatchers on frequent or uncommon situations. DTS widely used are generally based on simplified models for the system dynamics (static modelling or very simplified dynamics) and are limited to small/medium systems due to constraints in computational performance. Taking into account fast dynamics on large system is a real challenge for the simulation engine of a DTS. Indeed, in order to represent effectively the reaction of the power system, the simulation must be carried out very close to real-time. The PEGASE project addressed this challenge on the European Transmission Network (ETN). Many different algorithms were investigated and several have been implemented in a prototype based on FAST, a full-dynamic DTS simulation engine developed by Tractebel Engineering S.A. and integrated in the Energy Management System (EMS) of several TSO's. This paper describes the considered algorithmic improvements and presents numerical results (both in terms of accuracy and efficiency) obtained on a representation of the whole ETN.
调度员培训模拟器(DTS)是世界各地输电系统运营商(TSO)和配电系统运营商(DSO)使用的基本工具,用于培训他们的调度员处理常见或不常见的情况。广泛使用的DTS通常基于简化的系统动力学模型(静态建模或非常简化的动力学),由于计算性能的限制,仅限于中小型系统。考虑大型系统的快速动态是DTS仿真引擎面临的真正挑战。事实上,为了有效地表示电力系统的反应,必须进行非常接近实时的仿真。PEGASE项目在欧洲输电网络(ETN)上解决了这一挑战。研究了许多不同的算法,其中一些已经在基于FAST的原型中实现,FAST是由Tractebel Engineering S.A.开发的全动态DTS仿真引擎,并集成在几个TSO的能源管理系统(EMS)中。本文描述了所考虑的算法改进,并给出了在整个ETN的表示上获得的数值结果(精度和效率方面)。
{"title":"Improved Real-Time Computation Engine for a Dispatcher Training Center of the European Transmission Network","authors":"B. Haut, Francois Bouchez, F. Villella","doi":"10.1109/SC.Companion.2012.52","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.52","url":null,"abstract":"Dispatcher Training Simulators (DTS) are fundamental tools used by Transmission System Operators (TSO's) and Distribution System Operators (DSO's) around the world to train their dispatchers on frequent or uncommon situations. DTS widely used are generally based on simplified models for the system dynamics (static modelling or very simplified dynamics) and are limited to small/medium systems due to constraints in computational performance. Taking into account fast dynamics on large system is a real challenge for the simulation engine of a DTS. Indeed, in order to represent effectively the reaction of the power system, the simulation must be carried out very close to real-time. The PEGASE project addressed this challenge on the European Transmission Network (ETN). Many different algorithms were investigated and several have been implemented in a prototype based on FAST, a full-dynamic DTS simulation engine developed by Tractebel Engineering S.A. and integrated in the Energy Management System (EMS) of several TSO's. This paper describes the considered algorithmic improvements and presents numerical results (both in terms of accuracy and efficiency) obtained on a representation of the whole ETN.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"54 1","pages":"333-340"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77781127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Improving Data Analysis Performance for High-Performance Computing with Integrating Statistical Metadata in Scientific Datasets 集成科学数据集中的统计元数据,提高高性能计算的数据分析性能
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.156
Jialin Liu, Yong Chen
Scientific datasets and libraries, such as HDF5, ADIOS, and NetCDF, have been used widely in many data intensive applications. These libraries have their special file formats and I/O functions to provide efficient access to large datasets. When the data size keeps increasing, these high level I/O libraries face new challenges. Recent studies have started to utilize database techniques such as indexing and subsetting, and data reorganization to manage the increasing datasets. In this work, we present a new approach to boost the data analysis performance, namely Fast Analysis with Statistical Metadata (FASM), via data subsetting and integrating a small amount of statistics into the original datasets. The added statistical information illustrates the data shape and provides knowledge of the data distribution; therefore the original I/O libraries can utilize these statistical metadata to perform fast queries and analyses. The proposed FASM approach is currently evaluated with the PnetCDF on Lustre file systems, but can also be implemented with other scientific libraries. The FASM can potentially lead to a new dataset design and can have an impact on big data analysis.
科学数据集和库,如HDF5、ADIOS和NetCDF,已广泛应用于许多数据密集型应用。这些库具有其特殊的文件格式和I/O函数,以提供对大型数据集的有效访问。当数据量不断增加时,这些高级I/O库面临着新的挑战。近年来的研究开始利用索引和子集、数据重组等数据库技术来管理不断增加的数据集。在这项工作中,我们提出了一种新的方法来提高数据分析性能,即快速分析统计元数据(FASM),通过数据子集和集成少量的统计数据到原始数据集中。所添加的统计信息说明了数据形状并提供了有关数据分布的知识;因此,原始I/O库可以利用这些统计元数据来执行快速查询和分析。提出的FASM方法目前在Lustre文件系统上使用PnetCDF进行评估,但也可以与其他科学库一起实现。FASM可能会导致新的数据集设计,并可能对大数据分析产生影响。
{"title":"Improving Data Analysis Performance for High-Performance Computing with Integrating Statistical Metadata in Scientific Datasets","authors":"Jialin Liu, Yong Chen","doi":"10.1109/SC.Companion.2012.156","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.156","url":null,"abstract":"Scientific datasets and libraries, such as HDF5, ADIOS, and NetCDF, have been used widely in many data intensive applications. These libraries have their special file formats and I/O functions to provide efficient access to large datasets. When the data size keeps increasing, these high level I/O libraries face new challenges. Recent studies have started to utilize database techniques such as indexing and subsetting, and data reorganization to manage the increasing datasets. In this work, we present a new approach to boost the data analysis performance, namely Fast Analysis with Statistical Metadata (FASM), via data subsetting and integrating a small amount of statistics into the original datasets. The added statistical information illustrates the data shape and provides knowledge of the data distribution; therefore the original I/O libraries can utilize these statistical metadata to perform fast queries and analyses. The proposed FASM approach is currently evaluated with the PnetCDF on Lustre file systems, but can also be implemented with other scientific libraries. The FASM can potentially lead to a new dataset design and can have an impact on big data analysis.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"50 1","pages":"1292-1295"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87796451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Poster: MPACK 0.7.0: Multiple Precision Version of BLAS and LAPACK 海报:MPACK 0.7.0: BLAS和LAPACK的多精度版本
Pub Date : 2012-11-10 DOI: 10.1109/SC.Companion.2012.183
Maho Nakata
We are interested in the accuracy of linear algebra operations; accuracy of the solution of linear equation, eigenvalue and eigenvectors of some matrices, etc. This is a reason for we have been developing the MPACK. The MPACK consists of MBLAS and MLAPACK, multiple precision version of BLAS and LAPACK, respectively. Features of MPACK are: (i) based on LAPACK 3.x, (ii) to provide a reference implementation and or API (iii) written in C++, rewrite from FORTRAN77 (iv) supports GMP, MPFR, DD/QD and binary128 as multiple precision arithmetic library and (v) portable. Current version of MPACK is 0.7.0 and it supports 76 MBLAS routines and 100 MLAPACK routines. Matrix-matrix multiplication routine has been accelerated using NVIDIA C2050 GPU.
我们感兴趣的是线性代数运算的准确性;线性方程解的精度,某些矩阵的特征值和特征向量等。这就是我们一直在开发MPACK的原因。MPACK由MBLAS和MLAPACK组成,分别是BLAS和LAPACK的多精度版本。MPACK的特点是:(1)基于LAPACK 3。x, (ii)提供参考实现和/或API (iii)用c++编写,从FORTRAN77重写(iv)支持GMP, MPFR, DD/QD和binary128作为多精度算术库和(v)可移植。当前版本的MPACK是0.7.0,它支持76个MBLAS例程和100个MLAPACK例程。矩阵-矩阵乘法例程已使用NVIDIA C2050 GPU加速。
{"title":"Poster: MPACK 0.7.0: Multiple Precision Version of BLAS and LAPACK","authors":"Maho Nakata","doi":"10.1109/SC.Companion.2012.183","DOIUrl":"https://doi.org/10.1109/SC.Companion.2012.183","url":null,"abstract":"We are interested in the accuracy of linear algebra operations; accuracy of the solution of linear equation, eigenvalue and eigenvectors of some matrices, etc. This is a reason for we have been developing the MPACK. The MPACK consists of MBLAS and MLAPACK, multiple precision version of BLAS and LAPACK, respectively. Features of MPACK are: (i) based on LAPACK 3.x, (ii) to provide a reference implementation and or API (iii) written in C++, rewrite from FORTRAN77 (iv) supports GMP, MPFR, DD/QD and binary128 as multiple precision arithmetic library and (v) portable. Current version of MPACK is 0.7.0 and it supports 76 MBLAS routines and 100 MLAPACK routines. Matrix-matrix multiplication routine has been accelerated using NVIDIA C2050 GPU.","PeriodicalId":6346,"journal":{"name":"2012 SC Companion: High Performance Computing, Networking Storage and Analysis","volume":"26 1","pages":"1353-1353"},"PeriodicalIF":0.0,"publicationDate":"2012-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81879453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2012 SC Companion: High Performance Computing, Networking Storage and Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1