CCF Transactions on High Performance Computing最新文献

英文中文

DCU-CHK: checkpointing for large-scale CPU-DCU heterogeneous computing systems DCU-CHK：大规模 CPU-DCU 异构计算系统的检查点功能

IF 0.9 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2024-01-07 DOI: 10.1007/s42514-023-00178-4

Jie Jia, Xinyuan Lin, Fang Lin, Yi Liu

引用次数: 0

HiRM: Hierarchical resource management for earth system models on many-core clusters HiRM：多核集群上地球系统模型的分级资源管理

IF 0.9 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2024-01-05 DOI: 10.1007/s42514-023-00176-6

Zhewen Xu, Xiaohui Wei, JieYun Hao, Jiale Li, Hongliang Li, Zhaohui Ding, Sicong Li

引用次数: 0

Extending OP2 framework to support portable parallel programming of complex applications 扩展 OP2 框架，支持复杂应用程序的可移植并行编程

IF 0.9 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-12-07 DOI: 10.1007/s42514-023-00174-8

Zongjing Chen, Kangjin Huang, Yonggang Che, Chuanfu Xu, Jian Zhang, Z. Dai, Ming Li

引用次数: 0

Leveraging simulation of high performance computing systems with node simulation using architecture simulator 利用架构模拟器对高性能计算系统进行节点仿真

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-11-13 DOI: 10.1007/s42514-023-00173-9

Fang Lin, Yi Liu, Xin Wang, Xueyan Gai

引用次数: 0

OneGraph: a cross-architecture framework for large-scale graph computing on GPUs based on oneAPI OneGraph:基于oneAPI的gpu大规模图计算跨架构框架

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-11-09 DOI: 10.1007/s42514-023-00172-w

Shiyang Li, Jingyu Zhu, Jiaxun Han, Yuting Peng, Zhuoran Wang, Xiaoli Gong, Gang Wang, Jin Zhang, Xuqiang Wang

引用次数: 0

BSPADMM: block splitting proximal ADMM for sparse representation with strong scalability BSPADMM:块分割近端ADMM，具有较强的可扩展性

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-10-07 DOI: 10.1007/s42514-023-00164-w

Yidong Chen, Jingshan Pan, Zidong Han, Yonghong Hu, Meng Guo, Zhonghua Lu

引用次数: 0

Conflict-aware workload co-execution on SX-aurora TSUBASA SX-aurora TSUBASA上的冲突感知工作负载协同执行

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-10-05 DOI: 10.1007/s42514-023-00171-x

Riku Nunokawa, Yoichi Shimomura, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa

Abstract NEC SX-Aurora TSUBASA (SX-AT) is the latest vector supercomputer, consisting of host processors called Vector Hosts (VHs) and vector processors called Vector Engines (VEs). The goal of this work is to simultaneously use both VHs and VEs to increase the resource utilization and improve the system throughput by co-executing more workloads. One difficulty is that performance interferences among VH and VE workloads could occur because they share some computing resources and potentially compete to use the same resource at the same time, so-called resource conflicts. To achieve efficient workload co-execution, first, this paper experimentally investigates the performance interference between a VH and a VE, when each of the two processors executes a different workload. It is empirically shown that the frequency of system calls from the VE workload could be a good indicator to predict if the co-execution could cause severe performance interference, even though monitoring system calls requires a huge runtime overhead and it is impractical to simply use it for decision making of co-execution. Then, this paper proposes a workload co-execution strategy based on a practical approach to identifying a pair of VE and VH workloads that could cause severe performance interferences. Our evaluation results clearly demonstrate that the system call frequency can be used to predict if the workload can affect the performance of another co-executing workload, and VH’s CPU load can be a good approximation of the system call frequency. The proposed approach based on the CPU loads could accurately identify a pair of workloads causing frequent resource conflicts, and thus reduce the risk of severe performance interferences between co-executing workloads on an SX-AT system, resulting in shorter makespan without significantly increasing the turn-around time.

NEC SX-Aurora TSUBASA (SX-AT)是最新的矢量超级计算机，由称为矢量主机(VHs)的主机处理器和称为矢量引擎(VEs)的矢量处理器组成。这项工作的目标是同时使用vh和ve，通过共同执行更多的工作负载来提高资源利用率和提高系统吞吐量。一个困难是，VH和VE工作负载之间可能出现性能干扰，因为它们共享一些计算资源，并可能同时竞争使用相同的资源，即所谓的资源冲突。为了实现高效的工作负载协同执行，本文首先通过实验研究了VH和VE在执行不同工作负载时的性能干扰。经验表明，来自VE工作负载的系统调用的频率可能是预测共同执行是否会导致严重性能干扰的一个很好的指标，尽管监视系统调用需要巨大的运行时开销，并且简单地将其用于共同执行的决策是不切实际的。然后，本文提出了一种基于实际方法的工作负载协同执行策略，以识别可能导致严重性能干扰的一对VE和VH工作负载。我们的评估结果清楚地表明，可以使用系统调用频率来预测工作负载是否会影响另一个协同执行工作负载的性能，并且VH的CPU负载可以很好地近似系统调用频率。所提出的基于CPU负载的方法可以准确地识别导致频繁资源冲突的一对工作负载，从而降低SX-AT系统上共同执行的工作负载之间严重性能干扰的风险，从而在不显著增加周转时间的情况下缩短makespan。

{"title":"Conflict-aware workload co-execution on SX-aurora TSUBASA","authors":"Riku Nunokawa, Yoichi Shimomura, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa","doi":"10.1007/s42514-023-00171-x","DOIUrl":"https://doi.org/10.1007/s42514-023-00171-x","url":null,"abstract":"Abstract NEC SX-Aurora TSUBASA (SX-AT) is the latest vector supercomputer, consisting of host processors called Vector Hosts (VHs) and vector processors called Vector Engines (VEs). The goal of this work is to simultaneously use both VHs and VEs to increase the resource utilization and improve the system throughput by co-executing more workloads. One difficulty is that performance interferences among VH and VE workloads could occur because they share some computing resources and potentially compete to use the same resource at the same time, so-called resource conflicts. To achieve efficient workload co-execution, first, this paper experimentally investigates the performance interference between a VH and a VE, when each of the two processors executes a different workload. It is empirically shown that the frequency of system calls from the VE workload could be a good indicator to predict if the co-execution could cause severe performance interference, even though monitoring system calls requires a huge runtime overhead and it is impractical to simply use it for decision making of co-execution. Then, this paper proposes a workload co-execution strategy based on a practical approach to identifying a pair of VE and VH workloads that could cause severe performance interferences. Our evaluation results clearly demonstrate that the system call frequency can be used to predict if the workload can affect the performance of another co-executing workload, and VH’s CPU load can be a good approximation of the system call frequency. The proposed approach based on the CPU loads could accurately identify a pair of workloads causing frequent resource conflicts, and thus reduce the risk of severe performance interferences between co-executing workloads on an SX-AT system, resulting in shorter makespan without significantly increasing the turn-around time.","PeriodicalId":29895,"journal":{"name":"CCF Transactions on High Performance Computing","volume":"440 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135480691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FILL: a heterogeneous resource scheduling system addressing the low throughput problem in GROMACS FILL:一个异构资源调度系统，解决GROMACS中的低吞吐量问题

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-09-23 DOI: 10.1007/s42514-023-00169-5

Yueyuan Zhou, ZiYi Ren, En Shao, Lixian Ma, Qiang Hu, Leping Wang, Guangming Tan

引用次数: 0

ConvDarts: a fast and exact convolutional algorithm selector for deep learning frameworks convdart:一个快速、精确的深度学习框架卷积算法选择器

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-09-20 DOI: 10.1007/s42514-023-00167-7

Lu Bai, Weixing Ji, Qinyuan Li, Xilai Yao, Wei Xin, Wanyi Zhu

引用次数: 0

Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920 用静态代码分析器揭示现代高性能计算处理器的性能瓶颈——以鲲鹏920为例

Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CCF Transactions on High Performance Computing

Pub Date : 2023-09-15 DOI: 10.1007/s42514-023-00160-0

Shaojie Tan, Qingcai Jiang, Zhenwei Cao, Xiaoyu Hao, Junshi Chen, Hong An

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

CCF Transactions on High Performance Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀