首页 > 最新文献

2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)最新文献

英文 中文
Offloading the Training of an I/O Access Pattern Detector to the Cloud 将I/O访问模式检测器的训练卸载到云端
C. Künas, M. Serpa, J. L. Bez, E. Padoin, P. Navaux
I/O operations are a bottleneck for numerous applications, so optimizing the performance of these operations is of paramount importance. Many techniques explore and apply optimizations to different layers of the I/O stack to improve performance. The difficulty that arises is that the workload changes constantly. So detecting access patterns correctly, at runtime, becomes essential for systems that seek to self-adjust their parameters. Furthermore, the I/O pattern detection techniques should represent minimal overhead and should be able to perform detection as quickly as possible. This paper approaches a machine learning technique for detecting the I/O access patterns and proposes offloading the local training workload to the cloud using a TPU accelerator. Such an approach does not interfere with classifier accuracy (reaching up to 99% accuracy). Still, it allows the training to be asynchronous, enabling the local machine to allocate its computing resources to scientific applications while the model is trained or updated in the cloud.
I/O操作是许多应用程序的瓶颈,因此优化这些操作的性能至关重要。许多技术对I/O堆栈的不同层进行探索和应用优化,以提高性能。出现的困难是工作量不断变化。因此,在运行时正确检测访问模式对于寻求自我调整其参数的系统至关重要。此外,I/O模式检测技术应该表示最小的开销,并且应该能够尽可能快地执行检测。本文探讨了一种用于检测I/O访问模式的机器学习技术,并建议使用TPU加速器将本地训练工作量卸载到云端。这种方法不会干扰分类器的准确率(达到99%的准确率)。尽管如此,它允许异步训练,使本地机器能够将其计算资源分配给科学应用程序,而模型在云中进行训练或更新。
{"title":"Offloading the Training of an I/O Access Pattern Detector to the Cloud","authors":"C. Künas, M. Serpa, J. L. Bez, E. Padoin, P. Navaux","doi":"10.1109/sbac-padw53941.2021.00013","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00013","url":null,"abstract":"I/O operations are a bottleneck for numerous applications, so optimizing the performance of these operations is of paramount importance. Many techniques explore and apply optimizations to different layers of the I/O stack to improve performance. The difficulty that arises is that the workload changes constantly. So detecting access patterns correctly, at runtime, becomes essential for systems that seek to self-adjust their parameters. Furthermore, the I/O pattern detection techniques should represent minimal overhead and should be able to perform detection as quickly as possible. This paper approaches a machine learning technique for detecting the I/O access patterns and proposes offloading the local training workload to the cloud using a TPU accelerator. Such an approach does not interfere with classifier accuracy (reaching up to 99% accuracy). Still, it allows the training to be asynchronous, enabling the local machine to allocate its computing resources to scientific applications while the model is trained or updated in the cloud.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130411305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Quantifying and detecting HPC resource wastage in cloud environments 量化和检测云环境下的HPC资源浪费
William F. C. Tavares, M. M. Assis, E. Borin
Among many details that users need to consider when using cloud computing, the care not to waste resources requires more attention by administrators and new users. When the application does not fully utilize the provisioned resource, the end-of-the-month bill is unnecessarily increased. Several studies have developed solutions to avoid wastage using predictive techniques. Nonetheless, these approaches require applications’ to have predictive behavior and depend on pre-executions or history data. To circumvent these limitations, we explore how a reactive solution can be used to detect and contain wastage. More specifically, we discuss several important issues that arise when quantifying resource wastage caused by HPC resource wastage on the cloud and propose a reactive strategy to quantify, detect, and contain resource wastage in this context. This solution is designed so that it can be applied in environments with expert and non-expert users with no prior knowledge about the applications.
在用户使用云计算时需要考虑的许多细节中,不浪费资源需要管理员和新用户更多地关注。当应用程序没有充分利用所提供的资源时,月末账单就会不必要地增加。一些研究已经开发了使用预测技术来避免浪费的解决方案。尽管如此,这些方法要求应用程序具有预测行为,并依赖于预执行或历史数据。为了规避这些限制,我们将探讨如何使用反应性解决方案来检测和控制浪费。更具体地说,我们讨论了在量化由云上的HPC资源浪费引起的资源浪费时出现的几个重要问题,并提出了在这种情况下量化、检测和控制资源浪费的反应策略。该解决方案的设计目的是使其可以应用于具有专家和非专家用户的环境中,而不需要事先了解应用程序。
{"title":"Quantifying and detecting HPC resource wastage in cloud environments","authors":"William F. C. Tavares, M. M. Assis, E. Borin","doi":"10.1109/sbac-padw53941.2021.00017","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00017","url":null,"abstract":"Among many details that users need to consider when using cloud computing, the care not to waste resources requires more attention by administrators and new users. When the application does not fully utilize the provisioned resource, the end-of-the-month bill is unnecessarily increased. Several studies have developed solutions to avoid wastage using predictive techniques. Nonetheless, these approaches require applications’ to have predictive behavior and depend on pre-executions or history data. To circumvent these limitations, we explore how a reactive solution can be used to detect and contain wastage. More specifically, we discuss several important issues that arise when quantifying resource wastage caused by HPC resource wastage on the cloud and propose a reactive strategy to quantify, detect, and contain resource wastage in this context. This solution is designed so that it can be applied in environments with expert and non-expert users with no prior knowledge about the applications.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132110534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CLAP-Bot: a framework for automatic optimization of high-performance elastic applications on the Clouds CLAP-Bot:用于自动优化云上的高性能弹性应用程序的框架
O. Napoli, Gustavo Ciotto Pinton, E. Borin
The computational cloud has become notorious due to its business model, where the user only pays to use the system, with no acquisition or maintenance costs. However, cloud providers such as AWS EC2 and Google Computing Engine offer several virtual machine types making it difficult to choose which of them is most suitable to the user’s application and objective.In this work, we present CLAP-Bot, a system that automatically monitors and adjusts the computing infrastructure based on some recipe. CLAP-Bot is built over CLAP, allowing creating and managing computational clusters in different cloud providers. The recipe is a component that can read application metrics and execute a set of actions on the infrastructure. The application monitor is decoupled from the recipe, allowing it to be used transparently with different applications. We show how CLAP-Bot works by implementing three dynamic provisioning policies as recipes and evaluating them. Besides that, together with CLAP-Bot we also present CLAP-Bot-Sim, a discrete event simulator that allows modeling the use of a given recipe without the need to instantiate any virtual machine. CLAP-Bot-Sim also allows modeling dynamic events, such as virtual machine interruptions and instance price oscillation over time. We show that CLAP-Bot-Sim can accurately simulate the effects of recipes on the computing infrastructure and can easily be interchanged with CLAP-Bot.
计算云因其商业模式而臭名昭著,在这种模式下,用户只需支付使用系统的费用,无需购买或维护成本。然而,像AWS EC2和b谷歌Computing Engine这样的云提供商提供了几种虚拟机类型,因此很难选择哪种虚拟机最适合用户的应用程序和目标。在这项工作中,我们提出了CLAP-Bot,一个基于某些配方自动监控和调整计算基础设施的系统。CLAP- bot建立在CLAP之上,允许在不同的云提供商中创建和管理计算集群。配方是一个组件,它可以读取应用程序指标并在基础设施上执行一组操作。应用程序监视器与配方解耦,允许它透明地用于不同的应用程序。我们通过将三个动态供应策略实现为食谱并对其进行评估,来展示clip - bot是如何工作的。除此之外,与CLAP-Bot一起,我们还提出了CLAP-Bot- sim,这是一个离散事件模拟器,允许对给定配方的使用进行建模,而无需实例化任何虚拟机。CLAP-Bot-Sim还允许建模动态事件,例如虚拟机中断和实例价格随时间波动。我们证明了CLAP-Bot- sim可以准确地模拟食谱对计算基础设施的影响,并且可以很容易地与CLAP-Bot交换。
{"title":"CLAP-Bot: a framework for automatic optimization of high-performance elastic applications on the Clouds","authors":"O. Napoli, Gustavo Ciotto Pinton, E. Borin","doi":"10.1109/sbac-padw53941.2021.00015","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00015","url":null,"abstract":"The computational cloud has become notorious due to its business model, where the user only pays to use the system, with no acquisition or maintenance costs. However, cloud providers such as AWS EC2 and Google Computing Engine offer several virtual machine types making it difficult to choose which of them is most suitable to the user’s application and objective.In this work, we present CLAP-Bot, a system that automatically monitors and adjusts the computing infrastructure based on some recipe. CLAP-Bot is built over CLAP, allowing creating and managing computational clusters in different cloud providers. The recipe is a component that can read application metrics and execute a set of actions on the infrastructure. The application monitor is decoupled from the recipe, allowing it to be used transparently with different applications. We show how CLAP-Bot works by implementing three dynamic provisioning policies as recipes and evaluating them. Besides that, together with CLAP-Bot we also present CLAP-Bot-Sim, a discrete event simulator that allows modeling the use of a given recipe without the need to instantiate any virtual machine. CLAP-Bot-Sim also allows modeling dynamic events, such as virtual machine interruptions and instance price oscillation over time. We show that CLAP-Bot-Sim can accurately simulate the effects of recipes on the computing infrastructure and can easily be interchanged with CLAP-Bot.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126881927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WCC Program Committee WCC项目委员会
{"title":"WCC Program Committee","authors":"","doi":"10.1109/sbac-padw53941.2021.00009","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00009","url":null,"abstract":"","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131440401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selecting efficient VM types to train deep learning models on Amazon SageMaker 选择高效的虚拟机类型在Amazon SageMaker上训练深度学习模型
Rafael Keller Tesser, Alvaro Marques, E. Borin
The cloud has become a popular environment for running Deep Learning (DL) applications. Public cloud providers charge by the amount time the resources are actually used, with the price by hour depending on the configuration of the chosen cloud instance. Instances are usually provided in the form of a VM that gives access to a certain hardware configuration, and may also come with a pre-configured software environment. More advanced, and theoretically faster, VMs are usually more expensive, but may not necessarily provide the best performance for all applications. Therefore, in order to choose the best instance (or VM type), users must consider the relative performances (and consequent cost) of different VMs when running their specific target application. Taking this into account, we propose a model to estimate the relative performance and cost of training deep learning applications running in different VM instances. This model is built upon observations derived from the performance profile of executions of three different DL applications, on 12 different public cloud instances. We argue that this model is a valuable tool for cloud users looking for optimal VM types to train their deep learning applications on the cloud.
云已经成为运行深度学习(DL)应用程序的流行环境。公共云提供商按资源实际使用的时间收费,按小时收费取决于所选云实例的配置。实例通常以VM的形式提供,VM允许访问特定的硬件配置,也可能带有预配置的软件环境。更高级,理论上更快,vm通常更昂贵,但不一定能为所有应用程序提供最佳性能。因此,为了选择最佳实例(或虚拟机类型),用户必须在运行特定目标应用程序时考虑不同虚拟机的相对性能(以及相应的成本)。考虑到这一点,我们提出了一个模型来估计在不同VM实例中运行的训练深度学习应用程序的相对性能和成本。该模型建立在对12个不同公共云实例上执行三个不同DL应用程序的性能概况的观察基础之上。我们认为,对于云用户来说,这个模型是一个有价值的工具,可以帮助他们寻找最佳的虚拟机类型,在云上训练他们的深度学习应用程序。
{"title":"Selecting efficient VM types to train deep learning models on Amazon SageMaker","authors":"Rafael Keller Tesser, Alvaro Marques, E. Borin","doi":"10.1109/SBAC-PADW53941.2021.00014","DOIUrl":"https://doi.org/10.1109/SBAC-PADW53941.2021.00014","url":null,"abstract":"The cloud has become a popular environment for running Deep Learning (DL) applications. Public cloud providers charge by the amount time the resources are actually used, with the price by hour depending on the configuration of the chosen cloud instance. Instances are usually provided in the form of a VM that gives access to a certain hardware configuration, and may also come with a pre-configured software environment. More advanced, and theoretically faster, VMs are usually more expensive, but may not necessarily provide the best performance for all applications. Therefore, in order to choose the best instance (or VM type), users must consider the relative performances (and consequent cost) of different VMs when running their specific target application. Taking this into account, we propose a model to estimate the relative performance and cost of training deep learning applications running in different VM instances. This model is built upon observations derived from the performance profile of executions of three different DL applications, on 12 different public cloud instances. We argue that this model is a valuable tool for cloud users looking for optimal VM types to train their deep learning applications on the cloud.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132682223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Message from the WCC 2021 Workshop Chairs 来自2021年WCC工作坊主席的信息
{"title":"Message from the WCC 2021 Workshop Chairs","authors":"","doi":"10.1109/sbac-padw53941.2021.00008","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00008","url":null,"abstract":"","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123886278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An evaluation of Cassandra NoSQL database on a low-power cluster Cassandra NoSQL数据库在低功耗集群上的性能评估
Lucas B. da Silva, J. F. Lima
The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices, create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL distributed databases such as Cassandra have been used for unstructured data storage and to increase horizontal scalability and high availability. In this paper, we evaluated Cassandra on a low-power low-cost cluster of commodity Single Board Computers (SBC). The cluster has 15 Raspberry Pi v3 nodes with Docker Swarm orchestration tool for Cassandra service deployment and ingress load balancing over SBCs. Experimental results demonstrated that hardware limitations impacted workload throughput, but read and write latencies were comparable to results from other works on high-end or virtualized platforms. Despite the observed limitations, the results show that a low-cost SBC cluster can support cloud serving goals such as scale-out, elasticity and high availability.
社交媒体、非常规网络技术、移动应用程序和物联网(IoT)设备的不断发展,为云数据系统支持庞大的数据集和非常高的请求率带来了挑战。NoSQL分布式数据库(如Cassandra)已被用于非结构化数据存储,并增加水平可伸缩性和高可用性。在本文中,我们在低功耗低成本的商用单板计算机(SBC)集群上对Cassandra进行了评估。集群有15个Raspberry Pi v3节点,使用Docker Swarm编排工具,用于Cassandra服务部署和sbc的入口负载均衡。实验结果表明,硬件限制会影响工作负载吞吐量,但读写延迟与高端平台或虚拟化平台上的其他工作的结果相当。尽管存在观察到的限制,但结果表明,低成本SBC集群可以支持云服务目标,如横向扩展、弹性和高可用性。
{"title":"An evaluation of Cassandra NoSQL database on a low-power cluster","authors":"Lucas B. da Silva, J. F. Lima","doi":"10.1109/sbac-padw53941.2021.00012","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00012","url":null,"abstract":"The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices, create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL distributed databases such as Cassandra have been used for unstructured data storage and to increase horizontal scalability and high availability. In this paper, we evaluated Cassandra on a low-power low-cost cluster of commodity Single Board Computers (SBC). The cluster has 15 Raspberry Pi v3 nodes with Docker Swarm orchestration tool for Cassandra service deployment and ingress load balancing over SBCs. Experimental results demonstrated that hardware limitations impacted workload throughput, but read and write latencies were comparable to results from other works on high-end or virtualized platforms. Despite the observed limitations, the results show that a low-cost SBC cluster can support cloud serving goals such as scale-out, elasticity and high availability.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115131168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
WAMCA 2021 Program Committee WAMCA 2021项目委员会
{"title":"WAMCA 2021 Program Committee","authors":"","doi":"10.1109/sbac-padw53941.2021.00007","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00007","url":null,"abstract":"","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122298066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Memory Affinity Analysis of Scientific Applications on NUMA Platforms NUMA平台上科学应用的内存亲和性分析
Rafael Gauna Trindade, J. F. Lima, A. Charão
Understanding the underlying architecture is essential for scientific applications in general. An example of a computing environment is Non-Uniform Memory Access (NUMA) systems that enable a large amount of shared main memory. Nevertheless, NUMA systems can impose significant access latencies on data communications between distant memory nodes. Parallel applications with a naïve design may suffer significant performance penalties due to the lack of locality mechanisms. In this paper we present performance metrics on scientific applications to identify locality problems in NUMA systems and show data and thread mapping strategies to mitigate them. Our experiments were performed with four well-known scientific applications: CoMD, LBM, LULESH and Ondes3D. Experimental results demonstrate that scientific applications had significant locality problems and data and thread mapping strategies improved performance on all four applications.
一般来说,理解底层架构对于科学应用程序至关重要。计算环境的一个例子是支持大量共享主存的非统一内存访问(NUMA)系统。然而,NUMA系统可能会对远程内存节点之间的数据通信施加显著的访问延迟。由于缺乏局部性机制,采用naïve设计的并行应用程序可能会遭受严重的性能损失。在本文中,我们提出了科学应用程序的性能指标,以识别NUMA系统中的局部性问题,并展示了缓解这些问题的数据和线程映射策略。我们的实验使用了四种著名的科学应用程序:CoMD, LBM, LULESH和Ondes3D。实验结果表明,科学应用存在明显的局部性问题,数据和线程映射策略提高了这四种应用的性能。
{"title":"A Memory Affinity Analysis of Scientific Applications on NUMA Platforms","authors":"Rafael Gauna Trindade, J. F. Lima, A. Charão","doi":"10.1109/sbac-padw53941.2021.00011","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00011","url":null,"abstract":"Understanding the underlying architecture is essential for scientific applications in general. An example of a computing environment is Non-Uniform Memory Access (NUMA) systems that enable a large amount of shared main memory. Nevertheless, NUMA systems can impose significant access latencies on data communications between distant memory nodes. Parallel applications with a naïve design may suffer significant performance penalties due to the lack of locality mechanisms. In this paper we present performance metrics on scientific applications to identify locality problems in NUMA systems and show data and thread mapping strategies to mitigate them. Our experiments were performed with four well-known scientific applications: CoMD, LBM, LULESH and Ondes3D. Experimental results demonstrate that scientific applications had significant locality problems and data and thread mapping strategies improved performance on all four applications.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129509744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Cloud-Based Batch Processing System for Loosely-Coupled Applications 基于云的松耦合应用批处理系统
Raoni Matos Smaneoto, T. Pereira, F. Brasileiro
With the increased amount of data available for processing, and the increased need of processing this data, loosely-coupled batch applications have become very popular. Many batch applications require a high level of processing capacity, which leads us to the need of high performance computing infrastructures. This approach has been used for a long time, mainly for scientific purposes, and focused on the conventional environments for HPC, namely local clusters and supercomputers. The high-speed networks present in these systems are paramount for the execution of tightly-coupled scientific applications, but are wasted when executing loosely-coupled applications. Cloud infrastructures, on the other hand, provide a more appropriate infrastructure to support such loosely-coupled applications. Unfortunately, the user experience in cloud systems is completely different from that of conventional batch systems, mainly because the infrastructure needs to be deployed and subsequently released, to achieve the desired gains. In this paper we propose the architecture of a batch processing system that takes advantage of common features of cloud infrastructures to minimize cost and waiting time, while providing a user experience that is similar to conventional HPC systems.
随着可用于处理的数据量的增加,以及处理这些数据的需求的增加,松耦合批处理应用程序变得非常流行。许多批处理应用程序需要高水平的处理能力,这导致我们需要高性能的计算基础设施。这种方法已经使用了很长时间,主要用于科学目的,并且专注于HPC的传统环境,即本地集群和超级计算机。这些系统中的高速网络对于执行紧密耦合的科学应用程序至关重要,但是在执行松耦合的应用程序时就浪费了。另一方面,云基础设施提供了更合适的基础设施来支持这种松耦合的应用程序。不幸的是,云系统中的用户体验与传统的批处理系统完全不同,这主要是因为需要部署和随后发布基础设施,以实现预期的收益。在本文中,我们提出了一个批处理系统的架构,该架构利用云基础设施的共同特征来最小化成本和等待时间,同时提供类似于传统HPC系统的用户体验。
{"title":"A Cloud-Based Batch Processing System for Loosely-Coupled Applications","authors":"Raoni Matos Smaneoto, T. Pereira, F. Brasileiro","doi":"10.1109/sbac-padw53941.2021.00018","DOIUrl":"https://doi.org/10.1109/sbac-padw53941.2021.00018","url":null,"abstract":"With the increased amount of data available for processing, and the increased need of processing this data, loosely-coupled batch applications have become very popular. Many batch applications require a high level of processing capacity, which leads us to the need of high performance computing infrastructures. This approach has been used for a long time, mainly for scientific purposes, and focused on the conventional environments for HPC, namely local clusters and supercomputers. The high-speed networks present in these systems are paramount for the execution of tightly-coupled scientific applications, but are wasted when executing loosely-coupled applications. Cloud infrastructures, on the other hand, provide a more appropriate infrastructure to support such loosely-coupled applications. Unfortunately, the user experience in cloud systems is completely different from that of conventional batch systems, mainly because the infrastructure needs to be deployed and subsequently released, to achieve the desired gains. In this paper we propose the architecture of a batch processing system that takes advantage of common features of cloud infrastructures to minimize cost and waiting time, while providing a user experience that is similar to conventional HPC systems.","PeriodicalId":233108,"journal":{"name":"2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132849044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1