首页 > 最新文献

Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion最新文献

英文 中文
Blockchain as a Trusted Component in Cloud SLA Verification 区块链作为云SLA验证中的可信组件
Amir Teshome Wonjiga, S. Peisert, Louis Rilling, C. Morin
Migrating an application from local compute resources to commercial cloud resources involves giving up full control of the physical infrastructure, as the cloud service provider (CSP) is responsible for managing the physical infrastructure, including its security. The reliance of a tenant on a CSP can create a trust issue around whether the CSP is upholding its end of the bargain. CSPs acknowledge this and provide a guarantee through a Service Level Agreement (SLA). SLAs need to be verified for satisfaction of the defined objectives. To avoid raising the trust issue again, such a verification procedure needs to be unbiased and independently achievable by both tenants and CSPs without one relying on the other party. In this paper, we consider an SLA offered by the provider that guarantees the integrity of tenants' data, and propose to verify the SLA using an integrity checking method based on a distributed ledger. Our proposed method allows both CSPs and tenants to perform integrity checking without one party relying on the other. The method uses a blockchain as a distributed ledger to store evidence of data integrity. Assuming the ledger as a secure, trusted source of information, the evidence can be used to resolve conflicts between providers and tenants. In addition, we present a prototype implementation and an experimental evaluation to show the feasibility of our verification method and to measure the time overhead.
将应用程序从本地计算资源迁移到商业云资源涉及放弃对物理基础设施的完全控制,因为云服务提供商(CSP)负责管理物理基础设施,包括其安全性。租户对CSP的依赖可能会产生一个信任问题,围绕CSP是否坚持其交易的结束。csp承认这一点,并通过服务水平协议(SLA)提供保证。需要验证sla以满足所定义的目标。为了避免再次引发信任问题,这样的验证程序需要由租户和csp独立地实现,而不依赖于另一方。在本文中,我们考虑了供应商提供的保证租户数据完整性的SLA,并提出了使用基于分布式账本的完整性检查方法来验证SLA。我们提出的方法允许csp和租户执行完整性检查,而不需要一方依赖另一方。该方法使用区块链作为分布式分类账来存储数据完整性的证据。假设分类帐是一个安全的、可信的信息源,那么这些证据可以用来解决提供者和租户之间的冲突。此外,我们还提出了一个原型实现和一个实验评估,以显示我们的验证方法的可行性,并测量时间开销。
{"title":"Blockchain as a Trusted Component in Cloud SLA Verification","authors":"Amir Teshome Wonjiga, S. Peisert, Louis Rilling, C. Morin","doi":"10.1145/3368235.3368872","DOIUrl":"https://doi.org/10.1145/3368235.3368872","url":null,"abstract":"Migrating an application from local compute resources to commercial cloud resources involves giving up full control of the physical infrastructure, as the cloud service provider (CSP) is responsible for managing the physical infrastructure, including its security. The reliance of a tenant on a CSP can create a trust issue around whether the CSP is upholding its end of the bargain. CSPs acknowledge this and provide a guarantee through a Service Level Agreement (SLA). SLAs need to be verified for satisfaction of the defined objectives. To avoid raising the trust issue again, such a verification procedure needs to be unbiased and independently achievable by both tenants and CSPs without one relying on the other party. In this paper, we consider an SLA offered by the provider that guarantees the integrity of tenants' data, and propose to verify the SLA using an integrity checking method based on a distributed ledger. Our proposed method allows both CSPs and tenants to perform integrity checking without one party relying on the other. The method uses a blockchain as a distributed ledger to store evidence of data integrity. Assuming the ledger as a secure, trusted source of information, the evidence can be used to resolve conflicts between providers and tenants. In addition, we present a prototype implementation and an experimental evaluation to show the feasibility of our verification method and to measure the time overhead.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115300646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Applicability of Serverless Computing in Fog Computing Environments for IoT Scenarios 无服务器计算在物联网场景雾计算环境中的适用性
Marcel Großmann, Christos Ioannidis, D. Le
With new paradigms to deliver software, a programmer's utopia is close to becoming reality, where she only focuses on the realization of an application without messing around with infrastructural limitations and deployment considerations. Currently, this vision is supported by a paradigm shift of cloud providers' service models, where new abstraction layers enable in particular serverless computing. Besides, the Internet of Things (IoT) requires a shift from the cloud paradigms to a fog computing perspective, where the functionality of a system needs to be allocated in the cloud-to-fog-continuum. In this regard, we analyze the applicability of a Function as a Service (FaaS) framework on an IoT service platform - SensIoT, which actually monitors environmental factors. Additionally, we deliver functions to cheap, energy-efficient Single Board Computers, which nowadays rapidly emerge as nodes of the IoT. We evaluate our approach by analyzing the resource usages of a FaaS enabled SensIoT and give an outline whether the combination of serverless computing, fog computing, and the IoT is going to enable the era of cloudless computing.
有了交付软件的新范例,程序员的乌托邦即将成为现实,在那里他只关注应用程序的实现,而不受基础设施限制和部署考虑的干扰。目前,这一愿景得到了云提供商服务模型范式转变的支持,其中新的抽象层特别支持无服务器计算。此外,物联网(IoT)需要从云计算范式转变为雾计算视角,系统的功能需要在云到雾的连续体中进行分配。在这方面,我们分析了功能即服务(FaaS)框架在物联网服务平台- SensIoT上的适用性,该平台实际上监测环境因素。此外,我们还为廉价,节能的单板计算机提供功能,这些计算机如今迅速成为物联网的节点。我们通过分析支持FaaS的SensIoT的资源使用情况来评估我们的方法,并概述了无服务器计算、雾计算和物联网的结合是否会开启无云计算时代。
{"title":"Applicability of Serverless Computing in Fog Computing Environments for IoT Scenarios","authors":"Marcel Großmann, Christos Ioannidis, D. Le","doi":"10.1145/3368235.3368834","DOIUrl":"https://doi.org/10.1145/3368235.3368834","url":null,"abstract":"With new paradigms to deliver software, a programmer's utopia is close to becoming reality, where she only focuses on the realization of an application without messing around with infrastructural limitations and deployment considerations. Currently, this vision is supported by a paradigm shift of cloud providers' service models, where new abstraction layers enable in particular serverless computing. Besides, the Internet of Things (IoT) requires a shift from the cloud paradigms to a fog computing perspective, where the functionality of a system needs to be allocated in the cloud-to-fog-continuum. In this regard, we analyze the applicability of a Function as a Service (FaaS) framework on an IoT service platform - SensIoT, which actually monitors environmental factors. Additionally, we deliver functions to cheap, energy-efficient Single Board Computers, which nowadays rapidly emerge as nodes of the IoT. We evaluate our approach by analyzing the resource usages of a FaaS enabled SensIoT and give an outline whether the combination of serverless computing, fog computing, and the IoT is going to enable the era of cloudless computing.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"71 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114120275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Envisioning SLO-driven Service Selection in Multi-cloud Applications 设想多云应用中慢速驱动的服务选择
Abdessalam Elhabbash, Yehia El-khatib, G. Blair, Yuhui Lin, A. Barker, John Thomson
The current large selection of cloud instances that are functionally equivalent makes selecting the right cloud service a challenging decision. We envision a model driven engineering (MDE) approach to raise the level of abstraction for cloud service selection. One way to achieve this is through a domain specific language (DSL) for modelling the service level objectives (SLOs) and a brokerage system that utilises the SLO model to select services. However, this demands an understanding of the provider SLAs and the capabilities of the current cloud modelling languages (CMLs). This paper investigates the state-of-the-art for SLO support in both cloud providers SLAs and CMLs in order to identify the gaps for SLO support. We then outline research directions towards achieving the MDE-based cloud brokerage.
目前有大量功能相同的云实例可供选择,这使得选择正确的云服务成为一项具有挑战性的决策。我们设想了一种模型驱动工程(MDE)方法来提高云服务选择的抽象级别。实现这一目标的一种方法是使用领域特定语言(DSL)对服务水平目标(SLO)进行建模,并使用使用SLO模型选择服务的代理系统。然而,这需要了解提供商sla和当前云建模语言(cml)的功能。本文研究了云提供商sla和cml中最先进的SLO支持,以确定SLO支持的差距。然后,我们概述了实现基于mde的云经纪的研究方向。
{"title":"Envisioning SLO-driven Service Selection in Multi-cloud Applications","authors":"Abdessalam Elhabbash, Yehia El-khatib, G. Blair, Yuhui Lin, A. Barker, John Thomson","doi":"10.1145/3368235.3368831","DOIUrl":"https://doi.org/10.1145/3368235.3368831","url":null,"abstract":"The current large selection of cloud instances that are functionally equivalent makes selecting the right cloud service a challenging decision. We envision a model driven engineering (MDE) approach to raise the level of abstraction for cloud service selection. One way to achieve this is through a domain specific language (DSL) for modelling the service level objectives (SLOs) and a brokerage system that utilises the SLO model to select services. However, this demands an understanding of the provider SLAs and the capabilities of the current cloud modelling languages (CMLs). This paper investigates the state-of-the-art for SLO support in both cloud providers SLAs and CMLs in order to identify the gaps for SLO support. We then outline research directions towards achieving the MDE-based cloud brokerage.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124334075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Framework of Automation on Context-Aware Internet of Things (IoT) Systems 上下文感知物联网(IoT)系统的自动化框架
Hossein Chegini, Aniket Mahanti
An ever-increasing number of different types of objects are connecting to the Internet, and this phenomenon is called the Internet of Things(IoT). Processing the IoT generated data by Cloud Computing causes high latency. Fog Computing is a new motivation for resolving the latency issue, which is a hosting environment between the IoT and the Cloud layers. IoT applications are faced with three significant challenges: big data, device heterogeneity, and Fog resiliency. With the motivation of resolving the challenges, this proposal introduces a Microservice software framework for implementing automatic functions in the IoT-Fog-Cloud ecosystem. The proposed Microservice framework will also enable the development of IoT-based context-aware intelligent decision-making systems. We describe the functionality and contribution of each automatic function in the paper.
越来越多的不同类型的物体连接到互联网,这种现象被称为物联网(IoT)。通过云计算处理物联网生成的数据,时延高。雾计算是解决延迟问题的新动力,这是物联网和云层之间的托管环境。物联网应用面临三大挑战:大数据、设备异构和雾弹性。为了解决这些挑战,本提案引入了一个微服务软件框架,用于在物联网-雾云生态系统中实现自动功能。提出的微服务框架还将使基于物联网的上下文感知智能决策系统的开发成为可能。文中介绍了各个自动功能的功能和贡献。
{"title":"A Framework of Automation on Context-Aware Internet of Things (IoT) Systems","authors":"Hossein Chegini, Aniket Mahanti","doi":"10.1145/3368235.3368848","DOIUrl":"https://doi.org/10.1145/3368235.3368848","url":null,"abstract":"An ever-increasing number of different types of objects are connecting to the Internet, and this phenomenon is called the Internet of Things(IoT). Processing the IoT generated data by Cloud Computing causes high latency. Fog Computing is a new motivation for resolving the latency issue, which is a hosting environment between the IoT and the Cloud layers. IoT applications are faced with three significant challenges: big data, device heterogeneity, and Fog resiliency. With the motivation of resolving the challenges, this proposal introduces a Microservice software framework for implementing automatic functions in the IoT-Fog-Cloud ecosystem. The proposed Microservice framework will also enable the development of IoT-based context-aware intelligent decision-making systems. We describe the functionality and contribution of each automatic function in the paper.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"154 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117177649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Towards Self-Organized Load Distribution over Chaotic Resources 混沌资源下的自组织负载分配
Yong-Hyuk Moon, Yong-Ju Lee
This paper addresses a question of whether resources suffering nonlinear fluctuations can maintain their stability as a system expands for computing tasks in a distributed manner. To this end, we suggest that by evolving individual resources following the self-organized criticality of sandpile model, the whole load distribution system can reach a stable state after a small but extremely local overhead occurs, leading to lots of avalanches. The proposed load balancing approach is evaluated in terms of latency minimization.
本文解决了当系统以分布式方式扩展计算任务时,遭受非线性波动的资源是否能够保持其稳定性的问题。为此,我们建议,通过遵循沙堆模型的自组织临界性对单个资源进行演化,使整个负荷分配系统在发生小而极局部的开销导致大量雪崩后达到稳定状态。所提出的负载平衡方法是根据延迟最小化进行评估的。
{"title":"Towards Self-Organized Load Distribution over Chaotic Resources","authors":"Yong-Hyuk Moon, Yong-Ju Lee","doi":"10.1145/3368235.3369366","DOIUrl":"https://doi.org/10.1145/3368235.3369366","url":null,"abstract":"This paper addresses a question of whether resources suffering nonlinear fluctuations can maintain their stability as a system expands for computing tasks in a distributed manner. To this end, we suggest that by evolving individual resources following the self-organized criticality of sandpile model, the whole load distribution system can reach a stable state after a small but extremely local overhead occurs, leading to lots of avalanches. The proposed load balancing approach is evaluated in terms of latency minimization.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126813603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Serverless Computing and Cloud Function-based Applications 无服务器计算和基于云功能的应用
Josef Spillner
Serverless computing is a growing industry trend with corresponding rise in interest by scholars and tinkerers. Increasingly, open source and academic system prototypes are being proposed especially in relation with cloud, edge and fog computing among other distributed computing specialisations. Due to the strict separation between elastically scalable stateless microservices bound to stateful backend services prevalent in this computing paradigm, the resulting applications are inherently distributed with favourable characteristics such as elastic scalability and disposability. Still, software application developers are confronted with a multitude of different methods and tools to build, test and deploy their function-based applications in today's serverless ecosystems. The logical next step is therefore a methodical development approach with key enablers based on a classification of languages, tools, systems, system behaviours, patterns, pitfalls, application architectures, compositions and cloud services around the serverless application development process.
无服务器计算是一个不断发展的行业趋势,学者和修补者对此的兴趣也相应增加。越来越多的开源和学术系统原型被提出,特别是与其他分布式计算专业中的云计算、边缘计算和雾计算相关。由于在这种计算范式中,绑定到有状态后端服务的弹性可伸缩无状态微服务之间存在严格的分离,因此产生的应用程序本质上是分布式的,具有弹性可伸缩性和可处置性等有利特征。尽管如此,软件应用程序开发人员仍然面临着许多不同的方法和工具,以便在当今的无服务器生态系统中构建、测试和部署基于功能的应用程序。因此,合乎逻辑的下一步是一种基于围绕无服务器应用程序开发过程的语言、工具、系统、系统行为、模式、缺陷、应用程序体系结构、组合和云服务分类的有系统的开发方法。
{"title":"Serverless Computing and Cloud Function-based Applications","authors":"Josef Spillner","doi":"10.1145/3368235.3370269","DOIUrl":"https://doi.org/10.1145/3368235.3370269","url":null,"abstract":"Serverless computing is a growing industry trend with corresponding rise in interest by scholars and tinkerers. Increasingly, open source and academic system prototypes are being proposed especially in relation with cloud, edge and fog computing among other distributed computing specialisations. Due to the strict separation between elastically scalable stateless microservices bound to stateful backend services prevalent in this computing paradigm, the resulting applications are inherently distributed with favourable characteristics such as elastic scalability and disposability. Still, software application developers are confronted with a multitude of different methods and tools to build, test and deploy their function-based applications in today's serverless ecosystems. The logical next step is therefore a methodical development approach with key enablers based on a classification of languages, tools, systems, system behaviours, patterns, pitfalls, application architectures, compositions and cloud services around the serverless application development process.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126270409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Intelligent Price Alert System for Digital Assets - Cryptocurrencies 数字资产智能价格警报系统-加密货币
Sronglong Chhem, A. Anjum, Bilal Arshad
Cryptocurrency market is very volatile, trading prices for some tokens can experience a sudden spike up or downturn in a matter of minutes. As a result, traders are facing difficulty following with all the trading price movements unless they are monitoring them manually. Hence, we propose a real-time alert system for monitoring those trading prices, sending notifications to users if any target prices match or an anomaly occurs. We adopt a streaming platform as the backbone of our system. It can handle thousands of messages per second with low latency rate at an average of 19 seconds on our testing environment. Long-Short-Term-Memory (LSTM) model is used as an anomaly detector. We compare the impact of five different data normalisation approaches with LSTM model on Bitcoin price dataset. The result shows that decimal scaling produces only Mean Absolute Percentage Error (MAPE) of 8.4 per cent prediction error rate on daily price data, which is the best performance achieved compared to other observed methods. However, with one-minute price dataset, our model produces higher prediction error making it impractical to distinguish between normal and anomaly points of price movement.
加密货币市场非常不稳定,一些代币的交易价格可能在几分钟内突然上涨或下跌。因此,交易者很难跟随所有的交易价格变动,除非他们手动监控它们。因此,我们提出了一个实时警报系统来监控这些交易价格,如果任何目标价格匹配或异常发生,就会向用户发送通知。我们采用流媒体平台作为系统的主干。在我们的测试环境中,它可以以平均19秒的低延迟率每秒处理数千条消息。使用长短期记忆(LSTM)模型作为异常检测器。我们比较了五种不同的数据归一化方法与LSTM模型对比特币价格数据集的影响。结果表明,十进制缩放对每日价格数据的预测错误率仅为8.4%的平均绝对百分比误差(MAPE),与其他观察方法相比,这是取得的最佳性能。然而,对于一分钟价格数据集,我们的模型产生更高的预测误差,使得区分价格运动的正常点和异常点变得不切实际。
{"title":"Intelligent Price Alert System for Digital Assets - Cryptocurrencies","authors":"Sronglong Chhem, A. Anjum, Bilal Arshad","doi":"10.1145/3368235.3368874","DOIUrl":"https://doi.org/10.1145/3368235.3368874","url":null,"abstract":"Cryptocurrency market is very volatile, trading prices for some tokens can experience a sudden spike up or downturn in a matter of minutes. As a result, traders are facing difficulty following with all the trading price movements unless they are monitoring them manually. Hence, we propose a real-time alert system for monitoring those trading prices, sending notifications to users if any target prices match or an anomaly occurs. We adopt a streaming platform as the backbone of our system. It can handle thousands of messages per second with low latency rate at an average of 19 seconds on our testing environment. Long-Short-Term-Memory (LSTM) model is used as an anomaly detector. We compare the impact of five different data normalisation approaches with LSTM model on Bitcoin price dataset. The result shows that decimal scaling produces only Mean Absolute Percentage Error (MAPE) of 8.4 per cent prediction error rate on daily price data, which is the best performance achieved compared to other observed methods. However, with one-minute price dataset, our model produces higher prediction error making it impractical to distinguish between normal and anomaly points of price movement.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132202120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deadlock Detection for Concurrent Programs Using Resource Footprints 使用资源占用的并发程序的死锁检测
Sonam Sherpa, Abdi Vicenciodelmoral, Xinghui Zhao
Concurrency bugs are difficult to diagnose and fix, due to the nature of the bugs and how they manifest themselves during execution. Traditional approaches for diagnosing concurrency bugs attempt to reproduce the exact execution schedule which reveals the bug, resulting in high runtime overhead. In this paper, we present our work in identifying concurrency bugs using resource consumption footprints. This is based on the observation that resource access and consumption patterns are critical indications of the run-time behavior of concurrent software, and can be used as a powerful mechanism to guide the software debugging process. We demonstrate that monitoring resource footprints at runtime can effectively help detect software bugs. Specifically, for MPI programs, a simple SVM classifier can detect deadlocks with high accuracy using only the CPU usage patterns.
由于bug的性质以及它们在执行过程中表现出来的方式,并发错误很难诊断和修复。诊断并发性错误的传统方法试图重现显示错误的精确执行时间表,从而导致高运行时开销。在本文中,我们介绍了使用资源消耗足迹识别并发性错误的工作。这是基于这样一种观察,即资源访问和消费模式是并发软件运行时行为的关键指示,并且可以用作指导软件调试过程的强大机制。我们演示了在运行时监视资源占用可以有效地帮助检测软件错误。具体来说,对于MPI程序,一个简单的SVM分类器仅使用CPU使用模式就可以高精度地检测死锁。
{"title":"Deadlock Detection for Concurrent Programs Using Resource Footprints","authors":"Sonam Sherpa, Abdi Vicenciodelmoral, Xinghui Zhao","doi":"10.1145/3368235.3369370","DOIUrl":"https://doi.org/10.1145/3368235.3369370","url":null,"abstract":"Concurrency bugs are difficult to diagnose and fix, due to the nature of the bugs and how they manifest themselves during execution. Traditional approaches for diagnosing concurrency bugs attempt to reproduce the exact execution schedule which reveals the bug, resulting in high runtime overhead. In this paper, we present our work in identifying concurrency bugs using resource consumption footprints. This is based on the observation that resource access and consumption patterns are critical indications of the run-time behavior of concurrent software, and can be used as a powerful mechanism to guide the software debugging process. We demonstrate that monitoring resource footprints at runtime can effectively help detect software bugs. Specifically, for MPI programs, a simple SVM classifier can detect deadlocks with high accuracy using only the CPU usage patterns.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134638873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Opportunities and Challenges for Resource Management and Machine Learning Clusters 资源管理与机器学习集群的机遇与挑战
L. Chen
The practice of collecting big performance data has changed how infrastructure providers model and manage the system in the past decade. There is a methodology shift from domain-knowledge based white-box models, e.g., queueing [1] and simulation[2], to black-box data-driven models, e.g., machine learning. Such a game change for resource management from workload characterization[3], dependability prediction [4,5] to sprinting policy[6], can be seen from major IT infastructure providers, e.g., IBM and Google. While applying higher order deep neural networks show promises in predicting performance [4,5], the scalability of such an approach is often limited. A plethoral of prior work focus on deriving complex and highly accurate models, such as deep neural networks, overlooking the constraints of computation efficiency and the scalability. Their applicability on resource management problems of the production systems is thus hindered. A crucial aspect to derive accurate and scalable predictive performance models lies on leveraging the domain expertise, white-box models, and black-box models. Examples of scalable ticket management services from IBM [4] and predicting job failures [5] at Google. Model driven computation sprinting [6] dynamically scales the frequency and the allocation of computing cores based on grey box models which outperforms deep neural networks. Aforementioned case studies strongly argue for the importance of combing domain-driven and data-driven models At the same time, various of acceleration techniques are developed to reduce the computation overhead of (deep) machine learning models in small scale and isolated testbed. Managing the performance of clusters that are dominated by machine learning workloads remains challenging and calls for novel solutions. SlimML [9] accelerates the ML modeli training time by only processing critical data set at a slight cost of accuracy, whereas Dias [7] simultaneously explores the data dropping and frequency sprinting for ML clusters that support multiple priorities of different training workloads. Aforementioned studies point out the complexity of managing the accuracy-efficiency tradeoff of ML jobs in a cluster-like environment where jobs interfere each other via sharing the underlying resources and common data sets.
在过去十年中,收集大性能数据的做法已经改变了基础设施提供商对系统的建模和管理方式。从基于领域知识的白盒模型(如排队[1]和仿真[2])到黑盒数据驱动模型(如机器学习),方法学正在发生转变。从工作负载表征[3]、可靠性预测[4,5]到冲刺策略[6],这种资源管理的游戏改变可以从主要的IT基础设施提供商(例如IBM和Google)那里看到。虽然应用高阶深度神经网络在预测性能方面有希望[4,5],但这种方法的可扩展性通常是有限的。先前的大量工作集中于推导复杂和高精度的模型,例如深度神经网络,而忽略了计算效率和可扩展性的约束。因此,它们在生产系统资源管理问题上的适用性受到阻碍。获得准确和可伸缩的预测性能模型的一个关键方面在于利用领域专业知识、白盒模型和黑盒模型。IBM的可扩展票据管理服务[4]和Google的预测作业失败[5]的例子。模型驱动的计算冲刺[6]基于灰盒模型动态缩放计算核的频率和分配,优于深度神经网络。上述案例研究强烈地证明了结合领域驱动和数据驱动模型的重要性,同时,各种加速技术被开发出来,以减少(深度)机器学习模型在小规模和孤立的测试平台上的计算开销。管理由机器学习工作负载主导的集群的性能仍然具有挑战性,需要新的解决方案。SlimML[9]通过只处理关键数据集来加速ML模型的训练时间,而Dias[7]同时探索了支持不同训练工作负载的多个优先级的ML集群的数据下降和频率冲刺。上述研究指出了在类似集群的环境中管理ML作业的准确性和效率权衡的复杂性,在这种环境中,作业通过共享底层资源和公共数据集而相互干扰。
{"title":"Opportunities and Challenges for Resource Management and Machine Learning Clusters","authors":"L. Chen","doi":"10.1145/3368235.3369376","DOIUrl":"https://doi.org/10.1145/3368235.3369376","url":null,"abstract":"The practice of collecting big performance data has changed how infrastructure providers model and manage the system in the past decade. There is a methodology shift from domain-knowledge based white-box models, e.g., queueing [1] and simulation[2], to black-box data-driven models, e.g., machine learning. Such a game change for resource management from workload characterization[3], dependability prediction [4,5] to sprinting policy[6], can be seen from major IT infastructure providers, e.g., IBM and Google. While applying higher order deep neural networks show promises in predicting performance [4,5], the scalability of such an approach is often limited. A plethoral of prior work focus on deriving complex and highly accurate models, such as deep neural networks, overlooking the constraints of computation efficiency and the scalability. Their applicability on resource management problems of the production systems is thus hindered. A crucial aspect to derive accurate and scalable predictive performance models lies on leveraging the domain expertise, white-box models, and black-box models. Examples of scalable ticket management services from IBM [4] and predicting job failures [5] at Google. Model driven computation sprinting [6] dynamically scales the frequency and the allocation of computing cores based on grey box models which outperforms deep neural networks. Aforementioned case studies strongly argue for the importance of combing domain-driven and data-driven models At the same time, various of acceleration techniques are developed to reduce the computation overhead of (deep) machine learning models in small scale and isolated testbed. Managing the performance of clusters that are dominated by machine learning workloads remains challenging and calls for novel solutions. SlimML [9] accelerates the ML modeli training time by only processing critical data set at a slight cost of accuracy, whereas Dias [7] simultaneously explores the data dropping and frequency sprinting for ML clusters that support multiple priorities of different training workloads. Aforementioned studies point out the complexity of managing the accuracy-efficiency tradeoff of ML jobs in a cluster-like environment where jobs interfere each other via sharing the underlying resources and common data sets.","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125470556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
UCC/BDCAT'19 Poster Chairs Welcome Message UCC/BDCAT'19海报椅欢迎致辞
Kenichi Kourai, Evangelos Pournaras
{"title":"UCC/BDCAT'19 Poster Chairs Welcome Message","authors":"Kenichi Kourai, Evangelos Pournaras","doi":"10.1145/3368235.3368880","DOIUrl":"https://doi.org/10.1145/3368235.3368880","url":null,"abstract":"","PeriodicalId":166357,"journal":{"name":"Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122343216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1