2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)最新文献

英文中文

IoT Service Based on JointCloud Blockchain: The Case Study of Smart Traveling 基于JointCloud区块链的物联网服务——以智慧出行为例

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00036

Weili Chen, Mingjie Ma, Yongjian Ye, Zibin Zheng, Yuren Zhou

With the advancements in Internet technologies and Wireless Sensor Networks (WSN), a new era of the Internet of Things (IoT) is being realized. IoT produces a lot of information which can be used to improve the efficiency of our daily lives and provides advanced services in a wide range of application domains. However, the privacy and the data fusing problems remain major challenges, mainly due to the massive scale and distributed nature of IoT networks and the amount of data collected from IoT increasing at an exponential rate. Thus, a privacy-protected and inter-cloud data fusing platform is needed to the demand for data mining and analytic activities in IoT. In this paper, we propose such a platform based on JointCloud Blockchain and study a novel case of smart traveling based on the proposed platform.

随着互联网技术和无线传感器网络(WSN)的发展，物联网(IoT)的新时代正在实现。物联网产生大量的信息，这些信息可以用来提高我们日常生活的效率，并在广泛的应用领域提供先进的服务。然而，隐私和数据融合问题仍然是主要挑战，这主要是由于物联网网络的大规模和分布式性质以及从物联网收集的数据量以指数级速度增长。因此，需要一个隐私保护和云间数据融合的平台来满足物联网中数据挖掘和分析活动的需求。本文提出了基于JointCloud区块链的智能出行平台，并研究了基于该平台的智能出行新案例。

引用次数: 43

Comparing Container-Based Microservices and Workspace as a Service: Which One to Choose? 比较基于容器的微服务和作为服务的工作空间:选择哪一个?

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00040

Junming Ma, Bo An, Donggang Cao, Xiangqun Chen

The concept of microservices has gained increasing popularity since 2014. Almost during the same period, container technology keeps developing and is considered as an excellent way to build microservices-based applications. Mainstream public cloud vendors such as Amazon Web Services, Microsoft Azure, and Google Cloud Platform all provide users with container-based solutions to implementing microservices. Workspace as a Service (WaaS) proposed by An et al. is another approach which uses containers to serve users. Both container-based microservices and WaaS are used to effectively utilize cluster resources via maintaining a number of containers. In this paper, we compare the designing ideas and supporting platforms of these two approaches, which provides a perspective for cluster administrators and users to understand the scenarios where to use them and how to make an appropriate choice to meet their needs. We find that container-based microservices are more suitable for professional IT companies while WaaS fits education and research institutions better.

自2014年以来，微服务的概念越来越受欢迎。几乎在同一时期，容器技术不断发展，被认为是构建基于微服务的应用程序的一种极好的方式。主流的公共云供应商，如Amazon Web Services、Microsoft Azure和Google cloud Platform，都为用户提供基于容器的解决方案来实现微服务。An等人提出的工作空间即服务(WaaS)是另一种使用容器为用户服务的方法。基于容器的微服务和WaaS都可以通过维护大量容器来有效地利用集群资源。在本文中，我们比较了这两种方法的设计思想和支持平台，这为集群管理员和用户提供了一个视角，以了解在哪里使用它们以及如何做出适当的选择来满足他们的需求。我们发现基于容器的微服务更适合专业的IT公司，而WaaS更适合教育和研究机构。

引用次数: 1

Service-Oriented IoT Modeling and Its Deviation from Software Services 面向服务的物联网建模及其对软件服务的偏离

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00014

I. Yen, F. Bastani, Wei Zhu, Hessam Moeini, San-Yih Hwang, Yuqun Zhang

Service technologies have been widely applied to many application domains to facilitate rapid system composition and deployment. However, existing service models need to be enhanced in order to be used in Internet-of-Things (IoT). Also, due to the massive-scale, IoT service discovery and composition cannot be centralized. Existing discovery routing protocols for peer-to-peer systems have their shortcomings and need to be improved. In this paper, we analyze the differences between IoT services and software services and identify the requirements for designing IoT service models that are additional to software service models. We then discuss a service ontology model for the specification of IoT services. For IoT service discovery, we survey existing discovery routing approaches, including those for conventional peer-to-peer networks and for IoT systems and discuss the potential problems when used in IoT networks. Then, we discuss our approach, summarization and ontology coding, which greatly reduce the memory requirements of the routing protocols, for the IoT networks.

服务技术已广泛应用于许多应用领域，以促进系统的快速组合和部署。然而，为了在物联网(IoT)中使用，现有的服务模型需要得到增强。此外，由于规模庞大，物联网服务的发现和组合无法集中。现有的对等系统发现路由协议存在不足，需要进一步改进。在本文中，我们分析了物联网服务和软件服务之间的差异，并确定了设计软件服务模型之外的物联网服务模型的需求。然后，我们讨论了物联网服务规范的服务本体模型。对于物联网服务发现，我们调查了现有的发现路由方法，包括传统的点对点网络和物联网系统，并讨论了在物联网网络中使用时的潜在问题。然后，我们讨论了我们的方法，摘要和本体编码，大大降低了路由协议的内存需求，为物联网网络。

引用次数: 6

Intelligent Resource Scheduling at Scale: A Machine Learning Perspective 大规模智能资源调度:机器学习视角

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00025

Renyu Yang, Ouyang Xue, Yaofeng Chen, P. Townend, Jie Xu

Resource scheduling in a computing system addresses the problem of packing tasks with multi-dimensional resource requirements and non-functional constraints. The exhibited heterogeneity of workload and server characteristics in Cloud-scale or Internet-scale systems is adding further complexity and new challenges to the problem. Compared with,,,, existing solutions based on ad-hoc heuristics, Machine Learning (ML) has the potential to improve further the efficiency of resource management in large-scale systems. In this paper we,,,, will describe and discuss how ML could be used to understand automatically both workloads and environments, and to help to cope with scheduling-related challenges such as consolidating co-located workloads, handling resource requests, guaranteeing application’s QoSs, and mitigating tailed stragglers. We will introduce a generalized ML-based solution to large-scale resource scheduling and demonstrate its effectiveness through a case study that deals with performance-centric node classification and straggler mitigation. We believe that an MLbased method will help to achieve architectural optimization and efficiency improvement.

计算系统中的资源调度解决了具有多维资源需求和非功能约束的任务打包问题。在云规模或互联网规模的系统中，工作负载和服务器特征的异构性进一步增加了问题的复杂性和新的挑战。与，，，，现有的基于特设启发式的解决方案相比，机器学习(ML)具有进一步提高大规模系统中资源管理效率的潜力。在本文中，我们，，，，将描述和讨论如何使用ML来自动理解工作负载和环境，并帮助应对与调度相关的挑战，例如整合共存的工作负载，处理资源请求，保证应用程序的qos，以及减少尾部掉线。我们将介绍一种基于广义机器学习的大规模资源调度解决方案，并通过一个案例研究展示其有效性，该案例研究处理以性能为中心的节点分类和掉线者缓解。我们相信，基于机器学习的方法将有助于实现架构的优化和效率的提高。

{"title":"Intelligent Resource Scheduling at Scale: A Machine Learning Perspective","authors":"Renyu Yang, Ouyang Xue, Yaofeng Chen, P. Townend, Jie Xu","doi":"10.1109/SOSE.2018.00025","DOIUrl":"https://doi.org/10.1109/SOSE.2018.00025","url":null,"abstract":"Resource scheduling in a computing system addresses the problem of packing tasks with multi-dimensional resource requirements and non-functional constraints. The exhibited heterogeneity of workload and server characteristics in Cloud-scale or Internet-scale systems is adding further complexity and new challenges to the problem. Compared with,,,, existing solutions based on ad-hoc heuristics, Machine Learning (ML) has the potential to improve further the efficiency of resource management in large-scale systems. In this paper we,,,, will describe and discuss how ML could be used to understand automatically both workloads and environments, and to help to cope with scheduling-related challenges such as consolidating co-located workloads, handling resource requests, guaranteeing application’s QoSs, and mitigating tailed stragglers. We will introduce a generalized ML-based solution to large-scale resource scheduling and demonstrate its effectiveness through a case study that deals with performance-centric node classification and straggler mitigation. We believe that an MLbased method will help to achieve architectural optimization and efficiency improvement.","PeriodicalId":414464,"journal":{"name":"2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133638362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

Cuckoo Migration: Self Migration on JointCloud Using New Hardware Features 杜鹃迁移:在JointCloud上使用新的硬件特性进行自我迁移

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00033

Ruifeng Liu, Zeyu Mi

Abstract—With the growing number of cloud providers and the increasing market of cloud computing, it’s more and more necessary to realize the cooperation and aggregation of clouds. JointCloud is a framework aiming at facilitating the consociation of various clouds on the market. And the JointCloud Collaboration Environment (JCCE) is the ideal environment for global cloud providers. To realize the clouds cooperation, migration is one of the most important issues that have to be considered. Live migration has always been one of the major primitive operations of virtualization and has been discussed for long. Traditional works deal the migration mainly by using a host-driven migration method, the majority work of which is dominated by the hypervisor. However, as the age of cloud aggregation comes, traditional methods show their defects. In cloud aggregation environment, cloud providers may refuse to supply the migration service to grasp their customers, or the hypervisors are heterogeneous on the two sides of migration. Those problems raise challenges to traditional host-driven methods. In this paper, we propose Cuckoo Migration, a new self- migration method using Intel new hardware feature. We leverage a special processor function, VMFUNC, to create a two-EPT architecture for the guest VM, so that the guest has a mirror memory space, which can be used as the duplication of memory. This paper mainly introduces how we build a two-EPT architecture for the guest and discusses how we leverage such architecture to do our self-migration.

摘要:随着云提供商数量的不断增加和云计算市场的不断扩大，实现云之间的协作和聚合变得越来越有必要。JointCloud是一个框架，旨在促进市场上各种云的结合。JointCloud协作环境(JCCE)是全球云提供商的理想环境。为了实现云的协作，迁移是必须考虑的最重要的问题之一。实时迁移一直是虚拟化的主要基本操作之一，并且已经讨论了很长时间。传统的工作主要是通过使用主机驱动的迁移方法来处理迁移，其中大部分工作由管理程序主导。然而，随着云聚合时代的到来，传统的方法显示出其缺陷。在云聚合环境中，云提供商可能拒绝提供迁移服务以把握其客户，或者管理程序在迁移的两端是异构的。这些问题对传统的宿主驱动方法提出了挑战。本文提出了一种基于Intel新硬件特性的自迁移算法Cuckoo Migration。我们利用一个特殊的处理器函数VMFUNC为客户虚拟机创建一个双ept架构，这样客户虚拟机就有一个镜像内存空间，可以用作内存的复制。本文主要介绍了我们如何为客户机构建一个two-EPT体系结构，并讨论了我们如何利用这种体系结构进行自迁移。

{"title":"Cuckoo Migration: Self Migration on JointCloud Using New Hardware Features","authors":"Ruifeng Liu, Zeyu Mi","doi":"10.1109/SOSE.2018.00033","DOIUrl":"https://doi.org/10.1109/SOSE.2018.00033","url":null,"abstract":"Abstract—With the growing number of cloud providers and the increasing market of cloud computing, it’s more and more necessary to realize the cooperation and aggregation of clouds. JointCloud is a framework aiming at facilitating the consociation of various clouds on the market. And the JointCloud Collaboration Environment (JCCE) is the ideal environment for global cloud providers. To realize the clouds cooperation, migration is one of the most important issues that have to be considered. Live migration has always been one of the major primitive operations of virtualization and has been discussed for long. Traditional works deal the migration mainly by using a host-driven migration method, the majority work of which is dominated by the hypervisor. However, as the age of cloud aggregation comes, traditional methods show their defects. In cloud aggregation environment, cloud providers may refuse to supply the migration service to grasp their customers, or the hypervisors are heterogeneous on the two sides of migration. Those problems raise challenges to traditional host-driven methods. In this paper, we propose Cuckoo Migration, a new self- migration method using Intel new hardware feature. We leverage a special processor function, VMFUNC, to create a two-EPT architecture for the guest VM, so that the guest has a mirror memory space, which can be used as the duplication of memory. This paper mainly introduces how we build a two-EPT architecture for the guest and discusses how we leverage such architecture to do our self-migration.","PeriodicalId":414464,"journal":{"name":"2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116177208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing Imperative and Declarative Process Models with Flow Dependencies 比较具有流依赖关系的命令式和声明式流程模型

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00017

M. Baumann

The field of process model similarity matching is well examined for imperative process models like BPMN models, Petri nets, or EPCs. For the recently upcoming declarative process models, generally providing more flexibility than imperative models, however, there is a lack of comparison methods. Along with their advantage of providing more flexibility, declarative process models have a disadvantage in comprehending the models, especially the models' behavior. To overcome this problem, a comparison of imperative and declarative models is reasonable to check whether the declarative model represents a desired behavior which is easier to express and validate in an imperative notation. The work at hand provides a method based on flow dependencies, abstracting from the modeling type, for comparing two process models. It uses not only information about control-flow, but also data-based dependencies between process activities.

流程模型相似性匹配领域对命令式流程模型(如BPMN模型、Petri网或epc)进行了很好的研究。对于最近出现的声明性流程模型，通常比命令式模型提供更多的灵活性，但是缺乏比较方法。声明性流程模型具有提供更多灵活性的优点，但在理解模型(尤其是模型的行为)方面存在缺点。为了克服这个问题，比较命令式模型和声明式模型是合理的，可以检查声明式模型是否代表了更容易用命令式表示法表达和验证的期望行为。手头的工作提供了一种基于流依赖关系的方法，从建模类型中抽象出来，用于比较两个流程模型。它不仅使用有关控制流的信息，还使用流程活动之间基于数据的依赖关系。

引用次数: 2

VTDL: A Notation for Data Stream Processing Applications 数据流处理应用的符号

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00019

C. Hochreiner, Matteo Nardelli, Bernhard Knasmüller, Stefan Schulte, S. Dustdar

The continuing growth of the Internet of Things (IoT) requires established stream processing engines (SPEs) to cope with new challenges, like the geographic distribution of IoT sensors and clouds hosting the SPEs. These challenges obligate SPEs to support distributed stream processing across different geographic locations which also require a new approach on how data stream processing topologies are defined. In this paper, we identify required features for next-generation SPEs and introduce the Vienna Topology Description Language (VTDL). This language is specifically designed to address challenges for next-generation SPEs and proposes several novel aspects compared to existing topology description concepts. To assess not only the feasibility but also the reduced management overhead due to the VTDL, we evaluate the VTDL within the VISP stream processing ecosystem and show that the usage of the VTDL approach results in a management time reduction of up to 18 times.

物联网(IoT)的持续增长需要建立流处理引擎(spe)来应对新的挑战，例如物联网传感器的地理分布和托管spe的云。这些挑战要求spe支持跨不同地理位置的分布式流处理，这也需要一种新的方法来定义数据流处理拓扑。在本文中，我们确定了下一代spe所需的特性，并介绍了维也纳拓扑描述语言(VTDL)。这种语言是专门为解决下一代spe的挑战而设计的，与现有的拓扑描述概念相比，它提出了几个新颖的方面。为了不仅评估可行性，而且评估由于VTDL而减少的管理开销，我们在VISP流处理生态系统中评估了VTDL，并表明使用VTDL方法可将管理时间减少18倍。

引用次数: 1

Overcoming Security Challenges in Microservice Architectures 克服微服务架构中的安全挑战

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00011

T. Yarygina, A. H. Bagge

The microservice architectural style is an emerging trend in software engineering that allows building highly scalable and flexible systems. However, current state of the art provides only limited insight into the particular security concerns of microservice system. With this paper, we seek to unravel some of the mysteries surrounding microservice security by: providing a taxonomy of microservices security; assessing the security implications of the microservice architecture; and surveying related contemporary solutions, among others Docker Swarm and Netflix security decisions. We offer two important insights. On one hand, microservice security is a multi-faceted problem that requires a layered security solution that is not available out of the box at the moment. On the other hand, if these security challenges are solved, microservice architectures can improve security; their inherent properties of loose coupling, isolation, diversity, and fail fast all contribute to the increased robustness of a system. To address the lack of security guidelines this paper describes the design and implementation of a simple security framework for microservices that can be leveraged by practitioners. Proof-of-concept evaluation results show that the performance overhead of the security mechanisms is around 11%.

微服务架构风格是软件工程中的一种新兴趋势，它允许构建高度可伸缩和灵活的系统。然而，目前的技术水平对微服务系统的特定安全问题提供了有限的见解。在本文中，我们试图通过以下方式解开围绕微服务安全的一些谜团:提供微服务安全的分类;评估微服务架构的安全影响;并调查了相关的当代解决方案，其中包括Docker Swarm和Netflix的安全决策。我们提供了两个重要的见解。一方面，微服务安全是一个多方面的问题，需要一个分层的安全解决方案，而这个解决方案目前还无法开箱即用。另一方面，如果这些安全挑战得到解决，微服务架构可以提高安全性;它们固有的松耦合、隔离、多样性和快速故障特性都有助于提高系统的健壮性。为了解决缺乏安全指南的问题，本文描述了一个简单的微服务安全框架的设计和实现，从业者可以利用这个框架。概念验证评估结果表明，安全机制的性能开销约为11%。

{"title":"Overcoming Security Challenges in Microservice Architectures","authors":"T. Yarygina, A. H. Bagge","doi":"10.1109/SOSE.2018.00011","DOIUrl":"https://doi.org/10.1109/SOSE.2018.00011","url":null,"abstract":"The microservice architectural style is an emerging trend in software engineering that allows building highly scalable and flexible systems. However, current state of the art provides only limited insight into the particular security concerns of microservice system. With this paper, we seek to unravel some of the mysteries surrounding microservice security by: providing a taxonomy of microservices security; assessing the security implications of the microservice architecture; and surveying related contemporary solutions, among others Docker Swarm and Netflix security decisions. We offer two important insights. On one hand, microservice security is a multi-faceted problem that requires a layered security solution that is not available out of the box at the moment. On the other hand, if these security challenges are solved, microservice architectures can improve security; their inherent properties of loose coupling, isolation, diversity, and fail fast all contribute to the increased robustness of a system. To address the lack of security guidelines this paper describes the design and implementation of a simple security framework for microservices that can be leveraged by practitioners. Proof-of-concept evaluation results show that the performance overhead of the security mechanisms is around 11%.","PeriodicalId":414464,"journal":{"name":"2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115320554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 73

An Ensemble Signature-Based Approach for Performance Diagnosis in Big Data Platform 基于集成签名的大数据平台性能诊断方法

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00022

H. Kou, Pengfei Chen

The big data platform always suffers from performance problems due to internal impairments (e.g. software bugs) and external impairments (e.g. resource hog). And the situation is exacerbated by the properties of velocity, variety and volume (3Vs) of big data. To recovery the system from performance anomaly, the first step is to find the root causes. In this paper, we propose a novel signature-based performance diagnosis approach to rapidly pinpoint the root causes of performance problems in big data platforms. The performance diagnosis is formalized as a pattern recognition problem. We leverage Maximum Information Criterion (MIC) to express the invariant relationships amongst the performance metrics in the normal state. Each performance problem occurred in the big data platform is signified by a unique binary vector named signature, which consists of a set of violations of MIC invariants. The signatures of multiple performance problems form a signature database. If the Key Performance Indicator (KPI) of the big data application exhibits model drift, our approach can identify the real culprits by retrieving the root causes which have similar signatures to the current performance problem. Moreover, considering the diversity of big data applications, we establish an ensemble approach to treat each application separately. The experiment evaluations in a controlled big data platform show that our approach can pinpoint the real culprits of performance problems in an average 84% precision and 87% recall when one fault occurs, which is better than several state-of-the-art approaches.

大数据平台由于内部缺陷(如软件漏洞)和外部缺陷(如资源占用)而导致性能问题。大数据的速度、种类和数量(3Vs)的特性加剧了这种情况。要从性能异常中恢复系统，首先要找到根本原因。在本文中，我们提出了一种新的基于签名的性能诊断方法，以快速查明大数据平台中性能问题的根本原因。性能诊断被形式化为一个模式识别问题。我们利用最大信息标准(MIC)来表达正常状态下性能指标之间的不变关系。大数据平台中出现的每一个性能问题都用一个唯一的二进制向量来表示，这个向量被称为签名，它由一组违反MIC不变量的行为组成。多个性能问题的签名组成一个签名库。如果大数据应用的关键绩效指标(KPI)表现出模型漂移，我们的方法可以通过检索与当前性能问题具有相似特征的根本原因来识别真正的罪魁祸首。此外，考虑到大数据应用的多样性，我们建立了一个集成的方法来单独对待每个应用。在受控大数据平台上的实验评估表明，我们的方法能够以平均84%的准确率和87%的召回率找出性能问题的真正原因，优于几种最先进的方法。

{"title":"An Ensemble Signature-Based Approach for Performance Diagnosis in Big Data Platform","authors":"H. Kou, Pengfei Chen","doi":"10.1109/SOSE.2018.00022","DOIUrl":"https://doi.org/10.1109/SOSE.2018.00022","url":null,"abstract":"The big data platform always suffers from performance problems due to internal impairments (e.g. software bugs) and external impairments (e.g. resource hog). And the situation is exacerbated by the properties of velocity, variety and volume (3Vs) of big data. To recovery the system from performance anomaly, the first step is to find the root causes. In this paper, we propose a novel signature-based performance diagnosis approach to rapidly pinpoint the root causes of performance problems in big data platforms. The performance diagnosis is formalized as a pattern recognition problem. We leverage Maximum Information Criterion (MIC) to express the invariant relationships amongst the performance metrics in the normal state. Each performance problem occurred in the big data platform is signified by a unique binary vector named signature, which consists of a set of violations of MIC invariants. The signatures of multiple performance problems form a signature database. If the Key Performance Indicator (KPI) of the big data application exhibits model drift, our approach can identify the real culprits by retrieving the root causes which have similar signatures to the current performance problem. Moreover, considering the diversity of big data applications, we establish an ensemble approach to treat each application separately. The experiment evaluations in a controlled big data platform show that our approach can pinpoint the real culprits of performance problems in an average 84% precision and 87% recall when one fault occurs, which is better than several state-of-the-art approaches.","PeriodicalId":414464,"journal":{"name":"2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124371670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

DwarfGC: A Space-Efficient and Crash-Consistent Garbage Collector in NVM for Cloud Computing DwarfGC:用于云计算的NVM中空间高效且崩溃一致的垃圾收集器

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

Pub Date : 2018-03-26 DOI: 10.1109/SOSE.2018.00032

Heting Li, Mingyu Wu

Emerging cloud computing arouses need for large-scale data processing which in turn promises vigorous developments on big data platforms running on Java Virtual Machine (JVM), such as Hadoop, Spark and Flink. Storing a large amount of data in memory allows those platforms to benefit from satisfying performance and powerful memory management and garbage collection service in Java. Non-volatile memory (NVM) provides nonvolatility, byte-addressable and fast access speed characteristics and thus becomes a superior alternative for volatile memory utilizing in future cloud system and Java world. This paper presents a recoverable garbage collector named DwarfGC to manage Java objects in NVM so as to ensure crash consistency and durability. DwarfGC persists heap-related metadata into NVM at the beginning of GC and relies on it for recovery. The metadata is stored in a space-efficient fashion but incurring little time overhead.

新兴的云计算引发了对大规模数据处理的需求，这也使得Hadoop、Spark、Flink等基于Java虚拟机(JVM)的大数据平台蓬勃发展。在内存中存储大量数据可以使这些平台受益于Java中令人满意的性能和强大的内存管理和垃圾收集服务。非易失性内存(NVM)提供了非易失性、字节可寻址和快速访问速度的特性，因此成为未来云系统和Java世界中易失性内存的最佳选择。本文提出了一种可恢复的垃圾收集器，名为DwarfGC，用于管理NVM中的Java对象，以确保崩溃一致性和持久性。在GC开始时，DwarfGC将与堆相关的元数据保存到NVM中，并依赖它进行恢复。元数据以节省空间的方式存储，但产生很少的时间开销。

引用次数: 1

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 IEEE Symposium on Service-Oriented System Engineering (SOSE)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀