首页 > 最新文献

2020 IEEE International Conference on Smart Data Services (SMDS)最新文献

英文 中文
Machine Learning based User QoE Evaluation for Video Streaming over Mobile Network 基于机器学习的移动网络视频流用户QoE评价
Pub Date : 2020-10-01 DOI: 10.1109/SMDS49396.2020.00010
Yanhong Zhu, Tao Sun, Qin Li, Lu Lu, Xiaodong Duan, Weiyuan Li
Quality of experience (QoE) serves as a direct evaluation of users' experience in mobile video transmission and is critical to ensure good network service. Although many efforts have been made to predict QoE based on network parameters of the user terminal equipment, it is difficult to predict QoE based on Quality of Service (QoS) offered by the network servers. In this paper, a machine learning based QoE evaluation method is proposed to evaluate user QoE in real-time by analyzing the QoS characteristics for mobile video transmission. For this purpose, we construct a large-scale dataset by collecting more than 300 thousand pieces of metrics data with two kinds of key quality indicators (KQIs) describing the QoE and 91 key performance indicators (KPIs) describing the QoS. A two-process feature subset selection (FSS) method consisting of single parameter pre-FSS and multi-parameter FSS is then proposed to find the KPIs related to KQIs. An Extra-Trees model is finally developed to learn the relationships between the KPIs and KQIs. By employing machine learning and data analytics on network data with the data-driven framework, the proposed method can predict the user QoE according to the QoS of network servers. The results prove that our proposed method can outperform other state-of-the-art approaches.
体验质量(Quality of experience, QoE)是对用户移动视频传输体验的直接评价,是保证良好网络服务的关键。虽然基于用户终端设备的网络参数来预测QoE已经做了很多努力,但是基于网络服务器提供的服务质量(QoS)来预测QoE是很困难的。本文通过分析移动视频传输的QoS特性,提出了一种基于机器学习的QoE评价方法,对用户的QoE进行实时评价。为此,我们通过收集30多万条指标数据构建了一个大规模数据集,其中包括描述QoE的两种关键质量指标(kqi)和描述QoS的91种关键绩效指标(kpi)。然后提出了一种由单参数预特征子集选择和多参数特征子集选择组成的两过程特征子集选择(FSS)方法来寻找与kpi相关的kpi。最后建立了一个Extra-Trees模型来学习kpi和kqi之间的关系。该方法在数据驱动框架下对网络数据进行机器学习和数据分析,根据网络服务器的QoS预测用户的QoE。结果证明,我们提出的方法可以优于其他最先进的方法。
{"title":"Machine Learning based User QoE Evaluation for Video Streaming over Mobile Network","authors":"Yanhong Zhu, Tao Sun, Qin Li, Lu Lu, Xiaodong Duan, Weiyuan Li","doi":"10.1109/SMDS49396.2020.00010","DOIUrl":"https://doi.org/10.1109/SMDS49396.2020.00010","url":null,"abstract":"Quality of experience (QoE) serves as a direct evaluation of users' experience in mobile video transmission and is critical to ensure good network service. Although many efforts have been made to predict QoE based on network parameters of the user terminal equipment, it is difficult to predict QoE based on Quality of Service (QoS) offered by the network servers. In this paper, a machine learning based QoE evaluation method is proposed to evaluate user QoE in real-time by analyzing the QoS characteristics for mobile video transmission. For this purpose, we construct a large-scale dataset by collecting more than 300 thousand pieces of metrics data with two kinds of key quality indicators (KQIs) describing the QoE and 91 key performance indicators (KPIs) describing the QoS. A two-process feature subset selection (FSS) method consisting of single parameter pre-FSS and multi-parameter FSS is then proposed to find the KPIs related to KQIs. An Extra-Trees model is finally developed to learn the relationships between the KPIs and KQIs. By employing machine learning and data analytics on network data with the data-driven framework, the proposed method can predict the user QoE according to the QoS of network servers. The results prove that our proposed method can outperform other state-of-the-art approaches.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129777692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
2020 IEEE International Conference on Smart Data Services (SMDS) SMDS 2020 2020 IEEE智能数据服务国际会议(SMDS
Pub Date : 2020-10-01 DOI: 10.1109/smds49396.2020.00004
Lixin Gao, Guang Cheng, M. Sapino
Message from the SERVICES 2020 Steering Committee Chair viii Message from the SERVICES 2020 Symposia General Chair ix Welcome Message from Congress 2020 General Chairs x Message from the SERVICES 2020 Program Chairs in Chief xii Message from the SERVICES 2020 Technical Committee Chair on Services Computing of IEEE Computer Society xiii Welcome Message from the SERVICES 2020 Women in Services Computing Symposium Chair xiv Symposium on Women in Services Computing Program xv SERVICES 2020 Steering Committee xvii SERVICES 2020 Program Committee xxi Message from the SMDS 2020 Chairs xxii SMDS 2020 Organizing Committee xxiii
2020年服务指导委员会主席致词viii 2020年服务专题会议主席致词ix 2020年大会主席致词x 2020年服务项目首席主席致词xii IEEE计算机学会2020年服务计算技术委员会主席致词xiii 2020年服务计算女性研讨会主席致词第十四届服务计算女性研讨会第十五届服务2020指导会议SMDS 2020主席致辞SMDS 2020组委会
{"title":"2020 IEEE International Conference on Smart Data Services (SMDS) SMDS 2020","authors":"Lixin Gao, Guang Cheng, M. Sapino","doi":"10.1109/smds49396.2020.00004","DOIUrl":"https://doi.org/10.1109/smds49396.2020.00004","url":null,"abstract":"Message from the SERVICES 2020 Steering Committee Chair viii Message from the SERVICES 2020 Symposia General Chair ix Welcome Message from Congress 2020 General Chairs x Message from the SERVICES 2020 Program Chairs in Chief xii Message from the SERVICES 2020 Technical Committee Chair on Services Computing of IEEE Computer Society xiii Welcome Message from the SERVICES 2020 Women in Services Computing Symposium Chair xiv Symposium on Women in Services Computing Program xv SERVICES 2020 Steering Committee xvii SERVICES 2020 Program Committee xxi Message from the SMDS 2020 Chairs xxii SMDS 2020 Organizing Committee xxiii","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121168998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2020 IEEE International Conference on Smart Data Services 2020 IEEE智能数据服务国际会议
Pub Date : 2020-10-01 DOI: 10.1109/smds49396.2020.00002
{"title":"2020 IEEE International Conference on Smart Data Services","authors":"","doi":"10.1109/smds49396.2020.00002","DOIUrl":"https://doi.org/10.1109/smds49396.2020.00002","url":null,"abstract":"","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127012848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geolocation using GAT with Multiview Learning 使用GAT与多视图学习的地理定位
Pub Date : 2020-10-01 DOI: 10.1109/SMDS49396.2020.00017
Zhan Wang, Chunyang Ye, Hui Zhou
Information in social networks plays an important role in many fields such as event detection, disaster warning, etc. However, due to the lack of geographic metadata, the information is often unusable. Therefore, the geolocation using social network data has gradually become a hot research topic. Existing methods mainly use textual contents, and thus poorly exploit the available data, especially the hidden information in the link. To address this issue, we propose two Multiview learning models, M-GAT and M-GCN, based on the Graph Attention and Graph Convolution Network to fuse both the text and link information. By extracting the text features from multiple angles to extend the feature space, our models achieve the best results on the baseline dataset. The visual display of representations collected from a hidden layer illustrates the validity of our models. Experiments on different feature combination show the effectiveness of our proposal.
社交网络中的信息在事件检测、灾害预警等诸多领域发挥着重要作用。然而,由于缺乏地理元数据,这些信息往往是不可用的。因此,利用社交网络数据进行地理定位逐渐成为研究的热点。现有的方法主要使用文本内容,对可用数据尤其是链接中的隐藏信息的挖掘效果较差。为了解决这一问题,我们提出了两种基于图注意和图卷积网络的多视图学习模型M-GAT和M-GCN,以融合文本和链接信息。通过从多个角度提取文本特征来扩展特征空间,我们的模型在基线数据集上获得了最好的结果。从隐藏层收集的表示的可视化显示说明了我们模型的有效性。不同特征组合的实验证明了该方法的有效性。
{"title":"Geolocation using GAT with Multiview Learning","authors":"Zhan Wang, Chunyang Ye, Hui Zhou","doi":"10.1109/SMDS49396.2020.00017","DOIUrl":"https://doi.org/10.1109/SMDS49396.2020.00017","url":null,"abstract":"Information in social networks plays an important role in many fields such as event detection, disaster warning, etc. However, due to the lack of geographic metadata, the information is often unusable. Therefore, the geolocation using social network data has gradually become a hot research topic. Existing methods mainly use textual contents, and thus poorly exploit the available data, especially the hidden information in the link. To address this issue, we propose two Multiview learning models, M-GAT and M-GCN, based on the Graph Attention and Graph Convolution Network to fuse both the text and link information. By extracting the text features from multiple angles to extend the feature space, our models achieve the best results on the baseline dataset. The visual display of representations collected from a hidden layer illustrates the validity of our models. Experiments on different feature combination show the effectiveness of our proposal.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"158 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134348908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2020 IEEE International Conference on Smart Data Services SMDS 2020 2020 IEEE智能数据服务国际会议SMDS 2020
Pub Date : 2020-10-01 DOI: 10.1109/smds49396.2020.00001
{"title":"2020 IEEE International Conference on Smart Data Services SMDS 2020","authors":"","doi":"10.1109/smds49396.2020.00001","DOIUrl":"https://doi.org/10.1109/smds49396.2020.00001","url":null,"abstract":"","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"267 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133889737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Utber: Utilizing Fine-Grained Entity Types to Relation Extraction with Distant Supervision 利用细粒度实体类型进行远程监督的关系提取
Pub Date : 2020-10-01 DOI: 10.1109/SMDS49396.2020.00015
Chengmin Wu, Lei Chen
Recently, much effort has been paid to relation extraction during the construction of large ontological knowledge bases (KBs). However, most of the traditional relation extraction systems rely on human-annotated data for training, which requires expensive human effort. Therefore, Distant supervision is proposed to assist the creation of large amounts of labeled data. By this method, an existing KB is heuristically aligned to texts, and the alignment data are treated as training data. Nevertheless, the noise in the training data may cause two serious problems. First, the heuristic label alignment may fail and cause the wrong label problem. Second, the existing statistical models are applied to ad-hoc features, and hence perform poorly due to the dynamic features of noisy data. To address these two problems, in this paper, we propose a novel framework for automatic relation extraction from unstructured text corpora. Specifically, to solve the first problem, we propose a fine-grained entity typing technique to filter wrong data by choosing positive entity type pairs and conduct joint instance-type selection over bag of instances. To solve the second problem, instead of directly defining manually crafted features, we propose a deep neural architecture with attention mechanism to automatically learn positive and negative instance features. Extensive experiments on real-world datasets demonstrate that our method outperforms the competitive state-of-the-art techniques in terms of effectiveness.
在大型本体知识库(KBs)的构建过程中,关系抽取是近年来研究的热点。然而,大多数传统的关系提取系统依赖于人工标注的数据进行训练,这需要耗费大量的人力。因此,提出了远程监督来协助创建大量标记数据。通过该方法,将现有知识库启发式地与文本对齐,并将对齐数据作为训练数据。然而,训练数据中的噪声可能会导致两个严重的问题。首先,启发式标签对齐可能会失败并导致错误的标签问题。其次,由于噪声数据的动态特性,现有的统计模型应用于ad-hoc特征,因此性能不佳。为了解决这两个问题,本文提出了一种从非结构化文本语料库中自动提取关系的新框架。具体来说,为了解决第一个问题,我们提出了一种细粒度实体类型技术,通过选择正实体类型对来过滤错误数据,并对实例包进行联合实例类型选择。为了解决第二个问题,我们提出了一种带有注意机制的深度神经结构来自动学习正面和负面实例特征,而不是直接定义手工制作的特征。在真实世界数据集上的大量实验表明,我们的方法在有效性方面优于竞争最先进的技术。
{"title":"Utber: Utilizing Fine-Grained Entity Types to Relation Extraction with Distant Supervision","authors":"Chengmin Wu, Lei Chen","doi":"10.1109/SMDS49396.2020.00015","DOIUrl":"https://doi.org/10.1109/SMDS49396.2020.00015","url":null,"abstract":"Recently, much effort has been paid to relation extraction during the construction of large ontological knowledge bases (KBs). However, most of the traditional relation extraction systems rely on human-annotated data for training, which requires expensive human effort. Therefore, Distant supervision is proposed to assist the creation of large amounts of labeled data. By this method, an existing KB is heuristically aligned to texts, and the alignment data are treated as training data. Nevertheless, the noise in the training data may cause two serious problems. First, the heuristic label alignment may fail and cause the wrong label problem. Second, the existing statistical models are applied to ad-hoc features, and hence perform poorly due to the dynamic features of noisy data. To address these two problems, in this paper, we propose a novel framework for automatic relation extraction from unstructured text corpora. Specifically, to solve the first problem, we propose a fine-grained entity typing technique to filter wrong data by choosing positive entity type pairs and conduct joint instance-type selection over bag of instances. To solve the second problem, instead of directly defining manually crafted features, we propose a deep neural architecture with attention mechanism to automatically learn positive and negative instance features. Extensive experiments on real-world datasets demonstrate that our method outperforms the competitive state-of-the-art techniques in terms of effectiveness.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124009023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MY-AIR: A Personalized Air-quality Information Service MY-AIR:个性化空气质量信息服务
Pub Date : 2020-10-01 DOI: 10.1109/SMDS49396.2020.00020
Jane Lin, Ouri E. Wolfson
This paper describes an information service that personalizes air pollution monitoring by considering the fine grained user location, her microenvironment, and her activity. Personalization is obtained by integrating a large number of information sources including the Environmental Protection Agency (EPA) monitoring stations, traffic, weather, portable air pollution data from sensors carried by a small fraction of the population, smartphone sensors, vehicle sensors data captured via on-board diagnostics.
本文描述了一种信息服务,该服务通过考虑细粒度用户位置、微环境和活动来个性化空气污染监测。个性化是通过整合大量信息源获得的,包括环境保护局(EPA)监测站、交通、天气、一小部分人携带的便携式空气污染数据、智能手机传感器、通过车载诊断捕获的车辆传感器数据。
{"title":"MY-AIR: A Personalized Air-quality Information Service","authors":"Jane Lin, Ouri E. Wolfson","doi":"10.1109/SMDS49396.2020.00020","DOIUrl":"https://doi.org/10.1109/SMDS49396.2020.00020","url":null,"abstract":"This paper describes an information service that personalizes air pollution monitoring by considering the fine grained user location, her microenvironment, and her activity. Personalization is obtained by integrating a large number of information sources including the Environmental Protection Agency (EPA) monitoring stations, traffic, weather, portable air pollution data from sensors carried by a small fraction of the population, smartphone sensors, vehicle sensors data captured via on-board diagnostics.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130799462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Message from the Program Chairs in Chief 来自项目主席的信息
Pub Date : 2020-10-01 DOI: 10.1109/COMPSAC.2016.14
H. Sato, M. Matskin, W. Claycomb
It is our great pleasure to welcome you to the 2020 edition of the IEEE World Congress on Services. This year edition represents an important milestone as the Congress is held virtually, although preserving a strong organizational and scientific link to Beijing, where the Congress was scheduled to take place. The success of the Congress recognizes the strong research communities around the world that focus on foundations, systems, methodologies, and applications of computing-based services. This is a field that over the years has evolved and expanded to encompass new areas, including edge computing, IoT, and smart data services.
我们非常高兴地欢迎您参加2020年IEEE世界服务大会。今年的大会是一个重要的里程碑,因为大会是在网上举行的,尽管它与北京保持着组织和科学上的紧密联系,大会原定在北京举行。大会的成功认可了世界各地强大的研究界对基于计算的服务的基础、系统、方法和应用的关注。这是一个多年来不断发展和扩展的领域,包括边缘计算、物联网和智能数据服务等新领域。
{"title":"Message from the Program Chairs in Chief","authors":"H. Sato, M. Matskin, W. Claycomb","doi":"10.1109/COMPSAC.2016.14","DOIUrl":"https://doi.org/10.1109/COMPSAC.2016.14","url":null,"abstract":"It is our great pleasure to welcome you to the 2020 edition of the IEEE World Congress on Services. This year edition represents an important milestone as the Congress is held virtually, although preserving a strong organizational and scientific link to Beijing, where the Congress was scheduled to take place. The success of the Congress recognizes the strong research communities around the world that focus on foundations, systems, methodologies, and applications of computing-based services. This is a field that over the years has evolved and expanded to encompass new areas, including edge computing, IoT, and smart data services.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129910392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Copyright 版权
Pub Date : 2020-10-01 DOI: 10.1109/smds49396.2020.00003
{"title":"Copyright","authors":"","doi":"10.1109/smds49396.2020.00003","DOIUrl":"https://doi.org/10.1109/smds49396.2020.00003","url":null,"abstract":"","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126707433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable and Hybrid Ensemble-Based Causality Discovery 基于可扩展和混合集成的因果关系发现
Pub Date : 2020-10-01 DOI: 10.1109/SMDS49396.2020.00016
Pei Guo, Achuna Ofonedu, Jianwu Wang
Causality discovery mines cause-effect relationships among different variables of a system and has been widely used in many disciplines including climatology and neuroscience. To discover causal relationships, many data-driven causality discovery methods, e.g., Granger causality, PCMCI and Dynamic Bayesian Network, have been proposed. Many of these causality discovery approaches mine time series data and generate a directed causality graph where each graph edge denotes a cause-effect relationship between the two connected graph nodes. Our benchmarking of different causality discovery approaches with real-world climate data shows these approaches often generate quite different causality results with the same input dataset due to their internal learning mechanism differences. Meanwhile, there are ever-increasing available data in virtually every discipline, which makes it more and more difficult to use existing causality discovery algorithms to produce causality results within reasonable time. To address these two challenges, this paper utilizes data partitioning and ensemble techniques, and proposes a two-phase hybrid causality ensemble framework. The framework first conducts phase 1 data ensemble for partitioned data and then conducts phase 2 algorithm ensemble from data ensemble results. To achieve scalability, we further parallelize the ensemble approaches via the Spark big data analytics engine. Our experiments show that our proposed approaches achieve good accuracy through ensemble and high scalability through data-parallelization in distributed computing environments.
因果关系发现挖掘系统中不同变量之间的因果关系,已广泛应用于许多学科,包括气候学和神经科学。为了发现因果关系,人们提出了许多数据驱动的因果关系发现方法,如格兰杰因果关系、PCMCI和动态贝叶斯网络。许多因果关系发现方法挖掘时间序列数据并生成有向因果图,其中每个图边表示两个连接图节点之间的因果关系。我们对不同因果关系发现方法与现实世界气候数据的基准测试表明,由于内部学习机制的差异,这些方法通常会在相同的输入数据集上产生完全不同的因果关系结果。同时,几乎每个学科的可用数据都在不断增加,这使得使用现有的因果关系发现算法在合理的时间内产生因果关系结果变得越来越困难。为了解决这两个问题,本文利用数据划分和集成技术,提出了一个两阶段混合因果关系集成框架。框架首先对分区数据进行第一阶段的数据集成,然后根据数据集成结果进行第二阶段的算法集成。为了实现可扩展性,我们通过Spark大数据分析引擎进一步并行化集成方法。实验表明,本文提出的方法在分布式计算环境下通过集成获得了良好的精度,并通过数据并行化获得了较高的可扩展性。
{"title":"Scalable and Hybrid Ensemble-Based Causality Discovery","authors":"Pei Guo, Achuna Ofonedu, Jianwu Wang","doi":"10.1109/SMDS49396.2020.00016","DOIUrl":"https://doi.org/10.1109/SMDS49396.2020.00016","url":null,"abstract":"Causality discovery mines cause-effect relationships among different variables of a system and has been widely used in many disciplines including climatology and neuroscience. To discover causal relationships, many data-driven causality discovery methods, e.g., Granger causality, PCMCI and Dynamic Bayesian Network, have been proposed. Many of these causality discovery approaches mine time series data and generate a directed causality graph where each graph edge denotes a cause-effect relationship between the two connected graph nodes. Our benchmarking of different causality discovery approaches with real-world climate data shows these approaches often generate quite different causality results with the same input dataset due to their internal learning mechanism differences. Meanwhile, there are ever-increasing available data in virtually every discipline, which makes it more and more difficult to use existing causality discovery algorithms to produce causality results within reasonable time. To address these two challenges, this paper utilizes data partitioning and ensemble techniques, and proposes a two-phase hybrid causality ensemble framework. The framework first conducts phase 1 data ensemble for partitioned data and then conducts phase 2 algorithm ensemble from data ensemble results. To achieve scalability, we further parallelize the ensemble approaches via the Spark big data analytics engine. Our experiments show that our proposed approaches achieve good accuracy through ensemble and high scalability through data-parallelization in distributed computing environments.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121495453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2020 IEEE International Conference on Smart Data Services (SMDS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1