首页 > 最新文献

2020 IEEE International Conference on Services Computing (SCC)最新文献

英文 中文
Data Provenance for Complex Event Processing Invoking Composition of Services 调用服务组合的复杂事件处理的数据来源
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00027
Malik Khalfallah, P. Ghodous
Data provenance is a fundamental concept in scientific experimentation in general and complex event processing (CEP) in particular. For accurate determination and visualization of data provenance, efficient and user-friendly mechanisms are needed. Research in CEP optimization and visual notations can help in this process. This paper presents the extension of an optimized CEP framework to respond to data provenance requests. The extension consists in enriching the formal representation of execution plans of CEP queries to make them provenance-aware. These provenance-aware execution plans are then queried to generate a visual representation of the provenance data. We present the implementation of this framework and then its deployment and the associated evaluation in the context of an industrial use case.
数据来源是科学实验中的一个基本概念,特别是复杂事件处理(CEP)。为了准确地确定和可视化数据来源,需要有效和用户友好的机制。对CEP优化和可视化符号的研究可以帮助这一过程。本文提出了一个优化的CEP框架的扩展,以响应数据来源请求。扩展包括丰富CEP查询的执行计划的正式表示,使它们能够识别来源。然后查询这些感知来源的执行计划,以生成来源数据的可视化表示。我们将介绍该框架的实现,然后在一个工业用例的上下文中介绍其部署和相关的评估。
{"title":"Data Provenance for Complex Event Processing Invoking Composition of Services","authors":"Malik Khalfallah, P. Ghodous","doi":"10.1109/SCC49832.2020.00027","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00027","url":null,"abstract":"Data provenance is a fundamental concept in scientific experimentation in general and complex event processing (CEP) in particular. For accurate determination and visualization of data provenance, efficient and user-friendly mechanisms are needed. Research in CEP optimization and visual notations can help in this process. This paper presents the extension of an optimized CEP framework to respond to data provenance requests. The extension consists in enriching the formal representation of execution plans of CEP queries to make them provenance-aware. These provenance-aware execution plans are then queried to generate a visual representation of the provenance data. We present the implementation of this framework and then its deployment and the associated evaluation in the context of an industrial use case.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131234986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
App Competition Matters: How to Identify Your Competitor Apps? 应用竞争问题:如何识别竞争对手应用?
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00055
Md. Kafil Uddin, Qiang He, Jun Han, C. Chua
App stores, such as Google Play and Apple Store, contain abundance of information, including descriptions and reviews of various apps. They provide valuable information for app developers to learn about their own apps as well as similar apps for improving their apps, e.g., adding popular features or removing unpopular features. A place to start this learning process is to identify the competitor apps of a given target app in terms of their features, popularity as well as other relevant aspects. Given the large number of apps and the large amount of information available about these apps, it is a very challenging task for app developers to effectively and efficiently identify competitor apps. In this paper, we introduce a novel approach for identifying the competitor apps of a given target app, which includes three major components. Firstly, we identify the factors that characterise the competition between apps. Then, based on the identified competition factors, we extract and process the relevant information from the app store about the target app and the apps in the same app category. Finally, we cluster the apps based on their similarity across all the competition factors and identify those apps that are in the same cluster as the target app as its competitor apps. We evaluate our approach by comparing its results with corresponding search results in (1) Google Trends and (2) Google Search. The results show that our approach is effective in identifying competitor apps.
谷歌Play和Apple Store等应用商店包含大量信息,包括各种应用的描述和评论。它们为应用程序开发人员提供了有价值的信息,帮助他们了解自己的应用程序以及类似的应用程序,以改进他们的应用程序,例如,添加流行的功能或删除不受欢迎的功能。学习这个过程的起点是,根据特定目标应用的功能、受欢迎程度和其他相关方面,确定其竞争对手。考虑到应用的数量和大量的可用信息,对于应用开发者来说,有效地识别竞争对手应用是一项非常具有挑战性的任务。在本文中,我们介绍了一种新的方法来识别给定目标应用程序的竞争对手应用程序,它包括三个主要组成部分。首先,我们确定了应用程序之间竞争的特征因素。然后,基于识别出的竞争因素,我们从应用商店中提取和处理目标应用和同一应用类别中的应用的相关信息。最后,我们根据这些应用在所有竞争因素中的相似性对它们进行聚类,并识别出那些与目标应用和竞争对手应用在同一集群中的应用。我们通过将其结果与(1)谷歌Trends和(2)谷歌search中的相应搜索结果进行比较来评估我们的方法。结果表明,我们的方法在识别竞争对手应用程序方面是有效的。
{"title":"App Competition Matters: How to Identify Your Competitor Apps?","authors":"Md. Kafil Uddin, Qiang He, Jun Han, C. Chua","doi":"10.1109/SCC49832.2020.00055","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00055","url":null,"abstract":"App stores, such as Google Play and Apple Store, contain abundance of information, including descriptions and reviews of various apps. They provide valuable information for app developers to learn about their own apps as well as similar apps for improving their apps, e.g., adding popular features or removing unpopular features. A place to start this learning process is to identify the competitor apps of a given target app in terms of their features, popularity as well as other relevant aspects. Given the large number of apps and the large amount of information available about these apps, it is a very challenging task for app developers to effectively and efficiently identify competitor apps. In this paper, we introduce a novel approach for identifying the competitor apps of a given target app, which includes three major components. Firstly, we identify the factors that characterise the competition between apps. Then, based on the identified competition factors, we extract and process the relevant information from the app store about the target app and the apps in the same app category. Finally, we cluster the apps based on their similarity across all the competition factors and identify those apps that are in the same cluster as the target app as its competitor apps. We evaluate our approach by comparing its results with corresponding search results in (1) Google Trends and (2) Google Search. The results show that our approach is effective in identifying competitor apps.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128142236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Analytics on Health of Mobile Software Ecosystem Based on the Internal Operating Mechanism 基于内部运行机制的移动软件生态系统健康度分析
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00042
Jianmao Xiao, Shizhan Chen, Shiping Chen, Chao Gao, Hongyue Wu, Xiao Xue, Zhiyong Feng
In addition to publishing and downloading mobile apps, Mobile App Store (MAS) has become the most important ecosystem on mobile smart devices, i.e., Mobile Software Ecosystem (MSECO). However, most of the existing work focus on the analysis of a single entity in MSECO, and rarely analyze the interaction between the entities (such as users, developers, etc.) in MSECO as well as the comprehensive effect of each entity to the entire ecosystem health. In this paper, we propose a method based on computational experiments to simulate and analyze the complex and dynamic interaction of various entities and, how these entities impact on health of MSECO. Firstly, a requirement-driven MSECO model named R-MSECO is established to break down the entire MSECO, which includes user-app interaction, developers-requirements interaction and macro-control of mobile platform three sub-models. Secondly, the health measurement method is proposed to measure the health of MSECO. Finally, we simulate the impact of the dynamic interaction of various entities on the health of MSECO based on computational experiments. The experimental results show that our method is helpful for mobile users to better understanding of the current state of MSECO as well as can provide a reference for developers and platform managers to make development and operation decisions, which is of great significance for the continued healthy development of MSECO.
除了发布和下载移动应用程序之外,移动应用商店(MAS)已经成为移动智能设备上最重要的生态系统,即移动软件生态系统(MSECO)。然而,现有的工作大多侧重于对MSECO中单个实体的分析,很少分析MSECO中各个实体(如用户、开发者等)之间的相互作用以及各个实体对整个生态系统健康的综合影响。本文提出了一种基于计算实验的方法来模拟和分析各种实体之间复杂的动态相互作用,以及这些实体如何影响MSECO的健康。首先,建立需求驱动的MSECO模型R-MSECO,分解整个MSECO,包括用户-应用交互、开发人员-需求交互和移动平台宏观控制三个子模型。其次,提出了健康测量方法来衡量MSECO的健康状况。最后,在计算实验的基础上,模拟了各种实体的动态交互作用对MSECO健康状况的影响。实验结果表明,该方法有助于移动用户更好地了解MSECO的现状,为开发人员和平台管理者进行开发和运营决策提供参考,对MSECO的持续健康发展具有重要意义。
{"title":"Analytics on Health of Mobile Software Ecosystem Based on the Internal Operating Mechanism","authors":"Jianmao Xiao, Shizhan Chen, Shiping Chen, Chao Gao, Hongyue Wu, Xiao Xue, Zhiyong Feng","doi":"10.1109/SCC49832.2020.00042","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00042","url":null,"abstract":"In addition to publishing and downloading mobile apps, Mobile App Store (MAS) has become the most important ecosystem on mobile smart devices, i.e., Mobile Software Ecosystem (MSECO). However, most of the existing work focus on the analysis of a single entity in MSECO, and rarely analyze the interaction between the entities (such as users, developers, etc.) in MSECO as well as the comprehensive effect of each entity to the entire ecosystem health. In this paper, we propose a method based on computational experiments to simulate and analyze the complex and dynamic interaction of various entities and, how these entities impact on health of MSECO. Firstly, a requirement-driven MSECO model named R-MSECO is established to break down the entire MSECO, which includes user-app interaction, developers-requirements interaction and macro-control of mobile platform three sub-models. Secondly, the health measurement method is proposed to measure the health of MSECO. Finally, we simulate the impact of the dynamic interaction of various entities on the health of MSECO based on computational experiments. The experimental results show that our method is helpful for mobile users to better understanding of the current state of MSECO as well as can provide a reference for developers and platform managers to make development and operation decisions, which is of great significance for the continued healthy development of MSECO.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132050067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Research of Link Prediction in Knowledge Graph based on Distance Constraint 基于距离约束的知识图链接预测研究
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00018
Li Wei, Fangfang Liu
Large-scale knowledge graphs have a lot of hidden knowledge which has not been discovered, so the link prediction of the knowledge graph is an important topic. Translation models represented by TransE are the well-researched algorithms of link prediction. They project the entities and the relations in the knowledge graphs into some continuous vector spaces, and adjust the vector representations of the relations and the entities according to each piece of knowledge. However, in the case of a non-1-to-1 relationship, multiple entity vectors will compete for the same coordinate position in the space. Aiming at this problem, this paper proposes an improved method. By imposing a distance constraint on the competitive entities of a non-1-to-1 relationship, we can narrow the differences between them. Each entity will consider the other competitive entities while adapting itself to fit a triplet, so as to reach the status that each competitive entity is close to the coordinate point of the competition as a whole. Distance constraint can be applied to the existing translation models as a means of optimization. Experiments are conducted on the datasets: FB15K and WN18, and the experimental results show that the method we proposed is effective.
大规模知识图中存在大量未被发现的隐性知识,因此知识图的链接预测是一个重要的课题。以TransE为代表的翻译模型是研究比较成熟的链接预测算法。它们将知识图中的实体和关系投影到一些连续的向量空间中,并根据每条知识调整关系和实体的向量表示。然而,在非1对1关系的情况下,多个实体向量将竞争空间中的相同坐标位置。针对这一问题,本文提出了一种改进的方法。通过对非1对1关系的竞争实体施加距离约束,我们可以缩小它们之间的差异。每个竞争主体在适应一个三元组的同时会考虑其他竞争主体,从而达到每个竞争主体作为一个整体接近于竞争的坐标点的状态。距离约束可以作为一种优化的手段应用于现有的翻译模型。在FB15K和WN18数据集上进行了实验,实验结果表明我们提出的方法是有效的。
{"title":"The Research of Link Prediction in Knowledge Graph based on Distance Constraint","authors":"Li Wei, Fangfang Liu","doi":"10.1109/SCC49832.2020.00018","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00018","url":null,"abstract":"Large-scale knowledge graphs have a lot of hidden knowledge which has not been discovered, so the link prediction of the knowledge graph is an important topic. Translation models represented by TransE are the well-researched algorithms of link prediction. They project the entities and the relations in the knowledge graphs into some continuous vector spaces, and adjust the vector representations of the relations and the entities according to each piece of knowledge. However, in the case of a non-1-to-1 relationship, multiple entity vectors will compete for the same coordinate position in the space. Aiming at this problem, this paper proposes an improved method. By imposing a distance constraint on the competitive entities of a non-1-to-1 relationship, we can narrow the differences between them. Each entity will consider the other competitive entities while adapting itself to fit a triplet, so as to reach the status that each competitive entity is close to the coordinate point of the competition as a whole. Distance constraint can be applied to the existing translation models as a means of optimization. Experiments are conducted on the datasets: FB15K and WN18, and the experimental results show that the method we proposed is effective.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127914920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
desc2tag: A Reinforcement Learning Approach to Mashup Tag Recommendation desc2tag: Mashup标签推荐的强化学习方法
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00073
R. Anarfi, Benjamin A. Kwapong, K. K. Fletcher
Tags are critical sources of data for search, browsing and information retrieval. Manual selection of tags, over the years, have not been very effective. This paper introduces an approach to automatic mashup tag recommendation, based on reinforcement learning (RL). Our RL approach is able to carry out effective exploratory actions to automatically extract the and recommend tags for mashups. We perform experiments to evaluate our proposed method. Results from our experiments show that, the recommended mashup tags improve performance on the information retrieval task.
标签是搜索、浏览和信息检索的重要数据来源。多年来,手工选择标签并不是很有效。介绍了一种基于强化学习(RL)的混搭标签自动推荐方法。我们的RL方法能够执行有效的探索性操作来自动提取和推荐mashup的标记。我们通过实验来评估我们提出的方法。实验结果表明,推荐的mashup标记提高了信息检索任务的性能。
{"title":"desc2tag: A Reinforcement Learning Approach to Mashup Tag Recommendation","authors":"R. Anarfi, Benjamin A. Kwapong, K. K. Fletcher","doi":"10.1109/SCC49832.2020.00073","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00073","url":null,"abstract":"Tags are critical sources of data for search, browsing and information retrieval. Manual selection of tags, over the years, have not been very effective. This paper introduces an approach to automatic mashup tag recommendation, based on reinforcement learning (RL). Our RL approach is able to carry out effective exploratory actions to automatically extract the and recommend tags for mashups. We perform experiments to evaluate our proposed method. Results from our experiments show that, the recommended mashup tags improve performance on the information retrieval task.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128768808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-factor-based Motion Detection for Server Rack Doors Left Open 基于多因素的服务器机架门敞开运动检测
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00067
Ruriko Kudo, Yasuharu Katsuno, Fumiko Satoh
It is crucial to ensure that server rack doors are properly closed in data centers in order to prevent serious dangers caused by the doors suddenly opening in disasters and causing accidents that hurt maintenance workers and damage facilities. In this paper, we propose multi-factor-based motion detection for server rack doors that have been left open. Our approach recognizes the status of server rack doors on the basis of maintenance workers’ motion and prevents a worker from forgetting to close the doors without using any additional devices, only smartphones, which maintenance engineers already carry for work.
在数据中心,确保服务器机架的门关闭是至关重要的,以防止在灾难中突然打开的门造成严重的危险,并造成伤害维护人员和损坏设施的事故。在本文中,我们提出了基于多因素的运动检测服务器机架门已经打开。我们的方法可以根据维修工人的动作识别服务器机架门的状态,并防止工人忘记关门,而不需要使用任何额外的设备,只有智能手机,维修工程师已经带着智能手机上班了。
{"title":"Multi-factor-based Motion Detection for Server Rack Doors Left Open","authors":"Ruriko Kudo, Yasuharu Katsuno, Fumiko Satoh","doi":"10.1109/SCC49832.2020.00067","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00067","url":null,"abstract":"It is crucial to ensure that server rack doors are properly closed in data centers in order to prevent serious dangers caused by the doors suddenly opening in disasters and causing accidents that hurt maintenance workers and damage facilities. In this paper, we propose multi-factor-based motion detection for server rack doors that have been left open. Our approach recognizes the status of server rack doors on the basis of maintenance workers’ motion and prevents a worker from forgetting to close the doors without using any additional devices, only smartphones, which maintenance engineers already carry for work.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116813922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Reference Method for Performance Evaluation in Big Data Architectures 大数据架构中性能评估的参考方法
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00044
Wictor Souza Martins, B. Kuehne, Rafael Ferreira Sobrinho, F. Preti
This paper presents a reference method for performance evaluation in Big Data architectures, called by Improvement Method for Big Data Architectures (IMBDA) aiming to increase the performance, and consequently raising the quality of service provided. The method will contribute to small businesses and startups that have limited financial re-sources (impossible to invest in market solutions). The proposed approach considers the relationship of the processes in a data processing flow to find possible bottlenecks and optimization points. To this end, IMBDA collects system logs to compose functional metrics (e.g., processing time) and non-functional metrics (e.g., CPU and memory utilization, and other cloud computing infrastructure resources). The system stores these metrics in an external data analysis tool that investigates the correlation of performance between processes. The reference method applies to the architecture of a Big Data application, which provides solutions in fleet logistics. With the use of IMBDA, it was possible to identify performance bottlenecks, allowing the reconfiguration of the architecture to increase service quality at the lowest possible cost.
本文提出了一种大数据架构性能评估的参考方法,称为IMBDA (Improvement method for Big Data architectures),旨在提高大数据架构的性能,从而提高所提供的服务质量。该方法将有助于小型企业和创业公司,有有限的财政资源(不可能投资于市场解决方案)。该方法考虑了数据处理流中各过程之间的关系,以发现可能的瓶颈和优化点。为此,IMBDA收集系统日志,以组成功能指标(例如,处理时间)和非功能指标(例如,CPU和内存利用率,以及其他云计算基础设施资源)。系统将这些指标存储在外部数据分析工具中,该工具用于调查进程之间性能的相关性。参考方法适用于大数据应用的架构,为车队物流提供解决方案。通过使用IMBDA,可以识别性能瓶颈,从而允许对体系结构进行重新配置,以尽可能低的成本提高服务质量。
{"title":"A Reference Method for Performance Evaluation in Big Data Architectures","authors":"Wictor Souza Martins, B. Kuehne, Rafael Ferreira Sobrinho, F. Preti","doi":"10.1109/SCC49832.2020.00044","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00044","url":null,"abstract":"This paper presents a reference method for performance evaluation in Big Data architectures, called by Improvement Method for Big Data Architectures (IMBDA) aiming to increase the performance, and consequently raising the quality of service provided. The method will contribute to small businesses and startups that have limited financial re-sources (impossible to invest in market solutions). The proposed approach considers the relationship of the processes in a data processing flow to find possible bottlenecks and optimization points. To this end, IMBDA collects system logs to compose functional metrics (e.g., processing time) and non-functional metrics (e.g., CPU and memory utilization, and other cloud computing infrastructure resources). The system stores these metrics in an external data analysis tool that investigates the correlation of performance between processes. The reference method applies to the architecture of a Big Data application, which provides solutions in fleet logistics. With the use of IMBDA, it was possible to identify performance bottlenecks, allowing the reconfiguration of the architecture to increase service quality at the lowest possible cost.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117026327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bringing Semantics to Support Ocean FAIR Data Services with Ontologies 引入语义支持海洋公平数据服务与本体
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00011
Xiaoli Ren, Xiaoyong Li, Kefeng Deng, Kaijun Ren, Aolong Zhou, Junqiang Song
With the increasing attention to ocean and the development of data-intensive sciences, a large amount of ocean data has been acquired by various observing platforms and sensors, which poses new challenges to data management and utilization. Typically, nowadays we target to move ocean data management toward the FAIR principles of being findable, accessible, interoperable, and reusable. However, the data produced and managed by different organizations with wide diversity, various structures and increasing volume make it hard to be FAIR, and one of the most critical reason is the lack of unified data representation and publication methods. In this paper, we propose novel techniques to try to solve the problem by introducing semantics with ontologies. Specifically, we first propose a unified semantic model named OEDO to represent ocean data by defining the concepts of ocean observing field, specifying the relations between the concepts, and describing the properties with ocean metadata. Then, we further optimize the state-of-the-art quick service query list (QSQL) data structure, by extending the domain concepts with WordNet to improve data discovery. Moreover, based on the OEDO model and the optimized QSQL, we propose an ocean data service publishing method called DOLP to improve data discovery and data access. Finally, we conduct extensive experiments to demonstrate the effectiveness and efficiency of our proposals.
随着人们对海洋的日益关注和数据密集型科学的发展,各种观测平台和传感器获取了大量的海洋数据,这对数据的管理和利用提出了新的挑战。通常,现在我们的目标是将海洋数据管理转向可查找、可访问、可互操作和可重用的FAIR原则。然而,不同组织产生和管理的数据种类繁多,结构多样,数量不断增加,使得数据难以做到公平,其中最关键的原因之一是缺乏统一的数据表示和发布方法。在本文中,我们提出了一种新的技术,试图通过引入带有本体的语义来解决这个问题。具体来说,我们首先通过定义海洋观测场的概念,明确概念之间的关系,并用海洋元数据描述其属性,提出了一个统一的语义模型OEDO来表示海洋数据。然后,我们进一步优化了最先进的快速服务查询列表(QSQL)数据结构,通过扩展WordNet的领域概念来改进数据发现。此外,基于OEDO模型和优化后的QSQL,提出了一种海洋数据服务发布方法——DOLP,以提高数据发现和数据访问能力。最后,我们进行了大量的实验来证明我们的建议的有效性和效率。
{"title":"Bringing Semantics to Support Ocean FAIR Data Services with Ontologies","authors":"Xiaoli Ren, Xiaoyong Li, Kefeng Deng, Kaijun Ren, Aolong Zhou, Junqiang Song","doi":"10.1109/SCC49832.2020.00011","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00011","url":null,"abstract":"With the increasing attention to ocean and the development of data-intensive sciences, a large amount of ocean data has been acquired by various observing platforms and sensors, which poses new challenges to data management and utilization. Typically, nowadays we target to move ocean data management toward the FAIR principles of being findable, accessible, interoperable, and reusable. However, the data produced and managed by different organizations with wide diversity, various structures and increasing volume make it hard to be FAIR, and one of the most critical reason is the lack of unified data representation and publication methods. In this paper, we propose novel techniques to try to solve the problem by introducing semantics with ontologies. Specifically, we first propose a unified semantic model named OEDO to represent ocean data by defining the concepts of ocean observing field, specifying the relations between the concepts, and describing the properties with ocean metadata. Then, we further optimize the state-of-the-art quick service query list (QSQL) data structure, by extending the domain concepts with WordNet to improve data discovery. Moreover, based on the OEDO model and the optimized QSQL, we propose an ocean data service publishing method called DOLP to improve data discovery and data access. Finally, we conduct extensive experiments to demonstrate the effectiveness and efficiency of our proposals.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127161240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Suitability-based Task Assignment in Crowdsourcing Markets 众包市场中基于适用性的任务分配
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00054
Pengwei Wang, Zhen Chen, Zhaohui Zhang
Crowdsourcing web services has received much attention in recent years, which has been widely used in many fields, and provides important human computing services for the rapid development of AI. Crowdsourcing knowledge acquisition is one of the most important applications, which includes a series of work such as image annotation and picture classification. However, due to the differences in the difficulty of these tasks and the uncertainty of workers, how to make a reasonable task assignment while ensuring the completion of tasks becomes a big challenge. To this end, we introduce WordNet external knowledge base to help determine the difficulty of picture classification tasks. We refer to the e-sports rank mechanism and use dynamic update strategy to assess the actual ability of workers. A novel criterion affinity based on the weighted Euclidean distance with penalty factor is proposed to measure the suitability between tasks and workers. On this basis, the Kuhn-Munkres (KM) algorithm is used to solve the weighted bipartite graph matching problem. Through comparative experiments, the effectiveness of our proposed method is verified.
众包web服务近年来备受关注,广泛应用于多个领域,为人工智能的快速发展提供了重要的人工计算服务。众包知识获取是其中最重要的应用之一,它包括图像标注和图像分类等一系列工作。然而,由于这些任务难度的差异和工作者的不确定性,如何在保证任务完成的同时进行合理的任务分配成为一个很大的挑战。为此,我们引入了WordNet外部知识库来帮助确定图像分类任务的难度。我们借鉴电子竞技排名机制,采用动态更新策略来评估工作人员的实际能力。提出了一种基于带惩罚因子的加权欧几里得距离的标准亲和度来衡量任务与工人之间的适宜性。在此基础上,采用Kuhn-Munkres (KM)算法求解加权二部图匹配问题。通过对比实验,验证了该方法的有效性。
{"title":"Suitability-based Task Assignment in Crowdsourcing Markets","authors":"Pengwei Wang, Zhen Chen, Zhaohui Zhang","doi":"10.1109/SCC49832.2020.00054","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00054","url":null,"abstract":"Crowdsourcing web services has received much attention in recent years, which has been widely used in many fields, and provides important human computing services for the rapid development of AI. Crowdsourcing knowledge acquisition is one of the most important applications, which includes a series of work such as image annotation and picture classification. However, due to the differences in the difficulty of these tasks and the uncertainty of workers, how to make a reasonable task assignment while ensuring the completion of tasks becomes a big challenge. To this end, we introduce WordNet external knowledge base to help determine the difficulty of picture classification tasks. We refer to the e-sports rank mechanism and use dynamic update strategy to assess the actual ability of workers. A novel criterion affinity based on the weighted Euclidean distance with penalty factor is proposed to measure the suitability between tasks and workers. On this basis, the Kuhn-Munkres (KM) algorithm is used to solve the weighted bipartite graph matching problem. Through comparative experiments, the effectiveness of our proposed method is verified.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124447303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Novel Data-to-Text Generation Model with Transformer Planning and a Wasserstein Auto-Encoder 一种具有变压器规划和Wasserstein自编码器的新型数据到文本生成模型
Pub Date : 2020-11-01 DOI: 10.1109/SCC49832.2020.00051
Xiaohong Xu, T. He, Huazhen Wang
Existing methods for data-to-text generation have difficulty producing diverse texts with low duplication rates. In this paper, we propose a novel data-to-text generation model with Transformer planning and a Wasserstein auto-encoder, which can convert constructed data to coherent and diverse text. This model possesses the following features: Transformer is first used to generate the data planning sequence of the target text content (each sequence is a subset of the input items that can be covered by a sentence), and then the Wasserstein Auto-Encoder(WAE) and a deep neural network are employed to establish the global latent variable space of the model. Second, text generation is performed through a hierarchical structure that takes the data planning sequence, global latent variables, and context of the generated sentences as conditions. Furthermore, to achieve diversity of text expression, a decoder is developed that combines the neural network with the WAE. The experimental results show that this model can achieve higher evaluation scores than the existing baseline models in terms of the diversity metrics of text generation and the duplication rate.
现有的数据到文本生成方法难以产生低重复率的多种文本。在本文中,我们提出了一种新的数据到文本生成模型,该模型具有Transformer规划和Wasserstein自编码器,可以将构造的数据转换为连贯和多样的文本。该模型具有以下特点:首先使用Transformer生成目标文本内容的数据规划序列(每个序列是一个句子可以涵盖的输入项的子集),然后使用Wasserstein Auto-Encoder(WAE)和深度神经网络建立模型的全局潜在变量空间。其次,文本生成通过分层结构执行,该结构以数据规划序列、全局潜在变量和生成句子的上下文为条件。此外,为了实现文本表达的多样性,开发了一种将神经网络与WAE相结合的解码器。实验结果表明,该模型在文本生成的多样性指标和重复率方面都比现有的基线模型获得了更高的评价分数。
{"title":"A Novel Data-to-Text Generation Model with Transformer Planning and a Wasserstein Auto-Encoder","authors":"Xiaohong Xu, T. He, Huazhen Wang","doi":"10.1109/SCC49832.2020.00051","DOIUrl":"https://doi.org/10.1109/SCC49832.2020.00051","url":null,"abstract":"Existing methods for data-to-text generation have difficulty producing diverse texts with low duplication rates. In this paper, we propose a novel data-to-text generation model with Transformer planning and a Wasserstein auto-encoder, which can convert constructed data to coherent and diverse text. This model possesses the following features: Transformer is first used to generate the data planning sequence of the target text content (each sequence is a subset of the input items that can be covered by a sentence), and then the Wasserstein Auto-Encoder(WAE) and a deep neural network are employed to establish the global latent variable space of the model. Second, text generation is performed through a hierarchical structure that takes the data planning sequence, global latent variables, and context of the generated sentences as conditions. Furthermore, to achieve diversity of text expression, a decoder is developed that combines the neural network with the WAE. The experimental results show that this model can achieve higher evaluation scores than the existing baseline models in terms of the diversity metrics of text generation and the duplication rate.","PeriodicalId":274909,"journal":{"name":"2020 IEEE International Conference on Services Computing (SCC)","volume":"333 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122140862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2020 IEEE International Conference on Services Computing (SCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1