首页 > 最新文献

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)最新文献

英文 中文
Compositional Recurrent Neural Networks for Chinese Short Text Classification 中文短文本分类的组合递归神经网络
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0029
Yujun Zhou, Bo Xu, Jiaming Xu, Lei Yang, Changliang Li, Bo Xu
Word segmentation is the first step in Chinese natural language processing, and the error caused by word segmentation can be transmitted to the whole system. In order to reduce the impact of word segmentation and improve the overall performance of Chinese short text classification system, we propose a hybrid model of character-level and word-level features based on recurrent neural network (RNN) with long short-term memory (LSTM). By integrating character-level feature into word-level feature, the missing semantic information by the error of word segmentation will be constructed, meanwhile the wrong semantic relevance will be reduced. The final feature representation is that it suppressed the error of word segmentation in the case of maintaining most of the semantic features of the sentence. The whole model is finally trained end-to-end with supervised Chinese short text classification task. Results demonstrate that the proposed model in this paper is able to represent Chinese short text effectively, and the performances of 32-class and 5-class categorization outperform some remarkable methods.
分词是汉语自然语言处理的第一步,分词产生的误差可以传递到整个系统。为了减少分词的影响,提高中文短文本分类系统的整体性能,提出了一种基于长短时记忆递归神经网络(RNN)的字符级和词级特征混合模型。通过将字符级特征与词级特征相结合,构建分词错误所缺失的语义信息,同时减少错误的语义关联。最后的特征表示是在保持句子大部分语义特征的情况下,抑制了分词的错误。最后通过监督中文短文本分类任务对整个模型进行端到端训练。结果表明,本文提出的模型能够有效地表示中文短文本,32类分类和5类分类的性能优于一些显著的方法。
{"title":"Compositional Recurrent Neural Networks for Chinese Short Text Classification","authors":"Yujun Zhou, Bo Xu, Jiaming Xu, Lei Yang, Changliang Li, Bo Xu","doi":"10.1109/WI.2016.0029","DOIUrl":"https://doi.org/10.1109/WI.2016.0029","url":null,"abstract":"Word segmentation is the first step in Chinese natural language processing, and the error caused by word segmentation can be transmitted to the whole system. In order to reduce the impact of word segmentation and improve the overall performance of Chinese short text classification system, we propose a hybrid model of character-level and word-level features based on recurrent neural network (RNN) with long short-term memory (LSTM). By integrating character-level feature into word-level feature, the missing semantic information by the error of word segmentation will be constructed, meanwhile the wrong semantic relevance will be reduced. The final feature representation is that it suppressed the error of word segmentation in the case of maintaining most of the semantic features of the sentence. The whole model is finally trained end-to-end with supervised Chinese short text classification task. Results demonstrate that the proposed model in this paper is able to represent Chinese short text effectively, and the performances of 32-class and 5-class categorization outperform some remarkable methods.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"7 1","pages":"137-144"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90336560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
From Opinion Lexicons to Sentiment Classification of Tweets and Vice Versa: A Transfer Learning Approach 从观点词汇到推文的情感分类,反之亦然:一种迁移学习方法
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.29
Felipe Bravo-Marquez, E. Frank, B. Pfahringer
Message-level and word-level polarity classification are two popular tasks in Twitter sentiment analysis. They have been commonly addressed by training supervised models from labelled data. The main limitation of these models is the high cost of data annotation. Transferring existing labels from a related problem domain is one possible solution for this problem. In this paper, we propose a simple model for transferring sentiment labels from words to tweets and vice versa by representing both tweets and words using feature vectors residing in the same feature space. Tweets are represented by standard NLP features such as unigrams and part-of-speech tags. Words are represented by averaging the vectors of the tweets in which they occur. We evaluate our approach in two transfer learning problems: 1) training a tweet-level polarity classifier from a polarity lexicon, and 2) inducing a polarity lexicon from a collection of polarity-annotated tweets. Our results show that the proposed approach can successfully classify words and tweets after transfer.
消息级和词级极性分类是Twitter情感分析中的两个常用任务。它们通常通过从标记数据中训练监督模型来解决。这些模型的主要限制是数据注释的高成本。从相关问题领域转移现有标签是解决此问题的一种可能方法。在本文中,我们提出了一个简单的模型,通过使用驻留在相同特征空间中的特征向量表示tweet和单词,将情感标签从单词转移到tweet,反之亦然。推文由标准的NLP特征表示,如单字符和词性标记。单词是通过对它们出现的tweet的向量进行平均来表示的。我们在两个迁移学习问题中评估了我们的方法:1)从极性词典中训练推文级极性分类器,以及2)从极性注释的推文集合中诱导极性词典。实验结果表明,该方法可以成功地对迁移后的词和推文进行分类。
{"title":"From Opinion Lexicons to Sentiment Classification of Tweets and Vice Versa: A Transfer Learning Approach","authors":"Felipe Bravo-Marquez, E. Frank, B. Pfahringer","doi":"10.1109/WI.2016.29","DOIUrl":"https://doi.org/10.1109/WI.2016.29","url":null,"abstract":"Message-level and word-level polarity classification are two popular tasks in Twitter sentiment analysis. They have been commonly addressed by training supervised models from labelled data. The main limitation of these models is the high cost of data annotation. Transferring existing labels from a related problem domain is one possible solution for this problem. In this paper, we propose a simple model for transferring sentiment labels from words to tweets and vice versa by representing both tweets and words using feature vectors residing in the same feature space. Tweets are represented by standard NLP features such as unigrams and part-of-speech tags. Words are represented by averaging the vectors of the tweets in which they occur. We evaluate our approach in two transfer learning problems: 1) training a tweet-level polarity classifier from a polarity lexicon, and 2) inducing a polarity lexicon from a collection of polarity-annotated tweets. Our results show that the proposed approach can successfully classify words and tweets after transfer.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"52 1","pages":"145-152"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89923499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Managing Evolving Trust Policies within Open and Decentralized Communities 在开放和分散的社区中管理不断发展的信任政策
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0119
Reda Yaich
Online communities promise a new era of flexible and dynamic collaborations. However, these features also raise new security challenges, especially regarding how trust is managed. In this paper, we focus on situations wherein communities participants collaborate with each others via software agents that take trust decisions on their behalf based on policies. Due to the open and dynamic nature of Online Communities, participants can neither anticipate all possible interactions nor have foreknowledge of sensitive resources and potentially malicious partners. This makes the specification of trust policies complex and risky, especially for collective (i.e., community-level) policies, motivating the need for policies evolution. The aim of this paper is to introduce an approach in order to manage the evolution of trust policies within online communities. Our scenario allows any member of the community to trigger the evolution of the community-level policy and make the other members of the community converge towards it.
在线社区预示着一个灵活和动态协作的新时代。然而,这些特性也带来了新的安全挑战,特别是在如何管理信任方面。在本文中,我们关注社区参与者通过软件代理相互协作的情况,软件代理根据策略代表他们做出信任决策。由于在线社区的开放性和动态性,参与者既无法预测所有可能的交互,也无法预知敏感资源和潜在的恶意伙伴。这使得信任策略的规范变得复杂和有风险,特别是对于集体(即社区级)策略,从而激发了对策略演变的需求。本文的目的是介绍一种方法来管理在线社区中信任策略的演变。我们的场景允许社区的任何成员触发社区级政策的演变,并使社区的其他成员向它靠拢。
{"title":"Managing Evolving Trust Policies within Open and Decentralized Communities","authors":"Reda Yaich","doi":"10.1109/WI.2016.0119","DOIUrl":"https://doi.org/10.1109/WI.2016.0119","url":null,"abstract":"Online communities promise a new era of flexible and dynamic collaborations. However, these features also raise new security challenges, especially regarding how trust is managed. In this paper, we focus on situations wherein communities participants collaborate with each others via software agents that take trust decisions on their behalf based on policies. Due to the open and dynamic nature of Online Communities, participants can neither anticipate all possible interactions nor have foreknowledge of sensitive resources and potentially malicious partners. This makes the specification of trust policies complex and risky, especially for collective (i.e., community-level) policies, motivating the need for policies evolution. The aim of this paper is to introduce an approach in order to manage the evolution of trust policies within online communities. Our scenario allows any member of the community to trigger the evolution of the community-level policy and make the other members of the community converge towards it.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"10 1","pages":"668-673"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90154666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recommendation System Based on Prediction of User Preference Changes 基于用户偏好变化预测的推荐系统
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0036
Kenta Inuzuka, Tomonori Hayashi, T. Takagi
Time always exists in our lives and time data can easily be collected in a variety of applications. For example, when you purchase items online or click on an ad, the time at which you chose the item or clicked the ad is recorded. The analysis of time information can therefore be applied in various areas. It is important to note that user preferences change over time. For example, a person who watched animated TV shows in childhood will most likely switch to watching the news in adulthood. It is effective to incorporate such changes into recommender systems. In this paper, we propose an approach that predicts user preferences with consideration of preference changes by learning the order of purchase history in a recommender system. Our approach is composed of three steps. First, we obtain user features based on matrix factorization and purchasing time. Next, we use a Kalman filter to predict user preference vectors from user features. Finally, we generate a recommendation list, at which time we propose two types of recommendation methods using the predicted vectors. We then show through experiments using a real-world dataset that our approach outperforms competitive methods such as the first order Markov model.
时间一直存在于我们的生活中,时间数据可以很容易地收集到各种应用程序中。例如,当你在网上购买物品或点击广告时,你选择物品或点击广告的时间被记录下来。因此,时间信息的分析可以应用于各个领域。需要注意的是,用户偏好会随着时间而变化。例如,一个在童年时期看动画电视节目的人很可能在成年后转向看新闻。将这些变化纳入推荐系统是有效的。在本文中,我们提出了一种通过学习推荐系统中的购买历史顺序来预测用户偏好并考虑偏好变化的方法。我们的方法由三个步骤组成。首先,基于矩阵分解和购买时间获得用户特征;接下来,我们使用卡尔曼滤波从用户特征中预测用户偏好向量。最后,我们生成了一个推荐列表,同时我们利用预测的向量提出了两种推荐方法。然后,我们通过使用真实世界数据集的实验证明,我们的方法优于一阶马尔可夫模型等竞争方法。
{"title":"Recommendation System Based on Prediction of User Preference Changes","authors":"Kenta Inuzuka, Tomonori Hayashi, T. Takagi","doi":"10.1109/WI.2016.0036","DOIUrl":"https://doi.org/10.1109/WI.2016.0036","url":null,"abstract":"Time always exists in our lives and time data can easily be collected in a variety of applications. For example, when you purchase items online or click on an ad, the time at which you chose the item or clicked the ad is recorded. The analysis of time information can therefore be applied in various areas. It is important to note that user preferences change over time. For example, a person who watched animated TV shows in childhood will most likely switch to watching the news in adulthood. It is effective to incorporate such changes into recommender systems. In this paper, we propose an approach that predicts user preferences with consideration of preference changes by learning the order of purchase history in a recommender system. Our approach is composed of three steps. First, we obtain user features based on matrix factorization and purchasing time. Next, we use a Kalman filter to predict user preference vectors from user features. Finally, we generate a recommendation list, at which time we propose two types of recommendation methods using the predicted vectors. We then show through experiments using a real-world dataset that our approach outperforms competitive methods such as the first order Markov model.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"5 1","pages":"192-199"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74322285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Research on Sentence Similarity for Question Answering System Based on Multi-feature Fusion 基于多特征融合的问答系统句子相似度研究
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0085
Haipeng Ruan, Yuan Li, Qinling Wang, Yu Liu
If just consider one feature of sentences to calculate sentences similarity, the performance of system is difficult to reach a satisfactory level. This paper presents a method of combining the features of semantic and structural to compute sentences similarity. It first discusses the methods of calculating the semantic similarity of sentences through word embedding and Tongyici Cilin. Next, it discusses the methods of calculating the morphological similarity and order similarity of sentences, and then combines the features through the neutral network to calculate the total similarity of the sentences. We include results from an evaluation of the system's performance and show that a combination of the features works better than any single approach.
如果只考虑句子的一个特征来计算句子的相似度,系统的性能很难达到令人满意的水平。本文提出了一种结合语义特征和结构特征计算句子相似度的方法。首先讨论了通过词嵌入和同义词林计算句子语义相似度的方法。其次,讨论了句子的形态相似度和顺序相似度的计算方法,然后通过神经网络将特征结合起来计算句子的总相似度。我们包括了对系统性能的评估结果,并表明组合这些特征比任何单一方法都更好。
{"title":"A Research on Sentence Similarity for Question Answering System Based on Multi-feature Fusion","authors":"Haipeng Ruan, Yuan Li, Qinling Wang, Yu Liu","doi":"10.1109/WI.2016.0085","DOIUrl":"https://doi.org/10.1109/WI.2016.0085","url":null,"abstract":"If just consider one feature of sentences to calculate sentences similarity, the performance of system is difficult to reach a satisfactory level. This paper presents a method of combining the features of semantic and structural to compute sentences similarity. It first discusses the methods of calculating the semantic similarity of sentences through word embedding and Tongyici Cilin. Next, it discusses the methods of calculating the morphological similarity and order similarity of sentences, and then combines the features through the neutral network to calculate the total similarity of the sentences. We include results from an evaluation of the system's performance and show that a combination of the features works better than any single approach.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"64 1","pages":"507-510"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76301879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Generic Framework to Predict Repeat Behavior of Customers Using Their Transaction History 使用交易历史预测客户重复行为的通用框架
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0072
Auon Haidar Kazmi, Gautam M. Shroff, P. Agarwal
There exists a class of problems in e-commerce and retail businesses where the shopping behavior of customers is analyzed in order to predict their repeat behavior for products or retail stores. This analysis plays a crucial role in advertisement budgeting, product placement and relevant customer targeting. Researchers have addressed this problem by using standard predictive models, which use ad hoc features. We propose a metamodel that abstracts the different dimensions of data present in transactional datasets. These dimensions can be customer, product, offer, target, marketplace and transactions. Our framework also has abstract functions for comprehensive feature set generation, and includes different machine learning algorithms to learn prediction model. Our framework works end-to-end from feature engineering to reporting repeat probabilities of customers for products (or marketplace, brand, website or storechain). Moreover, the predicted repeat behavior of customers for different products along with their transactional history is used by our offer optimization model i-Prescribe to suggest products to be offered to customers with the goal of maximizing the return on investment of given marketing budget. We prove that our abstract features work on two different data-challenge datasets, by sharing experimental results.
在电子商务和零售业务中存在一类问题,即通过分析顾客的购物行为来预测他们对产品或零售商店的重复行为。这种分析在广告预算、产品植入和相关的客户定位中起着至关重要的作用。研究人员通过使用标准的预测模型解决了这个问题,这些模型使用了特别的特征。我们提出了一个元模型,它抽象了事务数据集中存在的不同维度的数据。这些维度可以是客户、产品、报价、目标、市场和交易。我们的框架还具有用于综合特征集生成的抽象功能,并包含不同的机器学习算法来学习预测模型。我们的框架从特征工程到报告产品(或市场、品牌、网站或连锁店)客户重复概率的端到端工作。此外,我们的报价优化模型i- prescription利用预测的客户对不同产品的重复行为及其交易历史,以给定营销预算的投资回报率最大化为目标,向客户推荐产品。通过分享实验结果,我们证明了我们的抽象特征可以在两个不同的数据挑战数据集上工作。
{"title":"Generic Framework to Predict Repeat Behavior of Customers Using Their Transaction History","authors":"Auon Haidar Kazmi, Gautam M. Shroff, P. Agarwal","doi":"10.1109/WI.2016.0072","DOIUrl":"https://doi.org/10.1109/WI.2016.0072","url":null,"abstract":"There exists a class of problems in e-commerce and retail businesses where the shopping behavior of customers is analyzed in order to predict their repeat behavior for products or retail stores. This analysis plays a crucial role in advertisement budgeting, product placement and relevant customer targeting. Researchers have addressed this problem by using standard predictive models, which use ad hoc features. We propose a metamodel that abstracts the different dimensions of data present in transactional datasets. These dimensions can be customer, product, offer, target, marketplace and transactions. Our framework also has abstract functions for comprehensive feature set generation, and includes different machine learning algorithms to learn prediction model. Our framework works end-to-end from feature engineering to reporting repeat probabilities of customers for products (or marketplace, brand, website or storechain). Moreover, the predicted repeat behavior of customers for different products along with their transactional history is used by our offer optimization model i-Prescribe to suggest products to be offered to customers with the goal of maximizing the return on investment of given marketing budget. We prove that our abstract features work on two different data-challenge datasets, by sharing experimental results.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"263 1","pages":"449-452"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76416714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An Analysis of Main Solutions for the Automatic Construction of Ontologies from Text 文本本体自动构建的主要解决方案分析
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0074
R. Girardi
Ontologies are increasingly used by modern knowledge systems for representing and sharing knowledge. Supporting semantic processing, ontology-driven knowledge systems allow for more precise information interpretation, thus providing greater usability and effectiveness than traditional information systems. Manual construction of ontologies by domain experts and knowledge engineers is a costly task, therefore automatic and/or semi-automatic approaches to their development are needed, a field of research that is usually referred to as ontology learning and population. This is the main focus of this article which discusses main problems and corresponding solutions for the automated acquisition of each one of the components of an ontology (classes, properties, taxonomic and non-taxonomic relationships, axioms and instances).
现代知识系统越来越多地使用本体来表示和共享知识。本体驱动的知识系统支持语义处理,允许更精确的信息解释,从而提供比传统信息系统更高的可用性和有效性。领域专家和知识工程师手工构建本体是一项昂贵的任务,因此需要自动和/或半自动的方法来开发本体,这是一个通常被称为本体学习和人口的研究领域。这是本文的主要焦点,讨论了自动获取本体的每个组件(类、属性、分类和非分类关系、公理和实例)的主要问题和相应的解决方案。
{"title":"An Analysis of Main Solutions for the Automatic Construction of Ontologies from Text","authors":"R. Girardi","doi":"10.1109/WI.2016.0074","DOIUrl":"https://doi.org/10.1109/WI.2016.0074","url":null,"abstract":"Ontologies are increasingly used by modern knowledge systems for representing and sharing knowledge. Supporting semantic processing, ontology-driven knowledge systems allow for more precise information interpretation, thus providing greater usability and effectiveness than traditional information systems. Manual construction of ontologies by domain experts and knowledge engineers is a costly task, therefore automatic and/or semi-automatic approaches to their development are needed, a field of research that is usually referred to as ontology learning and population. This is the main focus of this article which discusses main problems and corresponding solutions for the automated acquisition of each one of the components of an ontology (classes, properties, taxonomic and non-taxonomic relationships, axioms and instances).","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"54 1","pages":"457-460"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78135871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensing Real-World Events Using Social Media Data and a Classification-Clustering Framework 使用社交媒体数据和分类聚类框架感知现实世界事件
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0039
Nasser Alsaedi, P. Burnap, O. Rana
In recent years, there has been increased interest in real-world event identification using data collected from social media, where theWeb enables the general public to post real-time reactions to terrestrial events - thereby acting as social sensors of terrestrial activity. Automatically extracting and categorizing activity from streamed data is a non-trivial task. To address this task, we present a novel event detection framework which comprises five main components: data collection, pre-processing, classification, online clustering and summarization. The integration between classification and clustering allows events to be detected - including "disruptive" events - incidents that threaten social safety and security, or could disrupt the social order. We evaluate our framework on a large-scale, real-world dataset from Twitter. We also compare our results to other leading approaches using Flickr MediaEval Event Detection Benchmark.
近年来,人们对利用从社交媒体收集的数据来识别现实世界的事件越来越感兴趣,在社交媒体上,网络使公众能够发布对地球事件的实时反应,从而充当地球活动的社会传感器。从流数据中自动提取和分类活动是一项重要的任务。为了解决这个问题,我们提出了一个新的事件检测框架,它包括五个主要部分:数据收集、预处理、分类、在线聚类和总结。将分类和聚类结合起来,可以检测到事件,包括“破坏性”事件,即威胁社会安全和保障或可能破坏社会秩序的事件。我们在来自Twitter的大规模真实数据集上评估我们的框架。我们还将我们的结果与使用Flickr MediaEval事件检测基准的其他领先方法进行了比较。
{"title":"Sensing Real-World Events Using Social Media Data and a Classification-Clustering Framework","authors":"Nasser Alsaedi, P. Burnap, O. Rana","doi":"10.1109/WI.2016.0039","DOIUrl":"https://doi.org/10.1109/WI.2016.0039","url":null,"abstract":"In recent years, there has been increased interest in real-world event identification using data collected from social media, where theWeb enables the general public to post real-time reactions to terrestrial events - thereby acting as social sensors of terrestrial activity. Automatically extracting and categorizing activity from streamed data is a non-trivial task. To address this task, we present a novel event detection framework which comprises five main components: data collection, pre-processing, classification, online clustering and summarization. The integration between classification and clustering allows events to be detected - including \"disruptive\" events - incidents that threaten social safety and security, or could disrupt the social order. We evaluate our framework on a large-scale, real-world dataset from Twitter. We also compare our results to other leading approaches using Flickr MediaEval Event Detection Benchmark.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"14 1","pages":"216-223"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79306731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Dynamic Allocation of Service Function Chains under Priority Dependency Constraint 优先依赖约束下业务功能链的动态分配
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0122
M. Masoud, Sanghoon Lee, S. Belkasim
Network functions virtualization is a new technology for the future internet that eliminates the dependency of the network function and the hardware requirement. The network functions virtualization provides a successful approach for meeting the increase in demand of the end-to-end (E2E) services with low operational and capital costs. Replacing the network specific purpose hardware (e.g. firewall) with a software implementation of the network functions in which a chain of Virtualized Network Functions (VNFs) can logically connect the end points and provide the desired network services. However, this approach is associated with the challenge of dynamically mapping the predefined VNFs onto the existing substrate network in an optimal way. In this paper, we propose a simple and effective approach for mapping the VNFs with the physical resources in a dynamic service request environment. The algorithm considers the priority dependency between the VNFs as a case of study, with the objective of minimizing the mapping blocking rate.
网络功能虚拟化是面向未来互联网的一项新技术,它消除了网络功能与硬件需求的依赖性。网络功能虚拟化以较低的运营成本和资金成本,成功地满足了端到端业务需求的增长。将网络专用硬件(例如防火墙)替换为网络功能的软件实现,其中虚拟网络功能链(VNFs)可以逻辑地连接端点并提供所需的网络服务。然而,这种方法面临着以最佳方式将预定义的VNFs动态映射到现有基板网络上的挑战。在本文中,我们提出了一种在动态服务请求环境中映射VNFs与物理资源的简单而有效的方法。该算法以VNFs之间的优先级依赖为研究对象,以最小化映射阻塞率为目标。
{"title":"Dynamic Allocation of Service Function Chains under Priority Dependency Constraint","authors":"M. Masoud, Sanghoon Lee, S. Belkasim","doi":"10.1109/WI.2016.0122","DOIUrl":"https://doi.org/10.1109/WI.2016.0122","url":null,"abstract":"Network functions virtualization is a new technology for the future internet that eliminates the dependency of the network function and the hardware requirement. The network functions virtualization provides a successful approach for meeting the increase in demand of the end-to-end (E2E) services with low operational and capital costs. Replacing the network specific purpose hardware (e.g. firewall) with a software implementation of the network functions in which a chain of Virtualized Network Functions (VNFs) can logically connect the end points and provide the desired network services. However, this approach is associated with the challenge of dynamically mapping the predefined VNFs onto the existing substrate network in an optimal way. In this paper, we propose a simple and effective approach for mapping the VNFs with the physical resources in a dynamic service request environment. The algorithm considers the priority dependency between the VNFs as a case of study, with the objective of minimizing the mapping blocking rate.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"6 1","pages":"684-688"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87637616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Learning Processes Based on Data Sources with Certainty Levels in Linked Open Data 基于关联开放数据中具有确定性级别的数据源的学习过程
Pub Date : 2016-10-01 DOI: 10.1109/WI.2016.0068
Jesse Xi Chen, M. Reformat, R. Yager
Linked Open Data (LOD) consists of numerous data stores that are highly interconnected. LOD stores use Resource Description Framework (RDF) as a data representation format. A graph-based nature of RDF brings an opportunity to develop new approaches for accumulating data from multiple sources characterized by different levels of confidence in them. Recently, a participatory learning mechanism has been extended to cope with RDF. It is an attractive way of integrating new pieces of information with already known ones. Further, it has been recognized that pieces of information describing entities can have a disjunctive or conjunctive form. This paper uses an RDF-based participatory learning process to aggregate information obtained from multiple data stores. This process provides mechanisms that determine overall certainty in combined data based on levels of confidence in already known pieces of information and new ones. The behavior of such a process used for integrating information equipped with different levels of uncertainty is presented, and a simple case study is included.
链接开放数据(LOD)由许多高度互连的数据存储组成。LOD存储使用资源描述框架(RDF)作为数据表示格式。RDF基于图的特性为开发新方法提供了机会,可以从具有不同置信度的多个数据源中积累数据。最近,一种参与式学习机制已经扩展到处理RDF。这是一种将新信息与已知信息整合在一起的有吸引力的方式。此外,人们已经认识到,描述实体的信息片段可以具有析取或连接形式。本文使用基于rdf的参与式学习过程来聚合从多个数据存储中获得的信息。这一过程提供了一种机制,可以根据对已知信息和新信息的置信度来确定组合数据的总体确定性。介绍了用于集成具有不同程度不确定性的信息的这种过程的行为,并包括一个简单的案例研究。
{"title":"Learning Processes Based on Data Sources with Certainty Levels in Linked Open Data","authors":"Jesse Xi Chen, M. Reformat, R. Yager","doi":"10.1109/WI.2016.0068","DOIUrl":"https://doi.org/10.1109/WI.2016.0068","url":null,"abstract":"Linked Open Data (LOD) consists of numerous data stores that are highly interconnected. LOD stores use Resource Description Framework (RDF) as a data representation format. A graph-based nature of RDF brings an opportunity to develop new approaches for accumulating data from multiple sources characterized by different levels of confidence in them. Recently, a participatory learning mechanism has been extended to cope with RDF. It is an attractive way of integrating new pieces of information with already known ones. Further, it has been recognized that pieces of information describing entities can have a disjunctive or conjunctive form. This paper uses an RDF-based participatory learning process to aggregate information obtained from multiple data stores. This process provides mechanisms that determine overall certainty in combined data based on levels of confidence in already known pieces of information and new ones. The behavior of such a process used for integrating information equipped with different levels of uncertainty is presented, and a simple case study is included.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"116 1","pages":"429-434"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86796078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1