Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics最新文献

英文中文

Concept-aware geographic information retrieval 概念感知地理信息检索

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106498

Noemi Mauro, L. Ardissono, Adriano Savoca

Textual queries are largely employed in information retrieval to let users specify search goals in a natural way. However, differences in user and system terminologies can challenge the identification of the user's information needs, and thus the generation of relevant results. We argue that the explicit management of ontological knowledge, and of the meaning of concepts (by integrating linguistic and encyclopaedic knowledge in the system ontology), can improve the analysis of search queries, because it enables a flexible identification of the topics the user is searching for, regardless of the adopted vocabulary. This paper proposes an information retrieval support model based on semantic concept identification. Starting from the recognition of the ontology concepts that the search query refers to, this model exploits the qualifiers specified in the query to select information items on the basis of possibly fine-grained features. Moreover, it supports query expansion and reformulation by suggesting the exploration of semantically similar concepts, as well as of concepts related to those referred in the query through thematic relations. A test on a data-set collected using the OnToMap Participatory GIS has shown that this approach provides accurate results.

文本查询主要用于信息检索，使用户能够以自然的方式指定搜索目标。然而，用户和系统术语的差异可能会对用户信息需求的识别造成挑战，从而影响相关结果的生成。我们认为，对本体知识和概念含义的明确管理(通过在系统本体中集成语言和百科知识)可以改善搜索查询的分析，因为它可以灵活地识别用户正在搜索的主题，而不管采用的词汇是什么。提出了一种基于语义概念识别的信息检索支持模型。该模型从识别搜索查询所引用的本体概念开始，利用查询中指定的限定符，根据可能的细粒度特征选择信息项。此外，它通过建议探索语义相似的概念以及通过主题关系与查询中引用的概念相关的概念来支持查询扩展和重新表述。对使用OnToMap参与式地理信息系统收集的数据集进行的测试表明，这种方法提供了准确的结果。

{"title":"Concept-aware geographic information retrieval","authors":"Noemi Mauro, L. Ardissono, Adriano Savoca","doi":"10.1145/3106426.3106498","DOIUrl":"https://doi.org/10.1145/3106426.3106498","url":null,"abstract":"Textual queries are largely employed in information retrieval to let users specify search goals in a natural way. However, differences in user and system terminologies can challenge the identification of the user's information needs, and thus the generation of relevant results. We argue that the explicit management of ontological knowledge, and of the meaning of concepts (by integrating linguistic and encyclopaedic knowledge in the system ontology), can improve the analysis of search queries, because it enables a flexible identification of the topics the user is searching for, regardless of the adopted vocabulary. This paper proposes an information retrieval support model based on semantic concept identification. Starting from the recognition of the ontology concepts that the search query refers to, this model exploits the qualifiers specified in the query to select information items on the basis of possibly fine-grained features. Moreover, it supports query expansion and reformulation by suggesting the exploration of semantically similar concepts, as well as of concepts related to those referred in the query through thematic relations. A test on a data-set collected using the OnToMap Participatory GIS has shown that this approach provides accurate results.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78438492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Brand key asset discovery via cluster-wise biased discriminant projection 品牌关键资产发现通过集群明智的偏见判别投影

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106516

Yang Liu, Zhonglei Gu, Tobey H. Ko, Jiming Liu

Accurate and effective discovery of a brand's key assets, namely, Key Opinion Leaders (KOLs) and potential customers, plays an essential role in marketing campaigns. In a massive online social network, brands are challenged with identifying a small portion of key assets over an enormous volume of irrelevant users, making the problem a highly imbalanced one. Moreover, having to deal with social media data that are usually high-dimensional, the task of brand key asset discovery can be immensely expensive yet inaccurate if the information are not processed efficiently to extract representative features from the original space prior to the learning process. To address the above issues, we propose a novel method dubbed Cluster-wise Biased Discriminant Projection (CBDP) to uncover the compact and informative features from users' data for brand key asset discovery. CBDP conducts a two-layer learning procedure. In the first layer, a Discriminant Clustering (DC) scheme is developed to partition the original dataset into clusters with maximum discriminant capacity. In the second layer, a Biased Discriminant Projection (BDP) algorithm is proposed and performed on each cluster to map the high-dimensional data to the low-dimensional subspace, where the discriminant information of classes with high importance/preference is preserved. A unified mapping function of CBDP is finally established by integrating these two layers. Experiments on both synthetic examples and a real-world brand key asset dataset validate the effectiveness of the proposed method.

准确有效地发现品牌的关键资产，即关键意见领袖(kol)和潜在客户，在营销活动中起着至关重要的作用。在一个庞大的在线社交网络中，品牌面临着在大量无关用户中识别一小部分关键资产的挑战，这使得问题变得高度不平衡。此外，由于必须处理通常是高维的社交媒体数据，如果在学习过程之前没有有效地处理信息以从原始空间中提取代表性特征，那么品牌关键资产发现的任务可能会非常昂贵且不准确。为了解决上述问题，我们提出了一种新的方法，称为聚类有偏差判别投影(CBDP)，从用户数据中揭示紧凑和信息丰富的特征，用于品牌关键资产发现。CBDP是一个两层学习过程。在第一层，提出了一种判别聚类(DC)方案，将原始数据集划分为具有最大判别能力的聚类。在第二层，提出了一种偏差判别投影(Biased Discriminant Projection, BDP)算法，并在每个聚类上执行该算法，将高维数据映射到低维子空间，在低维子空间中保留高重要性/偏好类的判别信息。将这两层进行整合，最终建立了统一的CBDP映射函数。在合成示例和真实品牌密钥资产数据集上的实验验证了所提方法的有效性。

{"title":"Brand key asset discovery via cluster-wise biased discriminant projection","authors":"Yang Liu, Zhonglei Gu, Tobey H. Ko, Jiming Liu","doi":"10.1145/3106426.3106516","DOIUrl":"https://doi.org/10.1145/3106426.3106516","url":null,"abstract":"Accurate and effective discovery of a brand's key assets, namely, Key Opinion Leaders (KOLs) and potential customers, plays an essential role in marketing campaigns. In a massive online social network, brands are challenged with identifying a small portion of key assets over an enormous volume of irrelevant users, making the problem a highly imbalanced one. Moreover, having to deal with social media data that are usually high-dimensional, the task of brand key asset discovery can be immensely expensive yet inaccurate if the information are not processed efficiently to extract representative features from the original space prior to the learning process. To address the above issues, we propose a novel method dubbed Cluster-wise Biased Discriminant Projection (CBDP) to uncover the compact and informative features from users' data for brand key asset discovery. CBDP conducts a two-layer learning procedure. In the first layer, a Discriminant Clustering (DC) scheme is developed to partition the original dataset into clusters with maximum discriminant capacity. In the second layer, a Biased Discriminant Projection (BDP) algorithm is proposed and performed on each cluster to map the high-dimensional data to the low-dimensional subspace, where the discriminant information of classes with high importance/preference is preserved. A unified mapping function of CBDP is finally established by integrating these two layers. Experiments on both synthetic examples and a real-world brand key asset dataset validate the effectiveness of the proposed method.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83391067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Improving the classification of events in tweets using semantic enrichment 使用语义充实改进tweets中的事件分类

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106435

Simone Aparecida Pinto Romero, Karin Becker

Contextual enrichment using external sources has been proposed as a means to deal with the poor textual contents of tweets for event classification. Related work performs contextual enrichment according to specific assumptions about the events. Furthermore, enrichment adds a significant amount of extra features, most of them with no discriminative contribution to the event classification task. In this paper, we propose an enrichment framework targeted at the classification of events in general, of which the key elements are: a) external enrichment using related web pages for extending the conceptual features contained within the tweets; b) semantic enrichment using the DBpedia to add related semantic features, and c) a pruning technique that selects the semantic features with discriminative potential. We compared the proposed approach against two distinct baselines based on textual features only and word embeddings, using seven different event datasets. Our experiments reveal that the proposed framework supports the classification of distinct event types, outperforming the textual baseline in 63.5% of the cases, and the word embeddings baseline in 96.5% of the cases.

使用外部源的上下文丰富被提出作为一种手段来处理推文的文本内容差的事件分类。相关工作根据对事件的特定假设进行上下文丰富。此外，浓缩增加了大量的额外特征，其中大多数对事件分类任务没有区别性贡献。在本文中，我们提出了一个针对一般事件分类的浓缩框架，其中的关键要素是:a)使用相关网页来扩展tweet中包含的概念特征的外部浓缩;b)使用DBpedia添加相关语义特征的语义丰富，以及c)选择具有判别潜力的语义特征的修剪技术。我们使用七个不同的事件数据集，将所提出的方法与仅基于文本特征和词嵌入的两个不同基线进行了比较。我们的实验表明，该框架支持不同事件类型的分类，在63.5%的情况下优于文本基线，在96.5%的情况下优于词嵌入基线。

{"title":"Improving the classification of events in tweets using semantic enrichment","authors":"Simone Aparecida Pinto Romero, Karin Becker","doi":"10.1145/3106426.3106435","DOIUrl":"https://doi.org/10.1145/3106426.3106435","url":null,"abstract":"Contextual enrichment using external sources has been proposed as a means to deal with the poor textual contents of tweets for event classification. Related work performs contextual enrichment according to specific assumptions about the events. Furthermore, enrichment adds a significant amount of extra features, most of them with no discriminative contribution to the event classification task. In this paper, we propose an enrichment framework targeted at the classification of events in general, of which the key elements are: a) external enrichment using related web pages for extending the conceptual features contained within the tweets; b) semantic enrichment using the DBpedia to add related semantic features, and c) a pruning technique that selects the semantic features with discriminative potential. We compared the proposed approach against two distinct baselines based on textual features only and word embeddings, using seven different event datasets. Our experiments reveal that the proposed framework supports the classification of distinct event types, outperforming the textual baseline in 63.5% of the cases, and the word embeddings baseline in 96.5% of the cases.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81887214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

An IoT approach for context-aware smart traffic management using ontology 使用本体实现上下文感知智能交通管理的物联网方法

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106499

Deepti Goel, S. Chaudhury, Hiranmay Ghosh

This paper exhibits a novel context-aware service framework for IoT based Smart Traffic Management using ontology to regulate smooth traffic flow in smart cities by analyzing real-time traffic environment. The proposed approach makes smarter use of transport networks to achieve objectives related to performance of transport system. This requires efficient traffic planning measures which relate to the actions designed to adjust the demand and capacity of the network in time and space by use of IoT technologies. The adoption of sensors and IoT devices in Smart Traffic System helps to capture the user's preferences and context information which can be in the form of travel time, weather conditions or real-life driving patterns. We have employed multimedia ontology to derive higher level descriptions of traffic conditions and vehicles from perceptual observation of traffic information which provides important grounds for our proposed IoT framework. The multimedia ontology encoded in Multimedia Web Ontology Language(MOWL) helps to define classes, properties, and structure of a possible traffic environment to provide insights across the transportation network. MOWL supports Dynamic Bayesian networks (DBN) to deal with time-series data and uncertainties linked with context observations which fits the definition of an intelligent IoT system. Thus, our proposed smart traffic framework aggregates information corresponding to traffic domain such as traffic videos captured using CCTV cameras and allows automatic prediction of dynamically changing situations which helps to make traffic authorities more responsive. We have illustrated use of our approach by utilizing contextual information, to assess real-time congestion situation on roads thus allowing to visualize planning services. Once the congestion situation is predicted, alternate congestion free routes which are in accordance with the coveted criteria are suggested that can be propagated through text-messages or e-mails to the users.

本文提出了一种新的基于物联网的智能交通管理服务框架，通过分析实时交通环境，利用本体来调节智慧城市的交通顺畅。提出的方法可以更智能地利用运输网络来实现与运输系统性能相关的目标。这需要有效的交通规划措施，这些措施涉及到通过使用物联网技术在时间和空间上调整网络需求和容量的行动。在智能交通系统中采用传感器和物联网设备有助于捕捉用户的偏好和上下文信息，这些信息可以以旅行时间、天气条件或现实驾驶模式的形式呈现。我们使用多媒体本体从对交通信息的感知观察中获得更高层次的交通状况和车辆描述，这为我们提出的物联网框架提供了重要依据。用多媒体Web本体语言(multimedia Web ontology Language, MOWL)编码的多媒体本体有助于定义可能的交通环境的类、属性和结构，从而提供跨交通网络的洞察。MOWL支持动态贝叶斯网络(DBN)来处理时间序列数据和与上下文观测相关的不确定性，符合智能物联网系统的定义。因此，我们提出的智能交通框架聚合了与交通领域相对应的信息，例如使用闭路电视摄像机捕获的交通视频，并允许自动预测动态变化的情况，这有助于交通管理部门做出更快的反应。我们通过使用上下文信息来说明我们的方法的使用，以评估道路上的实时拥堵情况，从而使规划服务可视化。一旦预测到拥堵情况，就会提出符合期望标准的备用无拥堵路线，并通过短信或电子邮件传播给用户。

{"title":"An IoT approach for context-aware smart traffic management using ontology","authors":"Deepti Goel, S. Chaudhury, Hiranmay Ghosh","doi":"10.1145/3106426.3106499","DOIUrl":"https://doi.org/10.1145/3106426.3106499","url":null,"abstract":"This paper exhibits a novel context-aware service framework for IoT based Smart Traffic Management using ontology to regulate smooth traffic flow in smart cities by analyzing real-time traffic environment. The proposed approach makes smarter use of transport networks to achieve objectives related to performance of transport system. This requires efficient traffic planning measures which relate to the actions designed to adjust the demand and capacity of the network in time and space by use of IoT technologies. The adoption of sensors and IoT devices in Smart Traffic System helps to capture the user's preferences and context information which can be in the form of travel time, weather conditions or real-life driving patterns. We have employed multimedia ontology to derive higher level descriptions of traffic conditions and vehicles from perceptual observation of traffic information which provides important grounds for our proposed IoT framework. The multimedia ontology encoded in Multimedia Web Ontology Language(MOWL) helps to define classes, properties, and structure of a possible traffic environment to provide insights across the transportation network. MOWL supports Dynamic Bayesian networks (DBN) to deal with time-series data and uncertainties linked with context observations which fits the definition of an intelligent IoT system. Thus, our proposed smart traffic framework aggregates information corresponding to traffic domain such as traffic videos captured using CCTV cameras and allows automatic prediction of dynamically changing situations which helps to make traffic authorities more responsive. We have illustrated use of our approach by utilizing contextual information, to assess real-time congestion situation on roads thus allowing to visualize planning services. Once the congestion situation is predicted, alternate congestion free routes which are in accordance with the coveted criteria are suggested that can be propagated through text-messages or e-mails to the users.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"173 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79565614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Mining ordinal data under human response uncertainty 人类响应不确定性下的有序数据挖掘

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106448

Sergej Sizov

Analysis and interpretation of collective feedback on ordinal scales is an important issue for several disciplines, including social sciences, recommender systems research, marketing, political science, and many others. A "reasonable" model is expected to provide an "explanation" of collective user behaviour. Many existing data mining approaches employ for this purpose probabilistic models, based on distributions and mixtures from a certain parametric family. In real life, users meet their decisions with considerable uncertainty. Its assessment and use in probabilistic models for better interpretation of collective feedback is the key concern of this paper. In doing so, we introduce approaches for gathering individual uncertainty, and discuss their viability and limitations. Consequently, we enrich state of the art response mining models (especially focused on discovery of latent user groups) with uncertainty knowledge, and demonstrate resulting advantages in systematic experiments with real users.

对于社会科学、推荐系统研究、市场营销、政治学等许多学科来说，分析和解释有序尺度上的集体反馈是一个重要问题。一个“合理的”模型有望为用户的集体行为提供一个“解释”。许多现有的数据挖掘方法为此目的采用基于某个参数族的分布和混合的概率模型。在现实生活中，用户在做出决定时存在相当大的不确定性。它的评估和使用的概率模型，以更好地解释集体反馈是本文的重点关注。在此过程中，我们介绍了收集个体不确定性的方法，并讨论了它们的可行性和局限性。因此，我们用不确定性知识丰富了最先进的响应挖掘模型(特别是关注潜在用户群体的发现)，并在与真实用户的系统实验中展示了由此产生的优势。

引用次数: 2

Adaptive training instance selection for cross-domain emotion identification 跨领域情感识别的自适应训练实例选择

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106457

Wenbo Wang, Lu Chen, Keke Chen, K. Thirunarayan, A. Sheth

This paper exploits a large number of self-labeled emotion tweets as the training data from the source domain to improve emotion identification in target domains (i.e., blogs and fairy tales), where there is a short supply of labeled data. Due to the noisy and ambiguous nature of self-labeled emotion training data, the existing domain adaptation methods that typically depend on high-quality labeled source-domain data do not work satisfactorily. This paper describes an adaptive source-domain training instance selection method to address the problem of noisy source-domain training data. The proposed approach can effectively identify the most informative training examples based on three carefully designed measures: consistency, diversity, and similarity. It uses an iterative method that consists of the following steps in each iteration: selecting informative samples from the source domain with the informativeness measures, merging with the target-domain training data, evaluating the performance of learned classifier for the target domain, and updating the informativeness measures for the next iteration. It stops until no new training instance is selected or in a designated number of iterations. Experiments show that our approach performs effectively for cross-domain emotion identification and consistently outperforms baseline approaches across four domains.

本文利用大量自标记的情感推文作为源域的训练数据，改进了标记数据不足的目标域(即博客和童话)的情感识别。由于自标记情感训练数据具有噪声和模糊性，现有的基于高质量标记源域数据的领域自适应方法效果不理想。针对源域训练数据中存在噪声的问题，提出了一种自适应源域训练实例选择方法。所提出的方法可以基于三个精心设计的度量:一致性、多样性和相似性，有效地识别信息最多的训练样例。它采用迭代方法，在每次迭代中包括以下步骤:从具有信息度量的源域中选择信息样本，与目标域训练数据合并，评估学习到的分类器在目标域的性能，更新下一次迭代的信息度量。它会停止，直到没有新的训练实例被选择或在指定的迭代次数。实验表明，我们的方法在跨领域情感识别方面表现有效，并且在四个领域中始终优于基线方法。

{"title":"Adaptive training instance selection for cross-domain emotion identification","authors":"Wenbo Wang, Lu Chen, Keke Chen, K. Thirunarayan, A. Sheth","doi":"10.1145/3106426.3106457","DOIUrl":"https://doi.org/10.1145/3106426.3106457","url":null,"abstract":"This paper exploits a large number of self-labeled emotion tweets as the training data from the source domain to improve emotion identification in target domains (i.e., blogs and fairy tales), where there is a short supply of labeled data. Due to the noisy and ambiguous nature of self-labeled emotion training data, the existing domain adaptation methods that typically depend on high-quality labeled source-domain data do not work satisfactorily. This paper describes an adaptive source-domain training instance selection method to address the problem of noisy source-domain training data. The proposed approach can effectively identify the most informative training examples based on three carefully designed measures: consistency, diversity, and similarity. It uses an iterative method that consists of the following steps in each iteration: selecting informative samples from the source domain with the informativeness measures, merging with the target-domain training data, evaluating the performance of learned classifier for the target domain, and updating the informativeness measures for the next iteration. It stops until no new training instance is selected or in a designated number of iterations. Experiments show that our approach performs effectively for cross-domain emotion identification and consistently outperforms baseline approaches across four domains.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"294 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77050350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A proactive event-driven approach for dynamic QoS compliance in cloud of things 在物联网中实现动态QoS遵从的主动事件驱动方法

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3109431

Falak Nawaz, O. Hussain, N. Janjua, Elizabeth Chang

Cloud-of-things service providers use various descriptions languages to describe Quality of Service (QoS) attributes. However, existing modelling approaches provide support for modelling static QoS attributes only and lack features to model and reason with dynamic QoS attributes such as response time and availability. This paper presents an event-based approach for monitoring dynamic QoS values and their compliance by modelling the behavior of QoS attributes using an Event Calculus (EC) based framework. The logic based reasoning is then performed to proactively identify the possible QoS violations in future.

物云服务提供商使用各种描述语言来描述服务质量(QoS)属性。然而，现有的建模方法只支持对静态QoS属性进行建模，缺乏对响应时间和可用性等动态QoS属性进行建模和推理的功能。本文提出了一种基于事件的方法，通过使用基于事件演算(Event Calculus, EC)的框架对QoS属性的行为进行建模，从而监测动态QoS值及其遵从性。然后执行基于逻辑的推理，以主动识别将来可能的QoS违规。

引用次数: 7

Consensus-based ranking of wikipedia topics 基于共识的维基百科主题排名

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106529

Waleed Nema, Yinshan Tang

To improve the effectiveness of users' information seeking experience in interactive web search we hypothesize how people might be influenced when making relevance judgment decisions by introducing the Consensus Theory & Relevance Judgment Model (CT&M). This is combined with a practical path to assess the extent of difference between suggestions of current search engines versus user expectations. A user-centered, evidence-based, phenomenology approach is used to improve on Google PageRank (GPR) in two ways. The first by biasing GPR's equal navigation probability assumption using (f)actual usage stats as implicit user consensus which leads to the StatsRank (SR) algorithm. Secondly, we aggregate users' explicit ranking to derive Consensus Rank (CR) which is shown to predict individual user ranking significantly better than GPR and meta-search of modern search engines Google and Yahoo/Bing real-time. CT&M contextualizes CR, SR, and a live open online web experiment, called The Ranking Game, which is based on the August-2016 English Wikipedia corpus (12.7 million pages) and Page View Statistics for May to July 2016. Limiting this work to Wikipedia makes GPR topic-based since any Wikipedia page is focused on one topic. TREC's pooling is used to merge top 20 results from major search engines and present an alphabetized list for users' explicit ranking via drag and drop. The same platform captures implicit data for future research and can be used for controlled experiments. Our contributions are: CT&M, SR, CR, and the open online user feedback web experiment research platform.

为了提高交互式网络搜索中用户信息寻求体验的有效性，我们通过引入共识理论和关联判断模型(CT&M)来假设人们在做出关联判断决策时可能受到的影响。这与实际路径相结合，以评估当前搜索引擎的建议与用户期望之间的差异程度。以用户为中心，以证据为基础的现象学方法用于从两个方面提高Google PageRank (GPR)。第一个是通过使用(f)实际使用统计作为隐含用户共识来偏倚GPR的相等导航概率假设，从而导致StatsRank (SR)算法。其次，我们汇总用户的显式排名，得出共识排名(Consensus Rank, CR)，该排名预测个人用户排名的效果明显优于GPR和现代搜索引擎谷歌和雅虎/必应的实时元搜索。CT&M将CR、SR和一个名为“排名游戏”(The Ranking Game)的实时开放网络实验结合起来，该实验基于2016年8月至2016年8月的英文维基百科语料库(1270万页)和2016年5月至7月的页面浏览量统计数据。将这项工作限制在维基百科使GPR基于主题，因为任何维基百科页面都专注于一个主题。TREC的池用于合并来自主要搜索引擎的前20个结果，并通过拖放显示按字母顺序排列的列表，以便用户明确排名。同样的平台为未来的研究捕获隐含数据，并可用于控制实验。我们的贡献是:CT&M, SR, CR和开放的在线用户反馈网络实验研究平台。

{"title":"Consensus-based ranking of wikipedia topics","authors":"Waleed Nema, Yinshan Tang","doi":"10.1145/3106426.3106529","DOIUrl":"https://doi.org/10.1145/3106426.3106529","url":null,"abstract":"To improve the effectiveness of users' information seeking experience in interactive web search we hypothesize how people might be influenced when making relevance judgment decisions by introducing the Consensus Theory & Relevance Judgment Model (CT&M). This is combined with a practical path to assess the extent of difference between suggestions of current search engines versus user expectations. A user-centered, evidence-based, phenomenology approach is used to improve on Google PageRank (GPR) in two ways. The first by biasing GPR's equal navigation probability assumption using (f)actual usage stats as implicit user consensus which leads to the StatsRank (SR) algorithm. Secondly, we aggregate users' explicit ranking to derive Consensus Rank (CR) which is shown to predict individual user ranking significantly better than GPR and meta-search of modern search engines Google and Yahoo/Bing real-time. CT&M contextualizes CR, SR, and a live open online web experiment, called The Ranking Game, which is based on the August-2016 English Wikipedia corpus (12.7 million pages) and Page View Statistics for May to July 2016. Limiting this work to Wikipedia makes GPR topic-based since any Wikipedia page is focused on one topic. TREC's pooling is used to merge top 20 results from major search engines and present an alphabetized list for users' explicit ranking via drag and drop. The same platform captures implicit data for future research and can be used for controlled experiments. Our contributions are: CT&M, SR, CR, and the open online user feedback web experiment research platform.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75260427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conjoint utilization of structured and unstructured information for planning interleaving deliberation in supply chains 供应链规划交错审议中结构化与非结构化信息的联合利用

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106545

N. Janjua, O. Hussain, Elizabeth Chang, S. M. Islam

Effective business planning requires seamless access and intelligent analysis of information in its totality to allow the business planner to gain enhanced critical business insights for decision support. Current business planning tools provide insights from structured business data (i.e. sales forecasts, customers and products data, inventory details) only and fail to take into account unstructured complementary information residing in contracts, reports, user's comments, emails etc. In this article, a planning support system is designed and developed that empower business planners to develop and revise business plans utilizing both structured data and unstructured information conjointly. This planning system activity model comprises of two steps. Firstly, a business planner develops a candidate plan using planning template. Secondly, the candidate plan is put forward to collaborating partners for its revision interleaving deliberation. Planning interleaving deliberation activity in the proposed framework enables collaborating planners to challenge both a decision and the thinking that underpins the decision in the candidate plan. The planning system is modeled using situation calculus and is validated through a prototype development.

有效的业务规划需要对信息进行无缝访问和智能分析，以使业务规划人员能够获得增强的关键业务洞察力，从而支持决策。目前的业务规划工具只提供结构化业务数据(即销售预测、客户和产品数据、库存细节)的见解，而没有考虑到驻留在合同、报告、用户评论、电子邮件等中的非结构化补充信息。在本文中，设计并开发了一个计划支持系统，使业务计划人员能够同时利用结构化数据和非结构化信息来开发和修改业务计划。这个计划系统活动模型包括两个步骤。首先，商业策划人员使用计划模板制定候选计划。其次，将候选方案提交合作伙伴进行修改审议;拟议框架中的规划交叉审议活动使协作规划者能够挑战候选计划中支持决策的决策和思维。利用情景演算对规划系统进行了建模，并通过原型开发进行了验证。

{"title":"Conjoint utilization of structured and unstructured information for planning interleaving deliberation in supply chains","authors":"N. Janjua, O. Hussain, Elizabeth Chang, S. M. Islam","doi":"10.1145/3106426.3106545","DOIUrl":"https://doi.org/10.1145/3106426.3106545","url":null,"abstract":"Effective business planning requires seamless access and intelligent analysis of information in its totality to allow the business planner to gain enhanced critical business insights for decision support. Current business planning tools provide insights from structured business data (i.e. sales forecasts, customers and products data, inventory details) only and fail to take into account unstructured complementary information residing in contracts, reports, user's comments, emails etc. In this article, a planning support system is designed and developed that empower business planners to develop and revise business plans utilizing both structured data and unstructured information conjointly. This planning system activity model comprises of two steps. Firstly, a business planner develops a candidate plan using planning template. Secondly, the candidate plan is put forward to collaborating partners for its revision interleaving deliberation. Planning interleaving deliberation activity in the proposed framework enables collaborating planners to challenge both a decision and the thinking that underpins the decision in the candidate plan. The planning system is modeled using situation calculus and is validated through a prototype development.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75556976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Fusing domain-specific data with general data for in-domain applications 将特定于领域的数据与域内应用程序的通用数据融合

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106473

An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen

This paper analyzes the lexical semantics of domain-specific terms based on various pre-trained specific domain and general domain word vectors, and addresses the semantic drift between domains. To capture lexical semantics in the specific domain, we propose a bridge mechanism to introduce domain-specific data into general data, and re-train word vectors. We find that even a small-scale fusion can result in the similar lexical semantics learned by using the large-scale domain-specific dataset. Experiments on sentiment analysis and outlier detection show that application of word embedding by the fusion dataset has the better performance than applications of word embeddings by pure large domain-specific and pure large general datasets. The simple, but effective methodology facilitates the domain adaptation of distributed word representations.

本文基于各种预训练的特定领域和一般领域词向量，分析了领域特定术语的词汇语义，并解决了领域之间的语义漂移问题。为了捕获特定领域的词汇语义，我们提出了一种桥接机制，将特定领域的数据引入到一般数据中，并重新训练词向量。我们发现，即使是小规模的融合也能产生与使用大规模特定领域数据集学习到的相似的词汇语义。情感分析和离群点检测实验表明，融合数据集的词嵌入应用比纯大型特定领域和纯大型通用数据集的词嵌入应用具有更好的性能。这种简单而有效的方法促进了分布式词表示的领域适应。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀