首页 > 最新文献

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics最新文献

英文 中文
Crowdsourcing worker development based on probabilistic task network 基于概率任务网络的众包工人开发
Masayuki Ashikawa, Takahiro Kawamura, Akihiko Ohsuga
Crowdsourcing platforms provide an attractive solution for processing numerous tasks at low cost. However, insufficient quality control remains a major concern. In the present study, we propose a grade-based training method for workers. Our training method utilizes probabilistic networks to estimate correlations between tasks based on workers' records for 18.5 million tasks and then allocates pre-learning tasks to the workers to raise the accuracy of target tasks according to the task correlations. In an experiment, the method automatically allocated 31 pre-learning task categories for 9 target task categories, and after the training of the pre-learning tasks, we confirmed that the accuracy of the target tasks was raised by 7.8 points on average. We thus confirmed that the task correlations can be estimated using a large amount of worker records, and that these are useful for the grade-based training of low-quality workers.
众包平台为低成本处理大量任务提供了一个有吸引力的解决方案。然而,质量控制不足仍然是一个主要问题。在本研究中,我们提出了一种基于等级的工人培训方法。我们的训练方法基于1850万个任务的工人记录,利用概率网络估计任务之间的相关性,然后根据任务相关性分配预学习任务给工人,以提高目标任务的准确性。在实验中,该方法为9个目标任务类别自动分配了31个预学习任务类别,经过预学习任务的训练,我们确认目标任务的准确率平均提高了7.8分。因此,我们证实了任务相关性可以使用大量的工人记录来估计,并且这些对于基于等级的低质量工人培训是有用的。
{"title":"Crowdsourcing worker development based on probabilistic task network","authors":"Masayuki Ashikawa, Takahiro Kawamura, Akihiko Ohsuga","doi":"10.1145/3106426.3106501","DOIUrl":"https://doi.org/10.1145/3106426.3106501","url":null,"abstract":"Crowdsourcing platforms provide an attractive solution for processing numerous tasks at low cost. However, insufficient quality control remains a major concern. In the present study, we propose a grade-based training method for workers. Our training method utilizes probabilistic networks to estimate correlations between tasks based on workers' records for 18.5 million tasks and then allocates pre-learning tasks to the workers to raise the accuracy of target tasks according to the task correlations. In an experiment, the method automatically allocated 31 pre-learning task categories for 9 target task categories, and after the training of the pre-learning tasks, we confirmed that the accuracy of the target tasks was raised by 7.8 points on average. We thus confirmed that the task correlations can be estimated using a large amount of worker records, and that these are useful for the grade-based training of low-quality workers.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90861267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Large-scale taxonomy induction using entity and word embeddings 使用实体和词嵌入的大规模分类归纳
Petar Ristoski, Stefano Faralli, Simone Paolo Ponzetto, Heiko Paulheim
Taxonomies are an important ingredient of knowledge organization, and serve as a backbone for more sophisticated knowledge representations in intelligent systems, such as formal ontologies. However, building taxonomies manually is a costly endeavor, and hence, automatic methods for taxonomy induction are a good alternative to build large-scale taxonomies. In this paper, we propose TIEmb, an approach for automatic unsupervised class subsumption axiom extraction from knowledge bases using entity and text embeddings. We apply the approach on the WebIsA database, a database of subsumption relations extracted from the large portion of the World Wide Web, to extract class hierarchies in the Person and Place domain.
分类法是知识组织的重要组成部分,是智能系统(如形式本体)中更复杂的知识表示的支柱。然而,手动构建分类法是一项代价高昂的工作,因此,用于分类法归纳的自动方法是构建大规模分类法的一个很好的替代方法。在本文中,我们提出了TIEmb,一种利用实体嵌入和文本嵌入从知识库中自动提取无监督类包含公理的方法。我们将该方法应用于WebIsA数据库(从万维网的大部分内容中提取的包含关系数据库),以提取Person和Place域中的类层次结构。
{"title":"Large-scale taxonomy induction using entity and word embeddings","authors":"Petar Ristoski, Stefano Faralli, Simone Paolo Ponzetto, Heiko Paulheim","doi":"10.1145/3106426.3106465","DOIUrl":"https://doi.org/10.1145/3106426.3106465","url":null,"abstract":"Taxonomies are an important ingredient of knowledge organization, and serve as a backbone for more sophisticated knowledge representations in intelligent systems, such as formal ontologies. However, building taxonomies manually is a costly endeavor, and hence, automatic methods for taxonomy induction are a good alternative to build large-scale taxonomies. In this paper, we propose TIEmb, an approach for automatic unsupervised class subsumption axiom extraction from knowledge bases using entity and text embeddings. We apply the approach on the WebIsA database, a database of subsumption relations extracted from the large portion of the World Wide Web, to extract class hierarchies in the Person and Place domain.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88200365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Affective prediction by collaborative chains in movie recommendation 协同链在电影推荐中的情感预测
Yong Zheng
Recommender systems have been successfully applied to alleviate the information overload and assist user's decision makings. Emotional states have been demonstrated as effective factors in recommender systems. However, how to collect or predict a user's emotional state becomes one of the challenges to build affective recommender systems. In this paper, we explore and compare different solutions to predict emotions to be applied in the recommendation process. More specifically, we propose an approach named as collaborative chains. It predicts emotional states in a collaborative way and additionally takes correlations among emotions into consideration. Our experimental results based on a movie rating data demonstrate the effectiveness of affective prediction by collaborative chains in movie recommendations.
推荐系统在缓解信息过载和辅助用户决策方面得到了成功的应用。情绪状态已被证明是推荐系统中的有效因素。然而,如何收集或预测用户的情绪状态成为构建情感推荐系统的挑战之一。在本文中,我们探索和比较了不同的解决方案来预测情绪,并将其应用于推荐过程。更具体地说,我们提出了一种称为协作链的方法。它以协作的方式预测情绪状态,并考虑到情绪之间的相关性。基于电影评分数据的实验结果证明了协同链在电影推荐中情感预测的有效性。
{"title":"Affective prediction by collaborative chains in movie recommendation","authors":"Yong Zheng","doi":"10.1145/3106426.3106535","DOIUrl":"https://doi.org/10.1145/3106426.3106535","url":null,"abstract":"Recommender systems have been successfully applied to alleviate the information overload and assist user's decision makings. Emotional states have been demonstrated as effective factors in recommender systems. However, how to collect or predict a user's emotional state becomes one of the challenges to build affective recommender systems. In this paper, we explore and compare different solutions to predict emotions to be applied in the recommendation process. More specifically, we propose an approach named as collaborative chains. It predicts emotional states in a collaborative way and additionally takes correlations among emotions into consideration. Our experimental results based on a movie rating data demonstrate the effectiveness of affective prediction by collaborative chains in movie recommendations.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85703981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
LCHI: multiple, overlapping local communities LCHI:多个重叠的当地社区
Moeen Farasat, J. Scripps
Local community finding algorithms are helpful for finding communities around a seed node especially when the network is large and a global method is too slow. Most local methods find only a single community or are required to be run several times over different seed nodes to create multiple communities. In this paper, we present a new algorithm, LCHI that finds multiple, overlapping communities around a single node. Examples and analyses are presented support the effectiveness of LCHI.
局部社区查找算法有助于在种子节点周围查找社区,特别是当网络较大且全局方法太慢时。大多数本地方法只能找到一个社区,或者需要在不同的种子节点上运行多次才能创建多个社区。在本文中,我们提出了一种新的算法LCHI,它可以在单个节点周围找到多个重叠的社区。实例和分析证明了LCHI的有效性。
{"title":"LCHI: multiple, overlapping local communities","authors":"Moeen Farasat, J. Scripps","doi":"10.1145/3106426.3106438","DOIUrl":"https://doi.org/10.1145/3106426.3106438","url":null,"abstract":"Local community finding algorithms are helpful for finding communities around a seed node especially when the network is large and a global method is too slow. Most local methods find only a single community or are required to be run several times over different seed nodes to create multiple communities. In this paper, we present a new algorithm, LCHI that finds multiple, overlapping communities around a single node. Examples and analyses are presented support the effectiveness of LCHI.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81726953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context suggestion: empirical evaluations vs user studies 背景建议:经验评价vs用户研究
Yong Zheng
Recommender System has been successfully applied to assist user's decision making by providing a list of recommended items. Context-aware recommender system additionally incorporates contexts (such as time and location) into the system to improve the recommendation performance. The development of context-aware recommender systems brings a new opportunity - context suggestion which refers to the task of recommending appropriate contexts to the users to improve user experience. In this paper, we explore the question whether user's contextual ratings can be reused to produce context suggestions. We propose two evaluation mechanisms for context suggestion, and empirically compare direct context predictions and indirect context suggestions based on a movie data that was collected from user studies. The experimental results reveal that indirect context suggestion works better than the direct context prediction, and tensor factorization is the best approach to produce context suggestions in our movie data.
推荐系统已经成功地应用于通过提供推荐项目列表来帮助用户决策。上下文感知推荐系统还将上下文(如时间和地点)纳入到系统中,以提高推荐性能。上下文感知推荐系统的发展带来了一个新的机遇——上下文建议,即向用户推荐合适的上下文以改善用户体验的任务。在本文中,我们探讨了用户的上下文评分是否可以被重用来产生上下文建议的问题。我们提出了两种情境建议的评估机制,并基于从用户研究中收集的电影数据对直接情境预测和间接情境建议进行了实证比较。实验结果表明,间接上下文建议比直接上下文预测效果更好,张量分解是我们的电影数据中生成上下文建议的最佳方法。
{"title":"Context suggestion: empirical evaluations vs user studies","authors":"Yong Zheng","doi":"10.1145/3106426.3106466","DOIUrl":"https://doi.org/10.1145/3106426.3106466","url":null,"abstract":"Recommender System has been successfully applied to assist user's decision making by providing a list of recommended items. Context-aware recommender system additionally incorporates contexts (such as time and location) into the system to improve the recommendation performance. The development of context-aware recommender systems brings a new opportunity - context suggestion which refers to the task of recommending appropriate contexts to the users to improve user experience. In this paper, we explore the question whether user's contextual ratings can be reused to produce context suggestions. We propose two evaluation mechanisms for context suggestion, and empirically compare direct context predictions and indirect context suggestions based on a movie data that was collected from user studies. The experimental results reveal that indirect context suggestion works better than the direct context prediction, and tensor factorization is the best approach to produce context suggestions in our movie data.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82718068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Large-scale readability analysis of privacy policies 隐私政策的大规模可读性分析
Benjamin Fabian, Tatiana Ermakova, Tino Lentz
Online privacy policies notify users of a Website how their personal information is collected, processed and stored. Against the background of rising privacy concerns, privacy policies seem to represent an influential instrument for increasing customer trust and loyalty. However, in practice, consumers seem to actually read privacy policies only in rare cases, possibly reflecting the common assumption stating that policies are hard to comprehend. By designing and implementing an automated extraction and readability analysis toolset that embodies a diversity of established readability measures, we present the first large-scale study that provides current empirical evidence on the readability of nearly 50,000 privacy policies of popular English-speaking Websites. The results empirically confirm that on average, current privacy policies are still hard to read. Furthermore, this study presents new theoretical insights for readability research, in particular, to what extent practical readability measures are correlated. Specifically, it shows the redundancy of several well-established readability metrics such as SMOG, RIX, LIX, GFI, FKG, ARI, and FRES, thus easing future choice making processes and comparisons between readability studies, as well as calling for research towards a readability measures framework. Moreover, a more sophisticated privacy policy extractor and analyzer as well as a solid policy text corpus for further research are provided.
在线隐私政策通知网站用户他们的个人信息是如何收集、处理和存储的。在日益关注隐私的背景下,隐私政策似乎是提高客户信任和忠诚度的一种有影响力的工具。然而,在实践中,消费者似乎只有在极少数情况下才会真正阅读隐私政策,这可能反映了一种普遍的假设,即政策很难理解。通过设计和实现一个包含多种已建立的可读性措施的自动提取和可读性分析工具集,我们提出了第一个大规模研究,该研究提供了近50,000个流行英语网站隐私政策可读性的当前经验证据。研究结果从经验上证实,平均而言,当前的隐私政策仍然难以阅读。此外,该研究为可读性研究提供了新的理论见解,特别是在实际可读性度量之间的关联程度。具体来说,它显示了几个已建立的可读性指标的冗余性,如SMOG、RIX、LIX、GFI、FKG、ARI和FRES,从而简化了未来的选择过程和可读性研究之间的比较,并呼吁对可读性测量框架进行研究。此外,本文还提供了一个更完善的隐私策略提取器和分析器,以及一个可靠的策略文本语料库,供进一步研究使用。
{"title":"Large-scale readability analysis of privacy policies","authors":"Benjamin Fabian, Tatiana Ermakova, Tino Lentz","doi":"10.1145/3106426.3106427","DOIUrl":"https://doi.org/10.1145/3106426.3106427","url":null,"abstract":"Online privacy policies notify users of a Website how their personal information is collected, processed and stored. Against the background of rising privacy concerns, privacy policies seem to represent an influential instrument for increasing customer trust and loyalty. However, in practice, consumers seem to actually read privacy policies only in rare cases, possibly reflecting the common assumption stating that policies are hard to comprehend. By designing and implementing an automated extraction and readability analysis toolset that embodies a diversity of established readability measures, we present the first large-scale study that provides current empirical evidence on the readability of nearly 50,000 privacy policies of popular English-speaking Websites. The results empirically confirm that on average, current privacy policies are still hard to read. Furthermore, this study presents new theoretical insights for readability research, in particular, to what extent practical readability measures are correlated. Specifically, it shows the redundancy of several well-established readability metrics such as SMOG, RIX, LIX, GFI, FKG, ARI, and FRES, thus easing future choice making processes and comparisons between readability studies, as well as calling for research towards a readability measures framework. Moreover, a more sophisticated privacy policy extractor and analyzer as well as a solid policy text corpus for further research are provided.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82807438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Entity oriented action recommendations for actionable knowledge graph generation 面向实体的可操作知识图谱生成的行动建议
Md. Mostafizur Rahman, A. Takasu
Popular search engines have recently utilized the power of knowledge graphs (KGs) to provide specific answers to queries in a direct way. Search engine result pages (SERPs) are expected to provide facts in response to queries that satisfy semantic meaning. This encourages researchers to propose more influential knowledge graph generation techniques. To achieve and advance the technologies related to actionable knowledge graph presentation, creating action recommendations (ARs) is an essential step and a relatively new research direction to nurture research on generating KGs that are optimized for facilitating an entity's actions. An action represents the physical or mental activity of an entity. For example, for the entity "Donald J. Trump", typical potential actions could be "won the US presidential election" or "targets US journalists". In this paper, we describe the generation of relevant action recommendations based on entity instance and entity type. We propose two models that employ different approaches. Our first model exploits semisupervised learning and we introduce entity context vector (ECV) as an entity's distinguishing features for capturing the context of entities to reveal the similarity between entities, grounded on the prominent word2vec model. The second model is a probabilistic approach based on the Naive Bayes Theorem. We extensively evaluate our proposed models. Our first model significantly outperforms probabilistic and supervised learning-based models.
流行的搜索引擎最近利用知识图(KGs)的力量,以直接的方式为查询提供特定的答案。期望搜索引擎结果页(serp)为满足语义的查询提供事实响应。这鼓励研究人员提出更有影响力的知识图谱生成技术。为了实现和推进与可操作的知识图谱表示相关的技术,创建行动建议(ARs)是一个必要的步骤,也是一个相对较新的研究方向,以促进生成优化的知识图谱,以促进实体的行动。动作代表一个实体的身体或精神活动。例如,对于实体“Donald J. Trump”,典型的潜在行动可能是“赢得美国总统大选”或“针对美国记者”。在本文中,我们描述了基于实体实例和实体类型的相关操作建议的生成。我们提出了采用不同方法的两个模型。我们的第一个模型利用了半监督学习,我们引入了实体上下文向量(ECV)作为实体的区分特征,用于捕获实体的上下文,以揭示实体之间的相似性,以著名的word2vec模型为基础。第二个模型是基于朴素贝叶斯定理的概率方法。我们广泛地评估我们提出的模型。我们的第一个模型明显优于基于概率和监督学习的模型。
{"title":"Entity oriented action recommendations for actionable knowledge graph generation","authors":"Md. Mostafizur Rahman, A. Takasu","doi":"10.1145/3106426.3106546","DOIUrl":"https://doi.org/10.1145/3106426.3106546","url":null,"abstract":"Popular search engines have recently utilized the power of knowledge graphs (KGs) to provide specific answers to queries in a direct way. Search engine result pages (SERPs) are expected to provide facts in response to queries that satisfy semantic meaning. This encourages researchers to propose more influential knowledge graph generation techniques. To achieve and advance the technologies related to actionable knowledge graph presentation, creating action recommendations (ARs) is an essential step and a relatively new research direction to nurture research on generating KGs that are optimized for facilitating an entity's actions. An action represents the physical or mental activity of an entity. For example, for the entity \"Donald J. Trump\", typical potential actions could be \"won the US presidential election\" or \"targets US journalists\". In this paper, we describe the generation of relevant action recommendations based on entity instance and entity type. We propose two models that employ different approaches. Our first model exploits semisupervised learning and we introduce entity context vector (ECV) as an entity's distinguishing features for capturing the context of entities to reveal the similarity between entities, grounded on the prominent word2vec model. The second model is a probabilistic approach based on the Naive Bayes Theorem. We extensively evaluate our proposed models. Our first model significantly outperforms probabilistic and supervised learning-based models.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80982480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Zero-shot human activity recognition via nonlinear compatibility based method 基于非线性兼容的零射击人体活动识别方法
Wei Wang, C. Miao, Shuji Hao
Human activity recognition aims to recognize human activities from sensor readings. Most of existing methods in this area can only recognize activities contained in training dataset. However, in practical applications, previously unseen activities are often encountered. In this paper, we propose a new zero-shot learning method to solve the problem of recognizing previously unseen activities. The proposed method learns a nonlinear compatibility function between feature space instances and semantic space prototypes. With this function, testing instances are classified to unseen activities with highest compatibility scores. To evaluate the effectiveness of the proposed method, we conduct extensive experiments on three public datasets. Experimental results show that our proposed method consistently outperforms state-of-the-art methods in human activity recognition problems.
人体活动识别的目的是通过传感器的读数来识别人体活动。该领域的现有方法大多只能识别训练数据集中包含的活动。然而,在实际应用中,经常会遇到以前看不见的活动。在本文中,我们提出了一种新的零射击学习方法来解决识别以前未见过的活动的问题。该方法学习了特征空间实例与语义空间原型之间的非线性兼容函数。使用此功能,测试实例被分类为具有最高兼容性分数的未见过的活动。为了评估所提出方法的有效性,我们在三个公共数据集上进行了广泛的实验。实验结果表明,我们提出的方法在人类活动识别问题上始终优于最先进的方法。
{"title":"Zero-shot human activity recognition via nonlinear compatibility based method","authors":"Wei Wang, C. Miao, Shuji Hao","doi":"10.1145/3106426.3106526","DOIUrl":"https://doi.org/10.1145/3106426.3106526","url":null,"abstract":"Human activity recognition aims to recognize human activities from sensor readings. Most of existing methods in this area can only recognize activities contained in training dataset. However, in practical applications, previously unseen activities are often encountered. In this paper, we propose a new zero-shot learning method to solve the problem of recognizing previously unseen activities. The proposed method learns a nonlinear compatibility function between feature space instances and semantic space prototypes. With this function, testing instances are classified to unseen activities with highest compatibility scores. To evaluate the effectiveness of the proposed method, we conduct extensive experiments on three public datasets. Experimental results show that our proposed method consistently outperforms state-of-the-art methods in human activity recognition problems.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78756399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
CEDAL: time-efficient detection of erroneous links in large-scale link repositories CEDAL:在大规模链接存储库中高效地检测错误链接
André Valdestilhas, Tommaso Soru, A. N. Ngomo
More than 500 million facts on the Linked Data Web are statements across knowledge bases. These links are of crucial importance for the Linked Data Web as they make a large number of tasks possible, including cross-ontology, question answering and federated queries. However, a large number of these links are erroneous and can thus lead to these applications producing absurd results. We present a time-efficient and complete approach for the detection of erroneous links for properties that are transitive. To this end, we make use of the semantics of URIs on the Data Web and combine it with an efficient graph partitioning algorithm. We then apply our algorithm to the LinkLion repository and show that we can analyze 19,200,114 links in 4.6 minutes. Our results show that at least 13% of the owl :sameAs links we considered are erroneous. In addition, our analysis of the provenance of links allows discovering agents and knowledge bases that commonly display poor linking. Our algorithm can be easily executed in parallel and on a GPU. We show that these implementations are up to two orders of magnitude faster than classical reasoners and a non-parallel implementation.
关联数据网上有超过5亿个事实是跨知识库的陈述。这些链接对于关联数据Web至关重要,因为它们使大量任务成为可能,包括跨本体、问答和联合查询。然而,大量这些链接是错误的,因此可能导致这些应用程序产生荒谬的结果。我们提出了一种省时和完整的方法来检测可传递属性的错误链接。为此,我们利用了Data Web上的uri语义,并将其与高效的图划分算法相结合。然后,我们将我们的算法应用到LinkLion存储库,并表明我们可以在4.6分钟内分析19,200,114个链接。我们的研究结果表明,至少有13%的猫头鹰:相同的链接是错误的。此外,我们对链接来源的分析允许发现通常显示不良链接的代理和知识库。我们的算法可以很容易地在GPU上并行执行。我们表明,这些实现比经典推理器和非并行实现快两个数量级。
{"title":"CEDAL: time-efficient detection of erroneous links in large-scale link repositories","authors":"André Valdestilhas, Tommaso Soru, A. N. Ngomo","doi":"10.1145/3106426.3106497","DOIUrl":"https://doi.org/10.1145/3106426.3106497","url":null,"abstract":"More than 500 million facts on the Linked Data Web are statements across knowledge bases. These links are of crucial importance for the Linked Data Web as they make a large number of tasks possible, including cross-ontology, question answering and federated queries. However, a large number of these links are erroneous and can thus lead to these applications producing absurd results. We present a time-efficient and complete approach for the detection of erroneous links for properties that are transitive. To this end, we make use of the semantics of URIs on the Data Web and combine it with an efficient graph partitioning algorithm. We then apply our algorithm to the LinkLion repository and show that we can analyze 19,200,114 links in 4.6 minutes. Our results show that at least 13% of the owl :sameAs links we considered are erroneous. In addition, our analysis of the provenance of links allows discovering agents and knowledge bases that commonly display poor linking. Our algorithm can be easily executed in parallel and on a GPU. We show that these implementations are up to two orders of magnitude faster than classical reasoners and a non-parallel implementation.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76784114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Inferring win-lose product network from user behavior 从用户行为推断产品的输赢网络
S. Iitsuka, Kazuya Kawakami, S. Hagiwara, T. Kawakami, Takayuki Hamada, Y. Matsuo
Various data mining techniques to extract product relations have been examined, especially in the context of building intelligent recommender systems. Most such techniques, however, specifically examine co-occurrences of browsed or purchased products on e-commerce websites, which provide little or no useful information related to the direct relation of superiority or the factor which forms that superiority. For marketers and product managers, understanding the competitive advantages of a given product is important to consolidate their product differentiation strategies. As described in this paper, we propose a win-lose relation, a new product relation analysis method that retrieves the superiority relation between competitive products in terms of product attractiveness. Our proposed method uses the difference between user browsing and purchasing behaviors, assuming that a purchased product is superior to products that are browsed but not purchased. We also propose superiority factor analysis to examine keywords that represent the superiority factor by mining product reviews. We evaluate our methods using an actual dataset from Zexy, the largest wedding portal website in Japan. Our experimental evaluation revealed that our proposed method can estimate actual user preferences observed from a user study using only log data. Results also show that our proposed method raises the accuracy of superiority factor extraction by around 17% by considering the win-lose relation of products.
各种提取产品关系的数据挖掘技术已经被研究,特别是在构建智能推荐系统的背景下。然而,大多数这样的技术专门检查电子商务网站上浏览或购买的产品的共同出现,这些网站提供很少或根本没有与优势的直接关系或形成优势的因素有关的有用信息。对于营销人员和产品经理来说,了解特定产品的竞争优势对于巩固他们的产品差异化策略非常重要。如本文所述,我们提出了一种新的产品关系分析方法——输赢关系,它从产品吸引力的角度来检索竞争产品之间的优势关系。我们提出的方法利用用户浏览和购买行为之间的差异,假设购买的产品优于浏览但未购买的产品。我们还提出了优势因子分析法,通过挖掘产品评论来检验代表优势因子的关键词。我们使用来自日本最大的婚礼门户网站Zexy的实际数据集来评估我们的方法。我们的实验评估表明,我们提出的方法可以仅使用日志数据从用户研究中观察到的实际用户偏好。结果还表明,通过考虑产品的输赢关系,我们提出的方法将优势因子提取的准确率提高了17%左右。
{"title":"Inferring win-lose product network from user behavior","authors":"S. Iitsuka, Kazuya Kawakami, S. Hagiwara, T. Kawakami, Takayuki Hamada, Y. Matsuo","doi":"10.1145/3106426.3106502","DOIUrl":"https://doi.org/10.1145/3106426.3106502","url":null,"abstract":"Various data mining techniques to extract product relations have been examined, especially in the context of building intelligent recommender systems. Most such techniques, however, specifically examine co-occurrences of browsed or purchased products on e-commerce websites, which provide little or no useful information related to the direct relation of superiority or the factor which forms that superiority. For marketers and product managers, understanding the competitive advantages of a given product is important to consolidate their product differentiation strategies. As described in this paper, we propose a win-lose relation, a new product relation analysis method that retrieves the superiority relation between competitive products in terms of product attractiveness. Our proposed method uses the difference between user browsing and purchasing behaviors, assuming that a purchased product is superior to products that are browsed but not purchased. We also propose superiority factor analysis to examine keywords that represent the superiority factor by mining product reviews. We evaluate our methods using an actual dataset from Zexy, the largest wedding portal website in Japan. Our experimental evaluation revealed that our proposed method can estimate actual user preferences observed from a user study using only log data. Results also show that our proposed method raises the accuracy of superiority factor extraction by around 17% by considering the win-lose relation of products.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76943515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1