Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics最新文献

英文中文

Paid review and paid writer detection 付费评论和付费作者检测

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106433

Man-Chun Ko, Hen-Hsen Huang, Hsin-Hsi Chen

There has been a surge in opinion-sharing in the public domain. Some opinions greatly influence our decisions, e.g., the choice of purchase. Malicious parties or individuals exploit social media by generating fake reviews for opinion manipulation. This paper aims to investigate the phenomenon of online paid restaurant reviews by bloggers. Our research provides an insight into some characteristics of paid reviews and their authors. We then explore a set of features based on our observations and detect paid reviews and paid bloggers using supervised machine learning techniques. Experimental results show the effectiveness of our approach.

公共领域的意见分享激增。有些意见会极大地影响我们的决定，例如，购买的选择。恶意团体或个人利用社交媒体制造虚假评论，操纵舆论。本文旨在调查博主在线付费餐厅评论现象。我们的研究为付费评论及其作者提供了一些特征。然后，我们根据我们的观察探索一组特征，并使用监督机器学习技术检测付费评论和付费博主。实验结果表明了该方法的有效性。

引用次数: 4

Enhancing crowd wisdom using measures of diversity computed from social media data 使用从社交媒体数据中计算的多样性措施来增强人群智慧

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106491

Shreyansh P. Bhatt, B. Minnery, Srikanth Nadella, B. Bullemer, V. Shalin, A. Sheth

"Wisdom of Crowds" (WoC) refers to a form of collective intelligence in which the aggregate judgment of a group of individuals is, in most instances, superior to that of any one group member. For a crowd to be wise, its members must possess diverse knowledge and viewpoints. Such diversity leads to uncorrelated judgment errors that cancel out in aggregate. Yet despite the fact that diversity is known to be an essential ingredient in WoC, little research aims to measure and exploit diversity in human social systems for the purpose of maximizing crowd intelligence. Here we quantify the diversity of a group of individuals through semantic analysis of their social media (Twitter) communications. Focusing on the domain of fantasy sports, we show that virtual crowds of fantasy team owners selected based on the diversity of their tweet content can outperform both non-diverse and randomly sampled crowds. Our results suggest a new approach for intelligent crowd assembly in which measures of diversity extracted from online social media communications can guide the selection of crowd members. These results have implications for numerous domains that utilize aggregated judgments - from consumer reviews, to econometrics, to geopolitical forecasting and intelligence analysis.

“群体智慧”(WoC)指的是一种集体智慧，在这种智慧中，在大多数情况下，一群人的总体判断优于任何一个群体成员的判断。一个群体要有智慧，它的成员必须拥有不同的知识和观点。这种多样性导致了不相关的判断错误，这些错误在总体上相互抵消。然而，尽管多样性被认为是WoC的重要组成部分，但很少有研究旨在衡量和利用人类社会系统中的多样性，以最大化群体智能。在这里，我们通过对社交媒体(Twitter)通信的语义分析来量化一组个体的多样性。聚焦于梦幻体育领域，我们证明了基于其tweet内容多样性选择的梦幻球队所有者的虚拟人群可以优于非多样性和随机抽样的人群。我们的研究结果提出了一种智能人群聚集的新方法，其中从在线社交媒体通信中提取的多样性措施可以指导人群成员的选择。这些结果对利用综合判断的许多领域都有影响——从消费者评论到计量经济学，再到地缘政治预测和情报分析。

{"title":"Enhancing crowd wisdom using measures of diversity computed from social media data","authors":"Shreyansh P. Bhatt, B. Minnery, Srikanth Nadella, B. Bullemer, V. Shalin, A. Sheth","doi":"10.1145/3106426.3106491","DOIUrl":"https://doi.org/10.1145/3106426.3106491","url":null,"abstract":"\"Wisdom of Crowds\" (WoC) refers to a form of collective intelligence in which the aggregate judgment of a group of individuals is, in most instances, superior to that of any one group member. For a crowd to be wise, its members must possess diverse knowledge and viewpoints. Such diversity leads to uncorrelated judgment errors that cancel out in aggregate. Yet despite the fact that diversity is known to be an essential ingredient in WoC, little research aims to measure and exploit diversity in human social systems for the purpose of maximizing crowd intelligence. Here we quantify the diversity of a group of individuals through semantic analysis of their social media (Twitter) communications. Focusing on the domain of fantasy sports, we show that virtual crowds of fantasy team owners selected based on the diversity of their tweet content can outperform both non-diverse and randomly sampled crowds. Our results suggest a new approach for intelligent crowd assembly in which measures of diversity extracted from online social media communications can guide the selection of crowd members. These results have implications for numerous domains that utilize aggregated judgments - from consumer reviews, to econometrics, to geopolitical forecasting and intelligence analysis.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88750037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Extracting attribute-value pairs from product specifications on the web 从web上的产品规格中提取属性值对

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106449

P. Petrovski, Christian Bizer

Comparison shopping portals integrate product offers from large numbers of e-shops in order to support consumers in their buying decisions. Product offers often consist of a title and a free-text product description, both describing product attributes that are considered relevant by the specific vendor. In addition, product offers might contain structured or semi-structured product specifications in the form of HTML tables and HTML lists. As product specifications often cover more product attributes than free-text descriptions, being able to extract attribute-value pairs from these specifications is a critical prerequisite for achieving good results in tasks such as product matching, product categorisation, faceted product search, and product recommendation. In this paper, we present an approach for extracting attribute-value pairs from product specifications on the Web. We use supervised learning to classify the HTML tables and HTML lists within a web page as product specification or not. In order to extract attribute-value pairs from the HTML fragments identified by the specification detector, we again use supervised learning to classify columns as attribute column or value column. Compared to DEXTER, the current state-of-the-art approach for extracting attribute-value pairs from product specifications, we introduce several new features for specification detection and support the extraction of attribute-value pairs from specifications having more than two columns. This allows us to improve the F-score up to 10% for extracting attribute-value pairs from tables and up to 3% for lists. In addition, we report the results of using duplicate-based schema matching to align the product attribute schemata of 32 different e-shops. This experiment confirms the suitability of duplicate-based schema matching for product data integration.

比较购物门户网站整合了大量电子商店提供的产品，以支持消费者的购买决策。产品报价通常由标题和自由文本产品描述组成，两者都描述了特定供应商认为相关的产品属性。此外，产品报价可能包含HTML表格和HTML列表形式的结构化或半结构化产品规范。由于产品规格说明通常比自由文本描述涵盖更多的产品属性，因此能够从这些规格说明中提取属性值对是在产品匹配、产品分类、分面产品搜索和产品推荐等任务中获得良好结果的关键先决条件。本文提出了一种从Web上的产品规格中提取属性值对的方法。我们使用监督学习将网页中的HTML表格和HTML列表分类为产品规范或非产品规范。为了从规范检测器识别的HTML片段中提取属性-值对，我们再次使用监督学习将列分类为属性列或值列。与目前用于从产品规格中提取属性值对的最先进的方法DEXTER相比，我们引入了几个用于规格检测的新特性，并支持从包含两列以上的规格中提取属性值对。这允许我们将从表中提取属性值对的F-score提高10%，从列表中提取属性值对的F-score提高3%。此外，我们报告了使用基于重复的模式匹配来对齐32个不同电子商店的产品属性模式的结果。实验验证了基于副本的模式匹配在产品数据集成中的适用性。

{"title":"Extracting attribute-value pairs from product specifications on the web","authors":"P. Petrovski, Christian Bizer","doi":"10.1145/3106426.3106449","DOIUrl":"https://doi.org/10.1145/3106426.3106449","url":null,"abstract":"Comparison shopping portals integrate product offers from large numbers of e-shops in order to support consumers in their buying decisions. Product offers often consist of a title and a free-text product description, both describing product attributes that are considered relevant by the specific vendor. In addition, product offers might contain structured or semi-structured product specifications in the form of HTML tables and HTML lists. As product specifications often cover more product attributes than free-text descriptions, being able to extract attribute-value pairs from these specifications is a critical prerequisite for achieving good results in tasks such as product matching, product categorisation, faceted product search, and product recommendation. In this paper, we present an approach for extracting attribute-value pairs from product specifications on the Web. We use supervised learning to classify the HTML tables and HTML lists within a web page as product specification or not. In order to extract attribute-value pairs from the HTML fragments identified by the specification detector, we again use supervised learning to classify columns as attribute column or value column. Compared to DEXTER, the current state-of-the-art approach for extracting attribute-value pairs from product specifications, we introduce several new features for specification detection and support the extraction of attribute-value pairs from specifications having more than two columns. This allows us to improve the F-score up to 10% for extracting attribute-value pairs from tables and up to 3% for lists. In addition, we report the results of using duplicate-based schema matching to align the product attribute schemata of 32 different e-shops. This experiment confirms the suitability of duplicate-based schema matching for product data integration.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"406 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84868417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Social analytics framework for intelligent information systems based on a complex adaptive systems approach 基于复杂自适应系统方法的智能信息系统社会分析框架

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3109430

Somayeh Koohborfardhaghighi, J. Altmann, K. Tserpes

An employee profile record within a human resource management department includes information about the employee's past activities within the enterprise. These profile records are valuable sources of information for any enterprise. Using this information requires an intelligent enterprise information system. In this study, we emphasize the importance of having detailed analyses on the employees' knowledge base within an enterprise by applying dynamic social impact theory. We argue that the richer the knowledge base within an enterprise with respect to its human and social capital is, the more it can empower its employees to be creative and innovative during group works. We propose a framework for effectively modeling the ever-changing knowledge bases of big enterprises for delivering optimal and automated team composition techniques. Our discussions cover the complete pipeline from data management and knowledge modeling, via graph analysis, to decision support services.

人力资源管理部门的员工档案记录包含有关该员工过去在企业内活动的信息。这些概要文件记录对于任何企业来说都是有价值的信息来源。利用这些信息需要一个智能的企业信息系统。在本研究中，我们强调运用动态社会影响理论对企业内部员工知识库进行详细分析的重要性。我们认为，企业在人力资本和社会资本方面的知识基础越丰富，它就越能使员工在团队工作中发挥创造性和创新性。我们提出了一个框架，用于有效地为大企业不断变化的知识库建模，以提供最佳和自动化的团队组成技术。我们的讨论涵盖了从数据管理和知识建模，通过图形分析，到决策支持服务的完整管道。

引用次数: 1

Quantitative style analysis of Mo Yan and Zhang Wei's novels 莫言、张维小说的数量风格分析

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3109045

Yunxuan Li, Weiyun Ji, Dekuan Xu

Selecting 24 novels written by Mo Yan and Zhang Wei as corpus, This paper analyzed the stylistic features of Mo Yan and Zhang Wei's novels from the perspective of quantitative style. Features include the pauses in sentences, the relevance of context, the type/token ratio, the frequency of the word string, high-frequency words and text clustering. Through statistic analysis, it is found that Mo Yan and Zhang Wei's works have much in common, which are both very oral, creative and can use all kinds of linguistic materials. However, they are different from one another in the usage of sentence patterns and of words and in the attention of social life. Compared with Mo Yan's language features, Zhang Wei's is more changeable.

本文选取莫言和张伟的24部小说作为语料库，从数量文体的角度分析了莫言和张伟小说的文体特征。特征包括句子中的停顿、上下文的相关性、类型/标记比率、单词串的频率、高频单词和文本聚类。通过统计分析，我们发现莫言和张伟的作品有很多共同之处，都非常具有口语化、创造性，并且可以使用各种语言材料。然而，在句式、词语的使用以及对社会生活的关注等方面，两者都存在着差异。与莫言的语言特点相比，张伟的语言特点更加多变。

引用次数: 1

Matcher composition for identification of subsumption relations in ontology matching 本体匹配中包含关系识别的匹配器组合

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106503

A. Vennesland

Ontology matching is the process of identifying alignment between ontologies with the objective to facilitate interoperability and knowledge integration. A limitation of state of the art ontology matching systems is that the produced alignments usually only contain equivalence relations, while other relations, such as subsumption relations, are often neglected. Furthermore, an ontology matching system is normally composed of a set of differently tuned matching algorithms, but their appropriateness and order of execution typically require human judgement. The COMPOSE framework identifies subsumption relations automatically using a three-stage matcher composition process. These three processes are ontology analysis, matcher selection, and matcher combination. Within this framework we integrate existing techniques for all three processes with novel ones and evaluate the feasibility of the framework in an experiment involving six acknowledged ontologies.

本体匹配是识别本体之间是否对齐的过程，目的是促进互操作性和知识集成。现有本体匹配系统的一个局限是生成的对齐通常只包含等价关系，而其他关系，如包容关系，往往被忽略。此外，本体匹配系统通常由一组不同调优的匹配算法组成，但它们的适当性和执行顺序通常需要人工判断。COMPOSE框架使用一个分三阶段的匹配器组合过程自动识别包容关系。这三个过程分别是本体分析、匹配器选择和匹配器组合。在此框架内，我们将所有三个过程的现有技术与新技术相结合，并在涉及六个公认本体的实验中评估该框架的可行性。

引用次数: 7

Analysing the behaviour of online investors in times of geopolitical distress: a case study on war stocks 分析地缘政治危机时期网上投资者的行为:以战争股票为例

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106510

Pierpaolo Dondio, J. Usher

In this paper we analysed how the behavior of an online financial community changed in times of geopolitical crises. In particular, we studied the behaviour and communication patterns of online investors before and after a military geopolitical event. We selected a set of 23 key-events belonging to the 2003 US-led invasion of Iraq, the Arab Spring and the first period of the Ukraine crisis, and we restricted our study to a set of eight so called war stocks. We studied the resilience of the community to information shocks by comparing the community composition, its sentiment and users' communication networks before and after an event at different time intervals. We found how community reaction is governed by ordered patterns. Experimental evidence suggested how in the after-math of an event the community did not lose its information sharing functionality. Communication networks showed a higher in-degree Gini index, connectivity and a rich-club effect. Discussions developed around central users acting as hubs. These backbone users were present both before and after an event, their sentiment were less volatile than other users, and they were previously recognized as local experts of a specific stock. As a further evidence of community resilience, the equilibrium of all the indicators analysed were restored after two weeks.

在本文中，我们分析了网络金融社区的行为在地缘政治危机时期是如何变化的。特别是，我们研究了在线投资者在军事地缘政治事件前后的行为和沟通模式。我们选择了一组23个关键事件，分别属于2003年美国领导的入侵伊拉克、阿拉伯之春和乌克兰危机的第一阶段，我们将研究限制在一组8个所谓的战争库存中。我们通过比较不同时间间隔的社区构成、社区情绪和用户在事件前后的沟通网络，研究了社区对信息冲击的弹性。我们发现社区反应是如何受有序模式支配的。实验证据表明，在事件发生后，社区并没有失去其信息共享功能。通信网络表现出更高程度的基尼系数、连通性和富裕俱乐部效应。围绕充当集线器的中心用户展开了讨论。这些骨干用户在事件前后都在场，他们的情绪波动比其他用户小，并且他们之前被认为是特定股票的本地专家。作为社区恢复力的进一步证据，两周后，所有分析指标的平衡都恢复了。

{"title":"Analysing the behaviour of online investors in times of geopolitical distress: a case study on war stocks","authors":"Pierpaolo Dondio, J. Usher","doi":"10.1145/3106426.3106510","DOIUrl":"https://doi.org/10.1145/3106426.3106510","url":null,"abstract":"In this paper we analysed how the behavior of an online financial community changed in times of geopolitical crises. In particular, we studied the behaviour and communication patterns of online investors before and after a military geopolitical event. We selected a set of 23 key-events belonging to the 2003 US-led invasion of Iraq, the Arab Spring and the first period of the Ukraine crisis, and we restricted our study to a set of eight so called war stocks. We studied the resilience of the community to information shocks by comparing the community composition, its sentiment and users' communication networks before and after an event at different time intervals. We found how community reaction is governed by ordered patterns. Experimental evidence suggested how in the after-math of an event the community did not lose its information sharing functionality. Communication networks showed a higher in-degree Gini index, connectivity and a rich-club effect. Discussions developed around central users acting as hubs. These backbone users were present both before and after an event, their sentiment were less volatile than other users, and they were previously recognized as local experts of a specific stock. As a further evidence of community resilience, the equilibrium of all the indicators analysed were restored after two weeks.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"124 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91432665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Inferring your expertise from Twitter: combining multiple types of user activity 从Twitter推断你的专长:结合多种类型的用户活动

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106468

Yu Xu, Dong Zhou, S. Lawless

Understanding the expertise of users in social networking sites like Twitter is a key component for many applications such as user recommendation and talent seeking. A range of interactions between users on Twitter can provide important information that implicitly reflects a user's expertise. This paper proposes a learning model that tries to infer a user's topical expertise from Twitter using information such as tweets posted by the user and the characteristics of their followers. The model takes various types of user-related data from Twitter as input and considers their inference consistency in the process of learning. It aims to deliver accurate and effective inference results, even in cases where some types of data are missing for a user, e.g. the user has yet to post any tweets. The experiments reported in the paper were conducted on a large-scale Twitter dataset. Experimental results show that our model outperforms several baseline approaches and outperforms approaches which use only a single type of user data for inference.

了解Twitter等社交网站用户的专业知识是许多应用程序(如用户推荐和人才寻找)的关键组成部分。Twitter上用户之间的一系列互动可以提供隐含地反映用户专业知识的重要信息。本文提出了一个学习模型，该模型试图利用用户发布的推文及其追随者的特征等信息，从Twitter推断用户的主题专业知识。该模型将来自Twitter的各种类型的用户相关数据作为输入，并在学习过程中考虑其推理一致性。它旨在提供准确有效的推理结果，即使在某些类型的数据缺失的情况下，例如用户还没有发布任何tweet。论文中报道的实验是在一个大规模的Twitter数据集上进行的。实验结果表明，我们的模型优于几种基线方法，并且优于仅使用单一类型用户数据进行推理的方法。

引用次数: 4

Robust spread of cooperation by expectation-of-cooperation strategy with simple labeling method 基于简单标记法的合作期望策略的鲁棒合作扩展

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3106458

Tomoaki Otsuka, T. Sugawara

This paper proposes an interaction strategy called the extended expectation-of-cooperation (EEoC) that is intended to spread cooperative activities in prisoner's dilemma situations over an entire agent network. Recently developed computer and communications applications run on the network and interact with each other as delegates of the owners, so they often encounter social dilemma situations. To improve social efficiency, they are required to cooperate, but one-sided cooperation is meaningless and loses some payoff due to a rip-off by defecting agents. The concept of EEoC is that when agents encounter mutual cooperation, they continue to cooperate a few times with the desire to see the emergence of cooperative behavior in their neighbors. EEoC is easy to implement in computer systems. We experimentally show that EEoC can effectively spread cooperative activities in dilemma situations in complete, Erdös-Rényi, and regular networks. We also clarify the robustness against defecting agents and the limitation of the EEoC strategy.

本文提出了一种扩展合作期望(EEoC)的交互策略，旨在将囚徒困境下的合作活动扩展到整个代理网络中。最近开发的计算机和通信应用程序在网络上运行，并作为所有者的代表相互交互，因此经常遇到社会困境的情况。为了提高社会效率，他们需要合作，但单方面的合作是没有意义的，而且会因为叛逃的代理人的敲诈而失去一些回报。EEoC的概念是，当智能体遇到相互合作时，他们会继续合作几次，希望看到他们的邻居出现合作行为。EEoC在计算机系统中易于实现。实验表明，EEoC可以在完全网络、Erdös-Rényi网络和规则网络中有效地传播两难情境下的合作活动。我们还澄清了对叛变代理的鲁棒性和EEoC策略的局限性。

引用次数: 2

Data mining in IoT: data analysis for a new paradigm on the internet 物联网中的数据挖掘:互联网新范式的数据分析

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

Pub Date : 2017-08-23 DOI: 10.1145/3106426.3115866

Peter Wlodarczak, Mustafa A. Ally, J. Soar

This paper provides an overview on Data Mining (DM) technologies for the Internet of Things (IoT). IoT has become an active area of research, since IoT promises among other to improve quality of live and safety in Smart Cities, to make resource supply and waste management more efficient, and optimize traffic. DM is highly domain specific and depends on what is being mined for. For instance, if IoT is used to optimize traffic in a Smart City to reduce traffic jams and to find parking spaces quicker, different types of data needs to be collected and analysed from an eHealth solution, where IoT is used in a Smart Home to monitor the well being of patients or elderly people. IoT connects things that can collect numeric data from smart sensors, streaming data from cameras or route information on maps. Depending on the type of data, different techniques need to be adopted to analyse them. Also, many IoT applications analyse data from different devices and correlate them to make predictions about possible machine failures in production sites or looming emergency situations in Smart Buildings in a home security application. DM techniques need to handle the heterogeneity of IoT data, the large volumes of data and the speed at which they are produced. This paper explores the state of the art DM techniques for IoT.

本文概述了物联网(IoT)的数据挖掘(DM)技术。物联网已经成为一个活跃的研究领域，因为物联网有望改善智慧城市的生活质量和安全，提高资源供应和废物管理的效率，并优化交通。DM是高度特定于领域的，并且取决于所挖掘的内容。例如，如果使用物联网来优化智能城市的交通以减少交通拥堵并更快地找到停车位，则需要从电子健康解决方案收集和分析不同类型的数据，其中物联网用于智能家居以监测患者或老年人的健康状况。物联网连接的东西可以从智能传感器收集数字数据，从摄像头收集流数据或地图上的路线信息。根据数据的类型，需要采用不同的技术来分析它们。此外，许多物联网应用程序分析来自不同设备的数据，并将它们关联起来，以预测生产现场可能出现的机器故障或家庭安全应用中智能建筑中迫在眉睫的紧急情况。DM技术需要处理物联网数据的异构性、大量数据和生成速度。本文探讨了物联网DM技术的最新进展。

{"title":"Data mining in IoT: data analysis for a new paradigm on the internet","authors":"Peter Wlodarczak, Mustafa A. Ally, J. Soar","doi":"10.1145/3106426.3115866","DOIUrl":"https://doi.org/10.1145/3106426.3115866","url":null,"abstract":"This paper provides an overview on Data Mining (DM) technologies for the Internet of Things (IoT). IoT has become an active area of research, since IoT promises among other to improve quality of live and safety in Smart Cities, to make resource supply and waste management more efficient, and optimize traffic. DM is highly domain specific and depends on what is being mined for. For instance, if IoT is used to optimize traffic in a Smart City to reduce traffic jams and to find parking spaces quicker, different types of data needs to be collected and analysed from an eHealth solution, where IoT is used in a Smart Home to monitor the well being of patients or elderly people. IoT connects things that can collect numeric data from smart sensors, streaming data from cameras or route information on maps. Depending on the type of data, different techniques need to be adopted to analyse them. Also, many IoT applications analyse data from different devices and correlate them to make predictions about possible machine failures in production sites or looming emergency situations in Smart Buildings in a home security application. DM techniques need to handle the heterogeneity of IoT data, the large volumes of data and the speed at which they are produced. This paper explores the state of the art DM techniques for IoT.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85164134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀