首页 > 最新文献

The World Wide Web Conference最新文献

英文 中文
BotCamp: Bot-driven Interactions in Social Campaigns BotCamp:社交活动中机器人驱动的互动
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313420
Noor Abu-El-Rub, A. Mueen
Bots (i.e. automated accounts) involve in social campaigns typically for two obvious reasons: to inorganically sway public opinion and to build social capital exploiting the organic popularity of social campaigns. In the process, bots interact with each other and engage in human activities (e.g. likes, retweets, and following). In this work, we detect a large number of bots interested in politics. We perform multi-aspect (i.e. temporal, textual, and topographical) clustering of bots, and ensemble the clusters to identify campaigns of bots. We observe similarity among the bots in a campaign in various aspects such as temporal correlation, sentimental alignment, and topical grouping. However, we also discover bots compete in gaining attention from humans and occasionally engage in arguments. We classify such bot interactions in two primary groups: agreeing (i.e. positive) and disagreeing (i.e. negative) interactions and develop an automatic interaction classifier to discover novel interactions among bots participating in social campaigns.
机器人(即自动账户)参与社交活动通常有两个明显的原因:无机地影响公众舆论,利用社交活动的有机人气建立社会资本。在这个过程中,机器人彼此互动并参与人类活动(例如点赞、转发和关注)。在这项工作中,我们检测到大量对政治感兴趣的机器人。我们执行机器人的多方面(即时间,文本和地形)聚类,并集成聚类以识别机器人的活动。我们观察到活动中机器人之间在时间相关性、情感一致性和主题分组等各个方面的相似性。然而,我们也发现机器人在吸引人类注意力方面存在竞争,偶尔还会参与争论。我们将这种机器人交互分为两大类:同意(即积极)和不同意(即消极)交互,并开发了一个自动交互分类器来发现参与社交活动的机器人之间的新交互。
{"title":"BotCamp: Bot-driven Interactions in Social Campaigns","authors":"Noor Abu-El-Rub, A. Mueen","doi":"10.1145/3308558.3313420","DOIUrl":"https://doi.org/10.1145/3308558.3313420","url":null,"abstract":"Bots (i.e. automated accounts) involve in social campaigns typically for two obvious reasons: to inorganically sway public opinion and to build social capital exploiting the organic popularity of social campaigns. In the process, bots interact with each other and engage in human activities (e.g. likes, retweets, and following). In this work, we detect a large number of bots interested in politics. We perform multi-aspect (i.e. temporal, textual, and topographical) clustering of bots, and ensemble the clusters to identify campaigns of bots. We observe similarity among the bots in a campaign in various aspects such as temporal correlation, sentimental alignment, and topical grouping. However, we also discover bots compete in gaining attention from humans and occasionally engage in arguments. We classify such bot interactions in two primary groups: agreeing (i.e. positive) and disagreeing (i.e. negative) interactions and develop an automatic interaction classifier to discover novel interactions among bots participating in social campaigns.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87529042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Understanding the Evolution of Mobile App Ecosystems: A Longitudinal Measurement Study of Google Play 理解手机应用生态系统的演变:基于Google Play的纵向测量研究
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313611
Haoyu Wang, Hao Li, Yao Guo
The continuing expansion of mobile app ecosystems has attracted lots of efforts from the research community. However, although a large number of research studies have focused on analyzing the corpus of mobile apps and app markets, little is known at a comprehensive level on the evolution of mobile app ecosystems. Because the mobile app ecosystem is continuously evolving over time, understanding the dynamics of app ecosystems could provide unique insights that cannot be achieved through studying a single static snapshot. In this paper, we seek to shed light on the dynamics of mobile app ecosystems. Based on 5.3 million app records (with both app metadata and apks) collected from three snapshots of Google Play over more than three years, we conduct the first study on the evolution of app ecosystems from different aspects. Our results suggest that although the overall ecosystem shows promising progress in regard of app popularity, user ratings, permission usage and privacy policy declaration, there still exists a considerable number of unsolved issues including malicious apps, update issues, third-party tracking threats, improper app promotion behaviors, and spamming/malicious developers. Our study shows that understanding the evolution of mobile app ecosystems can help developers make better decision on developing and releasing apps, provide insights for app markets to identifying misbehaviors, and help mobile users to choose desired apps.
移动应用生态系统的持续扩展吸引了研究社区的大量努力。然而,尽管大量研究都集中在分析手机应用和应用市场的语料库上,但对手机应用生态系统的全面演变却知之甚少。由于移动应用生态系统是不断发展的,了解应用生态系统的动态可以提供独特的见解,这是通过研究单个静态快照无法实现的。在本文中,我们试图阐明移动应用生态系统的动态。基于三年多来从Google Play的三个快照中收集的530万个应用记录(包括应用元数据和apk),我们从不同的角度对应用生态系统的演变进行了首次研究。我们的研究结果表明,尽管整个生态系统在应用受欢迎程度、用户评分、权限使用和隐私政策声明方面取得了可喜的进展,但仍存在大量未解决的问题,包括恶意应用、更新问题、第三方跟踪威胁、不当应用推广行为以及垃圾邮件/恶意开发者。我们的研究表明,了解手机应用生态系统的演变可以帮助开发者在开发和发布应用时做出更好的决策,为应用市场识别不良行为提供洞见,并帮助手机用户选择自己想要的应用。
{"title":"Understanding the Evolution of Mobile App Ecosystems: A Longitudinal Measurement Study of Google Play","authors":"Haoyu Wang, Hao Li, Yao Guo","doi":"10.1145/3308558.3313611","DOIUrl":"https://doi.org/10.1145/3308558.3313611","url":null,"abstract":"The continuing expansion of mobile app ecosystems has attracted lots of efforts from the research community. However, although a large number of research studies have focused on analyzing the corpus of mobile apps and app markets, little is known at a comprehensive level on the evolution of mobile app ecosystems. Because the mobile app ecosystem is continuously evolving over time, understanding the dynamics of app ecosystems could provide unique insights that cannot be achieved through studying a single static snapshot. In this paper, we seek to shed light on the dynamics of mobile app ecosystems. Based on 5.3 million app records (with both app metadata and apks) collected from three snapshots of Google Play over more than three years, we conduct the first study on the evolution of app ecosystems from different aspects. Our results suggest that although the overall ecosystem shows promising progress in regard of app popularity, user ratings, permission usage and privacy policy declaration, there still exists a considerable number of unsolved issues including malicious apps, update issues, third-party tracking threats, improper app promotion behaviors, and spamming/malicious developers. Our study shows that understanding the evolution of mobile app ecosystems can help developers make better decision on developing and releasing apps, provide insights for app markets to identifying misbehaviors, and help mobile users to choose desired apps.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"77 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89010778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Variational Session-based Recommendation Using Normalizing Flows 使用规范化流的基于会话的可变推荐
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313615
Fan Zhou, Zijing Wen, Kunpeng Zhang, Goce Trajcevski, Ting Zhong
We present a novel generative Session-Based Recommendation (SBR) framework, called VAriational SEssion-based Recommendation (VASER) - a non-linear probabilistic methodology allowing Bayesian inference for flexible parameter estimation of sequential recommendations. Instead of directly applying extended Variational AutoEncoders (VAE) to SBR, the proposed method introduces normalizing flows to estimate the probabilistic posterior, which is more effective than the agnostic presumed prior approximation used in existing deep generative recommendation approaches. VASER explores soft attention mechanism to upweight the important clicks in a session. We empirically demonstrate that the proposed model significantly outperforms several state-of-the-art baselines, including the recently-proposed RNN/VAE-based approaches on real-world datasets.
我们提出了一种新的基于生成会话的推荐(SBR)框架,称为变分会话推荐(VASER) -一种非线性概率方法,允许贝叶斯推理对顺序推荐进行灵活的参数估计。该方法不是直接将扩展变分自编码器(VAE)应用于SBR,而是引入归一化流来估计概率后验,比现有深度生成推荐方法中使用的不可知论假设先验近似更有效。VASER探索软注意机制,以增加会话中重要点击的权重。我们的经验证明,所提出的模型显著优于几种最先进的基线,包括最近提出的基于RNN/ vae的方法在现实世界数据集上。
{"title":"Variational Session-based Recommendation Using Normalizing Flows","authors":"Fan Zhou, Zijing Wen, Kunpeng Zhang, Goce Trajcevski, Ting Zhong","doi":"10.1145/3308558.3313615","DOIUrl":"https://doi.org/10.1145/3308558.3313615","url":null,"abstract":"We present a novel generative Session-Based Recommendation (SBR) framework, called VAriational SEssion-based Recommendation (VASER) - a non-linear probabilistic methodology allowing Bayesian inference for flexible parameter estimation of sequential recommendations. Instead of directly applying extended Variational AutoEncoders (VAE) to SBR, the proposed method introduces normalizing flows to estimate the probabilistic posterior, which is more effective than the agnostic presumed prior approximation used in existing deep generative recommendation approaches. VASER explores soft attention mechanism to upweight the important clicks in a session. We empirically demonstrate that the proposed model significantly outperforms several state-of-the-art baselines, including the recently-proposed RNN/VAE-based approaches on real-world datasets.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"63 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84828258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Mobile App Risk Ranking via Exclusive Sparse Coding 基于排他性稀疏编码的手机应用风险排名
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313589
Deguang Kong, Lei Cen
To improve mobile application (App for short) user experience, it is very important to inform the users about the apps' privacy risk levels. To address the challenge of incorporating the heterogeneous feature indicators (such as app permissions, user review, developers' description and ads library) into the risk ranking model, we formalize the app risk ranking problem as an exclusive sparse coding optimization problem by taking advantage of features from different modalities via the maximization of the feature consistency and enhancement of feature diversity. We propose an efficient iterative re-weighted method to solve the resultant optimization problem, the convergence of which can be rigorously proved. The extensive experiments demonstrate the consistent performance improvement using the real-world mobile application datasets (totally 13786 apps, 37966 descriptions, 10557681 user reviews and 200 ad libraries).
为了改善移动应用程序(简称App)的用户体验,告知用户应用程序的隐私风险等级是非常重要的。为了解决将异构特征指标(如应用权限、用户评论、开发者描述和广告库)纳入风险排序模型的挑战,我们通过最大化特征一致性和增强特征多样性来利用不同模式的特征,将应用风险排序问题形式化为排他稀疏编码优化问题。我们提出了一种有效的迭代重加权方法来求解结果优化问题,并严格证明了该方法的收敛性。广泛的实验证明了使用实际移动应用程序数据集(总共13786个应用程序,37966个描述,10557681个用户评论和200个广告库)的一致性能改进。
{"title":"Mobile App Risk Ranking via Exclusive Sparse Coding","authors":"Deguang Kong, Lei Cen","doi":"10.1145/3308558.3313589","DOIUrl":"https://doi.org/10.1145/3308558.3313589","url":null,"abstract":"To improve mobile application (App for short) user experience, it is very important to inform the users about the apps' privacy risk levels. To address the challenge of incorporating the heterogeneous feature indicators (such as app permissions, user review, developers' description and ads library) into the risk ranking model, we formalize the app risk ranking problem as an exclusive sparse coding optimization problem by taking advantage of features from different modalities via the maximization of the feature consistency and enhancement of feature diversity. We propose an efficient iterative re-weighted method to solve the resultant optimization problem, the convergence of which can be rigorously proved. The extensive experiments demonstrate the consistent performance improvement using the real-world mobile application datasets (totally 13786 apps, 37966 descriptions, 10557681 user reviews and 200 ad libraries).","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"64 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79573366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fine-grained Type Inference in Knowledge Graphs via Probabilistic and Tensor Factorization Methods 基于概率和张量分解方法的知识图的细粒度类型推断
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313597
A. Moniruzzaman, R. Nayak, Maolin Tang, Thirunavukarasu Balasubramaniam
Knowledge Graphs (KGs) have been proven to be incredibly useful for enriching semantic Web search results and allowing queries with a well-defined result set. In recent years much attention has been given to the task of inferring missing facts based on existing facts in a KG. Approaches have also been proposed for inferring types of entities, however these are successful in common types such as 'Person', 'Movie', or 'Actor'. There is still a large gap, however, in the inference of fine-grained types which are highly important for exploring specific lists and collections within web search. Generally there are also relatively fewer observed instances of fine-grained types present to train in KGs, and this poses challenges for the development of effective approaches. In order to address the issue, this paper proposes a new approach to the fine-grained type inference problem. This new approach is explicitly modeled for leveraging domain knowledge and utilizing additional data outside KG, that improves performance in fine-grained type inference. Further improvements in efficiency are achieved by extending the model to probabilistic inference based on entity similarity and typed class classification. We conduct extensive experiments on type triple classification and entity prediction tasks on Freebase FB15K benchmark dataset. The experiment results show that the proposed model outperforms the state-of-the-art approaches for type inference in KG, and achieves high performance results in many-to-one relation in predicting tail for KG completion task.
知识图(Knowledge Graphs, KGs)已被证明在丰富语义Web搜索结果和允许使用定义良好的结果集进行查询方面非常有用。近年来,基于现有事实推断缺失事实的任务受到了广泛关注。还提出了推断实体类型的方法,但是这些方法在诸如“人物”、“电影”或“演员”等常见类型中是成功的。然而,在细粒度类型的推断方面仍然存在很大的差距,细粒度类型对于在web搜索中探索特定的列表和集合非常重要。一般来说,在kg中进行训练的细粒度类型的观察实例也相对较少,这对开发有效方法提出了挑战。为了解决这一问题,本文提出了一种新的方法来解决细粒度类型推理问题。这种新方法被显式建模,以利用领域知识和KG之外的其他数据,从而提高细粒度类型推断的性能。通过将模型扩展到基于实体相似性和类型化类分类的概率推理,进一步提高了效率。我们在Freebase FB15K基准数据集上对类型三重分类和实体预测任务进行了广泛的实验。实验结果表明,本文提出的模型优于现有的KG类型推理方法,在多对一关系下对KG完成任务的尾部预测取得了较好的结果。
{"title":"Fine-grained Type Inference in Knowledge Graphs via Probabilistic and Tensor Factorization Methods","authors":"A. Moniruzzaman, R. Nayak, Maolin Tang, Thirunavukarasu Balasubramaniam","doi":"10.1145/3308558.3313597","DOIUrl":"https://doi.org/10.1145/3308558.3313597","url":null,"abstract":"Knowledge Graphs (KGs) have been proven to be incredibly useful for enriching semantic Web search results and allowing queries with a well-defined result set. In recent years much attention has been given to the task of inferring missing facts based on existing facts in a KG. Approaches have also been proposed for inferring types of entities, however these are successful in common types such as 'Person', 'Movie', or 'Actor'. There is still a large gap, however, in the inference of fine-grained types which are highly important for exploring specific lists and collections within web search. Generally there are also relatively fewer observed instances of fine-grained types present to train in KGs, and this poses challenges for the development of effective approaches. In order to address the issue, this paper proposes a new approach to the fine-grained type inference problem. This new approach is explicitly modeled for leveraging domain knowledge and utilizing additional data outside KG, that improves performance in fine-grained type inference. Further improvements in efficiency are achieved by extending the model to probabilistic inference based on entity similarity and typed class classification. We conduct extensive experiments on type triple classification and entity prediction tasks on Freebase FB15K benchmark dataset. The experiment results show that the proposed model outperforms the state-of-the-art approaches for type inference in KG, and achieves high performance results in many-to-one relation in predicting tail for KG completion task.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"30 8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89482188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Learning How to Correct a Knowledge Base from the Edit History 学习如何从编辑历史中更正知识库
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313584
Thomas Pellissier Tanon, Camille Bourgaux, Fabian M. Suchanek
The curation of a knowledge base is a crucial but costly task. In this work, we propose to take advantage of the edit history of the knowledge base in order to learn how to correct constraint violations. Our method is based on rule mining, and uses the edits that solved some violations in the past to infer how to solve similar violations in the present. The experimental evaluation of our method on Wikidata shows significant improvements over baselines.
管理知识库是一项至关重要但代价高昂的任务。在这项工作中,我们建议利用知识库的编辑历史来学习如何纠正约束违规。我们的方法是基于规则挖掘,利用过去解决了一些违规的编辑来推断如何解决当前类似的违规。我们的方法在Wikidata上的实验评估显示出比基线有显著的改进。
{"title":"Learning How to Correct a Knowledge Base from the Edit History","authors":"Thomas Pellissier Tanon, Camille Bourgaux, Fabian M. Suchanek","doi":"10.1145/3308558.3313584","DOIUrl":"https://doi.org/10.1145/3308558.3313584","url":null,"abstract":"The curation of a knowledge base is a crucial but costly task. In this work, we propose to take advantage of the edit history of the knowledge base in order to learn how to correct constraint violations. Our method is based on rule mining, and uses the edits that solved some violations in the past to infer how to solve similar violations in the present. The experimental evaluation of our method on Wikidata shows significant improvements over baselines.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90811709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
How Serendipity Improves User Satisfaction with Recommendations? A Large-Scale User Evaluation Serendipity如何通过推荐提高用户满意度?大规模用户评估
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313469
Li Chen, Y. Yang, Ningxia Wang, Keping Yang, Quan Yuan
Recommendation serendipity is being increasingly recognized as being equally important as the other beyond-accuracy objectives (such as novelty and diversity), in eliminating the “filter bubble” phenomenon of the traditional recommender systems. However, little work has empirically verified the effects of serendipity on increasing user satisfaction and behavioral intention. In this paper, we report the results of a large-scale user survey (involving over 3,000 users) conducted in an industrial mobile e-commerce setting. The study has identified the significant causal relationships from novelty, unexpectedness, relevance, and timeliness to serendipity, and from serendipity to user satisfaction and purchase intention. Moreover, our findings reveal that user curiosity plays a moderating role in strengthening the relationships from novelty to serendipity and from serendipity to satisfaction. Our third contribution lies in the comparison of several recommender algorithms, which demonstrates the significant improvements of the serendipity-oriented algorithm over the relevance- and novelty-oriented approaches in terms of user perceptions. We finally discuss the implications of this experiment, which include the feasibility of developing a more precise metric for measuring recommendation serendipity, and the potential benefit of a curiosity-based personalized serendipity strategy for recommender systems.
人们越来越认识到,在消除传统推荐系统的“过滤泡沫”现象方面,推荐的偶然性与其他超越准确性的目标(如新颖性和多样性)同样重要。然而,很少有实证研究证实意外发现对提高用户满意度和行为意愿的影响。在本文中,我们报告了在工业移动电子商务环境中进行的大规模用户调查(涉及3000多名用户)的结果。该研究确定了新颖性、意外性、相关性和及时性与意外发现之间的重要因果关系,以及意外发现与用户满意度和购买意愿之间的因果关系。此外,我们的研究结果表明,用户的好奇心在加强从新奇到意外发现和从意外发现到满足的关系中起着调节作用。我们的第三个贡献在于几种推荐算法的比较,这表明在用户感知方面,面向偶然性的算法比面向相关性和新颖性的方法有显著的改进。我们最后讨论了这个实验的意义,包括开发一个更精确的度量推荐意外发现的可行性,以及基于好奇心的个性化推荐系统意外发现策略的潜在好处。
{"title":"How Serendipity Improves User Satisfaction with Recommendations? A Large-Scale User Evaluation","authors":"Li Chen, Y. Yang, Ningxia Wang, Keping Yang, Quan Yuan","doi":"10.1145/3308558.3313469","DOIUrl":"https://doi.org/10.1145/3308558.3313469","url":null,"abstract":"Recommendation serendipity is being increasingly recognized as being equally important as the other beyond-accuracy objectives (such as novelty and diversity), in eliminating the “filter bubble” phenomenon of the traditional recommender systems. However, little work has empirically verified the effects of serendipity on increasing user satisfaction and behavioral intention. In this paper, we report the results of a large-scale user survey (involving over 3,000 users) conducted in an industrial mobile e-commerce setting. The study has identified the significant causal relationships from novelty, unexpectedness, relevance, and timeliness to serendipity, and from serendipity to user satisfaction and purchase intention. Moreover, our findings reveal that user curiosity plays a moderating role in strengthening the relationships from novelty to serendipity and from serendipity to satisfaction. Our third contribution lies in the comparison of several recommender algorithms, which demonstrates the significant improvements of the serendipity-oriented algorithm over the relevance- and novelty-oriented approaches in terms of user perceptions. We finally discuss the implications of this experiment, which include the feasibility of developing a more precise metric for measuring recommendation serendipity, and the potential benefit of a curiosity-based personalized serendipity strategy for recommender systems.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91147246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 77
XFake: Explainable Fake News Detector with Visualizations XFake:可解释的假新闻检测器与可视化
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3314119
Fan Yang, Shiva K. Pentyala, Sina Mohseni, Mengnan Du, Hao Yuan, Rhema Linder, E. Ragan, Shuiwang Ji, Xia Hu
In this demo paper, we present the XFake system, an explainable fake news detector that assists end-users to identify news credibility. To effectively detect and interpret the fakeness of news items, we jointly consider both attributes (e.g., speaker) and statements. Specifically, MIMIC, ATTN and PERT frameworks are designed, where MIMIC is built for attribute analysis, ATTN is for statement semantic analysis and PERT is for statement linguistic analysis. Beyond the explanations extracted from the designed frameworks, relevant supporting examples as well as visualization are further provided to facilitate the interpretation. Our implemented system is demonstrated on a real-world dataset crawled from PolitiFact1, where thousands of verified political news have been collected.
在这篇演示论文中,我们介绍了XFake系统,这是一个可解释的假新闻检测器,可帮助最终用户识别新闻的可信度。为了有效地检测和解释新闻的真实性,我们共同考虑了属性(例如,说话者)和陈述。具体来说,设计了MIMIC、ATTN和PERT框架,其中MIMIC用于属性分析,ATTN用于语句语义分析,PERT用于语句语言分析。除了从设计框架中提取的解释之外,还提供了相关的支持示例和可视化,以方便解释。我们实现的系统在从PolitiFact1抓取的真实数据集上进行了演示,该数据集收集了数千条经过验证的政治新闻。
{"title":"XFake: Explainable Fake News Detector with Visualizations","authors":"Fan Yang, Shiva K. Pentyala, Sina Mohseni, Mengnan Du, Hao Yuan, Rhema Linder, E. Ragan, Shuiwang Ji, Xia Hu","doi":"10.1145/3308558.3314119","DOIUrl":"https://doi.org/10.1145/3308558.3314119","url":null,"abstract":"In this demo paper, we present the XFake system, an explainable fake news detector that assists end-users to identify news credibility. To effectively detect and interpret the fakeness of news items, we jointly consider both attributes (e.g., speaker) and statements. Specifically, MIMIC, ATTN and PERT frameworks are designed, where MIMIC is built for attribute analysis, ATTN is for statement semantic analysis and PERT is for statement linguistic analysis. Beyond the explanations extracted from the designed frameworks, relevant supporting examples as well as visualization are further provided to facilitate the interpretation. Our implemented system is demonstrated on a real-world dataset crawled from PolitiFact1, where thousands of verified political news have been collected.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89730950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 89
A Graph is Worth a Thousand Words: Telling Event Stories using Timeline Summarization Graphs 一张图表胜过千言万语:使用时间线总结图表讲述事件故事
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313396
Jeffery Ansah, Lin Liu, Wei Kang, Selasi Kwashie, Jixue Li, Jiuyong Li
Story timeline summarization is widely used by analysts, law enforcement agencies, and policymakers for content presentation, story-telling, and other data-driven decision-making applications. Recent advancements in web technologies have rendered social media sites such as Twitter and Facebook as a viable platform for discovering evolving stories and trending events for story timeline summarization. However, a timeline summarization structure that models complex evolving stories by tracking event evolution to identify different themes of a story and generate a coherent structure that is easy for users to understand is yet to be explored. In this paper, we propose StoryGraph, a novel graph timeline summarization structure that is capable of identifying the different themes of a story. By using high penalty metrics that leverage user network communities, temporal proximity, and the semantic context of the events, we construct coherent paths and generate structural timeline summaries to tell the story of how events evolve over time. We performed experiments on real-world datasets to show the prowess of StoryGraph. StoryGraph outperforms existing models and produces accurate timeline summarizations. As a key finding, we discover that user network communities increase coherence leading to the generation of consistent summary structures.
故事时间轴摘要被分析师、执法机构和决策者广泛用于内容呈现、故事讲述和其他数据驱动的决策应用。最近网络技术的进步使得Twitter和Facebook等社交媒体网站成为一个可行的平台,可以发现不断发展的故事和趋势事件,以便对故事时间轴进行总结。然而,一种时间线总结结构,通过跟踪事件演变来模拟复杂的故事演变,以识别故事的不同主题,并生成易于用户理解的连贯结构,还有待探索。在本文中,我们提出了StoryGraph,这是一种新颖的图形时间线总结结构,能够识别故事的不同主题。通过使用利用用户网络社区、时间接近度和事件语义上下文的高惩罚指标,我们构建了连贯的路径,并生成结构化的时间轴摘要,以讲述事件如何随时间演变的故事。我们在真实世界的数据集上进行了实验,以展示StoryGraph的强大功能。StoryGraph优于现有的模型,并产生准确的时间轴摘要。作为一个重要的发现,我们发现用户网络社区增加了连贯性,从而产生一致的摘要结构。
{"title":"A Graph is Worth a Thousand Words: Telling Event Stories using Timeline Summarization Graphs","authors":"Jeffery Ansah, Lin Liu, Wei Kang, Selasi Kwashie, Jixue Li, Jiuyong Li","doi":"10.1145/3308558.3313396","DOIUrl":"https://doi.org/10.1145/3308558.3313396","url":null,"abstract":"Story timeline summarization is widely used by analysts, law enforcement agencies, and policymakers for content presentation, story-telling, and other data-driven decision-making applications. Recent advancements in web technologies have rendered social media sites such as Twitter and Facebook as a viable platform for discovering evolving stories and trending events for story timeline summarization. However, a timeline summarization structure that models complex evolving stories by tracking event evolution to identify different themes of a story and generate a coherent structure that is easy for users to understand is yet to be explored. In this paper, we propose StoryGraph, a novel graph timeline summarization structure that is capable of identifying the different themes of a story. By using high penalty metrics that leverage user network communities, temporal proximity, and the semantic context of the events, we construct coherent paths and generate structural timeline summaries to tell the story of how events evolve over time. We performed experiments on real-world datasets to show the prowess of StoryGraph. StoryGraph outperforms existing models and produces accurate timeline summarizations. As a key finding, we discover that user network communities increase coherence leading to the generation of consistent summary structures.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75122053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Characterizing Speed and Scale of Cryptocurrency Discussion Spread on Reddit 表征加密货币讨论的速度和规模在Reddit上传播
Pub Date : 2019-05-13 DOI: 10.1145/3308558.3313702
M. Glenski, Emily Saldanha, Svitlana Volkova
Cryptocurrencies are a novel and disruptive technology that has prompted a new approach to how currencies work in the modern economy. As such, online discussions related to cryptocurrencies often go beyond posts about the technology and underlying architecture of the various coins, to subjective speculations of price fluctuations and predictions. Furthermore, online discussions, potentially driven by foreign adversaries, criminals or hackers, can have a significant impact on our economy and national security if spread at scale. This paper is the first to qualitatively measure and contrast discussion growth about three popular cryptocurrencies with key distinctions in motivation, usage, and implementation - Bitcoin, Ethereum, and Monero on Reddit. More specifically, we measure how discussions relevant to these coins spread in online social environments - how deep and how wide they go, how long they last, how many people they reach, etc. More importantly, we compare user behavior patterns between the focused community of the official coin subreddits and the general community across Reddit as a whole. Our Reddit sample covers three years of data between 2015 and 2018 and includes a time period of a record high Bitcoin price rise.1 Our results demonstrate that while the largest discussions on Reddit are focused on Bitcoin, posts about Monero (a cryptocurrency often used by criminals for illegal transactions on the Dark Web2) start discussions that are typically longer and wider. Bitcoin posts trigger subsequent discussion more immediately but Monero posts are more likely to trigger a longer lasting discussion. We find that moderately subjective posts across all three coins trigger larger, longer, and more viral discussion cascades within both focused and general communities on Reddit. Our analysis aims to bring the awareness to online discussion spread relevant to cryptocurrencies in addition to informing models for forecasting cryptocurrency price that rely on discussions in social media.
加密货币是一种新颖的颠覆性技术,它促使人们对货币在现代经济中的运作方式产生了新的看法。因此,与加密货币相关的在线讨论往往超出了关于各种硬币的技术和底层架构的帖子,而是对价格波动和预测的主观猜测。此外,可能由外国对手、犯罪分子或黑客推动的网上讨论,如果大规模传播,可能对我们的经济和国家安全产生重大影响。本文是第一个定性地衡量和对比关于三种流行的加密货币的讨论增长的文章,它们在动机、使用和实现方面具有关键区别——Reddit上的比特币、以太坊和门罗币。更具体地说,我们衡量与这些硬币相关的讨论如何在网络社交环境中传播——它们传播的深度和广度、持续的时间、影响的人数等等。更重要的是,我们比较了官方硬币子Reddit的重点社区和整个Reddit的普通社区之间的用户行为模式。我们的Reddit样本涵盖了2015年至2018年的三年数据,其中包括比特币价格创历史新高的时期我们的研究结果表明,虽然Reddit上最大的讨论集中在比特币上,但关于门罗币(一种犯罪分子经常在暗网进行非法交易的加密货币)的帖子通常会引发更长、更广泛的讨论。比特币帖子更容易引发后续讨论,但门罗币帖子更有可能引发更持久的讨论。我们发现,在所有三种货币中,适度主观的帖子都会在Reddit的重点社区和普通社区中引发更大、更长、更病毒式的讨论。我们的分析旨在提高人们对与加密货币相关的在线讨论传播的认识,并为依赖于社交媒体讨论的加密货币价格预测模型提供信息。
{"title":"Characterizing Speed and Scale of Cryptocurrency Discussion Spread on Reddit","authors":"M. Glenski, Emily Saldanha, Svitlana Volkova","doi":"10.1145/3308558.3313702","DOIUrl":"https://doi.org/10.1145/3308558.3313702","url":null,"abstract":"Cryptocurrencies are a novel and disruptive technology that has prompted a new approach to how currencies work in the modern economy. As such, online discussions related to cryptocurrencies often go beyond posts about the technology and underlying architecture of the various coins, to subjective speculations of price fluctuations and predictions. Furthermore, online discussions, potentially driven by foreign adversaries, criminals or hackers, can have a significant impact on our economy and national security if spread at scale. This paper is the first to qualitatively measure and contrast discussion growth about three popular cryptocurrencies with key distinctions in motivation, usage, and implementation - Bitcoin, Ethereum, and Monero on Reddit. More specifically, we measure how discussions relevant to these coins spread in online social environments - how deep and how wide they go, how long they last, how many people they reach, etc. More importantly, we compare user behavior patterns between the focused community of the official coin subreddits and the general community across Reddit as a whole. Our Reddit sample covers three years of data between 2015 and 2018 and includes a time period of a record high Bitcoin price rise.1 Our results demonstrate that while the largest discussions on Reddit are focused on Bitcoin, posts about Monero (a cryptocurrency often used by criminals for illegal transactions on the Dark Web2) start discussions that are typically longer and wider. Bitcoin posts trigger subsequent discussion more immediately but Monero posts are more likely to trigger a longer lasting discussion. We find that moderately subjective posts across all three coins trigger larger, longer, and more viral discussion cascades within both focused and general communities on Reddit. Our analysis aims to bring the awareness to online discussion spread relevant to cryptocurrencies in addition to informing models for forecasting cryptocurrency price that rely on discussions in social media.","PeriodicalId":23013,"journal":{"name":"The World Wide Web Conference","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75257896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
期刊
The World Wide Web Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1