首页 > 最新文献

Proceedings of the 13th International Conference on Web Search and Data Mining最新文献

英文 中文
Adversarial Machine Learning in Recommender Systems (AML-RecSys) 推荐系统中的对抗性机器学习(AML-RecSys)
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371877
Yashar Deldjoo, T. D. Noia, Felice Antonio Merra
Recommender systems (RS) are an integral part of many online services aiming to provide an enhanced user-oriented experience. Machine learning (ML) models are nowadays broadly adopted in modern state-of-the-art approaches to recommendation, which are typically trained to maximize a user-centred utility (e.g., user satisfaction) or a business-oriented one (e.g., profitability or sales increase). They work under the main assumption that users' historical feedback can serve as proper ground-truth for model training and evaluation. However, driven by the success in the ML community, recent advances show that state-of-the-art recommendation approaches such as matrix factorization (MF) models or the ones based on deep neural networks can be vulnerable to adversarial perturbations applied on the input data. These adversarial samples can impede the ability for training high-quality MF models and can put the driven success of these approaches at high risk. As a result, there is a new paradigm of secure training for RS that takes into account the presence of adversarial samples into the recommendation process. We present adversarial machine learning in Recommender Systems (AML-RecSys), which concerns the study of effective ML techniques in RS to fight against an adversarial component. AML-RecSys has been proposed in two main fashions within the RS literature: (i) adversarial regularization, which attempts to combat against adversarial perturbation added to input data or model parameters of a RS and, (ii) generative adversarial network (GAN)-based models, which adopt a generative process to train powerful ML models. We discuss a theoretical framework to unify the two above models, which is performed via a minimax game between an adversarial component and a discriminator. Furthermore, we explore various examples illustrating the successful application of AML to solve various RS tasks. Finally, we present a global taxonomy/overview of the academic literature based on several identified dimensions, namely (i) research goals and challenges, (ii) application domains and (iii) technical overview.
推荐系统(RS)是许多在线服务不可或缺的一部分,旨在提供增强的面向用户的体验。如今,机器学习(ML)模型被广泛应用于现代最先进的推荐方法中,这些方法通常被训练为最大化以用户为中心的效用(例如,用户满意度)或以业务为导向的效用(例如,盈利能力或销售增长)。它们的主要假设是,用户的历史反馈可以作为模型训练和评估的正确基础。然而,在机器学习社区成功的推动下,最近的进展表明,最先进的推荐方法,如矩阵分解(MF)模型或基于深度神经网络的推荐方法,可能容易受到应用于输入数据的对抗性扰动的影响。这些对抗性样本可能会阻碍训练高质量MF模型的能力,并可能使这些方法的成功处于高风险之中。因此,在推荐过程中考虑到对抗性样本的存在,出现了一种新的RS安全训练范例。我们在推荐系统(AML-RecSys)中提出了对抗性机器学习,它涉及在RS中研究有效的机器学习技术来对抗对抗性组件。AML-RecSys在RS文献中以两种主要方式提出:(i)对抗性正则化,试图对抗添加到RS输入数据或模型参数中的对抗性扰动;(ii)基于生成对抗网络(GAN)的模型,采用生成过程来训练强大的ML模型。我们讨论了一个统一上述两个模型的理论框架,该框架通过对抗组件和鉴别器之间的极大极小博弈来实现。此外,我们探讨了各种例子,说明AML成功应用于解决各种RS任务。最后,我们根据几个确定的维度,即(i)研究目标和挑战,(ii)应用领域和(iii)技术概述,对学术文献进行了全球分类/概述。
{"title":"Adversarial Machine Learning in Recommender Systems (AML-RecSys)","authors":"Yashar Deldjoo, T. D. Noia, Felice Antonio Merra","doi":"10.1145/3336191.3371877","DOIUrl":"https://doi.org/10.1145/3336191.3371877","url":null,"abstract":"Recommender systems (RS) are an integral part of many online services aiming to provide an enhanced user-oriented experience. Machine learning (ML) models are nowadays broadly adopted in modern state-of-the-art approaches to recommendation, which are typically trained to maximize a user-centred utility (e.g., user satisfaction) or a business-oriented one (e.g., profitability or sales increase). They work under the main assumption that users' historical feedback can serve as proper ground-truth for model training and evaluation. However, driven by the success in the ML community, recent advances show that state-of-the-art recommendation approaches such as matrix factorization (MF) models or the ones based on deep neural networks can be vulnerable to adversarial perturbations applied on the input data. These adversarial samples can impede the ability for training high-quality MF models and can put the driven success of these approaches at high risk. As a result, there is a new paradigm of secure training for RS that takes into account the presence of adversarial samples into the recommendation process. We present adversarial machine learning in Recommender Systems (AML-RecSys), which concerns the study of effective ML techniques in RS to fight against an adversarial component. AML-RecSys has been proposed in two main fashions within the RS literature: (i) adversarial regularization, which attempts to combat against adversarial perturbation added to input data or model parameters of a RS and, (ii) generative adversarial network (GAN)-based models, which adopt a generative process to train powerful ML models. We discuss a theoretical framework to unify the two above models, which is performed via a minimax game between an adversarial component and a discriminator. Furthermore, we explore various examples illustrating the successful application of AML to solve various RS tasks. Finally, we present a global taxonomy/overview of the academic literature based on several identified dimensions, namely (i) research goals and challenges, (ii) application domains and (iii) technical overview.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128934142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Balanced Influence Maximization in Attributed Social Network Based on Sampling 基于抽样的属性社会网络平衡影响最大化
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371833
Mingkai Lin, Wenzhong Li, Sanglu Lu
Influence maximization in social networks is the problem of finding a set of seed nodes in the network that maximizes the spread of influence under certain information prorogation model, which has become an important topic in social network analysis. In this paper, we show that conventional influence maximization algorithms cause uneven spread of influence among different attribute groups in social networks, which could lead to severer bias in public opinion dissemination and viral marketing. We formulate the balanced influence maximization problem to address the trade-off between influence maximization and attribute balance, and propose a sampling based solution to solve the problem efficiently. To avoid full network exploration, we first propose an attribute-based (AB) sampling method to sample attributed social networks with respect to preserving network structural properties and attribute proportion among user groups. Then we propose an attributed-based reverse influence sampling (AB-RIS) algorithm to select seed nodes from the sampled graph. The proposed AB-RIS algorithm runs efficiently with guaranteed accuracy, and achieves the trade-off between influence maximization and attribute balance. Extensive experiments based on four real-world social network datasets show that AB-RIS significantly outperforms the state-of-the-art approaches in balanced influence maximization.
社交网络中的影响力最大化问题是在一定的信息延拓模型下,在网络中找到一组影响传播最大化的种子节点,成为社会网络分析中的一个重要课题。在本文中,我们发现传统的影响力最大化算法导致社交网络中不同属性群体的影响力传播不均匀,这可能导致民意传播和病毒式营销的严重偏见。为了解决影响最大化和属性平衡之间的权衡,我们提出了平衡影响最大化问题,并提出了一种基于采样的解决方案来有效地解决问题。为了避免对整个网络进行探索,我们首先提出了一种基于属性(AB)的采样方法来对具有属性的社交网络进行采样,同时保留了网络的结构属性和属性在用户群体中的比例。然后,我们提出了一种基于属性的反向影响采样(AB-RIS)算法,从采样图中选择种子节点。所提出的AB-RIS算法在保证精度的前提下高效运行,实现了影响最大化和属性平衡之间的权衡。基于四个真实社会网络数据集的广泛实验表明,AB-RIS在平衡影响最大化方面显着优于最先进的方法。
{"title":"Balanced Influence Maximization in Attributed Social Network Based on Sampling","authors":"Mingkai Lin, Wenzhong Li, Sanglu Lu","doi":"10.1145/3336191.3371833","DOIUrl":"https://doi.org/10.1145/3336191.3371833","url":null,"abstract":"Influence maximization in social networks is the problem of finding a set of seed nodes in the network that maximizes the spread of influence under certain information prorogation model, which has become an important topic in social network analysis. In this paper, we show that conventional influence maximization algorithms cause uneven spread of influence among different attribute groups in social networks, which could lead to severer bias in public opinion dissemination and viral marketing. We formulate the balanced influence maximization problem to address the trade-off between influence maximization and attribute balance, and propose a sampling based solution to solve the problem efficiently. To avoid full network exploration, we first propose an attribute-based (AB) sampling method to sample attributed social networks with respect to preserving network structural properties and attribute proportion among user groups. Then we propose an attributed-based reverse influence sampling (AB-RIS) algorithm to select seed nodes from the sampled graph. The proposed AB-RIS algorithm runs efficiently with guaranteed accuracy, and achieves the trade-off between influence maximization and attribute balance. Extensive experiments based on four real-world social network datasets show that AB-RIS significantly outperforms the state-of-the-art approaches in balanced influence maximization.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122084493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Debiasing Word Embeddings from Sentiment Associations in Names 从人名的情感关联中去除词嵌入的偏见
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371779
C. Hube, Maximilian Idahl, B. Fetahu
Word embeddings, trained through models like skip-gram, have shown to be prone to capturing the biases from the training corpus, e.g. gender bias. Such biases are unwanted as they spill in downstream tasks, thus, leading to discriminatory behavior. In this work, we address the problem of prior sentiment associated with names in word embeddings where for a given name representation (e.g. "Smith"), a sentiment classifier will categorize it as either positive or negative. We propose DebiasEmb, a skip-gram based word embedding approach that, for a given oracle sentiment classification model, will debias the name representations, such that they cannot be associated with either positive or negative sentiment. Evaluation on standard word embedding benchmarks and a downstream analysis show that our approach is able to maintain a high quality of embeddings and at the same time mitigate sentiment bias in name embeddings.
通过skip-gram等模型训练的词嵌入,已经显示出容易从训练语料库中捕获偏见,例如性别偏见。这种偏见是不希望的,因为它们会溢出到下游任务中,从而导致歧视行为。在这项工作中,我们解决了词嵌入中与名称相关的先验情感问题,其中对于给定的名称表示(例如:“史密斯”),情感分类器会将其分类为积极或消极。我们提出了DebiasEmb,这是一种基于跳过图的词嵌入方法,对于给定的oracle情感分类模型,它将去偏向名称表示,这样它们就不能与积极或消极的情感相关联。对标准词嵌入基准的评估和下游分析表明,我们的方法能够保持高质量的嵌入,同时减轻名称嵌入中的情感偏差。
{"title":"Debiasing Word Embeddings from Sentiment Associations in Names","authors":"C. Hube, Maximilian Idahl, B. Fetahu","doi":"10.1145/3336191.3371779","DOIUrl":"https://doi.org/10.1145/3336191.3371779","url":null,"abstract":"Word embeddings, trained through models like skip-gram, have shown to be prone to capturing the biases from the training corpus, e.g. gender bias. Such biases are unwanted as they spill in downstream tasks, thus, leading to discriminatory behavior. In this work, we address the problem of prior sentiment associated with names in word embeddings where for a given name representation (e.g. \"Smith\"), a sentiment classifier will categorize it as either positive or negative. We propose DebiasEmb, a skip-gram based word embedding approach that, for a given oracle sentiment classification model, will debias the name representations, such that they cannot be associated with either positive or negative sentiment. Evaluation on standard word embedding benchmarks and a downstream analysis show that our approach is able to maintain a high quality of embeddings and at the same time mitigate sentiment bias in name embeddings.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124726550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Toward Activity Discovery in the Personal Web 面向个人网络中的活动发现
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371828
Tara Safavi, Adam Fourney, Robert B Sim, Marcin Juraszek, Shane Williams, Ned Friend, Danai Koutra, Paul N. Bennett
Individuals' personal information collections (their emails, files, appointments, web searches, contacts, etc) offer a wealth of insights into the organization and structure of their everyday lives. In this paper we address the task of learning representations of personal information items to capture individuals' ongoing activities, such as projects and tasks: Such representations can be used in activity-centric applications like personal assistants, email clients, and productivity tools to help people better manage their data and time. We propose a graph-based approach that leverages the inherent interconnected structure of personal information collections, and derive efficient, exact techniques to incrementally update representations as new data arrive. We demonstrate the strengths of our graph-based representations against competitive baselines in a novel intrinsic rating task and an extrinsic recommendation task.
个人信息的收集(他们的电子邮件、文件、约会、网络搜索、联系人等)为了解他们日常生活的组织和结构提供了丰富的见解。在本文中,我们解决了学习个人信息项的表示以捕获个人正在进行的活动(如项目和任务)的任务:这种表示可以用于以活动为中心的应用程序,如个人助理、电子邮件客户端和生产力工具,以帮助人们更好地管理他们的数据和时间。我们提出了一种基于图的方法,该方法利用个人信息集合固有的相互关联结构,并派生出高效、精确的技术,在新数据到达时增量更新表示。在一个新的内在评价任务和一个外在推荐任务中,我们展示了基于图的表示相对于竞争基线的优势。
{"title":"Toward Activity Discovery in the Personal Web","authors":"Tara Safavi, Adam Fourney, Robert B Sim, Marcin Juraszek, Shane Williams, Ned Friend, Danai Koutra, Paul N. Bennett","doi":"10.1145/3336191.3371828","DOIUrl":"https://doi.org/10.1145/3336191.3371828","url":null,"abstract":"Individuals' personal information collections (their emails, files, appointments, web searches, contacts, etc) offer a wealth of insights into the organization and structure of their everyday lives. In this paper we address the task of learning representations of personal information items to capture individuals' ongoing activities, such as projects and tasks: Such representations can be used in activity-centric applications like personal assistants, email clients, and productivity tools to help people better manage their data and time. We propose a graph-based approach that leverages the inherent interconnected structure of personal information collections, and derive efficient, exact techniques to incrementally update representations as new data arrive. We demonstrate the strengths of our graph-based representations against competitive baselines in a novel intrinsic rating task and an extrinsic recommendation task.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123121542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
AutoBlock
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371813
Wei Zhang, Hao Wei, Bunyamin Sisman, Xin Dong, Christos Faloutsos, Davd Page
Entity matching seeks to identify data records over one or multiple data sources that refer to the same real-world entity. Virtually every entity matching task on large datasets requires blocking, a step that reduces the number of record pairs to be matched. However, most of the traditional blocking methods are learning-free and key-based, and their successes are largely built on laborious human effort in cleaning data and designing blocking keys. In this paper, we propose AutoBlock, a novel hands-off blocking framework for entity matching, based on similarity-preserving representation learning and nearest neighbor search. Our contributions include: (a) Automation: AutoBlock frees users from laborious data cleaning and blocking key tuning. (b) Scalability: AutoBlock has a sub-quadratic total time complexity and can be easily deployed for millions of records. (c) Effectiveness: AutoBlock outperforms a wide range of competitive baselines on multiple large-scale, real-world datasets, especially when datasets are dirty and/or unstructured.
{"title":"AutoBlock","authors":"Wei Zhang, Hao Wei, Bunyamin Sisman, Xin Dong, Christos Faloutsos, Davd Page","doi":"10.1145/3336191.3371813","DOIUrl":"https://doi.org/10.1145/3336191.3371813","url":null,"abstract":"Entity matching seeks to identify data records over one or multiple data sources that refer to the same real-world entity. Virtually every entity matching task on large datasets requires blocking, a step that reduces the number of record pairs to be matched. However, most of the traditional blocking methods are learning-free and key-based, and their successes are largely built on laborious human effort in cleaning data and designing blocking keys. In this paper, we propose AutoBlock, a novel hands-off blocking framework for entity matching, based on similarity-preserving representation learning and nearest neighbor search. Our contributions include: (a) Automation: AutoBlock frees users from laborious data cleaning and blocking key tuning. (b) Scalability: AutoBlock has a sub-quadratic total time complexity and can be easily deployed for millions of records. (c) Effectiveness: AutoBlock outperforms a wide range of competitive baselines on multiple large-scale, real-world datasets, especially when datasets are dirty and/or unstructured.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117025444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-End Deep Reinforcement Learning based Recommendation with Supervised Embedding 基于监督嵌入的端到端深度强化学习推荐
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371858
Feng Liu, Huifeng Guo, Xutao Li, Ruiming Tang, Yunming Ye, Xiuqiang He
The research of reinforcement learning (RL) based recommendation method has become a hot topic in recommendation community, due to the recent advance in interactive recommender systems. The existing RL recommendation approaches can be summarized into a unified framework with three components, namely embedding component (EC), state representation component (SRC) and policy component (PC). We find that EC cannot be nicely trained with the other two components simultaneously. Previous studies bypass the obstacle through a pre-training and fixing strategy, which makes their approaches unlike a real end-to-end fashion. More importantly, such pre-trained and fixed EC suffers from two inherent drawbacks: (1) Pre-trained and fixed embeddings are unable to model evolving preference of users and item correlations in the dynamic environment; (2) Pre-training is inconvenient in the industrial applications. To address the problem, in this paper, we propose an End-to-end Deep Reinforcement learning based Recommendation framework (EDRR). In this framework, a supervised learning signal is carefully designed for smoothing the update gradients to EC, and three incorporating ways are introduced and compared. To the best of our knowledge, we are the first to address the training compatibility between the three components in RL based recommendations. Extensive experiments are conducted on three real-world datasets, and the results demonstrate the proposed EDRR effectively achieves the end-to-end training purpose for both policy-based and value-based RL models, and delivers better performance than state-of-the-art methods.
随着交互式推荐系统的发展,基于强化学习(RL)的推荐方法的研究成为了推荐界的热点。现有的RL推荐方法可以概括为一个统一的框架,包含三个组成部分,即嵌入组件(embedded component, EC)、状态表示组件(state representation component, SRC)和策略组件(policy component, PC)。我们发现电子商务不能很好地与其他两个组成部分同时训练。之前的研究通过预先训练和修复策略绕过了这个障碍,这使得他们的方法与真正的端到端方式不同。更重要的是,这种预训练和固定的电子商务存在两个固有的缺陷:(1)预训练和固定的嵌入无法模拟动态环境中用户偏好和物品相关性的演变;(2)预训练在工业应用中不方便。为了解决这个问题,在本文中,我们提出了一个基于端到端深度强化学习的推荐框架(EDRR)。在该框架中,精心设计了一个监督学习信号,使更新梯度平滑到EC,并介绍了三种合并方法并进行了比较。据我们所知,我们是第一个解决基于强化学习的建议中三个组件之间训练兼容性的人。在三个真实数据集上进行了大量的实验,结果表明,所提出的EDRR有效地实现了基于策略和基于价值的RL模型的端到端训练目的,并且提供了比现有方法更好的性能。
{"title":"End-to-End Deep Reinforcement Learning based Recommendation with Supervised Embedding","authors":"Feng Liu, Huifeng Guo, Xutao Li, Ruiming Tang, Yunming Ye, Xiuqiang He","doi":"10.1145/3336191.3371858","DOIUrl":"https://doi.org/10.1145/3336191.3371858","url":null,"abstract":"The research of reinforcement learning (RL) based recommendation method has become a hot topic in recommendation community, due to the recent advance in interactive recommender systems. The existing RL recommendation approaches can be summarized into a unified framework with three components, namely embedding component (EC), state representation component (SRC) and policy component (PC). We find that EC cannot be nicely trained with the other two components simultaneously. Previous studies bypass the obstacle through a pre-training and fixing strategy, which makes their approaches unlike a real end-to-end fashion. More importantly, such pre-trained and fixed EC suffers from two inherent drawbacks: (1) Pre-trained and fixed embeddings are unable to model evolving preference of users and item correlations in the dynamic environment; (2) Pre-training is inconvenient in the industrial applications. To address the problem, in this paper, we propose an End-to-end Deep Reinforcement learning based Recommendation framework (EDRR). In this framework, a supervised learning signal is carefully designed for smoothing the update gradients to EC, and three incorporating ways are introduced and compared. To the best of our knowledge, we are the first to address the training compatibility between the three components in RL based recommendations. Extensive experiments are conducted on three real-world datasets, and the results demonstrate the proposed EDRR effectively achieves the end-to-end training purpose for both policy-based and value-based RL models, and delivers better performance than state-of-the-art methods.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124216403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
ConvERSe'20: The WSDM 2020 Workshop on Conversational Systems for E-Commerce Recommendations and Search ConvERSe'20: WSDM 2020关于电子商务推荐和搜索会话系统的研讨会
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371882
Eugene Agichtein, Dilek Z. Hakkani-Tür, S. Kallumadi, S. Malmasi
Conversational systems have improved dramatically recently, and are receiving increasing attention in academic literature. These systems are also becoming adapted in E-Commerce due to increased integration of E-Commerce search and recommendation source with virtual assistants such as Alexa, Siri, and Google assistant. However, significant research challenges remain spanning areas of dialogue systems, spoken natural language processing, human-computer interaction, and search and recommender systems, which all are exacerbated with demanding requirements of E-Commerce. The purpose of this workshop is to bring together researchers and practitioners in the areas of conversational systems, human-computer interaction, information retrieval, and recommender systems. Bringing diverse research areas together into a single workshop would accelerate progress on adapting conversation systems to the E-Commerce domain, to set a research agenda, to examine how to build and share data sets, and to establish common evaluation metrics and benchmarks to drive research progress.
近年来,会话系统得到了极大的改进,并在学术文献中受到越来越多的关注。由于电子商务搜索和推荐源与虚拟助手(如Alexa、Siri和谷歌助手)的集成增加,这些系统也开始适应电子商务。然而,重大的研究挑战仍然跨越对话系统、口语自然语言处理、人机交互、搜索和推荐系统等领域,这些都随着电子商务的要求而加剧。本次研讨会的目的是将对话系统、人机交互、信息检索和推荐系统领域的研究人员和实践者聚集在一起。将不同的研究领域汇集到一个研讨会中,将加速使对话系统适应电子商务领域、制定研究议程、研究如何建立和共享数据集以及建立共同的评估指标和基准以推动研究进展等方面的进展。
{"title":"ConvERSe'20: The WSDM 2020 Workshop on Conversational Systems for E-Commerce Recommendations and Search","authors":"Eugene Agichtein, Dilek Z. Hakkani-Tür, S. Kallumadi, S. Malmasi","doi":"10.1145/3336191.3371882","DOIUrl":"https://doi.org/10.1145/3336191.3371882","url":null,"abstract":"Conversational systems have improved dramatically recently, and are receiving increasing attention in academic literature. These systems are also becoming adapted in E-Commerce due to increased integration of E-Commerce search and recommendation source with virtual assistants such as Alexa, Siri, and Google assistant. However, significant research challenges remain spanning areas of dialogue systems, spoken natural language processing, human-computer interaction, and search and recommender systems, which all are exacerbated with demanding requirements of E-Commerce. The purpose of this workshop is to bring together researchers and practitioners in the areas of conversational systems, human-computer interaction, information retrieval, and recommender systems. Bringing diverse research areas together into a single workshop would accelerate progress on adapting conversation systems to the E-Commerce domain, to set a research agenda, to examine how to build and share data sets, and to establish common evaluation metrics and benchmarks to drive research progress.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123012160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Fast Item Ranking under Neural Network based Measures 基于神经网络测度的快速项目排序
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371830
Shulong Tan, Zhixin Zhou, Zhao-Ying Xu, Ping Li
Recently, plenty of neural network based recommendation models have demonstrated their strength in modeling complicated relationships between heterogeneous objects (i.e., users and items). However, the applications of these fine trained recommendation models are limited to the off-line manner or the re-ranking procedure (on a pre-filtered small subset of items), due to their time-consuming computations. Fast item ranking under learned neural network based ranking measures is largely still an open question. In this paper, we formulate ranking under neural network based measures as a generic ranking task, Optimal Binary Function Search (OBFS), which does not make strong assumptions for the ranking measures. We first analyze limitations of existing fast ranking methods (e.g., ANN search) and explain why they are not applicable for OBFS. Further, we propose a flexible graph-based solution for it, Binary Function Search on Graph (BFSG). It can achieve approximate optimal efficiently, with accessible conditions. Experiments demonstrate effectiveness and efficiency of the proposed method, in fast item ranking under typical neural network based measures.
近年来,大量基于神经网络的推荐模型在建模异构对象(即用户和物品)之间的复杂关系方面表现出了强大的实力。然而,由于计算时间长,这些经过良好训练的推荐模型的应用仅限于离线方式或重新排序过程(在预先过滤的小项目子集上)。基于学习神经网络的排序方法下的快速排序在很大程度上仍然是一个悬而未决的问题。在本文中,我们将基于神经网络的度量下的排序表述为一个通用的排序任务,即最优二叉函数搜索(OBFS),它对排序度量没有很强的假设。我们首先分析了现有快速排序方法(例如,ANN搜索)的局限性,并解释了为什么它们不适用于OBFS。进一步,我们提出了一种灵活的基于图的解决方案,即图上二进制函数搜索(BFSG)。在可达条件下,它能有效地达到近似最优。实验证明了该方法在典型的基于神经网络的指标下快速排序的有效性和有效性。
{"title":"Fast Item Ranking under Neural Network based Measures","authors":"Shulong Tan, Zhixin Zhou, Zhao-Ying Xu, Ping Li","doi":"10.1145/3336191.3371830","DOIUrl":"https://doi.org/10.1145/3336191.3371830","url":null,"abstract":"Recently, plenty of neural network based recommendation models have demonstrated their strength in modeling complicated relationships between heterogeneous objects (i.e., users and items). However, the applications of these fine trained recommendation models are limited to the off-line manner or the re-ranking procedure (on a pre-filtered small subset of items), due to their time-consuming computations. Fast item ranking under learned neural network based ranking measures is largely still an open question. In this paper, we formulate ranking under neural network based measures as a generic ranking task, Optimal Binary Function Search (OBFS), which does not make strong assumptions for the ranking measures. We first analyze limitations of existing fast ranking methods (e.g., ANN search) and explain why they are not applicable for OBFS. Further, we propose a flexible graph-based solution for it, Binary Function Search on Graph (BFSG). It can achieve approximate optimal efficiently, with accessible conditions. Experiments demonstrate effectiveness and efficiency of the proposed method, in fast item ranking under typical neural network based measures.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127942032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Investigating Examination Behavior in Mobile Search 调查移动搜索中的检查行为
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371797
Yukun Zheng, Jiaxin Mao, Yiqun Liu, M. Sanderson, Min Zhang, Shaoping Ma
Examination is one of the most important user interactions in Web search. A number of works studied examination behavior in Web search and helped researchers better understand how users allocate their attention on search engine result pages (SERPs). Compared to desktop search, mobile search has a number of differences such as fewer results on the screen. These differences bring in mobile-specific factors affecting users' examination behavior. However, there still lacks research on users' attention allocation mechanism via viewports in mobile search. Therefore, we design a lab-based study to collect user's rich interaction behavior in mobile search. Based on the collected data, we first analyze how users examine SERPs and allocate their attention to heterogeneous results. Then we investigate the effect of mobile-specific factors and other common factors on users allocating attention. Finally, we apply the findings of user attention allocation from the user study into click model construction efforts, which significantly improves the state-of-the-art click model. Our work brings insights into a better understanding of users' interaction patterns in mobile search and may benefit other mobile search-related research.
检查是Web搜索中最重要的用户交互之一。许多研究工作研究了网络搜索中的检查行为,并帮助研究人员更好地理解用户如何在搜索引擎结果页面(serp)上分配他们的注意力。与桌面搜索相比,移动搜索有很多不同之处,比如屏幕上的搜索结果更少。这些差异带来了影响用户考试行为的手机特有因素。然而,对于移动搜索中用户通过视口分配注意力的机制,目前还缺乏相关研究。因此,我们设计了一个基于实验室的研究来收集用户在移动搜索中的丰富交互行为。基于收集到的数据,我们首先分析了用户如何检查serp并将他们的注意力分配到异构结果上。然后,我们研究了移动特定因素和其他常见因素对用户注意力分配的影响。最后,我们将用户注意力分配的研究结果应用到点击模型的构建工作中,这大大提高了当前的点击模型。我们的工作为更好地理解移动搜索中的用户交互模式提供了见解,并可能对其他移动搜索相关的研究有益。
{"title":"Investigating Examination Behavior in Mobile Search","authors":"Yukun Zheng, Jiaxin Mao, Yiqun Liu, M. Sanderson, Min Zhang, Shaoping Ma","doi":"10.1145/3336191.3371797","DOIUrl":"https://doi.org/10.1145/3336191.3371797","url":null,"abstract":"Examination is one of the most important user interactions in Web search. A number of works studied examination behavior in Web search and helped researchers better understand how users allocate their attention on search engine result pages (SERPs). Compared to desktop search, mobile search has a number of differences such as fewer results on the screen. These differences bring in mobile-specific factors affecting users' examination behavior. However, there still lacks research on users' attention allocation mechanism via viewports in mobile search. Therefore, we design a lab-based study to collect user's rich interaction behavior in mobile search. Based on the collected data, we first analyze how users examine SERPs and allocate their attention to heterogeneous results. Then we investigate the effect of mobile-specific factors and other common factors on users allocating attention. Finally, we apply the findings of user attention allocation from the user study into click model construction efforts, which significantly improves the state-of-the-art click model. Our work brings insights into a better understanding of users' interaction patterns in mobile search and may benefit other mobile search-related research.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126872121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Time Interval Aware Self-Attention for Sequential Recommendation 时序推荐的时间间隔感知自注意
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371786
Jiacheng Li, Yujie Wang, Julian McAuley
Sequential recommender systems seek to exploit the order of users' interactions, in order to predict their next action based on the context of what they have done recently. Traditionally, Markov Chains(MCs), and more recently Recurrent Neural Networks (RNNs) and Self Attention (SA) have proliferated due to their ability to capture the dynamics of sequential patterns. However a simplifying assumption made by most of these models is to regard interaction histories as ordered sequences, without regard for the time intervals between each interaction (i.e., they model the time-order but not the actual timestamp). In this paper, we seek to explicitly model the timestamps of interactions within a sequential modeling framework to explore the influence of different time intervals on next item prediction. We propose TiSASRec (Time Interval aware Self-attention based sequential recommendation), which models both the absolute positions of items as well as the time intervals between them in a sequence. Extensive empirical studies show the features of TiSASRec under different settings and compare the performance of self-attention with different positional encodings. Furthermore, experimental results show that our method outperforms various state-of-the-art sequential models on both sparse and dense datasets and different evaluation metrics.
顺序推荐系统试图利用用户交互的顺序,以便根据他们最近所做的事情来预测他们的下一步行动。传统上,马尔可夫链(MCs),以及最近的循环神经网络(rnn)和自我注意(SA)由于能够捕捉序列模式的动态而得到了广泛的应用。然而,这些模型中的大多数都做了一个简化的假设,即将交互历史视为有序序列,而不考虑每次交互之间的时间间隔(即,它们建模的是时间顺序,而不是实际的时间戳)。在本文中,我们试图在顺序建模框架内明确建模交互的时间戳,以探索不同时间间隔对下一个项目预测的影响。我们提出了TiSASRec(基于时间间隔感知的自注意顺序推荐),它既对项目的绝对位置建模,也对它们在序列中的时间间隔建模。大量的实证研究显示了TiSASRec在不同设置下的特征,并比较了不同位置编码下的自注意表现。此外,实验结果表明,我们的方法在稀疏和密集数据集以及不同的评估指标上都优于各种最先进的序列模型。
{"title":"Time Interval Aware Self-Attention for Sequential Recommendation","authors":"Jiacheng Li, Yujie Wang, Julian McAuley","doi":"10.1145/3336191.3371786","DOIUrl":"https://doi.org/10.1145/3336191.3371786","url":null,"abstract":"Sequential recommender systems seek to exploit the order of users' interactions, in order to predict their next action based on the context of what they have done recently. Traditionally, Markov Chains(MCs), and more recently Recurrent Neural Networks (RNNs) and Self Attention (SA) have proliferated due to their ability to capture the dynamics of sequential patterns. However a simplifying assumption made by most of these models is to regard interaction histories as ordered sequences, without regard for the time intervals between each interaction (i.e., they model the time-order but not the actual timestamp). In this paper, we seek to explicitly model the timestamps of interactions within a sequential modeling framework to explore the influence of different time intervals on next item prediction. We propose TiSASRec (Time Interval aware Self-attention based sequential recommendation), which models both the absolute positions of items as well as the time intervals between them in a sequence. Extensive empirical studies show the features of TiSASRec under different settings and compare the performance of self-attention with different positional encodings. Furthermore, experimental results show that our method outperforms various state-of-the-art sequential models on both sparse and dense datasets and different evaluation metrics.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128274823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 348
期刊
Proceedings of the 13th International Conference on Web Search and Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1