首页 > 最新文献

Proceedings of the 13th International Conference on Web Search and Data Mining最新文献

英文 中文
Time to Shop for Valentine's Day: Shopping Occasions and Sequential Recommendation in E-commerce 为情人节购物的时间:电子商务中的购物场合和顺序推荐
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371836
Jianling Wang, Raphael Louca, D. Hu, Caitlin Cellier, James Caverlee, Liangjie Hong
Currently, most sequence-based recommendation models aim to predict a user's next actions (e.g. next purchase) based on their past actions. These models either capture users' intrinsic preference (e.g. a comedy lover, or a fan of fantasy) from their long-term behavior patterns or infer their current needs by emphasizing recent actions. However, in e-commerce, intrinsic user behavior may be shifted by occasions such as birthdays, anniversaries, or gifting celebrations (Valentine's Day or Mother's Day), leading to purchases that deviate from long-term preferences and are not related to recent actions. In this work, we propose a novel next-item recommendation system which models a user's default, intrinsic preference, as well as two different kinds of occasion-based signals that may cause users to deviate from their normal behavior. More specifically, this model is novel in that it: (1) captures a personal occasion signal using an attention layer that models reoccurring occasions specific to that user (e.g. a birthday); (2) captures a global occasion signal using an attention layer that models seasonal or reoccurring occasions for many users (e.g. Christmas); (3) balances the user's intrinsic preferences with the personal and global occasion signals for different users at different timestamps with a gating layer. We explore two real-world e-commerce datasets (Amazon and Etsy) and show that the proposed model outperforms state-of-the-art models by 7.62% and 6.06% in predicting users' next purchase.
目前,大多数基于序列的推荐模型旨在根据用户过去的行为预测用户的下一步行为(例如下一次购买)。这些模型要么从用户的长期行为模式中捕捉他们的内在偏好(例如,喜剧爱好者或幻想迷),要么通过强调最近的行为来推断他们当前的需求。然而,在电子商务中,用户的内在行为可能会因生日、纪念日或礼物庆祝活动(情人节或母亲节)等场合而改变,从而导致购买偏离长期偏好,与最近的行为无关。在这项工作中,我们提出了一个新的下一项推荐系统,该系统模拟了用户的默认、内在偏好,以及两种不同类型的基于场合的信号,这些信号可能导致用户偏离他们的正常行为。更具体地说,这个模型是新颖的,因为它:(1)使用一个关注层捕获个人场合信号,该关注层对该用户特定的重复出现的场合(例如生日)进行建模;(2)使用关注层捕获全局场合信号,该关注层为许多用户(例如圣诞节)模拟季节性或重复发生的场合;(3)利用门控层平衡不同用户在不同时间戳的个人和全局场合信号与用户的内在偏好。我们探索了两个现实世界的电子商务数据集(亚马逊和Etsy),并表明所提出的模型在预测用户下一次购买方面比最先进的模型高出7.62%和6.06%。
{"title":"Time to Shop for Valentine's Day: Shopping Occasions and Sequential Recommendation in E-commerce","authors":"Jianling Wang, Raphael Louca, D. Hu, Caitlin Cellier, James Caverlee, Liangjie Hong","doi":"10.1145/3336191.3371836","DOIUrl":"https://doi.org/10.1145/3336191.3371836","url":null,"abstract":"Currently, most sequence-based recommendation models aim to predict a user's next actions (e.g. next purchase) based on their past actions. These models either capture users' intrinsic preference (e.g. a comedy lover, or a fan of fantasy) from their long-term behavior patterns or infer their current needs by emphasizing recent actions. However, in e-commerce, intrinsic user behavior may be shifted by occasions such as birthdays, anniversaries, or gifting celebrations (Valentine's Day or Mother's Day), leading to purchases that deviate from long-term preferences and are not related to recent actions. In this work, we propose a novel next-item recommendation system which models a user's default, intrinsic preference, as well as two different kinds of occasion-based signals that may cause users to deviate from their normal behavior. More specifically, this model is novel in that it: (1) captures a personal occasion signal using an attention layer that models reoccurring occasions specific to that user (e.g. a birthday); (2) captures a global occasion signal using an attention layer that models seasonal or reoccurring occasions for many users (e.g. Christmas); (3) balances the user's intrinsic preferences with the personal and global occasion signals for different users at different timestamps with a gating layer. We explore two real-world e-commerce datasets (Amazon and Etsy) and show that the proposed model outperforms state-of-the-art models by 7.62% and 6.06% in predicting users' next purchase.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129014895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Think like a Human: Constructing Cognitive-oriented Retrieval Model for Web Search 像人一样思考:构建面向认知的网络搜索检索模型
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3372180
Xiangsheng Li
Existing retrieval models have gained much success, however, we have to admit that the models work in a rather different manner than how humans make relevance judgments. To bridge the gap between practical user behavior and retrieval model, it is essential to construct cognitive-oriented retrieval model. The cognitive process in IR includes two important aspects: searching and reading. Searching is the user behaviors interacted with retrieval system such as query formulation and click while reading is the information seeking behavior in a specific landing page or document. We plan to better understand user cognitive behaviors in these two aspects by conducting a lab-based user study. More importantly, the heuristics drawn from cognitive behaviors are then incorporated into retrieval models.
现有的检索模型已经取得了很大的成功,然而,我们必须承认,这些模型的工作方式与人类做出相关判断的方式相当不同。为了弥补实际用户行为与检索模型之间的差距,有必要构建面向认知的检索模型。IR的认知过程包括两个重要方面:搜索和阅读。搜索是用户与检索系统交互的查询制定、点击等行为,而阅读是用户在特定的登陆页面或文档中寻找信息的行为。我们计划通过开展基于实验室的用户研究,更好地了解用户在这两个方面的认知行为。更重要的是,从认知行为中得出的启发式然后被纳入检索模型。
{"title":"Think like a Human: Constructing Cognitive-oriented Retrieval Model for Web Search","authors":"Xiangsheng Li","doi":"10.1145/3336191.3372180","DOIUrl":"https://doi.org/10.1145/3336191.3372180","url":null,"abstract":"Existing retrieval models have gained much success, however, we have to admit that the models work in a rather different manner than how humans make relevance judgments. To bridge the gap between practical user behavior and retrieval model, it is essential to construct cognitive-oriented retrieval model. The cognitive process in IR includes two important aspects: searching and reading. Searching is the user behaviors interacted with retrieval system such as query formulation and click while reading is the information seeking behavior in a specific landing page or document. We plan to better understand user cognitive behaviors in these two aspects by conducting a lab-based user study. More importantly, the heuristics drawn from cognitive behaviors are then incorporated into retrieval models.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127920337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OpenNIR: A Complete Neural Ad-Hoc Ranking Pipeline OpenNIR:一个完整的神经自组织排序管道
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371864
Sean MacAvaney
With the growing popularity of neural approaches for ad-hoc ranking, there is a need for tools that can effectively reproduce prior results and ease continued research by supporting current state-of-the-art approaches. Although several excellent neural ranking tools exist, none offer an easy end-to-end ad-hoc neural raking pipeline. A complete pipeline is particularly important for ad-hoc ranking because there are numerous parameter settings that have a considerable effect on the ultimate performance yet often are under-reported in current work (e.g., initial ranking settings, re-ranking threshold, training sampling strategy, etc.). In this work, I present a complete ad-hoc neural ranking pipeline which addresses these shortcomings: OpenNIR. The pipeline is easy to use (a single command will download required data, train, and evaluate a model), yet highly configurable, allowing for continued work in areas that are understudied. Aside from the core pipeline, the software also includes several bells and whistles that make use of components of the pipeline, such as performance benchmarking and tuning of unsupervised ranker parameters for fair comparisons against traditional baselines. The pipeline and these capabilities are demonstrated. The code is available, and contributions are welcome.
随着用于特别排序的神经方法的日益普及,需要能够有效地重现先前结果并通过支持当前最先进的方法来简化继续研究的工具。虽然存在一些优秀的神经排序工具,但没有一个提供简单的端到端特设神经排序管道。一个完整的流水线对于临时排序尤其重要,因为有许多参数设置对最终性能有相当大的影响,但在当前的工作中往往没有得到充分的报告(例如,初始排序设置、重新排序阈值、训练抽样策略等)。在这项工作中,我提出了一个完整的特设神经排序管道来解决这些缺点:OpenNIR。该管道易于使用(只需一个命令即可下载所需的数据、训练和评估模型),并且高度可配置,允许在未充分研究的领域继续工作。除了核心管道之外,该软件还包括一些利用管道组件的附加功能,例如性能基准测试和调整无监督排名参数,以便与传统基线进行公平比较。演示了管道和这些功能。代码是可用的,欢迎贡献。
{"title":"OpenNIR: A Complete Neural Ad-Hoc Ranking Pipeline","authors":"Sean MacAvaney","doi":"10.1145/3336191.3371864","DOIUrl":"https://doi.org/10.1145/3336191.3371864","url":null,"abstract":"With the growing popularity of neural approaches for ad-hoc ranking, there is a need for tools that can effectively reproduce prior results and ease continued research by supporting current state-of-the-art approaches. Although several excellent neural ranking tools exist, none offer an easy end-to-end ad-hoc neural raking pipeline. A complete pipeline is particularly important for ad-hoc ranking because there are numerous parameter settings that have a considerable effect on the ultimate performance yet often are under-reported in current work (e.g., initial ranking settings, re-ranking threshold, training sampling strategy, etc.). In this work, I present a complete ad-hoc neural ranking pipeline which addresses these shortcomings: OpenNIR. The pipeline is easy to use (a single command will download required data, train, and evaluate a model), yet highly configurable, allowing for continued work in areas that are understudied. Aside from the core pipeline, the software also includes several bells and whistles that make use of components of the pipeline, such as performance benchmarking and tuning of unsupervised ranker parameters for fair comparisons against traditional baselines. The pipeline and these capabilities are demonstrated. The code is available, and contributions are welcome.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126528571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments 在线控制实验结果评估中的挑战、最佳实践和陷阱
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371871
Somit Gupta, Xiaolin Shi, Pavel A. Dmitriev, Xin Fu, Avijit Mukherjee
A/B Testing is the gold standard to estimate the causal relationship between a change in a product and its impact on key outcome measures. It is widely used in the industry to test changes ranging from simple copy change or UI change to more complex changes like using machine learning models to personalize user experience. The key aspect of A/B testing is evaluation of experiment results. Designing the right set of metrics - correct outcome measures, data quality indicators, guardrails that prevent harm to business, and a comprehensive set of supporting metrics to understand the "why" behind the key movements is the #1 challenge practitioners face when trying to scale their experimentation program [11, 14]. On the technical side, improving sensitivity of experiment metrics is a hard problem and an active research area, with large practical implications as more and more small and medium size businesses are trying to adopt A/B testing and suffer from insufficient power. In this tutorial we will discuss challenges, best practices, and pitfalls in evaluating experiment results, focusing on both lessons learned and practical guidelines as well as open research questions. A version of this tutorial was also present at KDD 2019 [23]. It was attended by around 150 participants.
A/B测试是评估产品变更及其对关键结果度量的影响之间因果关系的黄金标准。它在行业中广泛用于测试更改,从简单的副本更改或UI更改到更复杂的更改,如使用机器学习模型来个性化用户体验。A/B测试的关键是评估实验结果。设计一套正确的度量标准——正确的结果度量、数据质量指标、防止对业务造成损害的防护措施,以及一套全面的支持性度量标准,以理解关键动作背后的“原因”,这是从业者在尝试扩展实验计划时面临的首要挑战[11,14]。在技术方面,提高实验指标的灵敏度是一个难题,也是一个活跃的研究领域,随着越来越多的中小型企业尝试采用a /B测试,并受到功率不足的困扰,提高实验指标的灵敏度具有很大的实际意义。在本教程中,我们将讨论评估实验结果的挑战,最佳实践和陷阱,重点是经验教训和实践指南以及开放的研究问题。本教程的一个版本也出现在KDD 2019上[23]。约有150人参加。
{"title":"Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments","authors":"Somit Gupta, Xiaolin Shi, Pavel A. Dmitriev, Xin Fu, Avijit Mukherjee","doi":"10.1145/3336191.3371871","DOIUrl":"https://doi.org/10.1145/3336191.3371871","url":null,"abstract":"A/B Testing is the gold standard to estimate the causal relationship between a change in a product and its impact on key outcome measures. It is widely used in the industry to test changes ranging from simple copy change or UI change to more complex changes like using machine learning models to personalize user experience. The key aspect of A/B testing is evaluation of experiment results. Designing the right set of metrics - correct outcome measures, data quality indicators, guardrails that prevent harm to business, and a comprehensive set of supporting metrics to understand the \"why\" behind the key movements is the #1 challenge practitioners face when trying to scale their experimentation program [11, 14]. On the technical side, improving sensitivity of experiment metrics is a hard problem and an active research area, with large practical implications as more and more small and medium size businesses are trying to adopt A/B testing and suffer from insufficient power. In this tutorial we will discuss challenges, best practices, and pitfalls in evaluating experiment results, focusing on both lessons learned and practical guidelines as well as open research questions. A version of this tutorial was also present at KDD 2019 [23]. It was attended by around 150 participants.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134433527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Temporal Pattern of Retweet(s) Help to Maximize Information Diffusion in Twitter 推文的时间模式有助于推特信息传播的最大化
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3372181
Ayan Kumar Bhowmick
Twitter is currently a popular microblogging platform for spread of information by users in the form of tweet messages. Such tweets are shared with followers of the seed user who may reshare it with their own set of followers. Long chain of such retweets form cascades. For effective diffusion of information through such Twitter cascades, we identify two different objectives based on using temporal sequence of retweets. Firstly, we aim to infer the structure of influence trees of Twitter cascades, denoting the who-influenced-whom relationship among retweeting users in the cascade, that can play a significant role in identifying critical paths in the network for information dissemination. The constructed trees closely resemble ground truth influence trees of empirical cascades with high retweet count. Secondly, we propose a fast and efficient algorithm for detection of influential users by identifying anchor nodes from temporal retweet sequence. Identification of such a diverse set of influential users enable a faster diffusion of tweets to a large and diverse population, when targeted as seeds thereby maximizing the influence spread, facilitating several applications including viral marketing, disease control and news dissemination.
Twitter是目前流行的微博平台,用户可以通过tweet消息的形式传播信息。这些推文与种子用户的追随者分享,这些追随者可能会与他们自己的追随者分享。这种转发的长链形成了级联。为了通过这样的Twitter级联有效地传播信息,我们根据转发的时间序列确定了两个不同的目标。首先,我们旨在推断Twitter级联的影响树结构,表示级联中转发用户之间的谁-受影响-谁关系,这对于识别网络中信息传播的关键路径具有重要作用。所构建的树与高转发数的经验级联的基础真值影响树非常相似。其次,我们提出了一种快速有效的算法,通过从时间转发序列中识别锚节点来检测有影响力的用户。识别这样一组不同的有影响力的用户,可以使推文更快地传播到大量不同的人群,当目标作为种子时,从而最大限度地扩大影响力传播,促进包括病毒营销、疾病控制和新闻传播在内的几种应用。
{"title":"Temporal Pattern of Retweet(s) Help to Maximize Information Diffusion in Twitter","authors":"Ayan Kumar Bhowmick","doi":"10.1145/3336191.3372181","DOIUrl":"https://doi.org/10.1145/3336191.3372181","url":null,"abstract":"Twitter is currently a popular microblogging platform for spread of information by users in the form of tweet messages. Such tweets are shared with followers of the seed user who may reshare it with their own set of followers. Long chain of such retweets form cascades. For effective diffusion of information through such Twitter cascades, we identify two different objectives based on using temporal sequence of retweets. Firstly, we aim to infer the structure of influence trees of Twitter cascades, denoting the who-influenced-whom relationship among retweeting users in the cascade, that can play a significant role in identifying critical paths in the network for information dissemination. The constructed trees closely resemble ground truth influence trees of empirical cascades with high retweet count. Secondly, we propose a fast and efficient algorithm for detection of influential users by identifying anchor nodes from temporal retweet sequence. Identification of such a diverse set of influential users enable a faster diffusion of tweets to a large and diverse population, when targeted as seeds thereby maximizing the influence spread, facilitating several applications including viral marketing, disease control and news dissemination.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114685736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Metrics, User Models, and Satisfaction 指标、用户模型和满意度
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371799
A. Wicaksono, Alistair Moffat
User satisfaction is an important factor when evaluating search systems, and hence a good metric should give rise to scores that have a strong positive correlation with user satisfaction ratings. A metric should also correspond to a plausible user model, and hence provide a tangible manifestation of how users interact with search rankings. Recent work has focused on metrics whose user models accurately portray the behavior of search engine users. Here we investigate whether those same metrics then also correlate with user satisfaction. We carry out experiments using various classes of metrics, and confirm through the lens of the C/W/L framework that the metrics with user models that reflect typical behavior also tend to be the metrics that correlate well with user satisfaction ratings.
在评估搜索系统时,用户满意度是一个重要的因素,因此一个好的度量应该产生与用户满意度评级有强烈正相关的分数。指标还应该与合理的用户模型相对应,从而提供用户如何与搜索排名交互的有形表现。最近的工作集中在用户模型准确描述搜索引擎用户行为的度量上。在这里,我们调查这些相同的指标是否也与用户满意度相关。我们使用各种类型的指标进行实验,并通过C/W/L框架的镜头确认,反映典型行为的用户模型指标也往往是与用户满意度评级密切相关的指标。
{"title":"Metrics, User Models, and Satisfaction","authors":"A. Wicaksono, Alistair Moffat","doi":"10.1145/3336191.3371799","DOIUrl":"https://doi.org/10.1145/3336191.3371799","url":null,"abstract":"User satisfaction is an important factor when evaluating search systems, and hence a good metric should give rise to scores that have a strong positive correlation with user satisfaction ratings. A metric should also correspond to a plausible user model, and hence provide a tangible manifestation of how users interact with search rankings. Recent work has focused on metrics whose user models accurately portray the behavior of search engine users. Here we investigate whether those same metrics then also correlate with user satisfaction. We carry out experiments using various classes of metrics, and confirm through the lens of the C/W/L framework that the metrics with user models that reflect typical behavior also tend to be the metrics that correlate well with user satisfaction ratings.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114793102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Enhancing Re-finding Behavior with External Memories for Personalized Search 利用外部记忆增强个性化搜索的重新发现行为
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371794
Yujia Zhou, Zhicheng Dou, Ji-rong Wen
The goal of personalized search is to tailor the document ranking list to meet user's individual needs. Previous studies showed users usually look for the information that has been searched before. This is called re-finding behavior which is widely explored in existing personalized search approaches. However, most existing methods for identifying re-finding behavior focus on simple lexical similarities between queries. In this paper, we propose to construct memory networks (MN) to support the identification of more complex re-finding behavior. Specifically, incorporating semantic information, we devise two external memories to make an expansion of re-finding based on the query and the document respectively. We further design an intent memory to recognize session-based re-finding behavior. Endowed with these memory networks, we can build a fine-grained user model dynamically based on the current query and documents, and use the model to re-rank the results. Experimental results show the significant improvement of our model compared with traditional methods.
个性化搜索的目标是定制文档排名列表,以满足用户的个性化需求。之前的研究表明,用户通常会寻找之前搜索过的信息。这被称为重新发现行为,在现有的个性化搜索方法中得到了广泛的探索。然而,大多数现有的识别重新查找行为的方法只关注查询之间的简单词汇相似性。在本文中,我们提出构建记忆网络(MN)来支持识别更复杂的重新发现行为。具体来说,结合语义信息,我们设计了两个外部存储器,分别在查询和文档的基础上扩展重新查找。我们进一步设计了一个意图存储器来识别基于会话的重新查找行为。利用这些内存网络,我们可以基于当前查询和文档动态构建细粒度的用户模型,并使用该模型对结果进行重新排序。实验结果表明,与传统方法相比,我们的模型有了显著的改进。
{"title":"Enhancing Re-finding Behavior with External Memories for Personalized Search","authors":"Yujia Zhou, Zhicheng Dou, Ji-rong Wen","doi":"10.1145/3336191.3371794","DOIUrl":"https://doi.org/10.1145/3336191.3371794","url":null,"abstract":"The goal of personalized search is to tailor the document ranking list to meet user's individual needs. Previous studies showed users usually look for the information that has been searched before. This is called re-finding behavior which is widely explored in existing personalized search approaches. However, most existing methods for identifying re-finding behavior focus on simple lexical similarities between queries. In this paper, we propose to construct memory networks (MN) to support the identification of more complex re-finding behavior. Specifically, incorporating semantic information, we devise two external memories to make an expansion of re-finding based on the query and the document respectively. We further design an intent memory to recognize session-based re-finding behavior. Endowed with these memory networks, we can build a fine-grained user model dynamically based on the current query and documents, and use the model to re-rank the results. Experimental results show the significant improvement of our model compared with traditional methods.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114497326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
FAQAugmenter: Suggesting Questions for Enterprise FAQ Pages faqaugmentor:为企业FAQ页面提出问题建议
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371862
Ankush Chatterjee, Manish Gupta, Puneet Agrawal
Lack of comprehensive information on frequently asked questions (FAQ) web pages forces users to pose their questions on community question answering forums or contact businesses over slow media like emails or phone calls. This in turn often results into sub-optimal user experience and opportunity loss for businesses. While previous work focuses on FAQ mining and answering queries from FAQ pages, there is no work on verifying completeness or augmenting FAQ pages. We present a system, called FAQAugmenter, which given an FAQ web page, (1) harnesses signals from query logs and the web corpus to identify missing topics, and (2) suggests ranked list of questions for FAQ web page augmentation. Our experiments with FAQ pages from five enterprises each across three categories (banks, hospitals and airports) show that FAQAugmenter suggests high quality relevant questions. FAQAugmenter will contribute significantly not just in improving quality of FAQ web pages but also in turn improving quality of downstream applications like Microsoft QnA Maker.
在常见问题(FAQ)网页上缺乏全面的信息,迫使用户在社区问答论坛上提出问题,或者通过电子邮件或电话等缓慢的媒体与企业联系。这通常会导致次优的用户体验和企业的机会损失。虽然以前的工作侧重于FAQ挖掘和回答FAQ页面中的查询,但没有在验证完整性或增加FAQ页面方面进行工作。我们提出了一个名为FAQAugmenter的系统,它给出了一个FAQ网页,(1)利用查询日志和网络语料库的信号来识别缺失的主题,(2)建议FAQ网页增强的问题排序列表。我们对五家企业的FAQ页面进行了实验,这些企业分别来自三个类别(银行、医院和机场),结果表明FAQAugmenter提出了高质量的相关问题。FAQAugmenter不仅将显著提高FAQ网页的质量,还将提高下游应用程序(如Microsoft QnA Maker)的质量。
{"title":"FAQAugmenter: Suggesting Questions for Enterprise FAQ Pages","authors":"Ankush Chatterjee, Manish Gupta, Puneet Agrawal","doi":"10.1145/3336191.3371862","DOIUrl":"https://doi.org/10.1145/3336191.3371862","url":null,"abstract":"Lack of comprehensive information on frequently asked questions (FAQ) web pages forces users to pose their questions on community question answering forums or contact businesses over slow media like emails or phone calls. This in turn often results into sub-optimal user experience and opportunity loss for businesses. While previous work focuses on FAQ mining and answering queries from FAQ pages, there is no work on verifying completeness or augmenting FAQ pages. We present a system, called FAQAugmenter, which given an FAQ web page, (1) harnesses signals from query logs and the web corpus to identify missing topics, and (2) suggests ranked list of questions for FAQ web page augmentation. Our experiments with FAQ pages from five enterprises each across three categories (banks, hospitals and airports) show that FAQAugmenter suggests high quality relevant questions. FAQAugmenter will contribute significantly not just in improving quality of FAQ web pages but also in turn improving quality of downstream applications like Microsoft QnA Maker.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115737710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems 评估-行动-反思:对话系统和推荐系统之间的深度互动
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371769
Wenqiang Lei, Xiangnan He, Yisong Miao, Qingyun Wu, Richang Hong, Min-Yen Kan, Tat-Seng Chua
Recommender systems are embracing conversational technologies to obtain user preferences dynamically, and to overcome inherent limitations of their static models. A successful Conversational Recommender System (CRS) requires proper handling of interactions between conversation and recommendation. We argue that three fundamental problems need to be solved: 1) what questions to ask regarding item attributes, 2) when to recommend items, and 3) how to adapt to the users' online feedback. To the best of our knowledge, there lacks a unified framework that addresses these problems. In this work, we fill this missing interaction framework gap by proposing a new CRS framework named Estimation"Action" Reflection, or EAR, which consists of three stages to better converse with users. (1) Estimation, which builds predictive models to estimate user preference on both items and item attributes; (2) Action, which learns a dialogue policy to determine whether to ask attributes or recommend items, based on Estimation stage and conversation history; and (3) Reflection, which updates the recommender model when a user rejects the recommendations made by the Action stage. We present two conversation scenarios on binary and enumerated questions, and conduct extensive experiments on two datasets from Yelp and LastFM, for each scenario, respectively. Our experiments demonstrate significant improvements over the state-of-the-art method CRM [32], corresponding to fewer conversation turns and a higher level of recommendation hits.
推荐系统正在采用会话技术来动态获取用户偏好,并克服其静态模型的固有局限性。一个成功的会话推荐系统(CRS)需要正确处理会话和推荐之间的交互。我们认为需要解决三个基本问题:1)关于商品属性该问什么问题,2)何时推荐商品,以及3)如何适应用户的在线反馈。据我们所知,目前还缺乏解决这些问题的统一框架。在这项工作中,我们通过提出一个新的CRS框架来填补这个缺失的交互框架的空白,该框架名为评估“行动”反射,或EAR,它由三个阶段组成,以更好地与用户交谈。(1)估计,构建预测模型来估计用户对物品和物品属性的偏好;(2) Action,学习对话策略,根据estimate阶段和对话历史来决定是询问属性还是推荐项目;(3) Reflection,当用户拒绝Action阶段的推荐时,会更新推荐模型。我们提出了关于二进制和枚举问题的两个对话场景,并分别在来自Yelp和LastFM的两个数据集上对每个场景进行了广泛的实验。我们的实验表明,与最先进的CRM方法相比,该方法有了显著的改进[32],对应于更少的会话次数和更高的推荐点击率。
{"title":"Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems","authors":"Wenqiang Lei, Xiangnan He, Yisong Miao, Qingyun Wu, Richang Hong, Min-Yen Kan, Tat-Seng Chua","doi":"10.1145/3336191.3371769","DOIUrl":"https://doi.org/10.1145/3336191.3371769","url":null,"abstract":"Recommender systems are embracing conversational technologies to obtain user preferences dynamically, and to overcome inherent limitations of their static models. A successful Conversational Recommender System (CRS) requires proper handling of interactions between conversation and recommendation. We argue that three fundamental problems need to be solved: 1) what questions to ask regarding item attributes, 2) when to recommend items, and 3) how to adapt to the users' online feedback. To the best of our knowledge, there lacks a unified framework that addresses these problems. In this work, we fill this missing interaction framework gap by proposing a new CRS framework named Estimation\"Action\" Reflection, or EAR, which consists of three stages to better converse with users. (1) Estimation, which builds predictive models to estimate user preference on both items and item attributes; (2) Action, which learns a dialogue policy to determine whether to ask attributes or recommend items, based on Estimation stage and conversation history; and (3) Reflection, which updates the recommender model when a user rejects the recommendations made by the Action stage. We present two conversation scenarios on binary and enumerated questions, and conduct extensive experiments on two datasets from Yelp and LastFM, for each scenario, respectively. Our experiments demonstrate significant improvements over the state-of-the-art method CRM [32], corresponding to fewer conversation turns and a higher level of recommendation hits.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123673216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 183
ENTYFI
Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371808
C. Chu, Simon Razniewski, G. Weikum
Fiction and fantasy are archetypes of long-tail domains that lack comprehensive methods for automated language processing and knowledge extraction. We present ENTYFI, the first methodology for typing entities in fictional texts coming from books, fan communities or amateur writers. ENTYFI builds on 205 automatically induced high-quality type systems for popular fictional domains, and exploits the overlap and reuse of these fictional domains for fine-grained typing in previously unseen texts. ENTYFI comprises five steps: type system induction, domain relatedness ranking, mention detection, mention typing, and type consolidation. The recall-oriented typing module combines a supervised neural model, unsupervised Hearst-style and dependency patterns, and knowledge base lookups. The precision-oriented consolidation stage utilizes co-occurrence statistics in order to remove noise and to identify the most relevant types. Extensive experiments on newly seen fictional texts demonstrate the quality of ENTYFI.
{"title":"ENTYFI","authors":"C. Chu, Simon Razniewski, G. Weikum","doi":"10.1145/3336191.3371808","DOIUrl":"https://doi.org/10.1145/3336191.3371808","url":null,"abstract":"Fiction and fantasy are archetypes of long-tail domains that lack comprehensive methods for automated language processing and knowledge extraction. We present ENTYFI, the first methodology for typing entities in fictional texts coming from books, fan communities or amateur writers. ENTYFI builds on 205 automatically induced high-quality type systems for popular fictional domains, and exploits the overlap and reuse of these fictional domains for fine-grained typing in previously unseen texts. ENTYFI comprises five steps: type system induction, domain relatedness ranking, mention detection, mention typing, and type consolidation. The recall-oriented typing module combines a supervised neural model, unsupervised Hearst-style and dependency patterns, and knowledge base lookups. The precision-oriented consolidation stage utilizes co-occurrence statistics in order to remove noise and to identify the most relevant types. Extensive experiments on newly seen fictional texts demonstrate the quality of ENTYFI.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125585541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 13th International Conference on Web Search and Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1