ACM Transactions on Information Systems (TOIS)最新文献_第9页

Spoken Conversational Context Improves Query Auto-completion in Web Search 口语会话上下文改善了Web搜索中的查询自动完成

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3447875

T. Vuong, S. Andolina, Giulio Jacucci, Tuukka Ruotsalo

Web searches often originate from conversations in which people engage before they perform a search. Therefore, conversations can be a valuable source of context with which to support the search process. We investigate whether spoken input from conversations can be used as a context to improve query auto-completion. We model the temporal dynamics of the spoken conversational context preceding queries and use these models to re-rank the query auto-completion suggestions. Data were collected from a controlled experiment and comprised conversations among 12 participant pairs conversing about movies or traveling. Search query logs during the conversations were recorded and temporally associated with the conversations. We compared the effects of spoken conversational input in four conditions: a control condition without contextualization; an experimental condition with the model using search query logs; an experimental condition with the model using spoken conversational input; and an experimental condition with the model using both search query logs and spoken conversational input. We show the advantage of combining the spoken conversational context with the Web-search context for improved retrieval performance. Our results suggest that spoken conversations provide a rich context for supporting information searches beyond current user-modeling approaches.

网络搜索通常源于人们在执行搜索之前进行的对话。因此，对话可以成为支持搜索过程的有价值的上下文来源。我们研究了对话中的语音输入是否可以用作上下文来改进查询自动完成。我们对查询之前的口语会话上下文的时间动态建模，并使用这些模型对查询自动完成建议进行重新排序。数据收集自一项对照实验，包括12对参与者关于电影或旅行的对话。记录对话期间的搜索查询日志，并与对话临时关联。我们比较了四种情况下口语会话输入的效果:没有语境化的控制条件;实验条件下，该模型使用搜索查询日志;使用口语会话输入模型的实验条件并给出了该模型同时使用搜索查询日志和语音会话输入的实验条件。我们展示了将口语会话上下文与web搜索上下文相结合以提高检索性能的优势。我们的研究结果表明，口语会话为支持信息搜索提供了丰富的上下文，超出了当前的用户建模方法。

{"title":"Spoken Conversational Context Improves Query Auto-completion in Web Search","authors":"T. Vuong, S. Andolina, Giulio Jacucci, Tuukka Ruotsalo","doi":"10.1145/3447875","DOIUrl":"https://doi.org/10.1145/3447875","url":null,"abstract":"Web searches often originate from conversations in which people engage before they perform a search. Therefore, conversations can be a valuable source of context with which to support the search process. We investigate whether spoken input from conversations can be used as a context to improve query auto-completion. We model the temporal dynamics of the spoken conversational context preceding queries and use these models to re-rank the query auto-completion suggestions. Data were collected from a controlled experiment and comprised conversations among 12 participant pairs conversing about movies or traveling. Search query logs during the conversations were recorded and temporally associated with the conversations. We compared the effects of spoken conversational input in four conditions: a control condition without contextualization; an experimental condition with the model using search query logs; an experimental condition with the model using spoken conversational input; and an experimental condition with the model using both search query logs and spoken conversational input. We show the advantage of combining the spoken conversational context with the Web-search context for improved retrieval performance. Our results suggest that spoken conversations provide a rich context for supporting information searches beyond current user-modeling approaches.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"28 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83299643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Meta-evaluation of Conversational Search Evaluation Metrics 会话搜索评价指标的元评价

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-27 DOI: 10.1145/3445029

Zeyang Liu, K. Zhou, Max L. Wilson

Conversational search systems, such as Google assistant and Microsoft Cortana, enable users to interact with search systems in multiple rounds through natural language dialogues. Evaluating such systems is very challenging, given that any natural language responses could be generated, and users commonly interact for multiple semantically coherent rounds to accomplish a search task. Although prior studies proposed many evaluation metrics, the extent of how those measures effectively capture user preference remain to be investigated. In this article, we systematically meta-evaluate a variety of conversational search metrics. We specifically study three perspectives on those metrics: (1) reliability: the ability to detect “actual” performance differences as opposed to those observed by chance; (2) fidelity: the ability to agree with ultimate user preference; and (3) intuitiveness: the ability to capture any property deemed important: adequacy, informativeness, and fluency in the context of conversational search. By conducting experiments on two test collections, we find that the performance of different metrics vary significantly across different scenarios, whereas consistent with prior studies, existing metrics only achieve weak correlation with ultimate user preference and satisfaction. METEOR is, comparatively speaking, the best existing single-turn metric considering all three perspectives. We also demonstrate that adapted session-based evaluation metrics can be used to measure multi-turn conversational search, achieving moderate concordance with user satisfaction. To our knowledge, our work establishes the most comprehensive meta-evaluation for conversational search to date.

会话式搜索系统，如Google assistant和Microsoft Cortana，使用户能够通过自然语言对话与搜索系统进行多轮交互。评估这样的系统是非常具有挑战性的，因为可以生成任何自然语言响应，并且用户通常会交互多个语义上连贯的回合来完成搜索任务。虽然先前的研究提出了许多评价指标，但这些措施如何有效地捕获用户偏好的程度仍有待调查。在本文中，我们系统地对各种会话搜索指标进行元评估。我们具体研究了这些指标的三个角度:(1)可靠性:检测“实际”性能差异的能力，而不是偶然观察到的差异;(2)保真度:符合最终用户偏好的能力;(3)直观性:捕捉任何被认为重要的属性的能力:在会话搜索的背景下，充分性、信息性和流畅性。通过对两个测试集的实验，我们发现不同指标在不同场景下的表现差异显著，而与先前的研究一致，现有指标与最终用户偏好和满意度仅实现弱相关性。流星是，相对而言，最好的单回合指标考虑到所有三个角度。我们还证明了适应的基于会话的评估指标可以用于测量多回合会话搜索，实现与用户满意度的适度一致性。据我们所知，我们的工作为会话搜索建立了迄今为止最全面的元评估。

{"title":"Meta-evaluation of Conversational Search Evaluation Metrics","authors":"Zeyang Liu, K. Zhou, Max L. Wilson","doi":"10.1145/3445029","DOIUrl":"https://doi.org/10.1145/3445029","url":null,"abstract":"Conversational search systems, such as Google assistant and Microsoft Cortana, enable users to interact with search systems in multiple rounds through natural language dialogues. Evaluating such systems is very challenging, given that any natural language responses could be generated, and users commonly interact for multiple semantically coherent rounds to accomplish a search task. Although prior studies proposed many evaluation metrics, the extent of how those measures effectively capture user preference remain to be investigated. In this article, we systematically meta-evaluate a variety of conversational search metrics. We specifically study three perspectives on those metrics: (1) reliability: the ability to detect “actual” performance differences as opposed to those observed by chance; (2) fidelity: the ability to agree with ultimate user preference; and (3) intuitiveness: the ability to capture any property deemed important: adequacy, informativeness, and fluency in the context of conversational search. By conducting experiments on two test collections, we find that the performance of different metrics vary significantly across different scenarios, whereas consistent with prior studies, existing metrics only achieve weak correlation with ultimate user preference and satisfaction. METEOR is, comparatively speaking, the best existing single-turn metric considering all three perspectives. We also demonstrate that adapted session-based evaluation metrics can be used to measure multi-turn conversational search, achieving moderate concordance with user satisfaction. To our knowledge, our work establishes the most comprehensive meta-evaluation for conversational search to date.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"1 1","pages":"1 - 42"},"PeriodicalIF":0.0,"publicationDate":"2021-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88116500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Anytime Ranking on Document-Ordered Indexes 文档排序索引的任何时间排序

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-18 DOI: 10.1145/3467890

J. Mackenzie, M. Petri, Alistair Moffat

Inverted indexes continue to be a mainstay of text search engines, allowing efficient querying of large document collections. While there are a number of possible organizations, document-ordered indexes are the most common, since they are amenable to various query types, support index updates, and allow for efficient dynamic pruning operations. One disadvantage with document-ordered indexes is that high-scoring documents can be distributed across the document identifier space, meaning that index traversal algorithms that terminate early might put search effectiveness at risk. The alternative is impact-ordered indexes, which primarily support top- disjunctions but also allow for anytime query processing, where the search can be terminated at any time, with search quality improving as processing latency increases. Anytime query processing can be used to effectively reduce high-percentile tail latency that is essential for operational scenarios in which a service level agreement (SLA) imposes response time requirements. In this work, we show how document-ordered indexes can be organized such that they can be queried in an anytime fashion, enabling strict latency control with effective early termination. Our experiments show that processing document-ordered topical segments selected by a simple score estimator outperforms existing anytime algorithms, and allows query runtimes to be accurately limited to comply with SLA requirements.

倒排索引仍然是文本搜索引擎的主流，它允许对大型文档集合进行高效查询。虽然有许多可能的组织，但文档顺序索引是最常见的，因为它们适用于各种查询类型，支持索引更新，并允许有效的动态修剪操作。文档有序索引的一个缺点是，高分文档可能分布在整个文档标识符空间中，这意味着过早终止的索引遍历算法可能会危及搜索效率。另一种选择是影响排序索引，它主要支持顶断，但也允许随时查询处理，其中搜索可以随时终止，随着处理延迟的增加，搜索质量也会提高。随时查询处理可用于有效地减少高百分位数的尾部延迟，这对于服务水平协议(SLA)施加响应时间需求的操作场景至关重要。在本文中，我们将展示如何组织按文档顺序排列的索引，以便可以随时查询它们，从而实现严格的延迟控制和有效的早期终止。我们的实验表明，处理由简单分数估计器选择的文档顺序主题片段优于现有的任何时间算法，并允许查询运行时间精确地限制以符合SLA要求。

{"title":"Anytime Ranking on Document-Ordered Indexes","authors":"J. Mackenzie, M. Petri, Alistair Moffat","doi":"10.1145/3467890","DOIUrl":"https://doi.org/10.1145/3467890","url":null,"abstract":"Inverted indexes continue to be a mainstay of text search engines, allowing efficient querying of large document collections. While there are a number of possible organizations, document-ordered indexes are the most common, since they are amenable to various query types, support index updates, and allow for efficient dynamic pruning operations. One disadvantage with document-ordered indexes is that high-scoring documents can be distributed across the document identifier space, meaning that index traversal algorithms that terminate early might put search effectiveness at risk. The alternative is impact-ordered indexes, which primarily support top- disjunctions but also allow for anytime query processing, where the search can be terminated at any time, with search quality improving as processing latency increases. Anytime query processing can be used to effectively reduce high-percentile tail latency that is essential for operational scenarios in which a service level agreement (SLA) imposes response time requirements. In this work, we show how document-ordered indexes can be organized such that they can be queried in an anytime fashion, enabling strict latency control with effective early termination. Our experiments show that processing document-ordered topical segments selected by a simple score estimator outperforms existing anytime algorithms, and allows query runtimes to be accurately limited to comply with SLA requirements.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"80 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83790065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

The Simpson’s Paradox in the Offline Evaluation of Recommendation Systems 推荐系统离线评价中的辛普森悖论

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-18 DOI: 10.1145/3458509

A. H. Jadidinejad, C. Macdonald, I. Ounis

Recommendation systems are often evaluated based on user’s interactions that were collected from an existing, already deployed recommendation system. In this situation, users only provide feedback on the exposed items and they may not leave feedback on other items since they have not been exposed to them by the deployed system. As a result, the collected feedback dataset that is used to evaluate a new model is influenced by the deployed system, as a form of closed loop feedback. In this article, we show that the typical offline evaluation of recommender systems suffers from the so-called Simpson’s paradox. Simpson’s paradox is the name given to a phenomenon observed when a significant trend appears in several different sub-populations of observational data but disappears or is even reversed when these sub-populations are combined together. Our in-depth experiments based on stratified sampling reveal that a very small minority of items that are frequently exposed by the deployed system plays a confounding factor in the offline evaluation of recommendation systems. In addition, we propose a novel evaluation methodology that takes into account the confounder, i.e., the deployed system’s characteristics. Using the relative comparison of many recommendation models as in the typical offline evaluation of recommender systems, and based on the Kendall rank correlation coefficient, we show that our proposed evaluation methodology exhibits statistically significant improvements of 14% and 40% on the examined open loop datasets (Yahoo! and Coat), respectively, in reflecting the true ranking of systems with an open loop (randomised) evaluation in comparison to the standard evaluation.

推荐系统通常基于从现有的、已经部署的推荐系统中收集的用户交互来评估。在这种情况下，用户只对公开的项目提供反馈，他们可能不会对其他项目留下反馈，因为部署的系统没有向他们公开这些项目。因此，作为闭环反馈的一种形式，用于评估新模型的收集的反馈数据集受到部署系统的影响。在本文中，我们展示了推荐系统的典型离线评估遭受所谓的辛普森悖论。辛普森悖论指的是一种现象，即在观测数据的几个不同的子种群中出现了一个显著的趋势，但当这些子种群组合在一起时，这个趋势就消失了，甚至出现了逆转。我们基于分层抽样的深入实验表明，部署系统经常暴露的极少数项目在推荐系统的离线评估中起着混淆因素的作用。此外，我们提出了一种新的评估方法，该方法考虑了混杂因素，即部署系统的特征。使用许多推荐模型的相对比较，就像在推荐系统的典型离线评估中一样，并基于肯德尔等级相关系数，我们表明，我们提出的评估方法在检查的开环数据集(Yahoo!和Coat)，分别反映了与标准评估相比，具有开环(随机)评估的系统的真实排名。

{"title":"The Simpson’s Paradox in the Offline Evaluation of Recommendation Systems","authors":"A. H. Jadidinejad, C. Macdonald, I. Ounis","doi":"10.1145/3458509","DOIUrl":"https://doi.org/10.1145/3458509","url":null,"abstract":"Recommendation systems are often evaluated based on user’s interactions that were collected from an existing, already deployed recommendation system. In this situation, users only provide feedback on the exposed items and they may not leave feedback on other items since they have not been exposed to them by the deployed system. As a result, the collected feedback dataset that is used to evaluate a new model is influenced by the deployed system, as a form of closed loop feedback. In this article, we show that the typical offline evaluation of recommender systems suffers from the so-called Simpson’s paradox. Simpson’s paradox is the name given to a phenomenon observed when a significant trend appears in several different sub-populations of observational data but disappears or is even reversed when these sub-populations are combined together. Our in-depth experiments based on stratified sampling reveal that a very small minority of items that are frequently exposed by the deployed system plays a confounding factor in the offline evaluation of recommendation systems. In addition, we propose a novel evaluation methodology that takes into account the confounder, i.e., the deployed system’s characteristics. Using the relative comparison of many recommendation models as in the typical offline evaluation of recommender systems, and based on the Kendall rank correlation coefficient, we show that our proposed evaluation methodology exhibits statistically significant improvements of 14% and 40% on the examined open loop datasets (Yahoo! and Coat), respectively, in reflecting the true ranking of systems with an open loop (randomised) evaluation in comparison to the standard evaluation.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"110 1","pages":"1 - 22"},"PeriodicalIF":0.0,"publicationDate":"2021-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87677459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks 增强邻域选择引导的多关系图神经网络

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-16 DOI: 10.1145/3490181

Hao Peng, Ruitong Zhang, Yingtong Dou, Renyu Yang, Jingyi Zhang, Philip S. Yu

Graph Neural Networks (GNNs) have been widely used for the representation learning of various structured graph data, typically through message passing among nodes by aggregating their neighborhood information via different operations. While promising, most existing GNNs oversimplify the complexity and diversity of the edges in the graph and thus are inefficient to cope with ubiquitous heterogeneous graphs, which are typically in the form of multi-relational graph representations. In this article, we propose RioGNN, a novel Reinforced, recursive, and flexible neighborhood selection guided multi-relational Graph Neural Network architecture, to navigate complexity of neural network structures whilst maintaining relation-dependent representations. We first construct a multi-relational graph, according to the practical task, to reflect the heterogeneity of nodes, edges, attributes, and labels. To avoid the embedding over-assimilation among different types of nodes, we employ a label-aware neural similarity measure to ascertain the most similar neighbors based on node attributes. A reinforced relation-aware neighbor selection mechanism is developed to choose the most similar neighbors of a targeting node within a relation before aggregating all neighborhood information from different relations to obtain the eventual node embedding. Particularly, to improve the efficiency of neighbor selecting, we propose a new recursive and scalable reinforcement learning framework with estimable depth and width for different scales of multi-relational graphs. RioGNN can learn more discriminative node embedding with enhanced explainability due to the recognition of individual importance of each relation via the filtering threshold mechanism. Comprehensive experiments on real-world graph data and practical tasks demonstrate the advancements of effectiveness, efficiency, and the model explainability, as opposed to other comparative GNN models.

图神经网络(gnn)已被广泛用于各种结构化图数据的表示学习，通常是通过不同的操作聚合节点间的邻域信息来实现节点间的消息传递。虽然有前景，但大多数现有的gnn过于简化了图中边缘的复杂性和多样性，因此在处理普遍存在的异构图时效率低下，这些图通常以多关系图表示的形式出现。在这篇文章中，我们提出了RioGNN，一种新的增强的、递归的、灵活的邻域选择引导的多关系图神经网络架构，在保持关系依赖表示的同时导航神经网络结构的复杂性。我们首先根据实际任务构造一个多关系图，以反映节点、边、属性和标签的异质性。为了避免不同类型节点之间的嵌入过度同化，我们采用了基于节点属性的标签感知神经相似性度量来确定最相似的邻居。提出了一种增强的关系感知邻居选择机制，在聚合来自不同关系的所有邻居信息以获得最终的节点嵌入之前，选择关系中目标节点最相似的邻居。为了提高邻域选择的效率，针对不同尺度的多关系图，提出了一种新的深度和宽度可估计的递归可扩展强化学习框架。RioGNN通过过滤阈值机制识别每个关系的个体重要性，从而学习到更多的判别性节点嵌入，并增强了可解释性。在真实世界图形数据和实际任务上的综合实验表明，与其他比较GNN模型相比，该模型在有效性、效率和模型可解释性方面取得了进步。

{"title":"Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks","authors":"Hao Peng, Ruitong Zhang, Yingtong Dou, Renyu Yang, Jingyi Zhang, Philip S. Yu","doi":"10.1145/3490181","DOIUrl":"https://doi.org/10.1145/3490181","url":null,"abstract":"Graph Neural Networks (GNNs) have been widely used for the representation learning of various structured graph data, typically through message passing among nodes by aggregating their neighborhood information via different operations. While promising, most existing GNNs oversimplify the complexity and diversity of the edges in the graph and thus are inefficient to cope with ubiquitous heterogeneous graphs, which are typically in the form of multi-relational graph representations. In this article, we propose RioGNN, a novel Reinforced, recursive, and flexible neighborhood selection guided multi-relational Graph Neural Network architecture, to navigate complexity of neural network structures whilst maintaining relation-dependent representations. We first construct a multi-relational graph, according to the practical task, to reflect the heterogeneity of nodes, edges, attributes, and labels. To avoid the embedding over-assimilation among different types of nodes, we employ a label-aware neural similarity measure to ascertain the most similar neighbors based on node attributes. A reinforced relation-aware neighbor selection mechanism is developed to choose the most similar neighbors of a targeting node within a relation before aggregating all neighborhood information from different relations to obtain the eventual node embedding. Particularly, to improve the efficiency of neighbor selecting, we propose a new recursive and scalable reinforcement learning framework with estimable depth and width for different scales of multi-relational graphs. RioGNN can learn more discriminative node embedding with enhanced explainability due to the recognition of individual importance of each relation via the filtering threshold mechanism. Comprehensive experiments on real-world graph data and practical tasks demonstrate the advancements of effectiveness, efficiency, and the model explainability, as opposed to other comparative GNN models.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"47 1","pages":"1 - 46"},"PeriodicalIF":0.0,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85268155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 66

A Large-scale Analysis of Mixed Initiative in Information-Seeking Dialogues for Conversational Search 面向会话搜索的信息寻求对话混合主动性的大规模分析

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-14 DOI: 10.1145/3466796

Svitlana Vakulenko, E. Kanoulas, M. de Rijke

Conversational search is a relatively young area of research that aims at automating an information-seeking dialogue. In this article, we help to position it with respect to other research areas within conversational artificial intelligence (AI) by analysing the structural properties of an information-seeking dialogue. To this end, we perform a large-scale dialogue analysis of more than 150K transcripts from 16 publicly available dialogue datasets. These datasets were collected to inform different dialogue-based tasks including conversational search. We extract different patterns of mixed initiative from these dialogue transcripts and use them to compare dialogues of different types. Moreover, we contrast the patterns found in information-seeking dialogues that are being used for research purposes with the patterns found in virtual reference interviews that were conducted by professional librarians. The insights we provide (1) establish close relations between conversational search and other conversational AI tasks and (2) uncover limitations of existing conversational datasets to inform future data collection tasks.

会话搜索是一个相对年轻的研究领域，旨在实现信息搜索对话的自动化。在本文中，我们通过分析信息寻求对话的结构属性，帮助将其定位于会话人工智能(AI)中的其他研究领域。为此，我们对来自16个公开可用的对话数据集的超过150K个文本进行了大规模的对话分析。收集这些数据集是为了通知不同的基于对话的任务，包括对话搜索。我们从这些对话文本中提取不同的混合主动性模式，并用它们来比较不同类型的对话。此外，我们将用于研究目的的信息寻求对话中的模式与由专业图书馆员进行的虚拟参考访谈中的模式进行了对比。我们提供的见解(1)在会话搜索和其他会话AI任务之间建立密切关系;(2)揭示现有会话数据集的局限性，为未来的数据收集任务提供信息。

{"title":"A Large-scale Analysis of Mixed Initiative in Information-Seeking Dialogues for Conversational Search","authors":"Svitlana Vakulenko, E. Kanoulas, M. de Rijke","doi":"10.1145/3466796","DOIUrl":"https://doi.org/10.1145/3466796","url":null,"abstract":"Conversational search is a relatively young area of research that aims at automating an information-seeking dialogue. In this article, we help to position it with respect to other research areas within conversational artificial intelligence (AI) by analysing the structural properties of an information-seeking dialogue. To this end, we perform a large-scale dialogue analysis of more than 150K transcripts from 16 publicly available dialogue datasets. These datasets were collected to inform different dialogue-based tasks including conversational search. We extract different patterns of mixed initiative from these dialogue transcripts and use them to compare dialogues of different types. Moreover, we contrast the patterns found in information-seeking dialogues that are being used for research purposes with the patterns found in virtual reference interviews that were conducted by professional librarians. The insights we provide (1) establish close relations between conversational search and other conversational AI tasks and (2) uncover limitations of existing conversational datasets to inform future data collection tasks.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"26 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83263120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge 情境化知识感知细心神经网络:用知识增强答案选择

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-04-12 DOI: 10.1145/3457533

Yang Deng, Yuexiang Xie, Yaliang Li, Min Yang, W. Lam, Ying Shen

Answer selection, which is involved in many natural language processing applications, such as dialog systems and question answering (QA), is an important yet challenging task in practice, since conventional methods typically suffer from the issues of ignoring diverse real-world background knowledge. In this article, we extensively investigate approaches to enhancing the answer selection model with external knowledge from knowledge graph (KG). First, we present a context-knowledge interaction learning framework, Knowledge-aware Neural Network, which learns the QA sentence representations by considering a tight interaction with the external knowledge from KG and the textual information. Then, we develop two kinds of knowledge-aware attention mechanism to summarize both the context-based and knowledge-based interactions between questions and answers. To handle the diversity and complexity of KG information, we further propose a Contextualized Knowledge-aware Attentive Neural Network, which improves the knowledge representation learning with structure information via a customized Graph Convolutional Network and comprehensively learns context-based and knowledge-based sentence representation via the multi-view knowledge-aware attention mechanism. We evaluate our method on four widely used benchmark QA datasets, including WikiQA, TREC QA, InsuranceQA, and Yahoo QA. Results verify the benefits of incorporating external knowledge from KG and show the robust superiority and extensive applicability of our method.

答案选择涉及到许多自然语言处理应用，如对话系统和问答(QA)，在实践中是一项重要但具有挑战性的任务，因为传统方法通常存在忽略各种现实世界背景知识的问题。在本文中，我们广泛地研究了利用知识图(KG)的外部知识来增强答案选择模型的方法。首先，我们提出了一个上下文-知识交互学习框架——知识感知神经网络，该框架通过考虑与KG外部知识和文本信息的紧密交互来学习QA句子表示。然后，我们建立了两种知识感知的注意机制来总结基于上下文和基于知识的问答互动。为了处理KG信息的多样性和复杂性，我们进一步提出了一种情境化的知识感知注意神经网络，该网络通过自定义的图卷积网络改进了基于结构信息的知识表示学习，并通过多视图知识感知注意机制综合学习基于情境和基于知识的句子表示。我们在四个广泛使用的基准QA数据集上评估了我们的方法，包括WikiQA, TREC QA, InsuranceQA和Yahoo QA。结果验证了从KG中吸收外部知识的好处，并显示了我们的方法的鲁棒性优势和广泛的适用性。

{"title":"Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge","authors":"Yang Deng, Yuexiang Xie, Yaliang Li, Min Yang, W. Lam, Ying Shen","doi":"10.1145/3457533","DOIUrl":"https://doi.org/10.1145/3457533","url":null,"abstract":"Answer selection, which is involved in many natural language processing applications, such as dialog systems and question answering (QA), is an important yet challenging task in practice, since conventional methods typically suffer from the issues of ignoring diverse real-world background knowledge. In this article, we extensively investigate approaches to enhancing the answer selection model with external knowledge from knowledge graph (KG). First, we present a context-knowledge interaction learning framework, Knowledge-aware Neural Network, which learns the QA sentence representations by considering a tight interaction with the external knowledge from KG and the textual information. Then, we develop two kinds of knowledge-aware attention mechanism to summarize both the context-based and knowledge-based interactions between questions and answers. To handle the diversity and complexity of KG information, we further propose a Contextualized Knowledge-aware Attentive Neural Network, which improves the knowledge representation learning with structure information via a customized Graph Convolutional Network and comprehensively learns context-based and knowledge-based sentence representation via the multi-view knowledge-aware attention mechanism. We evaluate our method on four widely used benchmark QA datasets, including WikiQA, TREC QA, InsuranceQA, and Yahoo QA. Results verify the benefits of incorporating external knowledge from KG and show the robust superiority and extensive applicability of our method.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"69 1","pages":"1 - 33"},"PeriodicalIF":0.0,"publicationDate":"2021-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85765862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Hierarchical Hyperedge Embedding-Based Representation Learning for Group Recommendation 基于分层超边缘嵌入的群组推荐表示学习

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-03-24 DOI: 10.1145/3457949

Lei Guo, Hongzhi Yin, Tong Chen, Xiangliang Zhang, Kai Zheng

Group recommendation aims to recommend items to a group of users. In this work, we study group recommendation in a particular scenario, namely occasional group recommendation, where groups are formed ad hoc and users may just constitute a group for the first time—that is, the historical group-item interaction records are highly limited. Most state-of-the-art works have addressed the challenge by aggregating group members’ personal preferences to learn the group representation. However, the representation learning for a group is most complex beyond the aggregation or fusion of group member representation, as the personal preferences and group preferences may be in different spaces and even orthogonal. In addition, the learned user representation is not accurate due to the sparsity of users’ interaction data. Moreover, the group similarity in terms of common group members has been overlooked, which, however, has the great potential to improve the group representation learning. In this work, we focus on addressing the aforementioned challenges in the group representation learning task, and devise a hierarchical hyperedge embedding-based group recommender, namely HyperGroup. Specifically, we propose to leverage the user-user interactions to alleviate the sparsity issue of user-item interactions, and design a graph neural network-based representation learning network to enhance the learning of individuals’ preferences from their friends’ preferences, which provides a solid foundation for learning groups’ preferences. To exploit the group similarity (i.e., overlapping relationships among groups) to learn a more accurate group representation from highly limited group-item interactions, we connect all groups as a network of overlapping sets (a.k.a. hypergraph), and treat the task of group preference learning as embedding hyperedges (i.e., user sets/groups) in a hypergraph, where an inductive hyperedge embedding method is proposed. To further enhance the group-level preference modeling, we develop a joint training strategy to learn both user-item and group-item interactions in the same process. We conduct extensive experiments on two real-world datasets, and the experimental results demonstrate the superiority of our proposed HyperGroup in comparison to the state-of-the-art baselines.

组推荐的目的是向一组用户推荐项目。在这项工作中，我们研究了一个特定场景下的群体推荐，即偶尔的群体推荐，其中群体是临时形成的，用户可能只是第一次组成一个群体，即历史的组项交互记录是高度有限的。大多数最先进的作品都通过聚合群体成员的个人偏好来学习群体表征来解决这一挑战。然而，群体表征学习的复杂性超出了群体成员表征的聚合或融合，因为个人偏好和群体偏好可能处于不同的空间，甚至是正交的。此外，由于用户交互数据的稀疏性，学习得到的用户表示并不准确。此外，群体相似度在共同群体成员方面被忽视了，但这对提高群体表征学习具有很大的潜力。在这项工作中，我们专注于解决小组表示学习任务中的上述挑战，并设计了一个基于分层超边缘嵌入的小组推荐器，即HyperGroup。具体来说，我们提出利用用户-用户交互来缓解用户-物品交互的稀疏性问题，并设计了一个基于图神经网络的表示学习网络，以增强个人从朋友的偏好中学习偏好，为学习群体的偏好提供坚实的基础。为了利用群体相似性(即群体之间的重叠关系)从高度有限的群体-项目交互中学习更准确的群体表示，我们将所有群体连接为重叠集网络(又称超图)，并将群体偏好学习任务视为嵌入超图中的超边(即用户集/组)，其中提出了一种归纳超边嵌入方法。为了进一步增强群体级偏好建模，我们开发了一种联合训练策略来学习用户-物品和群体-物品在同一过程中的交互。我们在两个真实世界的数据集上进行了广泛的实验，实验结果表明，与最先进的基线相比，我们提出的HyperGroup具有优势。

{"title":"Hierarchical Hyperedge Embedding-Based Representation Learning for Group Recommendation","authors":"Lei Guo, Hongzhi Yin, Tong Chen, Xiangliang Zhang, Kai Zheng","doi":"10.1145/3457949","DOIUrl":"https://doi.org/10.1145/3457949","url":null,"abstract":"Group recommendation aims to recommend items to a group of users. In this work, we study group recommendation in a particular scenario, namely occasional group recommendation, where groups are formed ad hoc and users may just constitute a group for the first time—that is, the historical group-item interaction records are highly limited. Most state-of-the-art works have addressed the challenge by aggregating group members’ personal preferences to learn the group representation. However, the representation learning for a group is most complex beyond the aggregation or fusion of group member representation, as the personal preferences and group preferences may be in different spaces and even orthogonal. In addition, the learned user representation is not accurate due to the sparsity of users’ interaction data. Moreover, the group similarity in terms of common group members has been overlooked, which, however, has the great potential to improve the group representation learning. In this work, we focus on addressing the aforementioned challenges in the group representation learning task, and devise a hierarchical hyperedge embedding-based group recommender, namely HyperGroup. Specifically, we propose to leverage the user-user interactions to alleviate the sparsity issue of user-item interactions, and design a graph neural network-based representation learning network to enhance the learning of individuals’ preferences from their friends’ preferences, which provides a solid foundation for learning groups’ preferences. To exploit the group similarity (i.e., overlapping relationships among groups) to learn a more accurate group representation from highly limited group-item interactions, we connect all groups as a network of overlapping sets (a.k.a. hypergraph), and treat the task of group preference learning as embedding hyperedges (i.e., user sets/groups) in a hypergraph, where an inductive hyperedge embedding method is proposed. To further enhance the group-level preference modeling, we develop a joint training strategy to learn both user-item and group-item interactions in the same process. We conduct extensive experiments on two real-world datasets, and the experimental results demonstrate the superiority of our proposed HyperGroup in comparison to the state-of-the-art baselines.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"17 1","pages":"1 - 27"},"PeriodicalIF":0.0,"publicationDate":"2021-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82777335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 44

Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based Chatbots 历史很重要!基于多回合检索的聊天机器人个性化响应选择

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-03-17 DOI: 10.1145/3453183

Juntao Li, Chang Liu, Chongyang Tao, Zhangming Chan, Dongyan Zhao, Min Zhang, Rui Yan

Existing multi-turn context-response matching methods mainly concentrate on obtaining multi-level and multi-dimension representations and better interactions between context utterances and response. However, in real-place conversation scenarios, whether a response candidate is suitable not only counts on the given dialogue context but also other backgrounds, e.g., wording habits, user-specific dialogue history content. To fill the gap between these up-to-date methods and the real-world applications, we incorporate user-specific dialogue history into the response selection and propose a personalized hybrid matching network (PHMN). Our contributions are two-fold: (1) our model extracts personalized wording behaviors from user-specific dialogue history as extra matching information; (2) we perform hybrid representation learning on context-response utterances and explicitly incorporate a customized attention mechanism to extract vital information from context-response interactions so as to improve the accuracy of matching. We evaluate our model on two large datasets with user identification, i.e., personalized Ubuntu dialogue Corpus (P-Ubuntu) and personalized Weibo dataset (P-Weibo). Experimental results confirm that our method significantly outperforms several strong models by combining personalized attention, wording behaviors, and hybrid representation learning.

现有的多回合上下文-响应匹配方法主要集中在获取多层次、多维度的表征以及上下文话语与响应之间更好的交互。然而，在真实的对话场景中，一个候选者是否合适不仅取决于给定的对话上下文，还取决于其他背景，例如措辞习惯、用户特定的对话历史内容。为了填补这些最新方法与实际应用之间的差距，我们将用户特定的对话历史纳入响应选择，并提出了个性化的混合匹配网络(PHMN)。我们的贡献有两个方面:(1)我们的模型从用户特定的对话历史中提取个性化的措辞行为作为额外的匹配信息;(2)对语境-反应话语进行混合表征学习，明确引入自定义注意机制，从语境-反应交互中提取重要信息，提高匹配精度。我们在两个具有用户识别的大型数据集上评估我们的模型，即个性化Ubuntu对话语料库(P-Ubuntu)和个性化微博数据集(P-Weibo)。实验结果证实，我们的方法通过结合个性化注意力、措辞行为和混合表征学习，显著优于几种强大的模型。

{"title":"Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based Chatbots","authors":"Juntao Li, Chang Liu, Chongyang Tao, Zhangming Chan, Dongyan Zhao, Min Zhang, Rui Yan","doi":"10.1145/3453183","DOIUrl":"https://doi.org/10.1145/3453183","url":null,"abstract":"Existing multi-turn context-response matching methods mainly concentrate on obtaining multi-level and multi-dimension representations and better interactions between context utterances and response. However, in real-place conversation scenarios, whether a response candidate is suitable not only counts on the given dialogue context but also other backgrounds, e.g., wording habits, user-specific dialogue history content. To fill the gap between these up-to-date methods and the real-world applications, we incorporate user-specific dialogue history into the response selection and propose a personalized hybrid matching network (PHMN). Our contributions are two-fold: (1) our model extracts personalized wording behaviors from user-specific dialogue history as extra matching information; (2) we perform hybrid representation learning on context-response utterances and explicitly incorporate a customized attention mechanism to extract vital information from context-response interactions so as to improve the accuracy of matching. We evaluate our model on two large datasets with user identification, i.e., personalized Ubuntu dialogue Corpus (P-Ubuntu) and personalized Weibo dataset (P-Weibo). Experimental results confirm that our method significantly outperforms several strong models by combining personalized attention, wording behaviors, and hybrid representation learning.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"6 1","pages":"1 - 25"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80269305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Semantic Models for the First-Stage Retrieval: A Comprehensive Review 第一阶段检索的语义模型:综述

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-03-08 DOI: 10.1145/3486250

Yinqiong Cai, Yixing Fan, Jiafeng Guo, Fei Sun, Ruqing Zhang, Xueqi Cheng

Multi-stage ranking pipelines have been a practical solution in modern search systems, where the first-stage retrieval is to return a subset of candidate documents and latter stages attempt to re-rank those candidates. Unlike re-ranking stages going through quick technique shifts over the past decades, the first-stage retrieval has long been dominated by classical term-based models. Unfortunately, these models suffer from the vocabulary mismatch problem, which may block re-ranking stages from relevant documents at the very beginning. Therefore, it has been a long-term desire to build semantic models for the first-stage retrieval that can achieve high recall efficiently. Recently, we have witnessed an explosive growth of research interests on the first-stage semantic retrieval models. We believe it is the right time to survey current status, learn from existing methods, and gain some insights for future development. In this article, we describe the current landscape of the first-stage retrieval models under a unified framework to clarify the connection between classical term-based retrieval methods, early semantic retrieval methods, and neural semantic retrieval methods. Moreover, we identify some open challenges and envision some future directions, with the hope of inspiring more research on these important yet less investigated topics.

多阶段排序管道在现代搜索系统中已经成为一种实用的解决方案，其中第一阶段检索是返回候选文档的子集，后一阶段尝试对这些候选文档重新排序。与过去几十年快速技术转换的重新排序阶段不同，第一阶段检索长期以来一直由经典的基于术语的模型主导。不幸的是，这些模型存在词汇表不匹配问题，这可能会在一开始就阻碍相关文档的重新排序阶段。因此，为第一阶段检索建立能够有效实现高召回率的语义模型一直是一个长期的愿望。近年来，人们对第一阶段语义检索模型的研究兴趣呈爆炸式增长。我们认为，现在正是审视现状、借鉴现有方法、为未来发展提供一些启示的好时机。在本文中，我们在一个统一的框架下描述了第一阶段检索模型的现状，以澄清经典的基于术语的检索方法、早期语义检索方法和神经语义检索方法之间的联系。此外，我们确定了一些开放的挑战，并设想了一些未来的方向，希望在这些重要但研究较少的主题上激发更多的研究。

{"title":"Semantic Models for the First-Stage Retrieval: A Comprehensive Review","authors":"Yinqiong Cai, Yixing Fan, Jiafeng Guo, Fei Sun, Ruqing Zhang, Xueqi Cheng","doi":"10.1145/3486250","DOIUrl":"https://doi.org/10.1145/3486250","url":null,"abstract":"Multi-stage ranking pipelines have been a practical solution in modern search systems, where the first-stage retrieval is to return a subset of candidate documents and latter stages attempt to re-rank those candidates. Unlike re-ranking stages going through quick technique shifts over the past decades, the first-stage retrieval has long been dominated by classical term-based models. Unfortunately, these models suffer from the vocabulary mismatch problem, which may block re-ranking stages from relevant documents at the very beginning. Therefore, it has been a long-term desire to build semantic models for the first-stage retrieval that can achieve high recall efficiently. Recently, we have witnessed an explosive growth of research interests on the first-stage semantic retrieval models. We believe it is the right time to survey current status, learn from existing methods, and gain some insights for future development. In this article, we describe the current landscape of the first-stage retrieval models under a unified framework to clarify the connection between classical term-based retrieval methods, early semantic retrieval methods, and neural semantic retrieval methods. Moreover, we identify some open challenges and envision some future directions, with the hope of inspiring more research on these important yet less investigated topics.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"183 1","pages":"1 - 42"},"PeriodicalIF":0.0,"publicationDate":"2021-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77597594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 68