ACM Transactions on Information Systems (TOIS)最新文献_第2页

Understanding the “Pathway” Towards a Searcher’s Learning Objective 了解通往搜索者学习目标的“路径”

ACM Transactions on Information Systems (TOIS)

Pub Date : 2022-01-12 DOI: 10.1145/3495222

Kelsey Urgo, Jaime Arguello

Search systems are often used to support learning-oriented goals. This trend has given rise to the “search-as-learning” movement, which proposes that search systems should be designed to support learning. To this end, an important research question is: How does a searcher’s type of learning objective (LO) influence their trajectory (or pathway) toward that objective? We report on a lab study (N = 36) in which participants gathered information to meet a specific type of LO. To characterize LOs and pathways, we leveraged Anderson and Krathwohl’s (A&K’s) taxonomy [3]. A&K’s taxonomy situates LOs at the intersection of two orthogonal dimensions: (1) cognitive process (CP) (remember, understand, apply, analyze, evaluate, and create) and (2) knowledge type (factual, conceptual, procedural, and metacognitive knowledge). Participants completed learning-oriented search tasks that varied along three CPs (apply, evaluate, and create) and three knowledge types (factual, conceptual, and procedural knowledge). A pathway is defined as a sequence of learning instances (e.g., subgoals) that were also each classified into cells from A&K’s taxonomy. Our study used a think-aloud protocol, and pathways were generated through a qualitative analysis of participants’ think-aloud comments and recorded screen activities. We investigate three research questions. First, in RQ1, we study the impact of the LO on pathway characteristics (e.g., pathway length). Second, in RQ2, we study the impact of the LO on the types of A&K cells traversed along the pathway. Third, in RQ3, we study common and uncommon transitions between A&K cells along pathways conditioned on the knowledge type of the objective. We discuss implications of our results for designing search systems to support learning.

搜索系统通常用于支持面向学习的目标。这一趋势引发了“搜索即学习”运动，该运动提出搜索系统应该被设计为支持学习。为此，一个重要的研究问题是:搜索者的学习目标(LO)类型如何影响他们实现该目标的轨迹(或途径)?我们报告了一项实验室研究(N = 36)，其中参与者收集信息以满足特定类型的LO。为了表征LOs和通路，我们利用了Anderson和Krathwohl (A&K)的分类法[3]。A&K的分类法将LOs置于两个正交维度的交叉点:(1)认知过程(CP)(记住、理解、应用、分析、评估和创造)和(2)知识类型(事实、概念、程序和元认知知识)。参与者完成了以学习为导向的搜索任务，这些任务沿着三个cp(应用、评估和创造)和三个知识类型(事实、概念和程序知识)变化。路径被定义为一系列学习实例(例如，子目标)，这些学习实例也被A&K的分类法分类为细胞。我们的研究使用了有声思考协议，并通过对参与者的有声思考评论和记录的屏幕活动进行定性分析来生成路径。我们调查了三个研究问题。首先，在RQ1中，我们研究了LO对路径特性(如路径长度)的影响。其次，在RQ2中，我们研究了LO对沿着该通路穿越的A&K细胞类型的影响。第三，在RQ3中，我们研究了A&K细胞之间沿目标知识类型的通路的常见和不常见转换。我们讨论了我们的结果对设计支持学习的搜索系统的影响。

{"title":"Understanding the “Pathway” Towards a Searcher’s Learning Objective","authors":"Kelsey Urgo, Jaime Arguello","doi":"10.1145/3495222","DOIUrl":"https://doi.org/10.1145/3495222","url":null,"abstract":"Search systems are often used to support learning-oriented goals. This trend has given rise to the “search-as-learning” movement, which proposes that search systems should be designed to support learning. To this end, an important research question is: How does a searcher’s type of learning objective (LO) influence their trajectory (or pathway) toward that objective? We report on a lab study (N = 36) in which participants gathered information to meet a specific type of LO. To characterize LOs and pathways, we leveraged Anderson and Krathwohl’s (A&K’s) taxonomy [3]. A&K’s taxonomy situates LOs at the intersection of two orthogonal dimensions: (1) cognitive process (CP) (remember, understand, apply, analyze, evaluate, and create) and (2) knowledge type (factual, conceptual, procedural, and metacognitive knowledge). Participants completed learning-oriented search tasks that varied along three CPs (apply, evaluate, and create) and three knowledge types (factual, conceptual, and procedural knowledge). A pathway is defined as a sequence of learning instances (e.g., subgoals) that were also each classified into cells from A&K’s taxonomy. Our study used a think-aloud protocol, and pathways were generated through a qualitative analysis of participants’ think-aloud comments and recorded screen activities. We investigate three research questions. First, in RQ1, we study the impact of the LO on pathway characteristics (e.g., pathway length). Second, in RQ2, we study the impact of the LO on the types of A&K cells traversed along the pathway. Third, in RQ3, we study common and uncommon transitions between A&K cells along pathways conditioned on the knowledge type of the objective. We discuss implications of our results for designing search systems to support learning.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"1 1","pages":"1 - 43"},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84006104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

MiDTD: A Simple and Effective Distillation Framework for Distantly Supervised Relation Extraction 一种用于远程监督关系提取的简单有效的蒸馏框架

ACM Transactions on Information Systems (TOIS)

Pub Date : 2022-01-12 DOI: 10.1145/3503917

Rui Li, Cheng Yang, Tingwei Li, Sen Su

Relation extraction (RE), an important information extraction task, faced the great challenge brought by limited annotation data. To this end, distant supervision was proposed to automatically label RE data, and thus largely increased the number of annotated instances. Unfortunately, lots of noise relation annotations brought by automatic labeling become a new obstacle. Some recent studies have shown that the teacher-student framework of knowledge distillation can alleviate the interference of noise relation annotations via label softening. Nevertheless, we find that they still suffer from two problems: propagation of inaccurate dark knowledge and constraint of a unified distillation temperature. In this article, we propose a simple and effective Multi-instance Dynamic Temperature Distillation (MiDTD) framework, which is model-agnostic and mainly involves two modules: multi-instance target fusion (MiTF) and dynamic temperature regulation (DTR). MiTF combines the teacher’s predictions for multiple sentences with the same entity pair to amend the inaccurate dark knowledge in each student’s target. DTR allocates alterable distillation temperatures to different training instances to enable the softness of most student’s targets to be regulated to a moderate range. In experiments, we construct three concrete MiDTD instantiations with BERT, PCNN, and BiLSTM-based RE models, and the distilled students significantly outperform their teachers and the state-of-the-art (SOTA) methods.

关系抽取作为一项重要的信息抽取任务，由于标注数据的有限性，给关系抽取带来了巨大的挑战。为此，提出了远程监督来自动标记RE数据，从而大大增加了注释实例的数量。然而，自动标注带来的大量噪声关系标注成为自动标注的新障碍。最近的一些研究表明，师生知识蒸馏框架可以通过标签软化来缓解噪声关系标注的干扰。然而，我们发现它们仍然存在两个问题:传播不准确的暗知识和统一蒸馏温度的约束。本文提出了一种简单有效的多实例动态温度蒸馏(MiDTD)框架，该框架与模型无关，主要包括两个模块:多实例目标融合(MiTF)和动态温度调节(DTR)。MiTF将教师对多个句子的预测与同一实体对结合起来，以修正每个学生目标中不准确的暗知识。DTR为不同的训练实例分配可变的蒸馏温度，以使大多数学生目标的柔软度被调节到一个适中的范围。在实验中，我们用BERT、PCNN和基于bilstm的RE模型构建了三个具体的MiDTD实例，提炼出来的学生明显优于他们的老师和最先进的(SOTA)方法。

{"title":"MiDTD: A Simple and Effective Distillation Framework for Distantly Supervised Relation Extraction","authors":"Rui Li, Cheng Yang, Tingwei Li, Sen Su","doi":"10.1145/3503917","DOIUrl":"https://doi.org/10.1145/3503917","url":null,"abstract":"Relation extraction (RE), an important information extraction task, faced the great challenge brought by limited annotation data. To this end, distant supervision was proposed to automatically label RE data, and thus largely increased the number of annotated instances. Unfortunately, lots of noise relation annotations brought by automatic labeling become a new obstacle. Some recent studies have shown that the teacher-student framework of knowledge distillation can alleviate the interference of noise relation annotations via label softening. Nevertheless, we find that they still suffer from two problems: propagation of inaccurate dark knowledge and constraint of a unified distillation temperature. In this article, we propose a simple and effective Multi-instance Dynamic Temperature Distillation (MiDTD) framework, which is model-agnostic and mainly involves two modules: multi-instance target fusion (MiTF) and dynamic temperature regulation (DTR). MiTF combines the teacher’s predictions for multiple sentences with the same entity pair to amend the inaccurate dark knowledge in each student’s target. DTR allocates alterable distillation temperatures to different training instances to enable the softness of most student’s targets to be regulated to a moderate range. In experiments, we construct three concrete MiDTD instantiations with BERT, PCNN, and BiLSTM-based RE models, and the distilled students significantly outperform their teachers and the state-of-the-art (SOTA) methods.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"8 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77071427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Jointly Predicting Future Content in Multiple Social Media Sites Based on Multi-task Learning 基于多任务学习的多个社交媒体网站未来内容联合预测

ACM Transactions on Information Systems (TOIS)

Pub Date : 2022-01-12 DOI: 10.1145/3495530

Peng Zhang, Baoxi Liu, T. Lu, X. Ding, Hansu Gu, Ning Gu

User-generated contents (UGC) in social media are the direct expression of users’ interests, preferences, and opinions. User behavior prediction based on UGC has increasingly been investigated in recent years. Compared to learning a person’s behavioral patterns in each social media site separately, jointly predicting user behavior in multiple social media sites and complementing each other (cross-site user behavior prediction) can be more accurate. However, cross-site user behavior prediction based on UGC is a challenging task due to the difficulty of cross-site data sampling, the complexity of UGC modeling, and uncertainty of knowledge sharing among different sites. For these problems, we propose a Cross-Site Multi-Task (CSMT) learning method to jointly predict user behavior in multiple social media sites. CSMT mainly derives from the hierarchical attention network and multi-task learning. Using this method, the UGC in each social media site can obtain fine-grained representations in terms of words, topics, posts, hashtags, and time slices as well as the relevances among them, and prediction tasks in different social media sites can be jointly implemented and complement each other. By utilizing two cross-site datasets sampled from Weibo, Douban, Facebook, and Twitter, we validate our method’s superiority on several classification metrics compared with existing related methods.

社交媒体中的用户生成内容(User-generated content, UGC)是用户兴趣、偏好和观点的直接表达。近年来，基于UGC的用户行为预测研究越来越多。与单独学习一个人在每个社交媒体网站的行为模式相比，联合预测多个社交媒体网站的用户行为，并相互补充(跨站点用户行为预测)可以更准确。然而，由于跨站点数据采样的困难、UGC建模的复杂性以及不同站点之间知识共享的不确定性，基于UGC的跨站点用户行为预测是一项具有挑战性的任务。针对这些问题，我们提出了一种跨站点多任务(CSMT)学习方法来联合预测多个社交媒体网站中的用户行为。CSMT主要来源于分层注意网络和多任务学习。利用该方法，各社交媒体网站的UGC可以获得词、话题、帖子、标签、时间片等方面的细粒度表示，以及它们之间的相关性，不同社交媒体网站的预测任务可以共同实现，相互补充。通过使用来自微博、豆瓣、Facebook和Twitter的两个跨站点数据集，与现有的相关方法相比，我们验证了我们的方法在几个分类指标上的优越性。

{"title":"Jointly Predicting Future Content in Multiple Social Media Sites Based on Multi-task Learning","authors":"Peng Zhang, Baoxi Liu, T. Lu, X. Ding, Hansu Gu, Ning Gu","doi":"10.1145/3495530","DOIUrl":"https://doi.org/10.1145/3495530","url":null,"abstract":"User-generated contents (UGC) in social media are the direct expression of users’ interests, preferences, and opinions. User behavior prediction based on UGC has increasingly been investigated in recent years. Compared to learning a person’s behavioral patterns in each social media site separately, jointly predicting user behavior in multiple social media sites and complementing each other (cross-site user behavior prediction) can be more accurate. However, cross-site user behavior prediction based on UGC is a challenging task due to the difficulty of cross-site data sampling, the complexity of UGC modeling, and uncertainty of knowledge sharing among different sites. For these problems, we propose a Cross-Site Multi-Task (CSMT) learning method to jointly predict user behavior in multiple social media sites. CSMT mainly derives from the hierarchical attention network and multi-task learning. Using this method, the UGC in each social media site can obtain fine-grained representations in terms of words, topics, posts, hashtags, and time slices as well as the relevances among them, and prediction tasks in different social media sites can be jointly implemented and complement each other. By utilizing two cross-site datasets sampled from Weibo, Douban, Facebook, and Twitter, we validate our method’s superiority on several classification metrics compared with existing related methods.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"43 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90804755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Scaling High-Quality Pairwise Link-Based Similarity Retrieval on Billion-Edge Graphs 在十亿边图上扩展高质量的基于成对链接的相似性检索

ACM Transactions on Information Systems (TOIS)

Pub Date : 2022-01-12 DOI: 10.1145/3495209

Weiren Yu, J. Mccann, Chengyuan Zhang, H. Ferhatosmanoğlu

SimRank is an attractive link-based similarity measure used in fertile fields of Web search and sociometry. However, the existing deterministic method by Kusumoto et al. [24] for retrieving SimRank does not always produce high-quality similarity results, as it fails to accurately obtain diagonal correction matrix D. Moreover, SimRank has a “connectivity trait” problem: increasing the number of paths between a pair of nodes would decrease its similarity score. The best-known remedy, SimRank++ [1], cannot completely fix this problem, since its score would still be zero if there are no common in-neighbors between two nodes. In this article, we study fast high-quality link-based similarity search on billion-scale graphs. (1) We first devise a “varied-D” method to accurately compute SimRank in linear memory. We also aggregate duplicate computations, which reduces the time of [24] from quadratic to linear in the number of iterations. (2) We propose a novel “cosine-based” SimRank model to circumvent the “connectivity trait” problem. (3) To substantially speed up the partial-pairs “cosine-based” SimRank search on large graphs, we devise an efficient dimensionality reduction algorithm, PSR#, with guaranteed accuracy. (4) We give mathematical insights to the semantic difference between SimRank and its variant, and correct an argument in [24] that “if D is replaced by a scaled identity matrix (1-Ɣ)I, their top-K rankings will not be affected much”. (5) We propose a novel method that can accurately convert from Li et al. SimRank ~{S} to Jeh and Widom’s SimRank S. (6) We propose GSR#, a generalisation of our “cosine-based” SimRank model, to quantify pairwise similarities across two distinct graphs, unlike SimRank that would assess nodes across two graphs as completely dissimilar. Extensive experiments on various datasets demonstrate the superiority of our proposed approaches in terms of high search quality, computational efficiency, accuracy, and scalability on billion-edge graphs.

simmrank是一种有吸引力的基于链接的相似性度量，用于网络搜索和社会计量学的肥沃领域。然而，现有的Kusumoto等人[24]的确定性方法在检索simmrank时，由于不能准确地获得对角修正矩阵d，并不能得到高质量的相似度结果。此外，simmrank存在“连通性”问题:增加一对节点之间的路径数会降低其相似度得分。最著名的补救方法simrank++[1]不能完全解决这个问题，因为如果两个节点之间没有共同的内邻居，它的得分仍然是零。在本文中，我们研究了在十亿尺度图上快速高质量的基于链接的相似度搜索。(1)我们首先设计了一种“变d”方法来精确计算线性存储器中的simmrank。我们还聚合了重复计算，这将[24]的迭代次数从二次型减少到线性。(2)我们提出了一种新的基于余弦的simmrank模型来规避“连通性特征”问题。(3)为了大大加快在大图上的部分对“基于余弦”的simmrank搜索，我们设计了一种有效的降维算法psr#，并保证了准确性。(4)我们对simmrank及其变体之间的语义差异进行了数学分析，并纠正了[24]中的一个论点，即“如果D被缩放的单位矩阵(1-Ɣ)I取代，它们的top-K排名不会受到太大影响”。(5)我们提出了一种新的方法，可以准确地从Li等人那里转换。(6)我们提出了GSR#，这是我们的“基于余弦的”simmrank模型的推广，用于量化两个不同图之间的两两相似性，不像simmrank会评估两个图之间的节点完全不相似。在各种数据集上进行的大量实验表明，我们提出的方法在高搜索质量、计算效率、准确性和数十亿边图的可扩展性方面具有优势。

{"title":"Scaling High-Quality Pairwise Link-Based Similarity Retrieval on Billion-Edge Graphs","authors":"Weiren Yu, J. Mccann, Chengyuan Zhang, H. Ferhatosmanoğlu","doi":"10.1145/3495209","DOIUrl":"https://doi.org/10.1145/3495209","url":null,"abstract":"SimRank is an attractive link-based similarity measure used in fertile fields of Web search and sociometry. However, the existing deterministic method by Kusumoto et al. [24] for retrieving SimRank does not always produce high-quality similarity results, as it fails to accurately obtain diagonal correction matrix D. Moreover, SimRank has a “connectivity trait” problem: increasing the number of paths between a pair of nodes would decrease its similarity score. The best-known remedy, SimRank++ [1], cannot completely fix this problem, since its score would still be zero if there are no common in-neighbors between two nodes. In this article, we study fast high-quality link-based similarity search on billion-scale graphs. (1) We first devise a “varied-D” method to accurately compute SimRank in linear memory. We also aggregate duplicate computations, which reduces the time of [24] from quadratic to linear in the number of iterations. (2) We propose a novel “cosine-based” SimRank model to circumvent the “connectivity trait” problem. (3) To substantially speed up the partial-pairs “cosine-based” SimRank search on large graphs, we devise an efficient dimensionality reduction algorithm, PSR#, with guaranteed accuracy. (4) We give mathematical insights to the semantic difference between SimRank and its variant, and correct an argument in [24] that “if D is replaced by a scaled identity matrix (1-Ɣ)I, their top-K rankings will not be affected much”. (5) We propose a novel method that can accurately convert from Li et al. SimRank ~{S} to Jeh and Widom’s SimRank S. (6) We propose GSR#, a generalisation of our “cosine-based” SimRank model, to quantify pairwise similarities across two distinct graphs, unlike SimRank that would assess nodes across two graphs as completely dissimilar. Extensive experiments on various datasets demonstrate the superiority of our proposed approaches in terms of high search quality, computational efficiency, accuracy, and scalability on billion-edge graphs.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"34 1","pages":"1 - 45"},"PeriodicalIF":0.0,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77502118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Simulating and Modeling the Risk of Conversational Search 会话搜索风险的模拟和建模

ACM Transactions on Information Systems (TOIS)

Pub Date : 2022-01-01 DOI: 10.1145/3507357

Zhenduo Wang, Qingyao Ai

In conversational search, agents can interact with users by asking clarifying questions to increase their chance of finding better results. Many recent works and shared tasks in both natural language processing and information retrieval communities have focused on identifying the need to ask clarifying questions and methodologies of generating them. These works assume that asking a clarifying question is a safe alternative to retrieving results. As existing conversational search models are far from perfect, it is possible and common that they could retrieve/generate bad clarifying questions. Asking too many clarifying questions can also drain a user’s patience when the user prefers searching efficiency over correctness. Hence, these models can backfire and harm a user’s search experience due to these risks from asking clarifying questions. In this work, we propose a simulation framework to simulate the risk of asking questions in conversational search and further revise a risk-aware conversational search model to control the risk. We show the model’s robustness and effectiveness through extensive experiments on three conversational datasets — MSDialog, Ubuntu Dialog Corpus, and Opendialkg — in which we compare it with multiple baselines. We show that the risk-control module can work with two different re-ranker models and outperform all of the baselines in most of our experiments.

在会话搜索中，代理可以通过询问澄清性问题来与用户互动，以增加他们找到更好结果的机会。在自然语言处理和信息检索社区中，许多最近的工作和共同的任务都集中在确定提出澄清性问题的需要和产生这些问题的方法上。这些工作假设问一个澄清的问题是一个安全的替代检索结果。由于现有的会话搜索模型还远远不够完美，它们可能会检索/生成不好的澄清问题。当用户更喜欢搜索效率而不是正确性时，问太多澄清性问题也会耗尽用户的耐心。因此，这些模型可能会适得其反，损害用户的搜索体验，因为这些风险来自于提出澄清性问题。在这项工作中，我们提出了一个模拟框架来模拟会话搜索中提问的风险，并进一步修改风险感知会话搜索模型来控制风险。我们通过在三个会话数据集(MSDialog、Ubuntu Dialog语料库和Opendialkg)上进行广泛的实验来展示模型的鲁棒性和有效性，并将其与多个基线进行比较。我们证明了风险控制模块可以与两种不同的重新排序模型一起工作，并且在我们的大多数实验中优于所有基线。

{"title":"Simulating and Modeling the Risk of Conversational Search","authors":"Zhenduo Wang, Qingyao Ai","doi":"10.1145/3507357","DOIUrl":"https://doi.org/10.1145/3507357","url":null,"abstract":"In conversational search, agents can interact with users by asking clarifying questions to increase their chance of finding better results. Many recent works and shared tasks in both natural language processing and information retrieval communities have focused on identifying the need to ask clarifying questions and methodologies of generating them. These works assume that asking a clarifying question is a safe alternative to retrieving results. As existing conversational search models are far from perfect, it is possible and common that they could retrieve/generate bad clarifying questions. Asking too many clarifying questions can also drain a user’s patience when the user prefers searching efficiency over correctness. Hence, these models can backfire and harm a user’s search experience due to these risks from asking clarifying questions. In this work, we propose a simulation framework to simulate the risk of asking questions in conversational search and further revise a risk-aware conversational search model to control the risk. We show the model’s robustness and effectiveness through extensive experiments on three conversational datasets — MSDialog, Ubuntu Dialog Corpus, and Opendialkg — in which we compare it with multiple baselines. We show that the risk-control module can work with two different re-ranker models and outperform all of the baselines in most of our experiments.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"64 1","pages":"1 - 33"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90345383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Are Topics Interesting or Not? An LDA-based Topic-graph Probabilistic Model for Web Search Personalization 话题是否有趣?基于lda的网络搜索个性化主题图概率模型

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-30 DOI: 10.1145/3476106

Jiashu Zhao, J. Huang, Hongbo Deng, Yi Chang, Long Xia

In this article, we propose a Latent Dirichlet Allocation– (LDA) based topic-graph probabilistic personalization model for Web search. This model represents a user graph in a latent topic graph and simultaneously estimates the probabilities that the user is interested in the topics, as well as the probabilities that the user is not interested in the topics. For a given query issued by the user, the webpages that have higher relevancy to the interested topics are promoted, and the webpages more relevant to the non-interesting topics are penalized. In particular, we simulate a user’s search intent by building two profiles: A positive user profile for the probabilities of the user is interested in the topics and a corresponding negative user profile for the probabilities of being not interested in the the topics. The profiles are estimated based on the user’s search logs. A clicked webpage is assumed to include interesting topics. A skipped (viewed but not clicked) webpage is assumed to cover some non-interesting topics to the user. Such estimations are performed in the latent topic space generated by LDA. Moreover, a new approach is proposed to estimate the correlation between a given query and the user’s search history so as to determine how much personalization should be considered for the query. We compare our proposed models with several strong baselines including state-of-the-art personalization approaches. Experiments conducted on a large-scale real user search log collection illustrate the effectiveness of the proposed models.

在本文中，我们提出了一个基于潜在狄利克雷分配(Latent Dirichlet Allocation, LDA)的Web搜索主题图概率个性化模型。该模型在潜在主题图中表示用户图，并同时估计用户对主题感兴趣的概率，以及用户对主题不感兴趣的概率。对于用户发出的给定查询，与感兴趣的主题相关度较高的网页会被提升，而与不感兴趣的主题相关度较高的网页会被扣分。特别是，我们通过构建两个配置文件来模拟用户的搜索意图:一个积极的用户配置文件用于用户对主题感兴趣的概率，一个相应的消极用户配置文件用于用户对主题不感兴趣的概率。概要文件是根据用户的搜索日志估计的。被点击的网页被认为包含有趣的主题。跳过的网页(已浏览但未点击)被认为涵盖了用户不感兴趣的主题。这种估计是在LDA生成的潜在主题空间中进行的。此外，提出了一种新的方法来估计给定查询与用户搜索历史之间的相关性，从而确定查询应该考虑多少个性化。我们将我们提出的模型与几个强大的基线进行比较，包括最先进的个性化方法。在大规模真实用户搜索日志收集上进行的实验证明了所提出模型的有效性。

{"title":"Are Topics Interesting or Not? An LDA-based Topic-graph Probabilistic Model for Web Search Personalization","authors":"Jiashu Zhao, J. Huang, Hongbo Deng, Yi Chang, Long Xia","doi":"10.1145/3476106","DOIUrl":"https://doi.org/10.1145/3476106","url":null,"abstract":"In this article, we propose a Latent Dirichlet Allocation– (LDA) based topic-graph probabilistic personalization model for Web search. This model represents a user graph in a latent topic graph and simultaneously estimates the probabilities that the user is interested in the topics, as well as the probabilities that the user is not interested in the topics. For a given query issued by the user, the webpages that have higher relevancy to the interested topics are promoted, and the webpages more relevant to the non-interesting topics are penalized. In particular, we simulate a user’s search intent by building two profiles: A positive user profile for the probabilities of the user is interested in the topics and a corresponding negative user profile for the probabilities of being not interested in the the topics. The profiles are estimated based on the user’s search logs. A clicked webpage is assumed to include interesting topics. A skipped (viewed but not clicked) webpage is assumed to cover some non-interesting topics to the user. Such estimations are performed in the latent topic space generated by LDA. Moreover, a new approach is proposed to estimate the correlation between a given query and the user’s search history so as to determine how much personalization should be considered for the query. We compare our proposed models with several strong baselines including state-of-the-art personalization approaches. Experiments conducted on a large-scale real user search log collection illustrate the effectiveness of the proposed models.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"14 1","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2021-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83646035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Toward Personalized Answer Generation in E-Commerce via Multi-perspective Preference Modeling 基于多视角偏好模型的电子商务个性化答案生成

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-27 DOI: 10.1145/3507782

Yang Deng, Yaliang Li, Wenxuan Zhang, Bolin Ding, W. Lam

Recently, Product Question Answering (PQA) on E-Commerce platforms has attracted increasing attention as it can act as an intelligent online shopping assistant and improve the customer shopping experience. Its key function, automatic answer generation for product-related questions, has been studied by aiming to generate content-preserving while question-related answers. However, an important characteristic of PQA, i.e., personalization, is neglected by existing methods. It is insufficient to provide the same “completely summarized” answer to all customers, since many customers are more willing to see personalized answers with customized information only for themselves, by taking into consideration their own preferences toward product aspects or information needs. To tackle this challenge, we propose a novel Personalized Answer GEneration method with multi-perspective preference modeling, which explores historical user-generated contents to model user preference for generating personalized answers in PQA. Specifically, we first retrieve question-related user history as external knowledge to model knowledge-level user preference. Then, we leverage the Gaussian Softmax distribution model to capture latent aspect-level user preference. Finally, we develop a persona-aware pointer network to generate personalized answers in terms of both content and style by utilizing personal user preference and dynamic user vocabulary. Experimental results on real-world E-Commerce QA datasets demonstrate that the proposed method outperforms existing methods by generating informative and customized answers and show that answer generation in E-Commerce can benefit from personalization.

近年来，电子商务平台上的产品问答(PQA)越来越受到人们的关注，因为它可以作为智能的网上购物助手，改善顾客的购物体验。对其关键功能——产品相关问题的自动答案生成进行了研究，旨在生成内容保留的问题相关答案。然而，现有的方法忽略了PQA的一个重要特征，即个性化。给所有的客户提供相同的“完全总结”的答案是不够的，因为很多客户更愿意看到个性化的答案和定制的信息，只是为了他们自己，考虑到自己对产品方面的偏好或信息需求。为了解决这一挑战，我们提出了一种新的个性化答案生成方法，该方法采用多角度偏好建模，通过探索历史用户生成的内容来建模用户偏好，从而在PQA中生成个性化答案。具体来说，我们首先检索与问题相关的用户历史作为外部知识来建模知识级用户偏好。然后，我们利用高斯Softmax分布模型来捕获潜在的方面级用户偏好。最后，我们开发了一个角色感知的指针网络，利用个人用户偏好和动态用户词汇，在内容和风格方面生成个性化的答案。在实际电子商务QA数据集上的实验结果表明，本文提出的方法在生成信息丰富和个性化的答案方面优于现有方法，并表明电子商务中的答案生成可以从个性化中受益。

{"title":"Toward Personalized Answer Generation in E-Commerce via Multi-perspective Preference Modeling","authors":"Yang Deng, Yaliang Li, Wenxuan Zhang, Bolin Ding, W. Lam","doi":"10.1145/3507782","DOIUrl":"https://doi.org/10.1145/3507782","url":null,"abstract":"Recently, Product Question Answering (PQA) on E-Commerce platforms has attracted increasing attention as it can act as an intelligent online shopping assistant and improve the customer shopping experience. Its key function, automatic answer generation for product-related questions, has been studied by aiming to generate content-preserving while question-related answers. However, an important characteristic of PQA, i.e., personalization, is neglected by existing methods. It is insufficient to provide the same “completely summarized” answer to all customers, since many customers are more willing to see personalized answers with customized information only for themselves, by taking into consideration their own preferences toward product aspects or information needs. To tackle this challenge, we propose a novel Personalized Answer GEneration method with multi-perspective preference modeling, which explores historical user-generated contents to model user preference for generating personalized answers in PQA. Specifically, we first retrieve question-related user history as external knowledge to model knowledge-level user preference. Then, we leverage the Gaussian Softmax distribution model to capture latent aspect-level user preference. Finally, we develop a persona-aware pointer network to generate personalized answers in terms of both content and style by utilizing personal user preference and dynamic user vocabulary. Experimental results on real-world E-Commerce QA datasets demonstrate that the proposed method outperforms existing methods by generating informative and customized answers and show that answer generation in E-Commerce can benefit from personalization.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"54 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2021-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80427754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

LkeRec: Toward Lightweight End-to-End Joint Representation Learning for Building Accurate and Effective Recommendation 面向构建准确有效推荐的轻量级端到端联合表示学习

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-14 DOI: 10.1145/3486673

Surong Yan, Kwei-Jay Lin, Xiaolin Zheng, Haosen Wang

Explicit and implicit knowledge about users and items have been used to describe complex and heterogeneous side information for recommender systems (RSs). Many existing methods use knowledge graph embedding (KGE) to learn the representation of a user-item knowledge graph (KG) in low-dimensional space. In this article, we propose a lightweight end-to-end joint learning framework for fusing the tasks of KGE and RSs at the model level. Our method proposes a lightweight KG embedding method by using bidirectional bijection relation-type modeling to enable scalability for large graphs while using self-adaptive negative sampling to optimize negative sample generating. Our method further generates the integrated views for users and items based on relation-types to explicitly model users’ preferences and items’ features, respectively. Finally, we add virtual “recommendation” relations between the integrated views of users and items to model the preferences of users on items, seamlessly integrating RS with user-item KG over a unified graph. Experimental results on multiple datasets and benchmarks show that our method can achieve a better accuracy of recommendation compared with existing state-of-the-art methods. Complexity and runtime analysis suggests that our method can gain a lower time and space complexity than most of existing methods and improve scalability.

关于用户和项目的显式和隐式知识已被用于描述推荐系统(RSs)的复杂和异构侧信息。现有的许多方法使用知识图嵌入(KGE)来学习用户-项目知识图在低维空间中的表示。在本文中，我们提出了一个轻量级的端到端联合学习框架，用于在模型级别融合KGE和RSs的任务。我们的方法提出了一种轻量级的KG嵌入方法，该方法使用双向双射关系型建模来实现大型图的可扩展性，同时使用自适应负采样来优化负样本的生成。我们的方法进一步基于关系类型为用户和项目生成集成视图，分别显式地为用户的偏好和项目的特征建模。最后，我们在用户和物品的集成视图之间添加虚拟“推荐”关系，以模拟用户对物品的偏好，在统一的图上将RS与用户-物品KG无缝集成。在多个数据集和基准测试上的实验结果表明，与现有的最先进的推荐方法相比，我们的方法可以达到更好的推荐精度。复杂度和运行时分析表明，该方法比大多数现有方法具有更低的时间和空间复杂度，并提高了可扩展性。

{"title":"LkeRec: Toward Lightweight End-to-End Joint Representation Learning for Building Accurate and Effective Recommendation","authors":"Surong Yan, Kwei-Jay Lin, Xiaolin Zheng, Haosen Wang","doi":"10.1145/3486673","DOIUrl":"https://doi.org/10.1145/3486673","url":null,"abstract":"Explicit and implicit knowledge about users and items have been used to describe complex and heterogeneous side information for recommender systems (RSs). Many existing methods use knowledge graph embedding (KGE) to learn the representation of a user-item knowledge graph (KG) in low-dimensional space. In this article, we propose a lightweight end-to-end joint learning framework for fusing the tasks of KGE and RSs at the model level. Our method proposes a lightweight KG embedding method by using bidirectional bijection relation-type modeling to enable scalability for large graphs while using self-adaptive negative sampling to optimize negative sample generating. Our method further generates the integrated views for users and items based on relation-types to explicitly model users’ preferences and items’ features, respectively. Finally, we add virtual “recommendation” relations between the integrated views of users and items to model the preferences of users on items, seamlessly integrating RS with user-item KG over a unified graph. Experimental results on multiple datasets and benchmarks show that our method can achieve a better accuracy of recommendation compared with existing state-of-the-art methods. Complexity and runtime analysis suggests that our method can gain a lower time and space complexity than most of existing methods and improve scalability.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"12 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75139229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Introduction to the Special Section on Graph Technologies for User Modeling and Recommendation, Part 2 介绍用于用户建模和推荐的图形技术特别部分，第2部分

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-14 DOI: 10.1145/3490180

Xiangnan He, Z. Ren, Emine Yilmaz, Marc Najork, Tat-seng Chua

As a powerful data structure that represents the relationships among data objects, graph-structure data is ubiquitous in real-world applications. Learning on graph-structure data has become a hot spot in machine learning and data mining. Since most data in user-oriented services can be naturally organized as graphs, graph technologies have attracted increasing attention from IR community and achieved immense success, especially in two major research topics—user modeling and recommendation. In the recent decade, the IR and related communities have witnessed a number of major contributions to the field of graph learning. They include but not limited to collaborative filtering (e.g., He et al. [2020], Wang et al. [2019b], Wu et al. [2021], and Ying et al. [2018]), knowledge-aware recommendation (e.g., Cao et al. [2019] andWang et al. [2018, 2019a]), user profiling and demographic inference (e.g., Chen et al. [2019] and Rahimi et al. [2018]), social and sequential recommendation (e.g., Wang et al. [2020b] and Wu et al. [2019a, b]), bias and fairness (e.g., Rahman et al. [2019], Zhang et al. [2021a], and Zheng et al. [2021]). The growing body of work in this area has been supplemented by an increasing number of recent workshops (e.g., Cui et al. [2021], Ding et al. [2020], Jannach et al. [2020], and Yin et al. [2021]) and tutorials (e.g., Chen et al. [2020], Mehrotra et al. [2020], Tang and Dong [2019], Wang et al. [2020a], and Xu et al. [2018]). Despite such great

作为表示数据对象之间关系的强大数据结构，图结构数据在实际应用程序中无处不在。图结构数据的学习已成为机器学习和数据挖掘领域的研究热点。由于面向用户的服务中的大多数数据都可以自然地组织成图形，因此图形技术越来越受到IR社区的关注，并取得了巨大的成功，特别是在用户建模和推荐这两个主要研究课题上。在最近的十年中，IR和相关社区见证了对图学习领域的许多重大贡献。包括但不限于协同过滤(例如,他et al。[2020],王et al。(2019 b),吴et al。[2021],并应et al . [2018]), knowledge-aware建议(例如,曹et al。[2019]andWang et al .(2018, 2019)),用户分析和统计推断(例如,Chen等人[2019]和拉希米et al .[2018]),社会和顺序推荐王(例如,et al。(2020 b)和吴et al . (2019 a, b)),偏见和公平(例如,拉赫曼et al。[2019],Zhang et al。(2021),郑等[2021])。最近越来越多的研讨会(例如，Cui等人[2021]、Ding等人[2020]、Jannach等人[2020]和Yin等人[2021])和教程(例如，Chen等人[2020]、Mehrotra等人[2020]、Tang和Dong[2019]、Wang等人[2020a]和Xu等人[2018])补充了这一领域不断增长的工作。尽管如此伟大

{"title":"Introduction to the Special Section on Graph Technologies for User Modeling and Recommendation, Part 2","authors":"Xiangnan He, Z. Ren, Emine Yilmaz, Marc Najork, Tat-seng Chua","doi":"10.1145/3490180","DOIUrl":"https://doi.org/10.1145/3490180","url":null,"abstract":"As a powerful data structure that represents the relationships among data objects, graph-structure data is ubiquitous in real-world applications. Learning on graph-structure data has become a hot spot in machine learning and data mining. Since most data in user-oriented services can be naturally organized as graphs, graph technologies have attracted increasing attention from IR community and achieved immense success, especially in two major research topics—user modeling and recommendation. In the recent decade, the IR and related communities have witnessed a number of major contributions to the field of graph learning. They include but not limited to collaborative filtering (e.g., He et al. [2020], Wang et al. [2019b], Wu et al. [2021], and Ying et al. [2018]), knowledge-aware recommendation (e.g., Cao et al. [2019] andWang et al. [2018, 2019a]), user profiling and demographic inference (e.g., Chen et al. [2019] and Rahimi et al. [2018]), social and sequential recommendation (e.g., Wang et al. [2020b] and Wu et al. [2019a, b]), bias and fairness (e.g., Rahman et al. [2019], Zhang et al. [2021a], and Zheng et al. [2021]). The growing body of work in this area has been supplemented by an increasing number of recent workshops (e.g., Cui et al. [2021], Ding et al. [2020], Jannach et al. [2020], and Yin et al. [2021]) and tutorials (e.g., Chen et al. [2020], Mehrotra et al. [2020], Tang and Dong [2019], Wang et al. [2020a], and Xu et al. [2018]). Despite such great","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"266 1","pages":"1 - 5"},"PeriodicalIF":0.0,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77493648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

“What Can I Cook with these Ingredients?” - Understanding Cooking-Related Information Needs in Conversational Search “我能用这些食材做什么?”——在对话搜索中理解烹饪相关信息需求

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-09 DOI: 10.1145/3498330

Alexander Frummet, David Elsweiler, Bernd Ludwig

As conversational search becomes more pervasive, it becomes increasingly important to understand the users’ underlying information needs when they converse with such systems in diverse domains. We conduct an in situ study to understand information needs arising in a home cooking context as well as how they are verbally communicated to an assistant. A human experimenter plays this role in our study. Based on the transcriptions of utterances, we derive a detailed hierarchical taxonomy of diverse information needs occurring in this context, which require different levels of assistance to be solved. The taxonomy shows that needs can be communicated through different linguistic means and require different amounts of context to be understood. In a second contribution, we perform classification experiments to determine the feasibility of predicting the type of information need a user has during a dialogue using the turn provided. For this multi-label classification problem, we achieve average F1 measures of 40% using BERT-based models. We demonstrate with examples which types of needs are difficult to predict and show why, concluding that models need to include more context information in order to improve both information need classification and assistance to make such systems usable.

随着会话搜索变得越来越普遍，当用户与不同领域的此类系统进行对话时，理解用户的潜在信息需求变得越来越重要。我们进行了一项现场研究，以了解在家庭烹饪环境中产生的信息需求，以及如何将这些需求口头传达给助理。人类实验者在我们的研究中扮演着这个角色。基于话语的转录，我们得出了在这种情况下发生的不同信息需求的详细层次分类，这些需求需要不同程度的帮助来解决。该分类表明，需求可以通过不同的语言手段进行沟通，并且需要不同数量的上下文才能被理解。在第二个贡献中，我们执行分类实验，以确定使用提供的回合预测用户在对话期间所需信息类型的可行性。对于这个多标签分类问题，我们使用基于bert的模型实现了40%的平均F1度量。我们用例子说明了哪些类型的需求是难以预测的，并说明了原因，结论是模型需要包括更多的上下文信息，以便改进信息需求分类和帮助，使此类系统可用。

{"title":"“What Can I Cook with these Ingredients?” - Understanding Cooking-Related Information Needs in Conversational Search","authors":"Alexander Frummet, David Elsweiler, Bernd Ludwig","doi":"10.1145/3498330","DOIUrl":"https://doi.org/10.1145/3498330","url":null,"abstract":"As conversational search becomes more pervasive, it becomes increasingly important to understand the users’ underlying information needs when they converse with such systems in diverse domains. We conduct an in situ study to understand information needs arising in a home cooking context as well as how they are verbally communicated to an assistant. A human experimenter plays this role in our study. Based on the transcriptions of utterances, we derive a detailed hierarchical taxonomy of diverse information needs occurring in this context, which require different levels of assistance to be solved. The taxonomy shows that needs can be communicated through different linguistic means and require different amounts of context to be understood. In a second contribution, we perform classification experiments to determine the feasibility of predicting the type of information need a user has during a dialogue using the turn provided. For this multi-label classification problem, we achieve average F1 measures of 40% using BERT-based models. We demonstrate with examples which types of needs are difficult to predict and show why, concluding that models need to include more context information in order to improve both information need classification and assistance to make such systems usable.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"38 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73263535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9