首页 > 最新文献

ACM Transactions on Information Systems最新文献

英文 中文
Using Neural and Graph Neural Recommender systems to Overcome Choice Overload: Evidence from a Music Education Platform 使用神经和图神经推荐系统克服选择过载:来自音乐教育平台的证据
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-20 DOI: 10.1145/3637873
Hédi Razgallah, Michalis Vlachos, Ahmad Ajalloeian, Ninghao Liu, Johannes Schneider, Alexis Steinmann

The application of recommendation technologies has been crucial in the promotion of physical and digital content across numerous global platforms such as Amazon, Apple, and Netflix. Our study aims to investigate the advantages of employing recommendation technologies on educational platforms, with a particular focus on an educational platform for learning and practicing music.

Our research is based on data from Tomplay, a music platform that offers sheet music with professional audio recordings, enabling users to discover and practice music content at varying levels of difficulty. Through our analysis, we emphasize the distinct interaction patterns on educational platforms like Tomplay, which we compare with other commonly used recommendation datasets. We find that interactions are comparatively sparse on educational platforms, with users often focusing on specific content as they learn, rather than interacting with a broader range of material. Therefore, our primary goal is to address the issue of data sparsity. We achieve this through entity resolution principles and propose a neural network (NN) based recommendation model. Further, we improve this model by utilizing graph neural networks (GNNs), which provide superior predictive accuracy compared to NNs. Notably, our study demonstrates that GNNs are highly effective even for users with little or no historical preferences (cold-start problem).

Our cold-start experiments also provide valuable insights into an independent issue, namely the number of historical interactions needed by a recommendation model to gain a comprehensive understanding of a user. Our findings demonstrate that a platform acquires a solid knowledge of a user’s general preferences and characteristics with 50 past interactions. Overall, our study makes significant contributions to information systems research on business analytics and prescriptive analytics. Moreover, our framework and evaluation results offer implications for various stakeholders, including online educational institutions, education policymakers, and learning platform users.

在亚马逊、苹果和 Netflix 等众多全球平台上推广实体和数字内容时,推荐技术的应用至关重要。我们的研究旨在探讨在教育平台上应用推荐技术的优势,尤其关注音乐学习和练习的教育平台。我们的研究基于 Tomplay 的数据,Tomplay 是一个音乐平台,提供带有专业录音的乐谱,使用户能够发现并练习不同难度的音乐内容。通过分析,我们强调了 Tomplay 等教育平台上独特的交互模式,并将其与其他常用的推荐数据集进行了比较。我们发现,教育平台上的交互相对稀少,用户在学习过程中往往只关注特定内容,而不是与更广泛的材料进行交互。因此,我们的首要目标是解决数据稀少的问题。我们通过实体解析原则来实现这一目标,并提出了一个基于神经网络 (NN) 的推荐模型。此外,我们还利用图神经网络(GNN)改进了这一模型,与神经网络相比,图神经网络具有更高的预测准确性。值得注意的是,我们的研究表明,即使用户很少或没有历史偏好(冷启动问题),图神经网络也非常有效。我们的冷启动实验还为一个独立问题提供了有价值的见解,即推荐模型全面了解用户所需的历史交互数量。我们的研究结果表明,一个平台通过过去 50 次互动就能获得关于用户一般偏好和特征的可靠知识。总之,我们的研究为商业分析和描述性分析方面的信息系统研究做出了重大贡献。此外,我们的框架和评估结果对在线教育机构、教育政策制定者和学习平台用户等各利益相关方都有借鉴意义。
{"title":"Using Neural and Graph Neural Recommender systems to Overcome Choice Overload: Evidence from a Music Education Platform","authors":"Hédi Razgallah, Michalis Vlachos, Ahmad Ajalloeian, Ninghao Liu, Johannes Schneider, Alexis Steinmann","doi":"10.1145/3637873","DOIUrl":"https://doi.org/10.1145/3637873","url":null,"abstract":"<p>The application of recommendation technologies has been crucial in the promotion of physical and digital content across numerous global platforms such as Amazon, Apple, and Netflix. Our study aims to investigate the advantages of employing recommendation technologies on educational platforms, with a particular focus on an educational platform for learning and practicing music. </p><p>Our research is based on data from Tomplay, a music platform that offers sheet music with professional audio recordings, enabling users to discover and practice music content at varying levels of difficulty. Through our analysis, we emphasize the distinct interaction patterns on educational platforms like Tomplay, which we compare with other commonly used recommendation datasets. We find that interactions are comparatively sparse on educational platforms, with users often focusing on specific content as they learn, rather than interacting with a broader range of material. Therefore, our primary goal is to address the issue of data sparsity. We achieve this through entity resolution principles and propose a neural network (NN) based recommendation model. Further, we improve this model by utilizing graph neural networks (GNNs), which provide superior predictive accuracy compared to NNs. Notably, our study demonstrates that GNNs are highly effective even for users with little or no historical preferences (cold-start problem). </p><p>Our cold-start experiments also provide valuable insights into an independent issue, namely the number of historical interactions needed by a recommendation model to gain a comprehensive understanding of a user. Our findings demonstrate that a platform acquires a solid knowledge of a user’s general preferences and characteristics with 50 past interactions. Overall, our study makes significant contributions to information systems research on business analytics and prescriptive analytics. Moreover, our framework and evaluation results offer implications for various stakeholders, including online educational institutions, education policymakers, and learning platform users.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138817699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Impact of Showing Evidence from Peers in Crowdsourced Truthfulness Assessments 论在众包真实性评估中展示同行证据的影响
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-19 DOI: 10.1145/3637872
Jiechen Xu, Lei Han, Shazia Sadiq, Gianluca Demartini

Misinformation has been rapidly spreading online. The common approach to deal with it is deploying expert fact-checkers that follow forensic processes to identify the veracity of statements. Unfortunately, such an approach does not scale well. To deal with this, crowdsourcing has been looked at as an opportunity to complement the work done by trained journalists. In this paper, we look at the effect of presenting the crowd with evidence from others while judging the veracity of statements. We implement various variants of the judgment task design to understand if and how the presented evidence may or may not affect the way crowd workers judge truthfulness and their performance. Our results show that, in certain cases, the presented evidence and the way in which it is presented may mislead crowd workers who would otherwise be more accurate if judging independently from others. Those who make appropriate use of the provided evidence, however, can benefit from it and generate better judgments.

错误信息在网上迅速传播。常见的应对方法是部署专家事实核查人员,按照取证流程识别言论的真实性。遗憾的是,这种方法不能很好地扩展。为了解决这个问题,众包被视为补充训练有素的记者工作的一个机会。在本文中,我们研究了在判断言论的真实性时,向人群展示他人证据的效果。我们实施了各种变体的判断任务设计,以了解呈现的证据是否会影响或如何影响人群工作者判断真实性的方式及其表现。我们的结果表明,在某些情况下,提供的证据和提供证据的方式可能会误导人群工作者,否则他们在独立于他人进行判断时会更加准确。然而,那些适当利用所提供证据的人却能从中受益,做出更好的判断。
{"title":"On the Impact of Showing Evidence from Peers in Crowdsourced Truthfulness Assessments","authors":"Jiechen Xu, Lei Han, Shazia Sadiq, Gianluca Demartini","doi":"10.1145/3637872","DOIUrl":"https://doi.org/10.1145/3637872","url":null,"abstract":"<p>Misinformation has been rapidly spreading online. The common approach to deal with it is deploying expert fact-checkers that follow forensic processes to identify the veracity of statements. Unfortunately, such an approach does not scale well. To deal with this, crowdsourcing has been looked at as an opportunity to complement the work done by trained journalists. In this paper, we look at the effect of presenting the crowd with evidence from others while judging the veracity of statements. We implement various variants of the judgment task design to understand if and how the presented evidence may or may not affect the way crowd workers judge truthfulness and their performance. Our results show that, in certain cases, the presented evidence and the way in which it is presented may mislead crowd workers who would otherwise be more accurate if judging independently from others. Those who make appropriate use of the provided evidence, however, can benefit from it and generate better judgments.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138743466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMLP4Rec: An Efficient all-MLP Architecture for Sequential Recommendations SMLP4Rec:顺序推荐的高效全 MLP 架构
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-18 DOI: 10.1145/3637871
Jingtong Gao, Xiangyu Zhao, Muyang Li, Minghao Zhao, Runze Wu, Ruocheng Guo, Yiding Liu, Dawei Yin

Self-attention models have achieved the state-of-the-art performance in sequential recommender systems by capturing the sequential dependencies among user-item interactions. However, they rely on adding positional embeddings to the item sequence to retain the sequential information, which may break the semantics of item embeddings due to the heterogeneity between these two types of embeddings. In addition, most existing works assume that such dependencies exist solely in the item embeddings, but neglect their existence among the item features. In our previous study, we proposed a novel sequential recommendation model, i.e., MLP4Rec, based on the recent advances of MLP-Mixer architectures, which is naturally sensitive to the order of items in a sequence because matrix elements related to different positions of a sequence will be given different weights in training. We developed a tri-directional fusion scheme to coherently capture sequential, cross-channel, and cross-feature correlations with linear computational complexity as well as much fewer model parameters than existing self-attention methods. However, the cascading mixer structure, the large number of normalization layers between different mixer layers, and the noise generated by these operations limit the efficiency of information extraction and the effectiveness of MLP4Rec. In this extended version, we propose a novel framework – SMLP4Rec for sequential recommendation to address the aforementioned issues. The new framework changes the flawed cascading structure to a parallel mode, and integrates normalization layers to minimize their impact on the model’s efficiency while maximizing their effectiveness. As a result, the training speed and prediction accuracy of SMLP4Rec are vastly improved in comparison to MLP4Rec. Extensive experimental results demonstrate that the proposed method is significantly superior to the state-of-the-art approaches. The implementation code is available online to ease reproducibility.

自我关注模型通过捕捉用户与项目交互之间的顺序依赖关系,在顺序推荐系统中取得了最先进的性能。然而,它们依赖于在项目序列中添加位置嵌入来保留序列信息,这可能会破坏项目嵌入的语义,因为这两种类型的嵌入之间存在异质性。此外,现有的大多数研究都假定这种依赖关系只存在于项目嵌入中,而忽略了它们在项目特征中的存在。在之前的研究中,我们基于 MLP-Mixer 体系结构的最新进展,提出了一种新颖的序列推荐模型,即 MLP4Rec,它对序列中项目的顺序具有天然的敏感性,因为与序列中不同位置相关的矩阵元素在训练中会被赋予不同的权重。我们开发了一种三向融合方案,以线性计算复杂度和比现有自注意方法更少的模型参数,连贯地捕捉序列、跨信道和跨特征相关性。然而,级联混频器结构、不同混频器层之间的大量归一化层以及这些操作产生的噪声限制了信息提取的效率和 MLP4Rec 的有效性。在本扩展版本中,我们提出了一种用于顺序推荐的新型框架--SMLP4Rec,以解决上述问题。新框架将有缺陷的级联结构改为并行模式,并整合了归一化层,以尽量减少其对模型效率的影响,同时最大限度地提高其有效性。因此,与 MLP4Rec 相比,SMLP4Rec 的训练速度和预测准确性都有了大幅提高。广泛的实验结果表明,所提出的方法明显优于最先进的方法。实现代码可在线获取,以方便重现。
{"title":"SMLP4Rec: An Efficient all-MLP Architecture for Sequential Recommendations","authors":"Jingtong Gao, Xiangyu Zhao, Muyang Li, Minghao Zhao, Runze Wu, Ruocheng Guo, Yiding Liu, Dawei Yin","doi":"10.1145/3637871","DOIUrl":"https://doi.org/10.1145/3637871","url":null,"abstract":"<p>Self-attention models have achieved the state-of-the-art performance in sequential recommender systems by capturing the sequential dependencies among user-item interactions. However, they rely on adding positional embeddings to the item sequence to retain the sequential information, which may break the semantics of item embeddings due to the heterogeneity between these two types of embeddings. In addition, most existing works assume that such dependencies exist solely in the item embeddings, but neglect their existence among the item features. In our previous study, we proposed a novel sequential recommendation model, i.e., MLP4Rec, based on the recent advances of MLP-Mixer architectures, which is naturally sensitive to the order of items in a sequence because matrix elements related to different positions of a sequence will be given different weights in training. We developed a tri-directional fusion scheme to coherently capture sequential, cross-channel, and cross-feature correlations with linear computational complexity as well as much fewer model parameters than existing self-attention methods. However, the cascading mixer structure, the large number of normalization layers between different mixer layers, and the noise generated by these operations limit the efficiency of information extraction and the effectiveness of MLP4Rec. In this extended version, we propose a novel framework – SMLP4Rec for sequential recommendation to address the aforementioned issues. The new framework changes the flawed cascading structure to a parallel mode, and integrates normalization layers to minimize their impact on the model’s efficiency while maximizing their effectiveness. As a result, the training speed and prediction accuracy of SMLP4Rec are vastly improved in comparison to MLP4Rec. Extensive experimental results demonstrate that the proposed method is significantly superior to the state-of-the-art approaches. The implementation code is available online to ease reproducibility.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138717249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dense Text Retrieval based on Pretrained Language Models: A Survey 基于预训练语言模型的密集文本检索:调查
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-18 DOI: 10.1145/3637870
Wayne Xin Zhao, Jing Liu, Ruiyang Ren, Ji-Rong Wen

Text retrieval is a long-standing research topic on information seeking, where a system is required to return relevant information resources to user’s queries in natural language. From heuristic-based retrieval methods to learning-based ranking functions, the underlying retrieval models have been continually evolved with the ever-lasting technical innovation. To design effective retrieval models, a key point lies in how to learn text representations and model the relevance matching. The recent success of pretrained language models (PLM) sheds light on developing more capable text retrieval approaches by leveraging the excellent modeling capacity of PLMs. With powerful PLMs, we can effectively learn the semantic representations of queries and texts in the latent representation space, and further construct the semantic matching function between the dense vectors for relevance modeling. Such a retrieval approach is called dense retrieval, since it employs dense vectors to represent the texts. Considering the rapid progress on dense retrieval, this survey systematically reviews the recent progress on PLM-based dense retrieval. Different from previous surveys on dense retrieval, we take a new perspective to organize the related studies by four major aspects, including architecture, training, indexing and integration, and thoroughly summarize the mainstream techniques for each aspect. We extensively collect the recent advances on this topic, and include 300+ reference papers. To support our survey, we create a website for providing useful resources, and release a code repository for dense retrieval. This survey aims to provide a comprehensive, practical reference focused on the major progress for dense text retrieval.

文本检索是信息搜索领域的一个长期研究课题,系统需要根据用户的自然语言查询返回相关的信息资源。从基于启发式的检索方法到基于学习的排序功能,随着技术的不断创新,基础检索模型也在不断发展。要设计有效的检索模型,关键在于如何学习文本表征和建立相关性匹配模型。最近,预训练语言模型(PLM)取得了成功,这为我们利用 PLM 的出色建模能力开发更强大的文本检索方法提供了启示。利用功能强大的 PLM,我们可以有效地学习潜在表征空间中查询和文本的语义表征,并进一步构建密集向量之间的语义匹配函数,从而建立相关性模型。这种检索方法采用密集向量来表示文本,因此被称为密集检索。考虑到高密度检索的快速发展,本调查系统地回顾了基于 PLM 的高密度检索的最新进展。与以往的密集检索研究不同,我们从一个全新的视角出发,从架构、训练、索引和集成四个主要方面对相关研究进行了梳理,并对每个方面的主流技术进行了全面总结。我们广泛收集了该主题的最新进展,并收录了 300 多篇参考文献。为了支持我们的调查,我们创建了一个提供有用资源的网站,并发布了一个用于密集检索的代码库。本调查旨在为密集文本检索的主要进展提供全面、实用的参考。
{"title":"Dense Text Retrieval based on Pretrained Language Models: A Survey","authors":"Wayne Xin Zhao, Jing Liu, Ruiyang Ren, Ji-Rong Wen","doi":"10.1145/3637870","DOIUrl":"https://doi.org/10.1145/3637870","url":null,"abstract":"<p>Text retrieval is a long-standing research topic on information seeking, where a system is required to return relevant information resources to user’s queries in natural language. From heuristic-based retrieval methods to learning-based ranking functions, the underlying retrieval models have been continually evolved with the ever-lasting technical innovation. To design effective retrieval models, a key point lies in how to learn text representations and model the relevance matching. The recent success of pretrained language models (PLM) sheds light on developing more capable text retrieval approaches by leveraging the excellent modeling capacity of PLMs. With powerful PLMs, we can effectively learn the semantic representations of queries and texts in the latent representation space, and further construct the semantic matching function between the dense vectors for relevance modeling. Such a retrieval approach is called <i>dense retrieval</i>, since it employs dense vectors to represent the texts. Considering the rapid progress on dense retrieval, this survey systematically reviews the recent progress on PLM-based dense retrieval. Different from previous surveys on dense retrieval, we take a new perspective to organize the related studies by four major aspects, including architecture, training, indexing and integration, and thoroughly summarize the mainstream techniques for each aspect. We extensively collect the recent advances on this topic, and include 300+ reference papers. To support our survey, we create a website for providing useful resources, and release a code repository for dense retrieval. This survey aims to provide a comprehensive, practical reference focused on the major progress for dense text retrieval.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138716968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relevance Feedback with Brain Signals 利用大脑信号进行相关性反馈
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-18 DOI: 10.1145/3637874
Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su, Min Zhang

The Relevance Feedback (RF) process relies on accurate and real-time relevance estimation of feedback documents to improve retrieval performance. Since collecting explicit relevance annotations imposes an extra burden on the user, extensive studies have explored using pseudo-relevance signals and implicit feedback signals as substitutes. However, such signals are indirect indicators of relevance and suffer from complex search scenarios where user interactions are absent or biased.

Recently, the advances in portable and high-precision brain-computer interface (BCI) devices have shown the possibility to monitor user’s brain activities during search process. Brain signals can directly reflect user’s psychological responses to search results and thus it can act as additional and unbiased RF signals. To explore the effectiveness of brain signals in the context of RF, we propose a novel RF framework that combines BCI-based relevance feedback with pseudo-relevance signals and implicit signals to improve the performance of document re-ranking. The experimental results on the user study dataset show that incorporating brain signals leads to significant performance improvement in our RF framework. Besides, we observe that brain signals perform particularly well in several hard search scenarios, especially when implicit signals as feedback are missing or noisy. This reveals when and how to exploit brain signals in the context of RF.

相关性反馈(RF)过程依赖于对反馈文档进行准确和实时的相关性估计,以提高检索性能。由于收集明确的相关性注释会给用户带来额外负担,因此大量研究都在探索使用伪相关性信号和隐式反馈信号作为替代。然而,这些信号都是相关性的间接指标,在用户互动缺失或有偏差的复杂搜索场景中会受到影响。最近,便携式高精度脑机接口(BCI)设备的发展为监测用户在搜索过程中的大脑活动提供了可能。脑信号可以直接反映用户对搜索结果的心理反应,因此可以作为额外的、无偏见的射频信号。为了探索大脑信号在搜索相关性方面的有效性,我们提出了一个新颖的搜索相关性框架,该框架将基于 BCI 的相关性反馈与伪相关性信号和隐式信号相结合,以提高文档重新排序的性能。在用户研究数据集上的实验结果表明,在我们的 RF 框架中,结合大脑信号可显著提高性能。此外,我们还观察到大脑信号在几种困难搜索场景中表现尤为出色,尤其是在作为反馈的隐含信号缺失或存在噪声的情况下。这揭示了何时以及如何在射频范围内利用大脑信号。
{"title":"Relevance Feedback with Brain Signals","authors":"Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su, Min Zhang","doi":"10.1145/3637874","DOIUrl":"https://doi.org/10.1145/3637874","url":null,"abstract":"<p>The Relevance Feedback (RF) process relies on accurate and real-time relevance estimation of feedback documents to improve retrieval performance. Since collecting explicit relevance annotations imposes an extra burden on the user, extensive studies have explored using pseudo-relevance signals and implicit feedback signals as substitutes. However, such signals are indirect indicators of relevance and suffer from complex search scenarios where user interactions are absent or biased. </p><p>Recently, the advances in portable and high-precision brain-computer interface (BCI) devices have shown the possibility to monitor user’s brain activities during search process. Brain signals can directly reflect user’s psychological responses to search results and thus it can act as additional and unbiased RF signals. To explore the effectiveness of brain signals in the context of RF, we propose a novel RF framework that combines BCI-based relevance feedback with pseudo-relevance signals and implicit signals to improve the performance of document re-ranking. The experimental results on the user study dataset show that incorporating brain signals leads to significant performance improvement in our RF framework. Besides, we observe that brain signals perform particularly well in several hard search scenarios, especially when implicit signals as feedback are missing or noisy. This reveals when and how to exploit brain signals in the context of RF.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138717032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems 理解还是操纵?反思现代推荐系统的在线性能收益
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-15 DOI: 10.1145/3637869
Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu†, Yong Yu, Weinan Zhang†

Recommender systems are expected to be assistants that help human users find relevant information automatically without explicit queries. As recommender systems evolve, increasingly sophisticated learning techniques are applied and have achieved better performance in terms of user engagement metrics such as clicks and browsing time. The increase in the measured performance, however, can have two possible attributions: a better understanding of user preferences, and a more proactive ability to utilize human bounded rationality to seduce user over-consumption. A natural following question is whether current recommendation algorithms are manipulating user preferences. If so, can we measure the manipulation level? In this paper, we present a general framework for benchmarking the degree of manipulations of recommendation algorithms, in both slate recommendation and sequential recommendation scenarios. The framework consists of four stages, initial preference calculation, training data collection, algorithm training and interaction, and metrics calculation that involves two proposed metrics, Manipulation Score and Preference Shift. We benchmark some representative recommendation algorithms in both synthetic and real-world datasets under the proposed framework. We have observed that a high online click-through rate does not necessarily mean a better understanding of user initial preference, but ends in prompting users to choose more documents they initially did not favor. Moreover, we find that the training data have notable impacts on the manipulation degrees, and algorithms with more powerful modeling abilities are more sensitive to such impacts. The experiments also verified the usefulness of the proposed metrics for measuring the degree of manipulations. We advocate that future recommendation algorithm studies should be treated as an optimization problem with constrained user preference manipulations.

推荐系统有望成为帮助人类用户在没有明确查询的情况下自动查找相关信息的助手。随着推荐系统的发展,人们应用了越来越复杂的学习技术,并在用户参与度指标(如点击量和浏览时间)方面取得了更好的性能。然而,衡量性能的提高可能有两个原因:一是对用户偏好有了更好的理解,二是能够更主动地利用人类的有限理性来诱导用户过度消费。接下来的一个自然问题是,当前的推荐算法是否操纵了用户偏好。如果是,我们能否衡量操纵程度?在本文中,我们提出了一个通用框架,用于在板块推荐和顺序推荐两种情况下对推荐算法的操纵程度进行基准测试。该框架包括四个阶段:初始偏好计算、训练数据收集、算法训练和交互,以及指标计算,其中涉及两个建议的指标:操纵分数和偏好偏移。我们根据提出的框架,在合成数据集和真实数据集中对一些具有代表性的推荐算法进行了基准测试。我们发现,在线点击率高并不一定意味着能更好地了解用户的初始偏好,而是会促使用户选择更多他们最初并不喜欢的文档。此外,我们还发现训练数据对操作度有显著影响,而建模能力更强的算法对这种影响更为敏感。实验还验证了所提出的衡量操纵程度的指标的实用性。我们主张,未来的推荐算法研究应将用户偏好操作作为一个优化问题来处理。
{"title":"Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems","authors":"Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu†, Yong Yu, Weinan Zhang†","doi":"10.1145/3637869","DOIUrl":"https://doi.org/10.1145/3637869","url":null,"abstract":"<p>Recommender systems are expected to be assistants that help human users find relevant information automatically without explicit queries. As recommender systems evolve, increasingly sophisticated learning techniques are applied and have achieved better performance in terms of user engagement metrics such as clicks and browsing time. The increase in the measured performance, however, can have two possible attributions: a better understanding of user preferences, and a more proactive ability to utilize human bounded rationality to seduce user over-consumption. A natural following question is whether current recommendation algorithms are manipulating user preferences. If so, can we measure the manipulation level? In this paper, we present a general framework for benchmarking the degree of manipulations of recommendation algorithms, in both slate recommendation and sequential recommendation scenarios. The framework consists of four stages, initial preference calculation, training data collection, algorithm training and interaction, and metrics calculation that involves two proposed metrics, Manipulation Score and Preference Shift. We benchmark some representative recommendation algorithms in both synthetic and real-world datasets under the proposed framework. We have observed that a high online click-through rate does not necessarily mean a better understanding of user initial preference, but ends in prompting users to choose more documents they initially did not favor. Moreover, we find that the training data have notable impacts on the manipulation degrees, and algorithms with more powerful modeling abilities are more sensitive to such impacts. The experiments also verified the usefulness of the proposed metrics for measuring the degree of manipulations. We advocate that future recommendation algorithm studies should be treated as an optimization problem with constrained user preference manipulations.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138691243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Effectiveness of Sampled Softmax Loss for Item Recommendation 论采样软最大损失在项目推荐中的有效性
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-13 DOI: 10.1145/3637061
Jiancan Wu, Xiang Wang, Xingyu Gao, Jiawei Chen, Hongcheng Fu, Tianyu Qiu, Xiangnan He

The learning objective plays a fundamental role to build a recommender system. Most methods routinely adopt either pointwise (e.g., binary cross-entropy) or pairwise (e.g., BPR) loss to train the model parameters, while rarely pay attention to softmax loss, which assumes the probabilities of all classes sum up to 1, due to its computational complexity when scaling up to large datasets or intractability for streaming data where the complete item space is not always available. The sampled softmax (SSM) loss emerges as an efficient substitute for softmax loss. Its special case, InfoNCE loss, has been widely used in self-supervised learning and exhibited remarkable performance for contrastive learning. Nonetheless, limited recommendation work uses the SSM loss as the learning objective. Worse still, none of them explores its properties thoroughly and answers “Does SSM loss suit for item recommendation?” and “What are the conceptual advantages of SSM loss, as compared with the prevalent losses?”, to the best of our knowledge.

In this work, we aim to offer a better understanding of SSM for item recommendation. Specifically, we first theoretically reveal three model-agnostic advantages: (1) mitigating popularity bias, which is beneficial to long-tail recommendation; (2) mining hard negative samples, which offers informative gradients to optimize model parameters; and (3) maximizing the ranking metric, which facilitates top-K performance. However, based on our empirical studies, we recognize that the default choice of cosine similarity function in SSM limits its ability in learning the magnitudes of representation vectors. As such, the combinations of SSM with the models that also fall short in adjusting magnitudes (e.g., matrix factorization) may result in poor representations. One step further, we provide mathematical proof that message passing schemes in graph convolution networks can adjust representation magnitude according to node degree, which naturally compensates for the shortcoming of SSM. Extensive experiments on four benchmark datasets justify our analyses, demonstrating the superiority of SSM for item recommendation. Our implementations are available in both TensorFlow and PyTorch.

学习目标对建立推荐系统起着至关重要的作用。大多数方法通常采用点状损失(如二元交叉熵)或成对损失(如 BPR)来训练模型参数,而很少关注软最大损失,因为软最大损失假定所有类别的概率总和为 1,这是因为它在扩展到大型数据集时计算复杂,或者对于并非总是有完整项目空间的流数据来说难以处理。采样软最大(SSM)损失作为软最大损失的有效替代品应运而生。它的特例--InfoNCE 损失已被广泛应用于自我监督学习中,并在对比学习中表现出卓越的性能。然而,使用 SSM 损失作为学习目标的推荐工作非常有限。更糟糕的是,据我们所知,没有一项研究深入探讨了 SSM 损失的特性,并回答了 "SSM 损失是否适合项目推荐?"以及 "与流行的损失相比,SSM 损失在概念上有哪些优势?"的问题。在这项工作中,我们旨在更好地理解用于项目推荐的 SSM。具体来说,我们首先从理论上揭示了与模型无关的三个优势:(1) 减少流行度偏差,这有利于长尾推荐;(2) 挖掘硬负样本,这为优化模型参数提供了信息梯度;(3) 最大化排名度量,这有利于 Top-K 性能。然而,根据我们的实证研究,我们认识到 SSM 默认选择的余弦相似度函数限制了其学习表示向量大小的能力。因此,将 SSM 与同样无法调整大小的模型(如矩阵因式分解)相结合,可能会导致较差的表示。我们进一步提供了数学证明,即图卷积网络中的消息传递方案可以根据节点度调整表示量,这自然弥补了 SSM 的不足。在四个基准数据集上进行的广泛实验证明了我们的分析,证明了 SSM 在项目推荐方面的优越性。我们的实现可在 TensorFlow 和 PyTorch 中使用。
{"title":"On the Effectiveness of Sampled Softmax Loss for Item Recommendation","authors":"Jiancan Wu, Xiang Wang, Xingyu Gao, Jiawei Chen, Hongcheng Fu, Tianyu Qiu, Xiangnan He","doi":"10.1145/3637061","DOIUrl":"https://doi.org/10.1145/3637061","url":null,"abstract":"<p>The learning objective plays a fundamental role to build a recommender system. Most methods routinely adopt either pointwise (<i>e.g.,</i> binary cross-entropy) or pairwise (<i>e.g.,</i> BPR) loss to train the model parameters, while rarely pay attention to softmax loss, which assumes the probabilities of all classes sum up to 1, due to its computational complexity when scaling up to large datasets or intractability for streaming data where the complete item space is not always available. The sampled softmax (SSM) loss emerges as an efficient substitute for softmax loss. Its special case, InfoNCE loss, has been widely used in self-supervised learning and exhibited remarkable performance for contrastive learning. Nonetheless, limited recommendation work uses the SSM loss as the learning objective. Worse still, none of them explores its properties thoroughly and answers “Does SSM loss suit for item recommendation?” and “What are the conceptual advantages of SSM loss, as compared with the prevalent losses?”, to the best of our knowledge. </p><p>In this work, we aim to offer a better understanding of SSM for item recommendation. Specifically, we first theoretically reveal three model-agnostic advantages: (1) mitigating popularity bias, which is beneficial to long-tail recommendation; (2) mining hard negative samples, which offers informative gradients to optimize model parameters; and (3) maximizing the ranking metric, which facilitates top-<i>K</i> performance. However, based on our empirical studies, we recognize that the default choice of cosine similarity function in SSM limits its ability in learning the magnitudes of representation vectors. As such, the combinations of SSM with the models that also fall short in adjusting magnitudes (<i>e.g.,</i> matrix factorization) may result in poor representations. One step further, we provide mathematical proof that message passing schemes in graph convolution networks can adjust representation magnitude according to node degree, which naturally compensates for the shortcoming of SSM. Extensive experiments on four benchmark datasets justify our analyses, demonstrating the superiority of SSM for item recommendation. Our implementations are available in both TensorFlow and PyTorch.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138580490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-Preserving Individual-Level COVID-19 Infection Prediction via Federated Graph Learning 通过联合图学习进行保护隐私的个人级 COVID-19 感染预测
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-07 DOI: 10.1145/3633202
Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

Accurately predicting individual-level infection state is of great value since its essential role in reducing the damage of the epidemic. However, there exists an inescapable risk of privacy leakage in the fine-grained user mobility trajectories required by individual-level infection prediction. In this paper, we focus on developing a framework of privacy-preserving individual-level infection prediction based on federated learning (FL) and graph neural networks (GNN). We propose Falcon, a Federated grAph Learning method for privacy-preserving individual-level infeCtion predictiON. It utilizes a novel hypergraph structure with spatio-temporal hyperedges to describe the complex interactions between individuals and locations in the contagion process. By organically combining the FL framework with hypergraph neural networks, the information propagation process of the graph machine learning is able to be divided into two stages distributed on the server and the clients, respectively, so as to effectively protect user privacy while transmitting high-level information. Furthermore, it elaborately designs a differential privacy perturbation mechanism as well as a plausible pseudo location generation approach to preserve user privacy in the graph structure. Besides, it introduces a cooperative coupling mechanism between the individual-level prediction model and an additional region-level model to mitigate the detrimental impacts caused by the injected obfuscation mechanisms. Extensive experimental results show that our methodology outperforms state-of-the-art algorithms and is able to protect user privacy against actual privacy attacks. Our code and datasets are available at the link: https://github.com/wjfu99/FL-epidemic.

准确预测个人层面的感染状态具有重要价值,因为它在减少疫情危害方面发挥着至关重要的作用。然而,个体级感染预测所需的细粒度用户移动轨迹存在不可避免的隐私泄露风险。在本文中,我们重点开发了一种基于联合学习(FL)和图神经网络(GNN)的保护隐私的个体级感染预测框架。我们提出了一种用于保护隐私的个体级推断预测的联合图神经网络学习方法 Falcon。它利用具有时空超边缘的新型超图结构来描述传染过程中个体与地点之间的复杂互动。通过将 FL 框架与超图神经网络有机结合,将图机器学习的信息传播过程分为两个阶段,分别分布在服务器和客户端,从而在传输高级信息的同时有效保护用户隐私。此外,它还精心设计了一种差分隐私扰动机制和一种可信的伪位置生成方法,以保护图结构中的用户隐私。此外,它还引入了个人级预测模型和附加区域级模型之间的合作耦合机制,以减轻注入式混淆机制造成的不利影响。广泛的实验结果表明,我们的方法优于最先进的算法,能够在实际隐私攻击中保护用户隐私。我们的代码和数据集可从以下链接获取:https://github.com/wjfu99/FL-epidemic。
{"title":"Privacy-Preserving Individual-Level COVID-19 Infection Prediction via Federated Graph Learning","authors":"Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang","doi":"10.1145/3633202","DOIUrl":"https://doi.org/10.1145/3633202","url":null,"abstract":"<p>Accurately predicting individual-level infection state is of great value since its essential role in reducing the damage of the epidemic. However, there exists an inescapable risk of privacy leakage in the fine-grained user mobility trajectories required by individual-level infection prediction. In this paper, we focus on developing a framework of privacy-preserving individual-level infection prediction based on federated learning (FL) and graph neural networks (GNN). We propose <i>Falcon</i>, a <b>F</b>ederated gr<b>A</b>ph <b>L</b>earning method for privacy-preserving individual-level infe<b>C</b>tion predicti<b>ON</b>. It utilizes a novel hypergraph structure with spatio-temporal hyperedges to describe the complex interactions between individuals and locations in the contagion process. By organically combining the FL framework with hypergraph neural networks, the information propagation process of the graph machine learning is able to be divided into two stages distributed on the server and the clients, respectively, so as to effectively protect user privacy while transmitting high-level information. Furthermore, it elaborately designs a differential privacy perturbation mechanism as well as a plausible pseudo location generation approach to preserve user privacy in the graph structure. Besides, it introduces a cooperative coupling mechanism between the individual-level prediction model and an additional region-level model to mitigate the detrimental impacts caused by the injected obfuscation mechanisms. Extensive experimental results show that our methodology outperforms state-of-the-art algorithms and is able to protect user privacy against actual privacy attacks. Our code and datasets are available at the link: https://github.com/wjfu99/FL-epidemic.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138546759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Effective and Efficient Sparse Neural Information Retrieval 实现有效、高效的稀疏神经信息检索
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-12-02 DOI: 10.1145/3634912
Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

Sparse representation learning based on Pre-trained Language Models has seen a growing interest in Information Retrieval. Such approaches can take advantage of the proven efficiency of inverted indexes, and inherit desirable IR priors such as explicit lexical matching or some degree of interpretability. In this work, we thoroughly develop the framework of sparse representation learning in IR, which unifies term weighting and expansion in a supervised setting. We then build on SPLADE – a sparse expansion-based retriever – and show to which extent it is able to benefit from the same training improvements as dense bi-encoders, by studying the effect of distillation, hard negative mining as well as the Pre-trained Language Model’s initialization on its effectiveness – leading to state-of-the-art results in both in- and out-of-domain evaluation settings (SPLADE++). We furthermore propose efficiency improvements, allowing us to reach latency requirements on par with traditional keyword-based approaches (Efficient-SPLADE).

基于预训练语言模型的稀疏表示学习在信息检索领域越来越受到关注。这种方法可以利用倒排索引的公认效率,并继承理想的 IR 先验,如明确的词性匹配或一定程度的可解释性。在这项工作中,我们深入开发了 IR 中的稀疏表示学习框架,该框架将术语加权和扩展统一在一个有监督的环境中。然后,我们建立了基于稀疏扩展的检索器 SPLADE,并通过研究蒸馏、硬否定挖掘以及预训练语言模型的初始化对其有效性的影响,展示了 SPLADE 在多大程度上能够从与密集双编码器相同的训练改进中获益,从而在域内和域外评估设置(SPLADE++)中都取得了最先进的结果。此外,我们还提出了提高效率的建议,使我们能够达到与传统基于关键词的方法(Efficient-SPLADE)同等的延迟要求。
{"title":"Towards Effective and Efficient Sparse Neural Information Retrieval","authors":"Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant","doi":"10.1145/3634912","DOIUrl":"https://doi.org/10.1145/3634912","url":null,"abstract":"<p>Sparse representation learning based on Pre-trained Language Models has seen a growing interest in Information Retrieval. Such approaches can take advantage of the proven efficiency of inverted indexes, and inherit desirable IR priors such as explicit lexical matching or some degree of interpretability. In this work, we thoroughly develop the framework of sparse representation learning in IR, which unifies term weighting and expansion in a supervised setting. We then build on SPLADE – a sparse expansion-based retriever – and show to which extent it is able to benefit from the same training improvements as dense bi-encoders, by studying the effect of distillation, hard negative mining as well as the Pre-trained Language Model’s initialization on its <i>effectiveness</i> – leading to state-of-the-art results in both in- and out-of-domain evaluation settings (SPLADE++). We furthermore propose <i>efficiency</i> improvements, allowing us to reach latency requirements on par with traditional keyword-based approaches (Efficient-SPLADE).</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138559919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Augmentation for Sample Efficient and Robust Document Ranking 基于样本高效鲁棒排序的数据增强
IF 5.6 2区 计算机科学 Q1 Business, Management and Accounting Pub Date : 2023-11-29 DOI: 10.1145/3634911
Abhijit Anand, Jurek Leonhardt, Jaspreet Singh, Koustav Rudra, Avishek Anand

Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine-tuning. In this paper, we propose data-augmentation methods for effective and robust ranking performance. One of the key benefits of using data augmentation is in achieving sample efficiency or learning effectively when we have only a small amount of training data. We propose supervised and unsupervised data augmentation schemes by creating training data using parts of the relevant documents in the query-document pairs. We then adapt a family of contrastive losses for the document ranking task that can exploit the augmented data to learn an effective ranking model. Our extensive experiments on subsets of the MS MARCO and TREC-DL test sets show that data augmentation, along with the ranking-adapted contrastive losses, results in performance improvements under most dataset sizes. Apart from sample efficiency, we conclusively show that data augmentation results in robust models when transferred to out-of-domain benchmarks. Our performance improvements in in-domain and more prominently in out-of-domain benchmarks show that augmentation regularizes the ranking model and improves its robustness and generalization capability.

在文档排序任务中,上下文排序模型比经典模型提供了令人印象深刻的性能改进。然而,这些高度过度参数化的模型往往需要大量数据,甚至需要大量数据进行微调。在本文中,我们提出了有效和稳健的排名性能的数据增强方法。使用数据增强的主要好处之一是在我们只有少量训练数据的情况下实现样本效率或有效学习。我们通过使用查询文档对中相关文档的部分创建训练数据,提出了有监督和无监督的数据增强方案。然后,我们为文档排序任务调整了一系列对比损失,可以利用增强的数据来学习有效的排序模型。我们对MS MARCO和TREC-DL测试集的子集进行了广泛的实验,结果表明,在大多数数据集大小下,数据增强以及与排名相适应的对比损失都能提高性能。除了样本效率之外,我们最后表明,当转移到域外基准测试时,数据增强会产生鲁棒模型。我们在域内和域外的性能改进表明,增强使排名模型规范化,提高了其鲁棒性和泛化能力。
{"title":"Data Augmentation for Sample Efficient and Robust Document Ranking","authors":"Abhijit Anand, Jurek Leonhardt, Jaspreet Singh, Koustav Rudra, Avishek Anand","doi":"10.1145/3634911","DOIUrl":"https://doi.org/10.1145/3634911","url":null,"abstract":"<p>Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine-tuning. In this paper, we propose data-augmentation methods for effective and robust ranking performance. One of the key benefits of using data augmentation is in achieving <i>sample efficiency</i> or learning effectively when we have only a small amount of training data. We propose supervised and unsupervised data augmentation schemes by creating training data using parts of the relevant documents in the query-document pairs. We then adapt a family of contrastive losses for the document ranking task that can exploit the augmented data to learn an effective ranking model. Our extensive experiments on subsets of the <span>MS MARCO</span> and <span>TREC-DL</span> test sets show that data augmentation, along with the ranking-adapted contrastive losses, results in performance improvements under most dataset sizes. Apart from sample efficiency, we conclusively show that data augmentation results in robust models when transferred to out-of-domain benchmarks. Our performance improvements in in-domain and more prominently in out-of-domain benchmarks show that augmentation regularizes the ranking model and improves its robustness and generalization capability.</p>","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":null,"pages":null},"PeriodicalIF":5.6,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138537120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
ACM Transactions on Information Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1