首页 > 最新文献

Foundations and Trends in Information Retrieval最新文献

英文 中文
Efficient and Effective Tree-based and Neural Learning to Rank 高效的基于树和神经学习的排名
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-14 DOI: 10.1561/1500000071
Sebastian Bruch, Claudio Lucchese, Franco Maria Nardini

As information retrieval researchers, we not only develop algorithmic solutions to hard problems, but we also insist on a proper, multifaceted evaluation of ideas. The literature on the fundamental topic of retrieval and ranking, for instance, has a rich history of studying the effectiveness of indexes, retrieval algorithms, and complex machine learning rankers, while at the same time quantifying their computational costs, from creation and training to application and inference. This is evidenced, for example, by more than a decade of research on efficient training and inference of large decision forest models in Learning to Rank (LtR). As we move towards even more complex, deep learning models in a wide range of applications, questions on efficiency have once again resurfaced with renewed urgency. Indeed, efficiency is no longer limited to time and space; instead it has found new, challenging dimensions that stretch to resource-, sample- and energy-efficiency with ramifications for researchers, users, and the environment.

This monograph takes a step towards promoting the study of efficiency in the era of neural information retrieval by offering a comprehensive survey of the literature on efficiency and effectiveness in ranking, and to a limited extent, retrieval. This monograph was inspired by the parallels that exist between the challenges in neural network-based ranking solutions and their predecessors, decision forest-based LtR models, as well as the connections between the solutions the literature to date has to offer. We believe that by understanding the fundamentals underpinning these algorithmic and data structure solutions for containing the contentious relationship between efficiency and effectiveness, one can better identify future directions and more efficiently determine the merits of ideas. We also present what we believe to be important research directions in the forefront of efficiency and effectiveness in retrieval and ranking.

作为信息检索研究人员,我们不仅开发解决难题的算法,而且还坚持对思想进行适当的、多方面的评估。例如,关于检索和排名这一基本主题的文献,在研究索引、检索算法和复杂机器学习排名器的有效性方面有着丰富的历史,同时量化它们的计算成本,从创建和训练到应用和推理。例如,十多年来对学习排序(LtR)中大型决策森林模型的有效训练和推理的研究证明了这一点。随着我们在更广泛的应用中向更复杂的深度学习模型迈进,效率问题再次以新的紧迫性重新浮出水面。事实上,效率不再局限于时间和空间;相反,它发现了新的、具有挑战性的维度,延伸到资源、样本和能源效率,对研究人员、用户和环境产生影响。本专著采取了一个步骤,以促进效率的研究在神经信息检索的时代,提供了一个全面的调查文献效率和有效性的排名,并在有限的程度上,检索。这本专著的灵感来自于基于神经网络的排名解决方案与其前身,基于决策森林的LtR模型的挑战之间存在的相似之处,以及迄今为止文献中所提供的解决方案之间的联系。我们相信,通过理解这些算法和数据结构解决方案的基础,以包含效率和有效性之间有争议的关系,人们可以更好地确定未来的方向,更有效地确定想法的优点。我们还提出了我们认为在检索和排名的效率和有效性方面的重要研究方向。
{"title":"Efficient and Effective Tree-based and Neural Learning to Rank","authors":"Sebastian Bruch, Claudio Lucchese, Franco Maria Nardini","doi":"10.1561/1500000071","DOIUrl":"https://doi.org/10.1561/1500000071","url":null,"abstract":"<p>As information retrieval researchers, we not only develop algorithmic solutions to hard problems, but we also insist on a proper, multifaceted evaluation of ideas. The literature on the fundamental topic of retrieval and ranking, for instance, has a rich history of studying the effectiveness of indexes, retrieval algorithms, and complex machine learning rankers, while at the same time quantifying their computational costs, from creation and training to application and inference. This is evidenced, for example, by more than a decade of research on efficient training and inference of large decision forest models in Learning to Rank (LtR). As we move towards even more complex, deep learning models in a wide range of applications, questions on efficiency have once again resurfaced with renewed urgency. Indeed, efficiency is no longer limited to time and space; instead it has found new, challenging dimensions that stretch to resource-, sample- and energy-efficiency with ramifications for researchers, users, and the environment.<p>This monograph takes a step towards promoting the study of efficiency in the era of neural information retrieval by offering a comprehensive survey of the literature on efficiency and effectiveness in ranking, and to a limited extent, retrieval. This monograph was inspired by the parallels that exist between the challenges in neural network-based ranking solutions and their predecessors, decision forest-based LtR models, as well as the connections between the solutions the literature to date has to offer. We believe that by understanding the fundamentals underpinning these algorithmic and data structure solutions for containing the contentious relationship between efficiency and effectiveness, one can better identify future directions and more efficiently determine the merits of ideas. We also present what we believe to be important research directions in the forefront of efficiency and effectiveness in retrieval and ranking.</p></p>","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"72 4","pages":""},"PeriodicalIF":10.4,"publicationDate":"2023-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49697987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Quantum-Inspired Neural Language Representation, Matching and Understanding 量子启发的神经语言表示、匹配和理解
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-04-18 DOI: 10.1561/1500000091
Peng Zhang, Hui Gao, Jing Zhang, Dawei Song

The introduction of Quantum Theory (QT) provides a unified mathematical framework for Information Retrieval (IR). Compared with the classical IR framework, the quantuminspired IR framework is based on user-centered modeling methods to model non-classical cognitive phenomena in human relevance judgment in the IR process. With the increase of data and computing resources, neural IR methods have been applied to the text matching and understanding task of IR. Neural networks have a strong learning ability of effective representation and generalization of matching patterns from raw data. However, these methods show some unavoidable defects, such as the inability to model user cognitive phenomena, large number of model parameters and the “black box” characteristics of network structure. These problems greatly limit the development of neural IR and related fields. Although the quantum-inspired retrieval framework can theoretically solve the above problems, it is faced with problems such as poor model efficiency and difficulty in integrating with neural network, which lead to a huge gap between QT and neural network modeling.

This review gives a systematic introduction to quantuminspired neural IR, including quantum-inspired neural language representation, matching and understanding. This is not only helpful to non-classical phenomena modeling in IR but also to break the theoretical bottleneck of neural networks and design more transparent neural IR models. We introduce the language representation method based on QT and the quantum-inspired text matching and decision making model under neural network, which shows its theoretical advantages in document ranking, relevance matching, multimodal IR, and can be integrated with neural networks to jointly promote the development of IR. The latest progress of quantum language understanding is introduced and further topics on QT and language modeling provide readers with more materials for thinking.

量子理论(QT)的引入为信息检索(IR)提供了统一的数学框架。与经典红外框架相比,量子启发红外框架基于以用户为中心的建模方法,对红外过程中人类关联判断中的非经典认知现象进行建模。随着数据量和计算资源的增加,神经红外方法已被应用于红外文本的匹配和理解任务。神经网络具有较强的学习能力,可以从原始数据中有效地表示和泛化匹配模式。然而,这些方法存在一些不可避免的缺陷,如无法对用户认知现象进行建模、模型参数过多、网络结构的“黑箱”特征等。这些问题极大地限制了神经红外及其相关领域的发展。虽然量子启发检索框架理论上可以解决上述问题,但它面临着模型效率差、难以与神经网络集成等问题,导致QT与神经网络建模存在巨大差距。本文系统地介绍了量子启发神经红外,包括量子启发神经语言表示、匹配和理解。这不仅有助于红外中的非经典现象建模,而且有助于突破神经网络的理论瓶颈,设计出更加透明的神经红外模型。本文介绍了基于QT的语言表示方法和神经网络下的量子启发文本匹配与决策模型,在文档排序、关联匹配、多模态IR等方面显示了其理论优势,并可与神经网络相结合,共同推动IR的发展。介绍了量子语言理解的最新进展,并进一步介绍了QT和语言建模的主题,为读者提供了更多的思考材料。
{"title":"Quantum-Inspired Neural Language Representation, Matching and Understanding","authors":"Peng Zhang, Hui Gao, Jing Zhang, Dawei Song","doi":"10.1561/1500000091","DOIUrl":"https://doi.org/10.1561/1500000091","url":null,"abstract":"<p>The introduction of Quantum Theory (QT) provides a unified mathematical framework for Information Retrieval (IR). Compared with the classical IR framework, the quantuminspired IR framework is based on user-centered modeling methods to model non-classical cognitive phenomena in human relevance judgment in the IR process. With the increase of data and computing resources, neural IR methods have been applied to the text matching and understanding task of IR. Neural networks have a strong learning ability of effective representation and generalization of matching patterns from raw data. However, these methods show some unavoidable defects, such as the inability to model user cognitive phenomena, large number of model parameters and the “black box” characteristics of network structure. These problems greatly limit the development of neural IR and related fields. Although the quantum-inspired retrieval framework can theoretically solve the above problems, it is faced with problems such as poor model efficiency and difficulty in integrating with neural network, which lead to a huge gap between QT and neural network modeling.<p>This review gives a systematic introduction to quantuminspired neural IR, including quantum-inspired neural language representation, matching and understanding. This is not only helpful to non-classical phenomena modeling in IR but also to break the theoretical bottleneck of neural networks and design more transparent neural IR models. We introduce the language representation method based on QT and the quantum-inspired text matching and decision making model under neural network, which shows its theoretical advantages in document ranking, relevance matching, multimodal IR, and can be integrated with neural networks to jointly promote the development of IR. The latest progress of quantum language understanding is introduced and further topics on QT and language modeling provide readers with more materials for thinking.</p></p>","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"54 45","pages":""},"PeriodicalIF":10.4,"publicationDate":"2023-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49698420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning for Dialogue Systems: Chit-Chat and Beyond 对话系统的深度学习:闲聊和超越
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-01-01 DOI: 10.1561/1500000083
Rui Yan, Juntao Li, Zhou Yu
Recommendation, information retrieval, and other information access systems pose unique challenges for investigating and applying the fairness and non-discrimination concepts that have been developed for studying other machine learning systems. While fair information access shares many commonalities with fair classification, there are important differences: the multistakeholder nature of information access applications, the rank-based problem setting, the centrality of personalization in many cases, and the role of user response all complicate the problem of identifying precisely what types and operationalizations of fairness may be relevant. In this monograph, we present a taxonomy of the various dimensions of fair information access and survey the literature to date on this new and rapidly-growing topic. We Michael D. Ekstrand, Anubrata Das, Robin Burke and Fernando Diaz (2022), “Fairness in Information Access Systems”, Foundations and Trends® in Information Retrieval: Vol. 16, No. 1-2, pp 1–177. DOI: 10.1561/1500000079. ©2022 M. D. Ekstrand et al. Full text available at: http://dx.doi.org/10.1561/1500000079
推荐、信息检索和其他信息访问系统对调查和应用公平和非歧视概念提出了独特的挑战,这些概念已经为研究其他机器学习系统而开发。虽然公平的信息访问与公平的分类有许多共同点,但也有重要的区别:信息访问应用的多利益相关者性质、基于排名的问题设置、在许多情况下个性化的中心地位以及用户响应的作用,所有这些都使准确识别公平的类型和操作可能相关的问题复杂化。在这本专著中,我们提出了公平信息获取的各个维度的分类,并调查了迄今为止关于这个新的和快速增长的主题的文献。我们Michael D. Ekstrand, Anubrata Das, Robin Burke和Fernando Diaz(2022),“信息获取系统的公平性”,《信息检索的基础与趋势》,第16卷第1-2期,第1-177页。DOI: 10.1561 / 1500000079。©2022 M. D. Ekstrand等。全文可在:http://dx.doi.org/10.1561/1500000079
{"title":"Deep Learning for Dialogue Systems: Chit-Chat and Beyond","authors":"Rui Yan, Juntao Li, Zhou Yu","doi":"10.1561/1500000083","DOIUrl":"https://doi.org/10.1561/1500000083","url":null,"abstract":"Recommendation, information retrieval, and other information access systems pose unique challenges for investigating and applying the fairness and non-discrimination concepts that have been developed for studying other machine learning systems. While fair information access shares many commonalities with fair classification, there are important differences: the multistakeholder nature of information access applications, the rank-based problem setting, the centrality of personalization in many cases, and the role of user response all complicate the problem of identifying precisely what types and operationalizations of fairness may be relevant. In this monograph, we present a taxonomy of the various dimensions of fair information access and survey the literature to date on this new and rapidly-growing topic. We Michael D. Ekstrand, Anubrata Das, Robin Burke and Fernando Diaz (2022), “Fairness in Information Access Systems”, Foundations and Trends® in Information Retrieval: Vol. 16, No. 1-2, pp 1–177. DOI: 10.1561/1500000079. ©2022 M. D. Ekstrand et al. Full text available at: http://dx.doi.org/10.1561/1500000079","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"19 1","pages":"417-589"},"PeriodicalIF":10.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72783210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Pre-training Methods in Information Retrieval 信息检索中的预训练方法
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-01-01 DOI: 10.1561/1500000100
Yixing Fan, Xiaohui Xie, Yinqiong Cai, Jia Chen, Xinyv Ma, Xiangsheng Li, Ruqing Zhang, Jiafeng Guo
{"title":"Pre-training Methods in Information Retrieval","authors":"Yixing Fan, Xiaohui Xie, Yinqiong Cai, Jia Chen, Xinyv Ma, Xiangsheng Li, Ruqing Zhang, Jiafeng Guo","doi":"10.1561/1500000100","DOIUrl":"https://doi.org/10.1561/1500000100","url":null,"abstract":"","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"29 1","pages":"178-317"},"PeriodicalIF":10.4,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74908117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Psychology-informed Recommender Systems 基于心理学的推荐系统
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-12-06 DOI: 10.1561/1500000090
E. Lex, Dominik Kowald, Paul Seitlinger, Thi Ngoc Trang Tran, A. Felfernig, M. Schedl
Personalized recommender systems have become indispensable in today’s online world. Most of today’s recommendation algorithms are data-driven and based on behavioral data. While such systems can produce useful recommendations, they are often uninterpretable, black-box models, which do not incorporate the underlying cognitive reasons for user behavior in the algorithms’ design. The aim of this survey is to present a thorough review of the state of the art of recommender systems that leverage psychological constructs and theories to model and predict user behavior and improve the recommendation process. We call such systems psychology-informed recommender systems. The survey identifies three categories of psychology-informed recommender systems: cognition-inspired, personality-aware, and affectaware recommender systems. Moreover, for each category, Elisabeth Lex, Dominik Kowald, Paul Seitlinger, Thi Ngoc Trang Tran, Alexander Felfernig and Markus Schedl (2021), “Psychology-informed Recommender Systems”, Foundations and Trends® in Information Retrieval: Vol. 15, No. 2, pp 134–242. DOI: 10.1561/1500000090. Full text available at: http://dx.doi.org/10.1561/1500000090
个性化推荐系统在当今的网络世界中已经不可或缺。今天的大多数推荐算法都是数据驱动的,基于行为数据。虽然这样的系统可以产生有用的建议,但它们通常是不可解释的黑箱模型,没有在算法设计中纳入用户行为的潜在认知原因。本调查的目的是对推荐系统的现状进行全面的回顾,这些系统利用心理学结构和理论来建模和预测用户行为,并改进推荐过程。我们称这种系统为基于心理的推荐系统。该调查确定了三类基于心理学的推荐系统:认知启发型、个性感知型和情感感知型推荐系统。此外,对于每个类别,Elisabeth Lex, Dominik Kowald, Paul Seitlinger, Thi Ngoc Trang Tran, Alexander felferning和Markus Schedl(2021),“心理通知推荐系统”,信息检索的基础和趋势®:第15卷,第2期,第134-242页。DOI: 10.1561 / 1500000090。全文可在:http://dx.doi.org/10.1561/1500000090
{"title":"Psychology-informed Recommender Systems","authors":"E. Lex, Dominik Kowald, Paul Seitlinger, Thi Ngoc Trang Tran, A. Felfernig, M. Schedl","doi":"10.1561/1500000090","DOIUrl":"https://doi.org/10.1561/1500000090","url":null,"abstract":"Personalized recommender systems have become indispensable in today’s online world. Most of today’s recommendation algorithms are data-driven and based on behavioral data. While such systems can produce useful recommendations, they are often uninterpretable, black-box models, which do not incorporate the underlying cognitive reasons for user behavior in the algorithms’ design. The aim of this survey is to present a thorough review of the state of the art of recommender systems that leverage psychological constructs and theories to model and predict user behavior and improve the recommendation process. We call such systems psychology-informed recommender systems. The survey identifies three categories of psychology-informed recommender systems: cognition-inspired, personality-aware, and affectaware recommender systems. Moreover, for each category, Elisabeth Lex, Dominik Kowald, Paul Seitlinger, Thi Ngoc Trang Tran, Alexander Felfernig and Markus Schedl (2021), “Psychology-informed Recommender Systems”, Foundations and Trends® in Information Retrieval: Vol. 15, No. 2, pp 134–242. DOI: 10.1561/1500000090. Full text available at: http://dx.doi.org/10.1561/1500000090","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"10 1","pages":"134-242"},"PeriodicalIF":10.4,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87436257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Search and Discovery in Personal Email Collections 搜索和发现在个人电子邮件收藏
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-07-05 DOI: 10.1561/1500000069
Michael Bendersky, Xuanhui Wang, Marc Najork, Donald Metzler

Email has been an essential communication medium for many years. As a result, the information accumulated in our mailboxes has become valuable for all of our personal and professional activities. For years, researchers have been developing interfaces, models and algorithms to facilitate search, discovery and organization of email data. In this survey, we attempt to bring together these diverse research directions, and provide both a historical background, as well as a comprehensive overview of the recent advances in the field. In particular, we lay out all the components needed in the design of a privacy-centric email search engine, including search interface, indexing, document and query understanding, retrieval, ranking and evaluation. We also go beyond search, presenting recent work on intelligent task assistance in email. Finally, we discuss some emerging trends and future directions in email search and discovery research.

多年来,电子邮件一直是一种重要的沟通媒介。因此,我们邮箱里积累的信息对我们所有的个人和职业活动都变得很有价值。多年来,研究人员一直在开发接口、模型和算法,以促进电子邮件数据的搜索、发现和组织。在本调查中,我们试图汇集这些不同的研究方向,并提供历史背景,以及该领域最新进展的全面概述。特别是,我们列出了设计一个以隐私为中心的电子邮件搜索引擎所需的所有组件,包括搜索界面、索引、文档和查询理解、检索、排名和评估。我们也超越了搜索,展示了最近在电子邮件中的智能任务协助方面的工作。最后,我们讨论了电子邮件搜索和发现研究的一些新趋势和未来方向。
{"title":"Search and Discovery in Personal Email Collections","authors":"Michael Bendersky, Xuanhui Wang, Marc Najork, Donald Metzler","doi":"10.1561/1500000069","DOIUrl":"https://doi.org/10.1561/1500000069","url":null,"abstract":"<p>Email has been an essential communication medium for many years. As a result, the information accumulated in our mailboxes has become valuable for all of our personal and professional activities. For years, researchers have been developing interfaces, models and algorithms to facilitate search, discovery and organization of email data. In this survey, we attempt to bring together these diverse research directions, and provide both a historical background, as well as a comprehensive overview of the recent advances in the field. In particular, we lay out all the components needed in the design of a privacy-centric email search engine, including search interface, indexing, document and query understanding, retrieval, ranking and evaluation. We also go beyond search, presenting recent work on intelligent task assistance in email. Finally, we discuss some emerging trends and future directions in email search and discovery research.</p>","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"318 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2021-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fairness in Information Access Systems 信息获取系统的公平性
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-05-12 DOI: 10.1561/1500000079
Michael D. Ekstrand, Anubrata Das, R. Burke, Fernando Diaz
Recommendation, information retrieval, and other information access systems pose unique challenges for investigating and applying the fairness and non-discrimination concepts that have been developed for studying other machine learning systems. While fair information access shares many commonalities with fair classification, the multistakeholder nature of information access applications, the rank-based problem setting, the centrality of personalization in many cases, and the role of user response complicate the problem of identifying precisely what types and operationalizations of fairness may be relevant, let alone measuring or promoting them. In this monograph, we present a taxonomy of the various dimensions of fair information access and survey the literature to date on this new and rapidly-growing topic. We preface this with brief introductions to information access and algorithmic fairness, to facilitate use of this work by scholars with experience in one (or neither) of these fields who wish to learn about their intersection. We conclude with several open problems in fair information access, along with some suggestions for how to approach research in this space.
推荐、信息检索和其他信息访问系统对调查和应用公平和非歧视概念提出了独特的挑战,这些概念已经为研究其他机器学习系统而开发。虽然公平的信息访问与公平分类有许多共同点,但信息访问应用的多利益相关者性质、基于排名的问题设置、在许多情况下个性化的中心地位以及用户响应的作用使准确识别公平的类型和操作可能相关的问题复杂化,更不用说衡量或促进它们了。在这本专著中,我们提出了公平信息获取的各个维度的分类,并调查了迄今为止关于这个新的和快速增长的主题的文献。在此之前,我们简要介绍了信息获取和算法公平,以方便在这些领域中有一个(或两个)经验的学者使用这项工作,他们希望了解他们的交集。最后,我们提出了公平信息获取中存在的几个问题,并就如何开展这方面的研究提出了一些建议。
{"title":"Fairness in Information Access Systems","authors":"Michael D. Ekstrand, Anubrata Das, R. Burke, Fernando Diaz","doi":"10.1561/1500000079","DOIUrl":"https://doi.org/10.1561/1500000079","url":null,"abstract":"Recommendation, information retrieval, and other information access systems pose unique challenges for investigating and applying the fairness and non-discrimination concepts that have been developed for studying other machine learning systems. While fair information access shares many commonalities with fair classification, the multistakeholder nature of information access applications, the rank-based problem setting, the centrality of personalization in many cases, and the role of user response complicate the problem of identifying precisely what types and operationalizations of fairness may be relevant, let alone measuring or promoting them. In this monograph, we present a taxonomy of the various dimensions of fair information access and survey the literature to date on this new and rapidly-growing topic. We preface this with brief introductions to information access and algorithmic fairness, to facilitate use of this work by scholars with experience in one (or neither) of these fields who wish to learn about their intersection. We conclude with several open problems in fair information access, along with some suggestions for how to approach research in this space.","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"1 1","pages":"1-177"},"PeriodicalIF":10.4,"publicationDate":"2021-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89800926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Search Interface Design and Evaluation 搜索界面设计与评价
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-01-01 DOI: 10.1561/1500000073
Chang Liu, Ying-Hsang Liu, Jingjing Liu, R. Bierig
{"title":"Search Interface Design and Evaluation","authors":"Chang Liu, Ying-Hsang Liu, Jingjing Liu, R. Bierig","doi":"10.1561/1500000073","DOIUrl":"https://doi.org/10.1561/1500000073","url":null,"abstract":"","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"77 1","pages":"243-416"},"PeriodicalIF":10.4,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87093182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Extracting, Mining and Predicting Users' Interests from Social Media 从社交媒体中提取、挖掘和预测用户兴趣
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2020-11-04 DOI: 10.1561/1500000078
F. Zarrinkalam, Stefano Faralli, Guangyuan Piao, E. Bagheri
The abundance of user generated content on social media provides the opportunity to build models that are able to accurately and effectively extract, mine and predict users’ interests with the hopes of enabling more effective user engagement, better quality delivery of appropriate services and higher user satisfaction. While traditional methods for building user profiles relied on AI-based preference elicitation techniques that could have been considered to be intrusive and undesirable by the users, more recent advances are focused on a non-intrusive yet accurate way of determining users’ interests and preferences. In this monograph, we will cover five important subjects related to the mining of user interests from social media: (1) the foundations of social user interest modeling, such as information sources, various types of representation models and temporal features, (2) techniques that have been adopted or proposed for Fattane Zarrinkalam, Stefano Faralli, Guangyuan Piao and Ebrahim Bagheri (2020), “Extracting, Mining and Predicting Users’ Interests from Social Media”, Foundations and Trends © in Information Retrieval: Vol. 14, No. 5, pp 445–617. DOI: 10.1561/1500000078. Full text available at: http://dx.doi.org/10.1561/1500000078
社交媒体上丰富的用户生成内容为建立能够准确有效地提取、挖掘和预测用户兴趣的模型提供了机会,以期实现更有效的用户参与,更好地提供适当的服务质量和更高的用户满意度。虽然建立用户档案的传统方法依赖于基于人工智能的偏好提取技术,这可能被用户认为是侵入性的和不受欢迎的,但最近的进展集中在一种非侵入性但准确的方式来确定用户的兴趣和偏好。在这本专著中,我们将涵盖与从社交媒体中挖掘用户兴趣相关的五个重要主题:(1)社会用户兴趣建模的基础,如信息源、各种类型的表示模型和时间特征;(2)Fattane Zarrinkalam、Stefano Faralli、Guangyuan Piao和Ebrahim Bagheri(2020)采用或提出的技术,“从社交媒体中提取、挖掘和预测用户兴趣”,《信息检索的基础与趋势©》,第14卷,第5期,445-617页。DOI: 10.1561 / 1500000078。全文可在:http://dx.doi.org/10.1561/1500000078
{"title":"Extracting, Mining and Predicting Users' Interests from Social Media","authors":"F. Zarrinkalam, Stefano Faralli, Guangyuan Piao, E. Bagheri","doi":"10.1561/1500000078","DOIUrl":"https://doi.org/10.1561/1500000078","url":null,"abstract":"The abundance of user generated content on social media provides the opportunity to build models that are able to accurately and effectively extract, mine and predict users’ interests with the hopes of enabling more effective user engagement, better quality delivery of appropriate services and higher user satisfaction. While traditional methods for building user profiles relied on AI-based preference elicitation techniques that could have been considered to be intrusive and undesirable by the users, more recent advances are focused on a non-intrusive yet accurate way of determining users’ interests and preferences. In this monograph, we will cover five important subjects related to the mining of user interests from social media: (1) the foundations of social user interest modeling, such as information sources, various types of representation models and temporal features, (2) techniques that have been adopted or proposed for Fattane Zarrinkalam, Stefano Faralli, Guangyuan Piao and Ebrahim Bagheri (2020), “Extracting, Mining and Predicting Users’ Interests from Social Media”, Foundations and Trends © in Information Retrieval: Vol. 14, No. 5, pp 445–617. DOI: 10.1561/1500000078. Full text available at: http://dx.doi.org/10.1561/1500000078","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"724 1","pages":"445-617"},"PeriodicalIF":10.4,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78742778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Knowledge Graphs: An Information Retrieval Perspective 知识图谱:信息检索的视角
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2020-10-14 DOI: 10.1561/1500000063
Ridho Reinanda, E. Meij, M. de Rijke
In this survey, we provide an overview of the literature on knowledge graphs (KGs) in the context of information retrieval (IR). Modern IR systems can benefit from information available in KGs in multiple ways, independent of whether the KGs are publicly available or proprietary ones. We provide an overview of the components required when building IR systems that leverage KGs and use a task-oriented organization of the material that we discuss. As an understanding of the intersection of IR and KGs is beneficial to many researchers and practitioners, we consider prior work from two complementary angles: leveraging KGs for information retrieval and enriching KGs using IR techniques. We start by discussing how KGs can be employed to support IR tasks, including document and entity retrieval. We then proceed by describing how IR—and language technology in general—can be utilized for the construction and completion of KGs. This includes tasks such as entity recognition, typing, and relation extraction. We discuss common issues that appear across the tasks that we consider and identify future directions for addressing them. We also provide pointers to datasets and other resources that should be useful for both newcomers and experienced researchers in the area. Ridho Reinanda, Edgar Meij and Maarten de Rijke (2020), “Knowledge Graphs: An Information Retrieval Perspective”, Foundations and Trends® in Information Retrieval: Vol. 14, No. 4, pp 289–444. DOI: 10.1561/1500000063. Full text available at: http://dx.doi.org/10.1561/1500000063
在本调查中,我们概述了知识图在信息检索(IR)背景下的文献。现代红外系统可以以多种方式受益于kg中提供的信息,而不依赖于kg是公开可用的还是专有的。我们提供了构建利用kg的IR系统所需组件的概述,并使用我们讨论的材料的面向任务的组织。由于理解IR和KGs的交集对许多研究人员和从业者都是有益的,我们从两个互补的角度来考虑之前的工作:利用KGs进行信息检索和使用IR技术丰富KGs。我们首先讨论如何使用kg来支持IR任务,包括文档和实体检索。然后,我们继续描述ir和一般语言技术如何用于构建和完成kg,这包括实体识别、输入和关系提取等任务。我们讨论在我们考虑的任务中出现的常见问题,并确定解决这些问题的未来方向。我们还提供了指向数据集和其他资源的指针,这些资源对该领域的新手和经验丰富的研究人员都很有用。Ridho Reinanda, Edgar Meij和Maarten de Rijke(2020),“知识图谱:信息检索视角”,信息检索的基础和趋势®:第14卷,第4期,第289-444页。DOI: 10.1561 / 1500000063。全文可在:http://dx.doi.org/10.1561/1500000063
{"title":"Knowledge Graphs: An Information Retrieval Perspective","authors":"Ridho Reinanda, E. Meij, M. de Rijke","doi":"10.1561/1500000063","DOIUrl":"https://doi.org/10.1561/1500000063","url":null,"abstract":"In this survey, we provide an overview of the literature on knowledge graphs (KGs) in the context of information retrieval (IR). Modern IR systems can benefit from information available in KGs in multiple ways, independent of whether the KGs are publicly available or proprietary ones. We provide an overview of the components required when building IR systems that leverage KGs and use a task-oriented organization of the material that we discuss. As an understanding of the intersection of IR and KGs is beneficial to many researchers and practitioners, we consider prior work from two complementary angles: leveraging KGs for information retrieval and enriching KGs using IR techniques. We start by discussing how KGs can be employed to support IR tasks, including document and entity retrieval. We then proceed by describing how IR—and language technology in general—can be utilized for the construction and completion of KGs. This includes tasks such as entity recognition, typing, and relation extraction. We discuss common issues that appear across the tasks that we consider and identify future directions for addressing them. We also provide pointers to datasets and other resources that should be useful for both newcomers and experienced researchers in the area. Ridho Reinanda, Edgar Meij and Maarten de Rijke (2020), “Knowledge Graphs: An Information Retrieval Perspective”, Foundations and Trends® in Information Retrieval: Vol. 14, No. 4, pp 289–444. DOI: 10.1561/1500000063. Full text available at: http://dx.doi.org/10.1561/1500000063","PeriodicalId":48829,"journal":{"name":"Foundations and Trends in Information Retrieval","volume":"3 1","pages":"289-444"},"PeriodicalIF":10.4,"publicationDate":"2020-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77053977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
期刊
Foundations and Trends in Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1