Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval最新文献

英文中文

Fairness and Discrimination in Retrieval and Recommendation 检索与推荐中的公平与歧视

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331380

Michael D. Ekstrand, R. Burke, Fernando Diaz

Fairness and related concerns have become of increasing importance in a variety of AI and machine learning contexts. They are also highly relevant to information retrieval and related problems such as recommendation, as evidenced by the growing literature in SIGIR, FAT*, RecSys, and special sessions such as the FATREC workshop and the Fairness track at TREC 2019; however, translating algorithmic fairness constructs from classification, scoring, and even many ranking settings into information retrieval and recommendation scenarios is not a straightforward task. This tutorial will help to orient IR researchers to algorithmic fairness, understand how concepts do and do not translate from other settings, and provide an introduction to the growing literature on this topic.

在各种人工智能和机器学习环境中，公平性和相关问题变得越来越重要。它们也与信息检索和推荐等相关问题高度相关，SIGIR、FAT*、RecSys中越来越多的文献以及FATREC研讨会和TREC 2019的公平专场等特别会议证明了这一点;然而，将算法公平性结构从分类、评分甚至许多排名设置转换为信息检索和推荐场景并不是一项简单的任务。本教程将帮助IR研究人员了解算法公平性，了解概念如何从其他设置中翻译和不翻译，并介绍关于该主题的日益增长的文献。

引用次数: 33

TDP

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1007/978-3-662-48986-4_313617

Zhenlong Zhu, Ruixuan Li, Minghui Shan, Yuhua Li, Lu Gao, Fei Wang, Jixing Xu, X. Gu

引用次数: 1

Human Behavior Inspired Machine Reading Comprehension 人类行为启发机器阅读理解

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331231

Yukun Zheng, Jiaxin Mao, Yiqun Liu, Zixin Ye, Min Zhang, Shaoping Ma

Machine Reading Comprehension (MRC) is one of the most challenging tasks in both NLP and IR researches. Recently, a number of deep neural models have been successfully adopted to some simplified MRC task settings, whose performances were close to or even better than human beings. However, these models still have large performance gaps with human beings in more practical settings, such as MS MARCO and DuReader datasets. Although there are many works studying human reading behavior, the behavior patterns in complex reading comprehension scenarios remain under-investigated. We believe that a better understanding of how human reads and allocates their attention during reading comprehension processes can help improve the performance of MRC tasks. In this paper, we conduct a lab study to investigate human's reading behavior patterns during reading comprehension tasks, where 32 users are recruited to take 60 distinct tasks. By analyzing the collected eye-tracking data and answers from participants, we propose a two-stage reading behavior model, in which the first stage is to search for possible answer candidates and the second stage is to generate the final answer through a comparison and verification process. We also find that human's attention distribution is affected by both question-dependent factors (e.g., answer and soft matching signal with questions) and question-independent factors (e.g., position, IDF and Part-of-Speech tags of words). We extract features derived from the two-stage reading behavior model to predict human's attention signals during reading comprehension, which significantly improves performance in the MRC task. Findings in our work may bring insight into the understanding of human reading and information seeking processes, and help the machine to better meet users' information needs.

机器阅读理解(MRC)是NLP和IR研究中最具挑战性的任务之一。近年来，许多深度神经模型被成功地应用于一些简化的MRC任务设置中，其表现接近甚至优于人类。然而，这些模型在更实际的环境中仍然与人类有很大的性能差距，比如MS MARCO和DuReader数据集。虽然有很多研究人类阅读行为的著作，但对复杂阅读理解情境下的行为模式的研究仍然不足。我们认为，更好地了解人类在阅读理解过程中如何阅读和分配注意力有助于提高MRC任务的表现。本文通过实验研究了阅读理解任务中人类的阅读行为模式，招募了32名用户完成60个不同的任务。通过分析收集到的眼动数据和参与者的回答，我们提出了一个两阶段的阅读行为模型，其中第一阶段是搜索可能的答案候选人，第二阶段是通过比较和验证过程生成最终答案。我们还发现，人的注意力分布同时受到问题相关因素(如答案和与问题的软匹配信号)和问题无关因素(如词的位置、IDF和词性标签)的影响。我们从两阶段阅读行为模型中提取特征来预测人类在阅读理解过程中的注意信号，从而显著提高了在MRC任务中的表现。我们的研究结果可能会对人类阅读和信息寻求过程的理解带来深刻的见解，并帮助机器更好地满足用户的信息需求。

{"title":"Human Behavior Inspired Machine Reading Comprehension","authors":"Yukun Zheng, Jiaxin Mao, Yiqun Liu, Zixin Ye, Min Zhang, Shaoping Ma","doi":"10.1145/3331184.3331231","DOIUrl":"https://doi.org/10.1145/3331184.3331231","url":null,"abstract":"Machine Reading Comprehension (MRC) is one of the most challenging tasks in both NLP and IR researches. Recently, a number of deep neural models have been successfully adopted to some simplified MRC task settings, whose performances were close to or even better than human beings. However, these models still have large performance gaps with human beings in more practical settings, such as MS MARCO and DuReader datasets. Although there are many works studying human reading behavior, the behavior patterns in complex reading comprehension scenarios remain under-investigated. We believe that a better understanding of how human reads and allocates their attention during reading comprehension processes can help improve the performance of MRC tasks. In this paper, we conduct a lab study to investigate human's reading behavior patterns during reading comprehension tasks, where 32 users are recruited to take 60 distinct tasks. By analyzing the collected eye-tracking data and answers from participants, we propose a two-stage reading behavior model, in which the first stage is to search for possible answer candidates and the second stage is to generate the final answer through a comparison and verification process. We also find that human's attention distribution is affected by both question-dependent factors (e.g., answer and soft matching signal with questions) and question-independent factors (e.g., position, IDF and Part-of-Speech tags of words). We extract features derived from the two-stage reading behavior model to predict human's attention signals during reading comprehension, which significantly improves performance in the MRC task. Findings in our work may bring insight into the understanding of human reading and information seeking processes, and help the machine to better meet users' information needs.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"40 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84858261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

Looking for Opportunities: Challenges in Procurement Search 寻找机会:采购搜索中的挑战

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331428

Stuart Mackie, David Macdonald, L. Azzopardi, Yashar Moshfeghi

Procurement legislation stipulates that information about the goods, services, or works, that tax-funded authorities wish to purchase are made publicly available in a procurement contract notice. However, for businesses wishing to tender for such competitive opportunities, finding relevant procurement contract notices presents a challenging professional search task. In this talk, we will provide an overview of procurement search and then describe the challenges in addressing the related search and recommendation tasks.

采购法规定，有关税收资助当局希望购买的货物、服务或工程的信息，应在采购合同通知中公开提供。然而，对于希望投标这种竞争机会的企业来说，寻找相关的采购合同通知是一项具有挑战性的专业搜索任务。在这次演讲中，我们将提供采购搜索的概述，然后描述解决相关搜索和推荐任务的挑战。

引用次数: 1

Adaptive Multi-Attention Network Incorporating Answer Information for Duplicate Question Detection 基于答案信息的自适应多注意网络重复问题检测

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331228

Di Liang, Fubao Zhang, Weidong Zhang, Qi Zhang, Jinlan Fu, Minlong Peng, Tao Gui, Xuanjing Huang

Community-based question answering (CQA), which provides a platform for people with diverse backgrounds to share information and knowledge, has become increasingly popular. With the accumulation of site data, methods to detect duplicate questions in CQA sites have attracted considerable attention. Existing methods typically use only questions to complete the task. However, the paired answers may also provide valuable information. In this paper, we propose an answer information- enhanced adaptive multi-attention network (AMAN) to perform this task. AMAN takes full advantage of the semantic information in the paired answers while alleviating the noise problem caused by adding the answers. To evaluate the proposed method, we use a CQADupStack set and the Quora question-pair dataset expanded with paired answers. Experimental results demonstrate that the proposed model can achieve state-of-the-art performance on the above two data sets.

基于社区的问答(CQA)为不同背景的人们提供了一个分享信息和知识的平台，已经变得越来越流行。随着站点数据的积累，CQA站点中重复问题的检测方法受到了广泛的关注。现有的方法通常只使用问题来完成任务。然而，配对的答案也可能提供有价值的信息。在本文中，我们提出了一个答案信息增强的自适应多注意网络(AMAN)来完成这个任务。AMAN充分利用了配对答案中的语义信息，同时减轻了添加答案带来的噪声问题。为了评估所提出的方法，我们使用了CQADupStack集和Quora的问题对数据集，这些数据集扩展了成对的答案。实验结果表明，所提出的模型在上述两个数据集上都能达到最先进的性能。

引用次数: 23

Query Performance Prediction for Pseudo-Feedback-Based Retrieval 基于伪反馈的检索查询性能预测

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331369

Haggai Roitman, Oren Kurland

The query performance prediction task (QPP) is estimating retrieval effectiveness in the absence of relevance judgments. Prior work has focused on prediction for retrieval methods based on surface level query-document similarities (e.g., query likelihood). We address the prediction challenge for pseudo-feedback-based retrieval methods which utilize an initial retrieval to induce a new query model; the query model is then used for a second (final) retrieval. Our suggested approach accounts for the presumed effectiveness of the initially retrieved list, its similarity with the final retrieved list and properties of the latter. Empirical evaluation demonstrates the clear merits of our approach.

查询性能预测任务(query performance prediction task, QPP)是在没有相关性判断的情况下估计检索的有效性。先前的工作集中在基于表面级查询文档相似性(例如，查询似然)的检索方法的预测上。我们解决了基于伪反馈的检索方法的预测挑战，该方法利用初始检索来诱导新的查询模型;然后将查询模型用于第二次(最终)检索。我们建议的方法考虑了最初检索列表的假定有效性、它与最终检索列表的相似性以及后者的属性。实证评估表明了我们的方法的明显优点。

引用次数: 6

The SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019) SIGIR 2019开源IR可复制性挑战(OSIRRC 2019)

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331647

R. Clancy, N. Ferro, C. Hauff, Jimmy J. Lin, T. Sakai, Z. Z. Wu

The importance of repeatability, replicability, and reproducibility is broadly recognized in the computational sciences, both in supporting desirable scientific methodology as well as sustaining empirical progress. This workshop tackles the replicability challenge for ad hoc document retrieval, via a common Docker interface specification to support images that capture systems performing ad hoc retrieval experiments on standard test collections.

在计算科学中，可重复性、可复制性和可再现性的重要性得到了广泛的认可，无论是在支持理想的科学方法方面，还是在维持经验进展方面。本研讨会解决了临时文档检索的可复制性挑战，通过一个通用的Docker接口规范来支持捕获系统在标准测试集合上执行临时检索实验的图像。

引用次数: 22

AliISA

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331409

F. Xiao, Zhen Wang, Haikuan Huang, Jun Huang, Xi Chen, Hongbo Deng, Minghui Qiu, Xiaoli Gong

Online shopping has been a habit of more and more people, while most users are unable to craft an informative query, and thus it often takes a long search session to satisfy their purchase intents. We present AliISA - a shopping assistant which offers users some tips to further specify their queries during a search session. With such an interactive search, users tend to find targeted items with fewer page requests, which often means a better user experience. Currently, AliISA assists tens of millions of users per day, earns more usage than existing systems, and consequently brings in a 5% improvement in CVR. In this paper, we present our system, describe the underlying techniques, and discuss our experience in stabilizing reinforcement learning under an E-commerce environment.

引用次数: 1

Automatic Task Completion Flows from Web APIs 来自Web api的自动任务完成流

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331318

Kyle Williams, Seyyed Hadi Hashemi, I. Zitouni

The Web contains many APIs that could be combined in countless ways to enable Intelligent Assistants to complete all sorts of tasks. We propose a method to automatically produce task completion flows from a collection of these APIs by combining them in a graph and automatically extracting paths from the graph for task completion. These paths chain together API calls and use the output of executed APIs as inputs to others. We automatically extract these paths from an API graph in response to a user query and then rank the paths by the likelihood of them leading to user satisfaction. We apply our approach for task completion in the email and calendar domains and show how it can be used to automatically create task completion flows.

Web包含许多api，它们可以以无数种方式组合在一起，使智能助手能够完成各种任务。我们提出了一种方法，通过将这些api集合组合在一个图中并自动从图中提取任务完成的路径，从这些api集合中自动生成任务完成流。这些路径将API调用链接在一起，并使用已执行API的输出作为其他API的输入。我们自动从API图中提取这些路径以响应用户查询，然后根据它们导致用户满意度的可能性对路径进行排序。我们将在电子邮件和日历域中应用我们的任务完成方法，并展示如何使用它来自动创建任务完成流。

引用次数: 3

Adversarial Training for Review-Based Recommendations 基于审查的推荐的对抗性训练

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331313

Dimitrios Rafailidis, F. Crestani

Recent studies have shown that incorporating users' reviews into the collaborative filtering strategy can significantly boost the recommendation accuracy. A pressing challenge resides on learning how reviews influence users' rating behaviors. In this paper, we propose an Adversarial Training approach for Review-based recommendations, namely ATR. We design a neural architecture of sequence-to-sequence learning to calculate the deep representations of users' reviews on items following an adversarial training strategy. At the same time we jointly learn to factorize the rating matrix, by regularizing the deep representations of reviews with the user and item latent features. In doing so, our model captures the non-linear associations among reviews and ratings while producing a review for each user-item pair. Our experiments on publicly available datasets demonstrate the effectiveness of the proposed model, outperforming other state-of-the-art methods.

最近的研究表明，将用户评论纳入协同过滤策略可以显著提高推荐的准确性。一个紧迫的挑战在于了解评论如何影响用户的评分行为。在本文中，我们提出了一种基于评论的推荐的对抗性训练方法，即ATR。我们设计了一个序列到序列学习的神经结构，以计算用户在对抗性训练策略下对物品评论的深度表示。同时，通过将评论的深度表征与用户和物品的潜在特征进行正则化，我们共同学习了评分矩阵的因式分解。在这样做的过程中，我们的模型捕获了评论和评级之间的非线性关联，同时为每个用户-项目对生成评论。我们在公开可用数据集上的实验证明了所提出模型的有效性，优于其他最先进的方法。

引用次数: 18

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀