Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval最新文献

英文中文

EARS 2019: The 2nd International Workshop on ExplainAble Recommendation and Search EARS 2019:第二届可解释推荐和搜索国际研讨会

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331649

Yongfeng Zhang, Yi Zhang, Min Zhang, C. Shah

Explainable recommendation and search attempt to develop models or methods that not only generate high-quality recommendation or search results, but also interpretability of the models or explanations of the results for users or system designers, which can help to improve the system transparency, persuasiveness, trustworthiness, and effectiveness, etc. This is even more important in personalized search and recommendation scenarios, where users would like to know why a particular product, web page, news report, or friend suggestion exists in his or her own search and recommendation lists. The workshop focuses on the research and application of explainable recommendation, search, and a broader scope of IR tasks. It will gather researchers as well as practitioners in the field for discussions, idea communications, and research promotions. It will also generate insightful debates about the recent regulations regarding AI interpretability, to a broader community including but not limited to IR, machine learning, AI, Data Science, and beyond.

可解释的推荐和搜索试图开发模型或方法，不仅产生高质量的推荐或搜索结果，而且模型或结果的解释对于用户或系统设计者来说是可解释性的，这有助于提高系统的透明度、说服力、可信度和有效性等。这在个性化搜索和推荐场景中更为重要，用户想知道为什么特定的产品、网页、新闻报道或朋友建议会出现在他或她自己的搜索和推荐列表中。研讨会的重点是可解释的推荐、搜索和更广泛的红外任务的研究和应用。它将聚集该领域的研究人员和实践者进行讨论、思想交流和研究推广。它还将引发有关人工智能可解释性的最新法规的深刻辩论，涉及更广泛的社区，包括但不限于人工智能、机器学习、人工智能、数据科学等。

引用次数: 8

Text Retrieval Priors for Bayesian Logistic Regression 基于贝叶斯逻辑回归的文本检索先验

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331299

Eugene Yang, D. Lewis, O. Frieder

Discriminative learning algorithms such as logistic regression excel when training data are plentiful, but falter when it is meager. An extreme case is text retrieval (zero training data), where discriminative learning is impossible and heuristics such as BM25, which combine domain knowledge (a topical keyword query) with generative learning (Naive Bayes), are dominant. Building on past work, we show that BM25-inspired Gaussian priors for Bayesian logistic regression based on topical keywords provide better effectiveness than the usual L2 (zero mode, uniform variance) Gaussian prior. On two high recall retrieval datasets, the resulting models transition smoothly from BM25 level effectiveness to discriminative effectiveness as training data volume increases, dominating L2 regularization even when substantial training data is available.

判别学习算法，如逻辑回归，在训练数据丰富的情况下表现优异，但在训练数据不足的情况下则表现不佳。一个极端的例子是文本检索(零训练数据)，其中判别学习是不可能的，而像BM25这样的启发式方法将领域知识(主题关键字查询)与生成学习(朴素贝叶斯)相结合，占主导地位。在过去工作的基础上，我们发现基于主题关键词的贝叶斯逻辑回归的bm25启发的高斯先验比通常的L2(零模式，均匀方差)高斯先验提供了更好的有效性。在两个高查全率检索数据集上，随着训练数据量的增加，所得模型从BM25水平的有效性平稳地过渡到判别有效性，即使在大量训练数据可用时也能主导L2正则化。

引用次数: 3

WestSearch Plus: A Non-factoid Question-Answering System for the Legal Domain WestSearch Plus:法律领域的非事实问答系统

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331397

Gayle McElvain, George Sanchez, S. Matthews, Don Teo, Filippo Pompili, Tonya Custis

We present a non-factoid QA system that provides legally accurate, jurisdictionally relevant, and conversationally responsive answers to user-entered questions in the legal domain. This commercially available system is entirely based on NLP and IR, and does not rely on a structured knowledge base. WestSearch Plus aims to provide concise one sentence answers for basic questions about the law. It is not restricted in scope to particular topics or jurisdictions. The corpus of potential answers contains approximately 22M documents classified to over 120K legal topics.

我们提出了一个非事实性的QA系统，该系统为法律领域的用户输入问题提供法律上准确、司法上相关和对话式响应的答案。这个商业上可用的系统完全基于NLP和IR，不依赖于结构化的知识库。WestSearch Plus旨在为有关法律的基本问题提供简洁的一句话答案。它的范围不限于特定主题或司法管辖区。潜在答案的语料库包含大约2200万份文档，分类为超过12万个法律主题。

引用次数: 17

Online Multi-modal Hashing with Dynamic Query-adaption 具有动态查询适应性的在线多模态哈希

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331217

X. Lu, Lei Zhu, Zhiyong Cheng, Liqiang Nie, Huaxiang Zhang

Multi-modal hashing is an effective technique to support large-scale multimedia retrieval, due to its capability of encoding heterogeneous multi-modal features into compact and similarity-preserving binary codes. Although great progress has been achieved so far, existing methods still suffer from several problems, including: 1) All existing methods simply adopt fixed modality combination weights in online hashing process to generate the query hash codes. This strategy cannot adaptively capture the variations of different queries. 2) They either suffer from insufficient semantics (for unsupervised methods) or require high computation and storage cost (for the supervised methods, which rely on pair-wise semantic matrix). 3) They solve the hash codes with relaxed optimization strategy or bit-by-bit discrete optimization, which results in significant quantization loss or consumes considerable computation time. To address the above limitations, in this paper, we propose an Online Multi-modal Hashing with Dynamic Query-adaption (OMH-DQ) method in a novel fashion. Specifically, a self-weighted fusion strategy is designed to adaptively preserve the multi-modal feature information into hash codes by exploiting their complementarity. The hash codes are learned with the supervision of pair-wise semantic labels to enhance their discriminative capability, while avoiding the challenging symmetric similarity matrix factorization. Under such learning framework, the binary hash codes can be directly obtained with efficient operations and without quantization errors. Accordingly, our method can benefit from the semantic labels, and simultaneously, avoid the high computation complexity. Moreover, to accurately capture the query variations, at the online retrieval stage, we design a parameter-free online hashing module which can adaptively learn the query hash codes according to the dynamic query contents. Extensive experiments demonstrate the state-of-the-art performance of the proposed approach from various aspects.

多模态哈希能够将异构多模态特征编码成紧凑且保持相似性的二进制编码，是支持大规模多媒体检索的有效技术。虽然目前已经取得了很大的进展，但是现有的方法仍然存在一些问题，包括:1)所有现有的方法在在线哈希过程中都简单地采用固定模态组合权值来生成查询哈希码。这种策略不能自适应地捕捉不同查询的变化。2)它们要么语义不足(对于无监督方法)，要么需要高计算和存储成本(对于依赖成对语义矩阵的监督方法)。3)他们采用宽松的优化策略或逐位离散优化来求解哈希码，这会导致严重的量化损失或消耗大量的计算时间。为了解决上述限制，在本文中，我们以一种新颖的方式提出了一种带有动态查询自适应的在线多模态哈希(OMH-DQ)方法。具体而言，设计了一种自加权融合策略，利用多模态特征信息的互补性，自适应地将多模态特征信息保存到哈希码中。哈希码是在对语义标签的监督下学习的，以增强其判别能力，同时避免了具有挑战性的对称相似矩阵分解。在这种学习框架下，可以直接得到二进制哈希码，运算效率高，没有量化误差。因此，我们的方法可以受益于语义标签，同时避免了高计算复杂度。此外，为了准确捕捉查询变化，在在线检索阶段，我们设计了一个无参数的在线哈希模块，该模块可以根据动态查询内容自适应学习查询哈希码。大量的实验从各个方面证明了所提出方法的最先进性能。

{"title":"Online Multi-modal Hashing with Dynamic Query-adaption","authors":"X. Lu, Lei Zhu, Zhiyong Cheng, Liqiang Nie, Huaxiang Zhang","doi":"10.1145/3331184.3331217","DOIUrl":"https://doi.org/10.1145/3331184.3331217","url":null,"abstract":"Multi-modal hashing is an effective technique to support large-scale multimedia retrieval, due to its capability of encoding heterogeneous multi-modal features into compact and similarity-preserving binary codes. Although great progress has been achieved so far, existing methods still suffer from several problems, including: 1) All existing methods simply adopt fixed modality combination weights in online hashing process to generate the query hash codes. This strategy cannot adaptively capture the variations of different queries. 2) They either suffer from insufficient semantics (for unsupervised methods) or require high computation and storage cost (for the supervised methods, which rely on pair-wise semantic matrix). 3) They solve the hash codes with relaxed optimization strategy or bit-by-bit discrete optimization, which results in significant quantization loss or consumes considerable computation time. To address the above limitations, in this paper, we propose an Online Multi-modal Hashing with Dynamic Query-adaption (OMH-DQ) method in a novel fashion. Specifically, a self-weighted fusion strategy is designed to adaptively preserve the multi-modal feature information into hash codes by exploiting their complementarity. The hash codes are learned with the supervision of pair-wise semantic labels to enhance their discriminative capability, while avoiding the challenging symmetric similarity matrix factorization. Under such learning framework, the binary hash codes can be directly obtained with efficient operations and without quantization errors. Accordingly, our method can benefit from the semantic labels, and simultaneously, avoid the high computation complexity. Moreover, to accurately capture the query variations, at the online retrieval stage, we design a parameter-free online hashing module which can adaptively learn the query hash codes according to the dynamic query contents. Extensive experiments demonstrate the state-of-the-art performance of the proposed approach from various aspects.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"115 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79398273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 106

Session details: Session 8C: Summarization and Information Extraction 会议详情:8C部分:总结与信息提取

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3349696

Bárbara Poblete

引用次数: 0

Answer-enhanced Path-aware Relation Detection over Knowledge Base 基于知识库的答案增强路径感知关系检测

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331328

Daoyuan Chen, Min Yang, Haitao Zheng, Yaliang Li, Ying Shen

Knowledge Based Question Answering (KBQA) is one of the most promising approaches to provide suitable answers for the queries posted by users. Relation detection that aims to take full advantage of the substantial knowledge contained in knowledge base (KB) becomes increasingly important. Significant progress has been made in performing relation detection over KB. However, recent deep neural networks that achieve the state of the art on KB-based relation detection task only consider the context information of question sentences rather than the relatedness between question and answer candidates, and exclusively extract the relation from KB triple rather than learn informative relational path. In this paper, we propose a Knowledge-driven Relation Detection network (KRD) to interactively learn answer-enhanced question representations and path-aware relation representations for relation detection. A Siamese LSTM is employed into a similarity matching process between the question representation and relation representation. Experimental results on the SimpleQuestions and WebQSP datasets demonstrate that KRD outperforms the state-of-the-art methods. In addition, a series of ablation test show the robust superiority of the proposed method.

基于知识的问答(KBQA)是为用户提出的问题提供合适答案的最有前途的方法之一。以充分利用知识库中包含的大量知识为目标的关系检测变得越来越重要。在KB上执行关系检测方面取得了重大进展。然而，目前在基于知识库的关系检测任务上取得最新进展的深度神经网络只考虑问题句子的上下文信息，而没有考虑问题和答案候选之间的相关性，并且只从知识库三元组中提取关系，而没有学习信息关系路径。在本文中，我们提出了一个知识驱动的关系检测网络(KRD)，以交互式地学习答案增强的问题表示和路径感知的关系表示，用于关系检测。采用Siamese LSTM对问题表示和关系表示进行相似度匹配。在SimpleQuestions和WebQSP数据集上的实验结果表明，KRD优于最先进的方法。此外，一系列烧蚀试验表明了该方法的鲁棒性。

{"title":"Answer-enhanced Path-aware Relation Detection over Knowledge Base","authors":"Daoyuan Chen, Min Yang, Haitao Zheng, Yaliang Li, Ying Shen","doi":"10.1145/3331184.3331328","DOIUrl":"https://doi.org/10.1145/3331184.3331328","url":null,"abstract":"Knowledge Based Question Answering (KBQA) is one of the most promising approaches to provide suitable answers for the queries posted by users. Relation detection that aims to take full advantage of the substantial knowledge contained in knowledge base (KB) becomes increasingly important. Significant progress has been made in performing relation detection over KB. However, recent deep neural networks that achieve the state of the art on KB-based relation detection task only consider the context information of question sentences rather than the relatedness between question and answer candidates, and exclusively extract the relation from KB triple rather than learn informative relational path. In this paper, we propose a Knowledge-driven Relation Detection network (KRD) to interactively learn answer-enhanced question representations and path-aware relation representations for relation detection. A Siamese LSTM is employed into a similarity matching process between the question representation and relation representation. Experimental results on the SimpleQuestions and WebQSP datasets demonstrate that KRD outperforms the state-of-the-art methods. In addition, a series of ablation test show the robust superiority of the proposed method.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"70 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77720921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Leveraging Emotional Signals for Credibility Detection 利用情感信号进行可信度检测

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331285

Anastasia Giahanou, Paolo Rosso, F. Crestani

The spread of false information on the Web is one of the main problems of our society. Automatic detection of fake news posts is a hard task since they are intentionally written to mislead the readers and to trigger intense emotions to them in an attempt to be disseminated in the social networks. Even though recent studies have explored different linguistic patterns of false claims, the role of emotional signals has not yet been explored. In this paper, we study the role of emotional signals in fake news detection. In particular, we propose an LSTM model that incorporates emotional signals extracted from the text of the claims to differentiate between credible and non-credible ones. Experiments on real world datasets show the importance of emotional signals for credibility assessment.

网络上虚假信息的传播是我们社会的主要问题之一。虚假新闻是为了误导读者，引发读者的强烈情绪，并试图在社交网络上传播，因此自动检测虚假新闻是一项艰巨的任务。尽管最近的研究已经探索了虚假陈述的不同语言模式，但情感信号的作用尚未得到探讨。本文研究了情感信号在假新闻检测中的作用。特别是，我们提出了一个LSTM模型，该模型结合了从声明文本中提取的情感信号，以区分可信和不可信的声明。真实世界数据集的实验表明情绪信号对可信度评估的重要性。

引用次数: 112

From Semantic Retrieval to Pairwise Ranking: Applying Deep Learning in E-commerce Search 从语义检索到配对排序:深度学习在电子商务搜索中的应用

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331434

Rui Li, Yunjiang Jiang, Wen-Yun Yang, Guoyu Tang, Songlin Wang, Chaoyi Ma, Wei He, Xi Xiong, Yun Xiao, Y. Zhao

We introduce deep learning models to the two most important stages in product search at JD.com, one of the largest e-commerce platforms in the world. Specifically, we outline the design of a deep learning system that retrieves semantically relevant items to a query within milliseconds, and a pairwise deep re-ranking system, which learns subtle user preferences. Compared to traditional search systems, the proposed approaches are better at semantic retrieval and personalized ranking, achieving significant improvements.

我们将深度学习模型引入京东(世界上最大的电子商务平台之一)产品搜索的两个最重要阶段。具体来说，我们概述了一个深度学习系统的设计，该系统可以在几毫秒内检索与查询相关的语义项，以及一个两两深度重新排序系统，该系统可以学习微妙的用户偏好。与传统的搜索系统相比，本文提出的方法在语义检索和个性化排序方面取得了显著的进步。

引用次数: 5

Explanatory and Actionable Debugging for Machine Learning: A TableQA Demonstration 解释性和可操作的机器学习调试:一个TableQA演示

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331404

Minseok Cho, Gyeongbok Lee, Seung-won Hwang

Question answering from tables (TableQA) extracting answers from tables from the question given in natural language, has been actively studied. Existing models have been trained and evaluated mostly with respect to answer accuracy using public benchmark datasets such as WikiSQL. The goal of this demonstration is to show a debugging tool for such models, explaining answers to humans, known as explanatory debugging. Our key distinction is making it "actionable" to allow users to directly correct models upon explanation. Specifically, our tool surfaces annotation and models errors for users to correct, and provides actionable insights.

表格问答(TableQA)从自然语言给出的问题中提取表格中的答案，已经得到了积极的研究。现有模型的训练和评估主要是基于使用公共基准数据集(如WikiSQL)的答案准确性。本演示的目标是展示用于此类模型的调试工具，向人们解释答案，即解释性调试。我们的关键区别是使其“可操作”，以允许用户在解释后直接纠正模型。具体来说，我们的工具显示注释和模型错误，供用户纠正，并提供可操作的见解。

引用次数: 6

Learning Unsupervised Semantic Document Representation for Fine-grained Aspect-based Sentiment Analysis 学习用于细粒度基于方面的情感分析的无监督语义文档表示

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2019-07-18 DOI: 10.1145/3331184.3331320

Hao-Ming Fu, Pu-Jen Cheng

Document representation is the core of many NLP tasks on machine understanding. A general representation learned in an unsupervised manner reserves generality and can be used for various applications. In practice, sentiment analysis (SA) has been a challenging task that is regarded to be deeply semantic-related and is often used to assess general representations. Existing methods on unsupervised document representation learning can be separated into two families: sequential ones, which explicitly take the ordering of words into consideration, and non-sequential ones, which do not explicitly do so. However, both of them suffer from their own weaknesses. In this paper, we propose a model that overcomes difficulties encountered by both families of methods. Experiments show that our model outperforms state-of-the-art methods on popular SA datasets and a fine-grained aspect-based SA by a large margin.

文档表示是机器理解中许多NLP任务的核心。以无监督方式学习的一般表示保留了通用性，可用于各种应用。在实践中，情感分析(SA)一直是一项具有挑战性的任务，被认为是与语义深度相关的，通常用于评估一般表征。现有的无监督文档表示学习方法可以分为两大类:顺序方法，明确考虑单词的顺序;非顺序方法，不明确考虑单词的顺序。然而，他们都有自己的弱点。在本文中，我们提出了一个模型，克服了这两种方法所遇到的困难。实验表明，我们的模型在流行的SA数据集和细粒度的基于方面的SA上表现优于最先进的方法。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀