首页 > 最新文献

Proceedings of the 2015 International Conference on The Theory of Information Retrieval最新文献

英文 中文
Revisiting Optimal Rank Aggregation: A Dynamic Programming Approach 重访最优秩聚合:一种动态规划方法
Shayan A. Tabrizi, J. Dadashkarimi, Mostafa Dehghani, H. Esfahani, A. Shakery
Rank aggregation, that is merging multiple ranked lists, is a pivotal challenge in many information retrieval (IR) systems, especially in distributed IR and multilingual IR. From the evaluation point of view, being able to calculate the upper-bound of performance of the final aggregated list lays the ground for evaluating different aggregation strategies, independently. In this paper, we propose an algorithm based on dynamic programming which, using relevancy information, obtains the aggregated list with the maximum performance that could be possibly achieved by any aggregation strategy. We also provide a detailed proof for the optimality of the result of the algorithm. Furthermore, we demonstrate that the previous proposed algorithm fails to reach the optimal result in many circumstances, due to its greedy essence.
在许多信息检索系统中,特别是在分布式信息检索和多语言信息检索中,排名聚合是一个关键的挑战。从评估的角度来看,能够计算最终聚合列表的性能上限,为独立评估不同的聚合策略奠定了基础。在本文中,我们提出了一种基于动态规划的算法,该算法利用关联信息,获得任何聚合策略所能达到的最大性能的聚合列表。我们还对算法结果的最优性提供了详细的证明。此外,我们还证明了由于其贪婪的本质,之前提出的算法在许多情况下无法达到最优结果。
{"title":"Revisiting Optimal Rank Aggregation: A Dynamic Programming Approach","authors":"Shayan A. Tabrizi, J. Dadashkarimi, Mostafa Dehghani, H. Esfahani, A. Shakery","doi":"10.1145/2808194.2809490","DOIUrl":"https://doi.org/10.1145/2808194.2809490","url":null,"abstract":"Rank aggregation, that is merging multiple ranked lists, is a pivotal challenge in many information retrieval (IR) systems, especially in distributed IR and multilingual IR. From the evaluation point of view, being able to calculate the upper-bound of performance of the final aggregated list lays the ground for evaluating different aggregation strategies, independently. In this paper, we propose an algorithm based on dynamic programming which, using relevancy information, obtains the aggregated list with the maximum performance that could be possibly achieved by any aggregation strategy. We also provide a detailed proof for the optimality of the result of the algorithm. Furthermore, we demonstrate that the previous proposed algorithm fails to reach the optimal result in many circumstances, due to its greedy essence.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134639934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Session Search by Direct Policy Learning 通过直接策略学习进行会话搜索
Jiyun Luo, Xuchu Dong, G. Yang
This paper proposes a novel retrieval model for session search. Through gradient descent, the model finds optimal policies for the best search engine actions from what is observed in the user and search engine interactions. The proposed framework applies direct policy learning to session search such that it greatly reduce the model complexity than prior work. It is also a flexible design, which includes a wide range of features describing the rich interactions in session search. The framework is shown to be highly effective evaluated on the recent TREC Session Tracks. As part of the efforts to bring reinforcement learning to information retrieval, this paper makes a novel contribution in theoretical modeling for session search.
提出了一种新的会话检索模型。通过梯度下降,该模型从用户和搜索引擎交互中观察到的内容中找到最佳搜索引擎操作的最优策略。提出的框架将直接策略学习应用于会话搜索,从而大大降低了模型的复杂性。它也是一种灵活的设计,它包含了描述会话搜索中丰富交互的广泛功能。该框架在最近的TREC会议轨道上被证明是非常有效的。作为将强化学习引入信息检索的一部分,本文在会话搜索的理论建模方面做出了新的贡献。
{"title":"Session Search by Direct Policy Learning","authors":"Jiyun Luo, Xuchu Dong, G. Yang","doi":"10.1145/2808194.2809461","DOIUrl":"https://doi.org/10.1145/2808194.2809461","url":null,"abstract":"This paper proposes a novel retrieval model for session search. Through gradient descent, the model finds optimal policies for the best search engine actions from what is observed in the user and search engine interactions. The proposed framework applies direct policy learning to session search such that it greatly reduce the model complexity than prior work. It is also a flexible design, which includes a wide range of features describing the rich interactions in session search. The framework is shown to be highly effective evaluated on the recent TREC Session Tracks. As part of the efforts to bring reinforcement learning to information retrieval, this paper makes a novel contribution in theoretical modeling for session search.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"337 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115608860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
On the Behavior of PRES Using Incomplete Judgment Sets 不完全判断集下press的行为
E. Voorhees
PRES, the Patent Retrieval Evaluation Score, is a family of retrieval evaluation measures that combines recall and user effort to reflect the quality of a retrieval run with respect to recall-oriented search tasks. Previous analysis of the measure was done using the test collection for the CLEF-IP 2009 track, a collection that contains a limited range of number of relevant documents, making it difficult to assess the behavior of PRES for varying recall contexts. This paper examines the effect of incomplete judgments on PRES scores using the well-studied TREC-8 ad hoc test collection, a collection with a much more varied number-of-relevants profile. Experiments with small judgment sets created through a typical collection-building process show the PRES measures are resilient to incomplete judgment sets.
PRES,专利检索评价分数,是一系列检索评价措施,结合了召回和用户努力来反映检索运行的质量,相对于面向召回的搜索任务。之前对该方法的分析是使用CLEF-IP 2009轨道的测试集合完成的,该集合包含有限数量的相关文档,因此很难评估PRES在不同召回上下文中的行为。本文使用经过充分研究的TREC-8特设测试集(一个具有更多不同数量的相关概况的集合)来检查不完整判断对PRES分数的影响。通过典型的集合构建过程创建的小判断集的实验表明,PRES措施对不完全判断集具有弹性。
{"title":"On the Behavior of PRES Using Incomplete Judgment Sets","authors":"E. Voorhees","doi":"10.1145/2808194.2809484","DOIUrl":"https://doi.org/10.1145/2808194.2809484","url":null,"abstract":"PRES, the Patent Retrieval Evaluation Score, is a family of retrieval evaluation measures that combines recall and user effort to reflect the quality of a retrieval run with respect to recall-oriented search tasks. Previous analysis of the measure was done using the test collection for the CLEF-IP 2009 track, a collection that contains a limited range of number of relevant documents, making it difficult to assess the behavior of PRES for varying recall contexts. This paper examines the effect of incomplete judgments on PRES scores using the well-studied TREC-8 ad hoc test collection, a collection with a much more varied number-of-relevants profile. Experiments with small judgment sets created through a typical collection-building process show the PRES measures are resilient to incomplete judgment sets.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123519799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing Aspects in Retrieved Search Results 在检索的搜索结果平衡方面
David Wemhoener, James Allan
Many queries contain explicit aspects which must be balanced in any retrieved result in order to meet a user's information need: if aspects of the query are missing or disproportionately represented in documents, the results will be of lower quality than desired. This balancing thus needs to occur both within the retrieved documents individually and across the entire set. We introduce the concept of query-aspect balance and describe a new evaluation measure, β-NDCG, that allows the evaluation of query-aspect balance on multivalued query-aspect judgments. We apply β-NDCG to a small test collection and explore its utility. We show that β-NDCG-NDCG captures problems of query aspect balance within and across documents in the ranked list.
许多查询包含显式方面,为了满足用户的信息需求,必须在任何检索结果中平衡这些方面:如果查询的方面缺失或在文档中不成比例地表示,则结果的质量将低于期望。因此,这种平衡既需要在单独检索的文档中进行,也需要在整个集合中进行。我们引入了查询方面平衡的概念,并描述了一个新的评价度量β-NDCG,它允许对多值查询方面判断进行查询方面平衡的评价。我们将β-NDCG应用于一个小型测试集,并探索其效用。我们证明了β-NDCG-NDCG捕获了排名列表中文档内部和文档之间的查询方面平衡问题。
{"title":"Balancing Aspects in Retrieved Search Results","authors":"David Wemhoener, James Allan","doi":"10.1145/2808194.2809492","DOIUrl":"https://doi.org/10.1145/2808194.2809492","url":null,"abstract":"Many queries contain explicit aspects which must be balanced in any retrieved result in order to meet a user's information need: if aspects of the query are missing or disproportionately represented in documents, the results will be of lower quality than desired. This balancing thus needs to occur both within the retrieved documents individually and across the entire set. We introduce the concept of query-aspect balance and describe a new evaluation measure, β-NDCG, that allows the evaluation of query-aspect balance on multivalued query-aspect judgments. We apply β-NDCG to a small test collection and explore its utility. We show that β-NDCG-NDCG captures problems of query aspect balance within and across documents in the ranked list.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122121060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improving Pseudo Relevance Feedback in the Divergence from Randomness Model 发散随机模型中伪相关反馈的改进
Dipasree Pal, Mandar Mitra, S. Bhattacharya
In an earlier analysis of Pseudo Relevance Feedback (PRF) models by Clinchant and Gaussier (2013), five desirable properties that PRF models should satisfy were formalised. Also, modifications to two PRF models were proposed in order to improve compliance with the desirable properties. These resulted in improved retrieval effectiveness. In this study, we introduce a sixth property that we believe PRF models should satisfy. We also extend the earlier exercise to Bo1, a standard PRF model. Experimental results on the robust, wt10g and gov2 datasets show that the proposed modifications yield improvements in effectiveness.
在Clinchant和Gaussier(2013)对伪相关反馈(PRF)模型的早期分析中,PRF模型应该满足的五个理想属性被形式化了。此外,还提出了对两个PRF模型的修改,以提高对理想特性的遵从性。这提高了检索效率。在本研究中,我们引入了我们认为PRF模型应该满足的第六个属性。我们还将前面的练习扩展到Bo1,一个标准的PRF模型。在鲁棒、wt10g和gov2数据集上的实验结果表明,所提出的改进方法提高了算法的有效性。
{"title":"Improving Pseudo Relevance Feedback in the Divergence from Randomness Model","authors":"Dipasree Pal, Mandar Mitra, S. Bhattacharya","doi":"10.1145/2808194.2809494","DOIUrl":"https://doi.org/10.1145/2808194.2809494","url":null,"abstract":"In an earlier analysis of Pseudo Relevance Feedback (PRF) models by Clinchant and Gaussier (2013), five desirable properties that PRF models should satisfy were formalised. Also, modifications to two PRF models were proposed in order to improve compliance with the desirable properties. These resulted in improved retrieval effectiveness. In this study, we introduce a sixth property that we believe PRF models should satisfy. We also extend the earlier exercise to Bo1, a standard PRF model. Experimental results on the robust, wt10g and gov2 datasets show that the proposed modifications yield improvements in effectiveness.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125129136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Predicting Relevance Feedback Effectiveness with the Help of the Principle of Polyrepresentation in MIR 基于多表示原理的MIR关联反馈有效性预测
David Zellhöfer
The principle of polyrepresentation - a representative of the cognitive viewpoint on IR, takes a holistic perspective on interactive IR research. One of the principle's core hypotheses is that a document is described by different representations such as visual low-level features, textual content, or relational metadata. The conjunctive combination of these representations, the so-called cognitive overlap, is assumed to compensate the inherent insecurity in relevance assessments of documents w.r.t. an information need. Recently, the cognitively motivated principle of polyrepresentation has been shown to correlate with quantum mechanics-inspired IR models. However, the principle's effectiveness has not been examined in relevance feedback-based interactive MIR. In this work, the principle's utility is studied in interactive MIR in order to investigate whether its main hypothesis can serve as a predictor of retrieval performance during relevance feedback. In order to obtain resilient results all experiments have been carried out with 6 different standard test sets that provide evidence of the utility of the presented approach and the underlying polyrepresentative hypothesis.
多元表征理论是人际关系认知观的代表,它从整体的角度看待互动人际关系研究。该原则的核心假设之一是,文档由不同的表示(如可视化的低级特征、文本内容或关系元数据)来描述。这些表征的结合,即所谓的认知重叠,被认为是为了补偿文件相关性评估中固有的不安全感,而不是信息需求。最近,多元表征的认知动机原理已被证明与量子力学启发的红外模型相关。然而,该原则的有效性尚未在基于相关反馈的交互式MIR中得到检验。在这项工作中,研究了该原则在交互式MIR中的效用,以探讨其主要假设是否可以作为相关反馈期间检索性能的预测因子。为了获得有弹性的结果,所有实验都使用6种不同的标准测试集进行,这些测试集提供了所提出方法和潜在多代表性假设的实用性的证据。
{"title":"Predicting Relevance Feedback Effectiveness with the Help of the Principle of Polyrepresentation in MIR","authors":"David Zellhöfer","doi":"10.1145/2808194.2809485","DOIUrl":"https://doi.org/10.1145/2808194.2809485","url":null,"abstract":"The principle of polyrepresentation - a representative of the cognitive viewpoint on IR, takes a holistic perspective on interactive IR research. One of the principle's core hypotheses is that a document is described by different representations such as visual low-level features, textual content, or relational metadata. The conjunctive combination of these representations, the so-called cognitive overlap, is assumed to compensate the inherent insecurity in relevance assessments of documents w.r.t. an information need. Recently, the cognitively motivated principle of polyrepresentation has been shown to correlate with quantum mechanics-inspired IR models. However, the principle's effectiveness has not been examined in relevance feedback-based interactive MIR. In this work, the principle's utility is studied in interactive MIR in order to investigate whether its main hypothesis can serve as a predictor of retrieval performance during relevance feedback. In order to obtain resilient results all experiments have been carried out with 6 different standard test sets that provide evidence of the utility of the presented approach and the underlying polyrepresentative hypothesis.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128037330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Signaling Game Approach to Databases Querying and Interaction 数据库查询与交互的信令博弈方法
Arash Termehchy, B. Touri
As most database users cannot precisely express their information needs, it is challenging for database querying and exploration interfaces to understand them. We propose a novel formal framework for representing and understanding information needs in database querying and exploration. Our framework considers querying as a collaboration between the user and the database system to establish a mutual language for representing information needs. We formalize this collaboration as a signaling game, where each mutual language is an equilibrium for the game. A query interface is more effective if it establishes a less ambiguous mutual language faster. We discuss some equilibria, strategies, and the convergence rates in this game. In particular, we propose a reinforcement learning mechanism and analyze it within our framework. We prove that this adaptation mechanism for the query interface improves the effectiveness of answering queries stochastically speaking, and converges almost surely.
由于大多数数据库用户不能准确地表达他们的信息需求,因此数据库查询和探索接口很难理解这些需求。我们提出了一个新的形式化框架来表示和理解数据库查询和探索中的信息需求。我们的框架将查询视为用户和数据库系统之间的协作,以建立一种表示信息需求的共同语言。我们将这种合作形式化为一种信号游戏,其中每种相互语言都是游戏的平衡。如果查询接口能够更快地建立不那么模糊的相互语言,那么它将更加有效。我们讨论了这个博弈中的一些均衡、策略和收敛速度。特别是,我们提出了一种强化学习机制,并在我们的框架内对其进行了分析。我们证明了这种查询接口的自适应机制提高了随机回答查询的有效性,并且几乎肯定收敛。
{"title":"A Signaling Game Approach to Databases Querying and Interaction","authors":"Arash Termehchy, B. Touri","doi":"10.1145/2808194.2809487","DOIUrl":"https://doi.org/10.1145/2808194.2809487","url":null,"abstract":"As most database users cannot precisely express their information needs, it is challenging for database querying and exploration interfaces to understand them. We propose a novel formal framework for representing and understanding information needs in database querying and exploration. Our framework considers querying as a collaboration between the user and the database system to establish a mutual language for representing information needs. We formalize this collaboration as a signaling game, where each mutual language is an equilibrium for the game. A query interface is more effective if it establishes a less ambiguous mutual language faster. We discuss some equilibria, strategies, and the convergence rates in this game. In particular, we propose a reinforcement learning mechanism and analyze it within our framework. We prove that this adaptation mechanism for the query interface improves the effectiveness of answering queries stochastically speaking, and converges almost surely.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128813785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Verboseness Fission for BM25 Document Length Normalization BM25文档长度规范化的冗余裂变
Aldo Lipani, M. Lupu, A. Hanbury, Akiko Aizawa
BM25 is probably the most well known term weighting model in Information Retrieval. It has, depending on the formula variant at hand, 2 or 3 parameters (k1, b, and k3). This paper addresses b - the document length normalization parameter. Based on the observation that the two cases previously discussed for length normalization (multi-topicality and verboseness) are actually three: multi-topicality, verboseness with word repetition (repetitiveness) and verboseness with synonyms, we propose and test a new length normalization method that removes the need for a b parameter in BM25. Testing the new method on a set of purposefully varied test collections, we observe that we can obtain results statistically indistinguishable from the optimal results, therefore removing the need for ground-truth based optimization.
BM25可能是信息检索中最著名的术语加权模型。根据手头的公式变体,它有2或3个参数(k1, b和k3)。本文讨论了文档长度规范化参数b。基于前面讨论的长度规范化的两种情况(多主题性和冗长性)实际上是三种情况:多主题性、单词重复的冗长性(重复性)和同义词的冗长性,我们提出并测试了一种新的长度规范化方法,该方法在BM25中不需要b参数。在一组有目的地变化的测试集合上测试新方法,我们观察到我们可以获得与最优结果在统计上无法区分的结果,因此无需基于真值的优化。
{"title":"Verboseness Fission for BM25 Document Length Normalization","authors":"Aldo Lipani, M. Lupu, A. Hanbury, Akiko Aizawa","doi":"10.1145/2808194.2809486","DOIUrl":"https://doi.org/10.1145/2808194.2809486","url":null,"abstract":"BM25 is probably the most well known term weighting model in Information Retrieval. It has, depending on the formula variant at hand, 2 or 3 parameters (k1, b, and k3). This paper addresses b - the document length normalization parameter. Based on the observation that the two cases previously discussed for length normalization (multi-topicality and verboseness) are actually three: multi-topicality, verboseness with word repetition (repetitiveness) and verboseness with synonyms, we propose and test a new length normalization method that removes the need for a b parameter in BM25. Testing the new method on a set of purposefully varied test collections, we observe that we can obtain results statistically indistinguishable from the optimal results, therefore removing the need for ground-truth based optimization.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126551449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Towards a Formal Framework for Utility-oriented Measurements of Retrieval Effectiveness 面向效用的检索有效性度量的形式化框架
M. Ferrante, N. Ferro, Maria Maistro
In this paper we present a formal framework to define and study the properties of utility-oriented measurements of retrieval effectiveness, like AP, RBP, ERR and many other popular IR evaluation measures. The proposed framework is laid in the wake of the representational theory of measurement, which provides the foundations of the modern theory of measurement in both physical and social sciences, thus contributing to explicitly link IR evaluation to a broader context. The proposed framework is minimal, in the sense that it relies on just one axiom, from which other properties are derived. Finally, it contributes to a better understanding and a clear separation of what issues are due to the inherent problems in comparing systems in terms of retrieval effectiveness and what others are due to the expected numerical properties of a measurement.
在本文中,我们提出了一个正式的框架来定义和研究面向效用的检索有效性度量,如AP、RBP、ERR和许多其他流行的IR评价度量。该框架是在表征性测量理论的基础上提出的,表征性测量理论为物理科学和社会科学的现代测量理论奠定了基础,从而有助于将IR评估明确地与更广泛的背景联系起来。所提出的框架是最小的,因为它只依赖于一个公理,从这个公理中推导出其他属性。最后,它有助于更好地理解和明确区分哪些问题是由于检索有效性方面比较系统的固有问题造成的,以及其他问题是由于测量的预期数值特性造成的。
{"title":"Towards a Formal Framework for Utility-oriented Measurements of Retrieval Effectiveness","authors":"M. Ferrante, N. Ferro, Maria Maistro","doi":"10.1145/2808194.2809452","DOIUrl":"https://doi.org/10.1145/2808194.2809452","url":null,"abstract":"In this paper we present a formal framework to define and study the properties of utility-oriented measurements of retrieval effectiveness, like AP, RBP, ERR and many other popular IR evaluation measures. The proposed framework is laid in the wake of the representational theory of measurement, which provides the foundations of the modern theory of measurement in both physical and social sciences, thus contributing to explicitly link IR evaluation to a broader context. The proposed framework is minimal, in the sense that it relies on just one axiom, from which other properties are derived. Finally, it contributes to a better understanding and a clear separation of what issues are due to the inherent problems in comparing systems in terms of retrieval effectiveness and what others are due to the expected numerical properties of a measurement.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128314795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
Context Retrieval for Web Tables Web表的上下文检索
Hong Wang, Anqi Liu, Jing Wang, Brian D. Ziebart, Clement T. Yu, Warren Shen
Many modern knowledge bases are built by extracting information from millions of web pages. Though existing extraction methods primarily focus on web pages' main text, a huge amount of information is embedded within other web structures, such as web tables. Previous studies have shown that linking web page tables and textual context is beneficial for extracting more information from web pages. However, using the text surrounding each table without carefully assessing its relevance introduces noise in the extracted information, degrading its accuracy. To the best of our knowledge, we provide the first systematic study of the problem of table-related context retrieval: given a table and the sentences within the same web page, determine for each sentence whether it is relevant to the table. We define the concept of relevance and introduce a Table-Related Context Retrieval system (TRCR) in this paper. We experiment with different machine learning algorithms, including a recently developed algorithm that is robust to biases in the training data, and show that our system retrieves table-related context with F1=0.735.
许多现代知识库是通过从数以百万计的网页中提取信息而建立起来的。虽然现有的提取方法主要集中在网页的主要文本上,但是大量的信息被嵌入到其他的网页结构中,比如网页表。以往的研究表明,链接网页表和文本上下文有利于从网页中提取更多的信息。然而,在没有仔细评估其相关性的情况下使用每个表周围的文本会在提取的信息中引入噪声,从而降低其准确性。据我们所知,我们提供了第一个与表相关的上下文检索问题的系统研究:给定一个表和同一网页中的句子,确定每个句子是否与表相关。本文定义了关联的概念,并介绍了一个表相关上下文检索系统(TRCR)。我们实验了不同的机器学习算法,包括最近开发的一种对训练数据中的偏差具有鲁棒性的算法,并表明我们的系统检索F1=0.735的表相关上下文。
{"title":"Context Retrieval for Web Tables","authors":"Hong Wang, Anqi Liu, Jing Wang, Brian D. Ziebart, Clement T. Yu, Warren Shen","doi":"10.1145/2808194.2809453","DOIUrl":"https://doi.org/10.1145/2808194.2809453","url":null,"abstract":"Many modern knowledge bases are built by extracting information from millions of web pages. Though existing extraction methods primarily focus on web pages' main text, a huge amount of information is embedded within other web structures, such as web tables. Previous studies have shown that linking web page tables and textual context is beneficial for extracting more information from web pages. However, using the text surrounding each table without carefully assessing its relevance introduces noise in the extracted information, degrading its accuracy. To the best of our knowledge, we provide the first systematic study of the problem of table-related context retrieval: given a table and the sentences within the same web page, determine for each sentence whether it is relevant to the table. We define the concept of relevance and introduce a Table-Related Context Retrieval system (TRCR) in this paper. We experiment with different machine learning algorithms, including a recently developed algorithm that is robust to biases in the training data, and show that our system retrieves table-related context with F1=0.735.","PeriodicalId":440325,"journal":{"name":"Proceedings of the 2015 International Conference on The Theory of Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131128206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings of the 2015 International Conference on The Theory of Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1