Is the unigram relevance model term independent?: classifying term dependencies in query expansion

Mike Symonds, P. Bruza, G. Zuccon, Laurianne Sitbon, I. Turner
{"title":"Is the unigram relevance model term independent?: classifying term dependencies in query expansion","authors":"Mike Symonds, P. Bruza, G. Zuccon, Laurianne Sitbon, I. Turner","doi":"10.1145/2407085.2407102","DOIUrl":null,"url":null,"abstract":"This paper develops a framework for classifying term dependencies in query expansion with respect to the role terms play in structural linguistic associations. The framework is used to classify and compare the query expansion terms produced by the unigram and positional relevance models. As the unigram relevance model does not explicitly model term dependencies in its estimation process it is often thought to ignore dependencies that exist between words in natural language.\n The framework presented in this paper is underpinned by two types of linguistic association, namely syntagmatic and paradigmatic associations. It was found that syntagmatic associations were a more prevalent form of linguistic association used in query expansion. Paradoxically, it was the unigram model that exhibited this association more than the positional relevance model. This surprising finding has two potential implications for information retrieval models: (1) if linguistic associations underpin query expansion, then a probabilistic term dependence assumption based on position is inadequate for capturing them; (2) the unigram relevance model captures more term dependency information than its underlying theoretical model suggests, so its normative position as a baseline that ignores term dependencies should perhaps be reviewed.","PeriodicalId":402985,"journal":{"name":"Australasian Document Computing Symposium","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australasian Document Computing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2407085.2407102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

This paper develops a framework for classifying term dependencies in query expansion with respect to the role terms play in structural linguistic associations. The framework is used to classify and compare the query expansion terms produced by the unigram and positional relevance models. As the unigram relevance model does not explicitly model term dependencies in its estimation process it is often thought to ignore dependencies that exist between words in natural language. The framework presented in this paper is underpinned by two types of linguistic association, namely syntagmatic and paradigmatic associations. It was found that syntagmatic associations were a more prevalent form of linguistic association used in query expansion. Paradoxically, it was the unigram model that exhibited this association more than the positional relevance model. This surprising finding has two potential implications for information retrieval models: (1) if linguistic associations underpin query expansion, then a probabilistic term dependence assumption based on position is inadequate for capturing them; (2) the unigram relevance model captures more term dependency information than its underlying theoretical model suggests, so its normative position as a baseline that ignores term dependencies should perhaps be reviewed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
单图相关模型项独立吗?:对查询扩展中的词依赖进行分类
本文根据术语在结构化语言关联中的作用,开发了一个用于分类查询扩展中的术语依赖关系的框架。该框架用于对单图和位置关联模型产生的查询扩展项进行分类和比较。由于单图关联模型在其估计过程中没有明确地对词之间的依赖关系进行建模,因此通常被认为忽略了自然语言中词之间存在的依赖关系。本文提出的框架以两种类型的语言关联为基础,即句法关联和范式关联。研究发现,在查询扩展中,组合联想是一种更为普遍的语言联想形式。矛盾的是,单图模型比位置关联模型更能显示这种关联。这一令人惊讶的发现对信息检索模型有两个潜在的影响:(1)如果语言关联是查询扩展的基础,那么基于位置的概率术语依赖假设不足以捕获它们;(2)单图关联模型比其基础理论模型所显示的捕获了更多的术语依赖信息,因此它作为忽略术语依赖的基线的规范地位也许应该被审查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploring the magic of WAND Classifying microblogs for disasters Using eye tracking for evaluating web search interfaces Power walk: revisiting the random surfer Efficient top-k retrieval with signatures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1