Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval最新文献

英文中文

Analyzing User's Sequential Behavior in Query Auto-Completion via Markov Processes 基于马尔可夫过程的查询自动完成中用户顺序行为分析

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767723

Liangda Li, Hongbo Deng, Anlei Dong, Yi Chang, H. Zha, R. Baeza-Yates

Query auto-completion (QAC) plays an important role in assisting users typing less while submitting a query. The QAC engine generally offers a list of suggested queries that start with a user's input as a prefix, and the list of suggestions is changed to match the updated input after the user types each keystroke. Therefore rich user interactions can be observed along with each keystroke until a user clicks a suggestion or types the entire query manually. It becomes increasingly important to analyze and understand users' interactions with the QAC engine, to improve its performance. Existing works on QAC either ignored users' interaction data, or assumed that their interactions at each keystroke are independent from others. Our paper pays high attention to users' sequential interactions with a QAC engine in and across QAC sessions, rather than users' interactions at each keystroke of each QAC session separately. Analyzing the dependencies in users' sequential interactions improves our understanding of the following three questions: 1) how is a user's skipping/viewing move at the current keystroke influenced by that at the previous keystroke? 2) how to improve search engines' query suggestions at short keystrokes based on those at latter long keystrokes? and 3) facing a targeted query shown in the suggestion list, why does a user decide to continue typing rather than click the intended suggestion? We propose a probabilistic model that addresses those three questions in a unified way, and illustrate how the model determines users' final click decisions. By comparing with state-of-the-art methods, our proposed model does suggest queries that better satisfy users' intents.

查询自动完成(QAC)在帮助用户在提交查询时减少输入方面发挥着重要作用。QAC引擎通常提供以用户输入作为前缀开始的建议查询列表，并且在用户键入每个击键后，建议列表将被更改以匹配更新的输入。因此，随着每次击键，可以观察到丰富的用户交互，直到用户单击建议或手动键入整个查询。为了提高QAC引擎的性能，分析和理解用户与QAC引擎的交互变得越来越重要。现有的QAC工作要么忽略用户的交互数据，要么假设他们在每次击键时的交互是独立的。我们的论文高度关注用户在QAC会话中和跨QAC会话中与QAC引擎的顺序交互，而不是用户在每个QAC会话的每次击键时分别进行的交互。分析用户顺序交互中的依赖关系有助于我们理解以下三个问题:1)用户在当前击键时的跳过/查看移动是如何受到前一次击键的影响的?2)如何在后期长击的基础上改进短击时搜索引擎的查询建议?3)面对建议列表中显示的目标查询，为什么用户决定继续输入而不是点击预期的建议?我们提出了一个概率模型，以统一的方式解决这三个问题，并说明该模型如何决定用户的最终点击决策。通过与最先进的方法进行比较，我们提出的模型确实提出了更好地满足用户意图的查询。

{"title":"Analyzing User's Sequential Behavior in Query Auto-Completion via Markov Processes","authors":"Liangda Li, Hongbo Deng, Anlei Dong, Yi Chang, H. Zha, R. Baeza-Yates","doi":"10.1145/2766462.2767723","DOIUrl":"https://doi.org/10.1145/2766462.2767723","url":null,"abstract":"Query auto-completion (QAC) plays an important role in assisting users typing less while submitting a query. The QAC engine generally offers a list of suggested queries that start with a user's input as a prefix, and the list of suggestions is changed to match the updated input after the user types each keystroke. Therefore rich user interactions can be observed along with each keystroke until a user clicks a suggestion or types the entire query manually. It becomes increasingly important to analyze and understand users' interactions with the QAC engine, to improve its performance. Existing works on QAC either ignored users' interaction data, or assumed that their interactions at each keystroke are independent from others. Our paper pays high attention to users' sequential interactions with a QAC engine in and across QAC sessions, rather than users' interactions at each keystroke of each QAC session separately. Analyzing the dependencies in users' sequential interactions improves our understanding of the following three questions: 1) how is a user's skipping/viewing move at the current keystroke influenced by that at the previous keystroke? 2) how to improve search engines' query suggestions at short keystrokes based on those at latter long keystrokes? and 3) facing a targeted query shown in the suggestion list, why does a user decide to continue typing rather than click the intended suggestion? We propose a probabilistic model that addresses those three questions in a unified way, and illustrate how the model determines users' final click decisions. By comparing with state-of-the-art methods, our proposed model does suggest queries that better satisfy users' intents.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131254189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Towards Understanding the Impact of Length in Web Search Result Summaries over a Speech-only Communication Channel 在纯语音通信通道上理解长度对网络搜索结果摘要的影响

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767826

Johanne R. Trippas, Damiano Spina, M. Sanderson, L. Cavedon

Presenting search results over a speech-only communication channel involves a number of challenges for users due to cognitive limitations and the serial nature of speech. We investigated the impact of search result summary length in speech-based web search, and compared our results to a text baseline. Based on crowdsourced workers, we found that users preferred longer, more informative summaries for text presentation. For audio, user preferences depended on the style of query. For single-facet queries, shortened audio summaries were preferred, additionally users were found to judge relevance with a similar accuracy compared to text-based summaries. For multi-facet queries, user preferences were not as clear, suggesting that more sophisticated techniques are required to handle such queries.

由于认知限制和语音的连续性，在纯语音通信通道上显示搜索结果给用户带来了许多挑战。我们研究了基于语音的网络搜索中搜索结果摘要长度的影响，并将我们的结果与文本基线进行了比较。根据众包工作者的调查，我们发现用户更喜欢更长、更有信息量的文本摘要。对于音频，用户的偏好取决于查询的样式。对于单面查询，缩短的音频摘要是首选，此外，与基于文本的摘要相比，用户发现判断相关性的准确性相似。对于多方面查询，用户首选项不那么清楚，这表明需要更复杂的技术来处理此类查询。

引用次数: 25

Modeling Multi-query Retrieval Tasks Using Density Matrix Transformation 基于密度矩阵变换的多查询检索任务建模

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767819

Qiuchi Li, Jingfei Li, Peng Zhang, D. Song

The quantum probabilistic framework has recently been applied to Information Retrieval (IR). A representative is the Quantum Language Model (QLM), which is developed for the ad-hoc retrieval with single queries and has achieved significant improvements over traditional language models. In QLM, a density matrix, defined on the quantum probabilistic space, is estimated as a representation of user's search intention with respect to a specific query. However, QLM is unable to capture the dynamics of user's information need in query history. This limitation restricts its further application on the dynamic search tasks, e.g., session search. In this paper, we propose a Session-based Quantum Language Model (SQLM) that deals with multi-query session search task. In SQLM, a transformation model of density matrices is proposed to model the evolution of user's information need in response to the user's interaction with search engine, by incorporating features extracted from both positive feedback (clicked documents) and negative feedback (skipped documents). Extensive experiments conducted on TREC 2013 and 2014 session track data demonstrate the effectiveness of SQLM in comparison with the classic QLM.

近年来，量子概率框架在信息检索领域得到了广泛的应用。一个代表是量子语言模型(Quantum Language Model, QLM)，它是为使用单个查询的特别检索而开发的，并且比传统的语言模型取得了显著的改进。在QLM中，定义在量子概率空间上的密度矩阵被估计为用户相对于特定查询的搜索意图的表示。但是，QLM无法在查询历史中捕捉用户信息需求的动态。这限制了它在动态搜索任务(如会话搜索)上的进一步应用。本文提出了一种基于会话的量子语言模型(SQLM)，用于处理多查询会话搜索任务。在SQLM中，提出了密度矩阵的转换模型，通过结合从正反馈(点击文档)和负反馈(跳过文档)中提取的特征，来模拟用户与搜索引擎交互时用户信息需求的演变。在TREC 2013年和2014年的会话轨迹数据上进行的大量实验表明，与经典的QLM相比，SQLM是有效的。

{"title":"Modeling Multi-query Retrieval Tasks Using Density Matrix Transformation","authors":"Qiuchi Li, Jingfei Li, Peng Zhang, D. Song","doi":"10.1145/2766462.2767819","DOIUrl":"https://doi.org/10.1145/2766462.2767819","url":null,"abstract":"The quantum probabilistic framework has recently been applied to Information Retrieval (IR). A representative is the Quantum Language Model (QLM), which is developed for the ad-hoc retrieval with single queries and has achieved significant improvements over traditional language models. In QLM, a density matrix, defined on the quantum probabilistic space, is estimated as a representation of user's search intention with respect to a specific query. However, QLM is unable to capture the dynamics of user's information need in query history. This limitation restricts its further application on the dynamic search tasks, e.g., session search. In this paper, we propose a Session-based Quantum Language Model (SQLM) that deals with multi-query session search task. In SQLM, a transformation model of density matrices is proposed to model the evolution of user's information need in response to the user's interaction with search engine, by incorporating features extracted from both positive feedback (clicked documents) and negative feedback (skipped documents). Extensive experiments conducted on TREC 2013 and 2014 session track data demonstrate the effectiveness of SQLM in comparison with the classic QLM.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114316445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 29

Time Pressure in Information Search 信息搜索中的时间压力

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767851

Anita Crescenzi

The primary purpose of this research is to explore the impact of perceived time pressure on search behaviors, searcher perceptions of the search system and the search experience. Are there observable behavioral changes when a searcher is time-pressured? To what extent are search behavior differences attributable to objective experimental manipulation versus to the subjective experience of time pressure? An important secondary purpose of this work is to identify appropriate outcome measures that allow for the comparison of session-level search behaviors when time is manipulated.

本研究的主要目的是探讨感知时间压力对搜索行为、搜索者对搜索系统的感知和搜索体验的影响。当搜索者有时间压力时，是否有可观察到的行为变化?搜索行为差异在多大程度上可归因于客观实验操作与主观时间压力体验?这项工作的一个重要的次要目的是确定适当的结果度量，以便在操纵时间时对会话级搜索行为进行比较。

引用次数: 0

Relevance Scores for Triples from Type-Like Relations 类类型关系中三元组的相关性评分

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767734

H. Bast, Björn Buchhold, Elmar Haussmann

We compute and evaluate relevance scores for knowledge-base triples from type-like relations. Such a score measures the degree to which an entity "belongs" to a type. For example, Quentin Tarantino has various professions, including Film Director, Screenwriter, and Actor. The first two would get a high score in our setting, because those are his main professions. The third would get a low score, because he mostly had cameo appearances in his own movies. Such scores are essential in the ranking for entity queries, e.g. "American actors" or "Quentin Tarantino professions". These scores are different from scores for "correctness" or "accuracy" (all three professions above are correct and accurate). We propose a variety of algorithms to compute these scores. For our evaluation we designed a new benchmark, which includes a ground truth based on about 14K human judgments obtained via crowdsourcing. Inter-judge agreement is slightly over 90%. Existing approaches from the literature give results far from the optimum. Our best algorithms achieve an agreement of about 80% with the ground truth.

我们计算和评估来自类类型关系的知识库三元组的相关性分数。这样的分数衡量一个实体“属于”某种类型的程度。例如，昆汀·塔伦蒂诺有多种职业，包括电影导演、编剧和演员。在我们的设置中，前两个会得到高分，因为这是他的主要职业。第三位的得分很低，因为他大多在自己的电影中客串。这样的分数在实体查询的排名中是必不可少的。“美国演员”或“昆汀·塔伦蒂诺职业”。这些分数不同于“正确性”或“准确性”的分数(以上三个职业都是正确和准确的)。我们提出了各种算法来计算这些分数。对于我们的评估，我们设计了一个新的基准，其中包括基于通过众包获得的大约14K个人类判断的基本事实。法官间的一致性略高于90%。现有的文献方法给出的结果远非最佳。我们最好的算法与实际情况的一致性约为80%。

{"title":"Relevance Scores for Triples from Type-Like Relations","authors":"H. Bast, Björn Buchhold, Elmar Haussmann","doi":"10.1145/2766462.2767734","DOIUrl":"https://doi.org/10.1145/2766462.2767734","url":null,"abstract":"We compute and evaluate relevance scores for knowledge-base triples from type-like relations. Such a score measures the degree to which an entity \"belongs\" to a type. For example, Quentin Tarantino has various professions, including Film Director, Screenwriter, and Actor. The first two would get a high score in our setting, because those are his main professions. The third would get a low score, because he mostly had cameo appearances in his own movies. Such scores are essential in the ranking for entity queries, e.g. \"American actors\" or \"Quentin Tarantino professions\". These scores are different from scores for \"correctness\" or \"accuracy\" (all three professions above are correct and accurate). We propose a variety of algorithms to compute these scores. For our evaluation we designed a new benchmark, which includes a ground truth based on about 14K human judgments obtained via crowdsourcing. Inter-judge agreement is slightly over 90%. Existing approaches from the literature give results far from the optimum. Our best algorithms achieve an agreement of about 80% with the ground truth.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116310516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 40

Personalizing Search on Shared Devices 在共享设备上个性化搜索

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767736

Ryen W. White, Ahmed Hassan Awadallah

Search personalization tailors the search experience to individual searchers. To do this, search engines construct interest models comprising signals from observed behavior associated with ma-chines, often via Web browser cookies or other user identifiers. However, shared device usage is common, meaning that the activities of multiple searchers may be interwoven in the interest models generated. Recent research on activity attribution has led to methods to automatically disentangle the histories of multiple searchers and correctly ascribe newly-observed search activity to the correct per-son. Building on this, we introduce attribution-based personalization (ABP), a procedure that extends traditional personalization to target individual searchers on shared devices. Activity attribution may improve personalization, but its benefits are not yet fully understood. We present an oracle study (with perfect knowledge of which searchers perform each action on each machine) to under-stand the effectiveness of ABP in predicting searchers' future interests. We utilize a large Web search log dataset containing both per-son identifiers and machine identifiers to quantify the gain in personalization performance from ABP, identify the circumstances under which ABP is most effective, and develop a classifier to determine when to apply it that yields sizable gains in personalization performance. ABP allows search providers to personalize experiences for individuals rather than targeting all users of a device collectively.

搜索个性化为单个搜索者定制搜索体验。为此，搜索引擎构建兴趣模型，包括与机器相关的观察行为的信号，通常通过Web浏览器cookie或其他用户标识符。然而，共享设备的使用是常见的，这意味着多个搜索者的活动可能在生成的兴趣模型中交织在一起。最近对活动归因的研究已经产生了自动解开多个搜索者的历史，并将新观察到的搜索活动正确地归因于正确的个人的方法。在此基础上，我们引入了基于归因的个性化(ABP)，这是一种将传统个性化扩展到针对共享设备上的单个搜索者的过程。活动归因可能会提高个性化，但其好处尚未得到充分理解。我们提出了一个oracle研究(完全了解搜索者在每台机器上执行的每个动作)，以了解ABP在预测搜索者未来兴趣方面的有效性。我们利用包含个人标识符和机器标识符的大型Web搜索日志数据集来量化ABP在个性化性能方面的收益，确定ABP最有效的情况，并开发一个分类器来确定何时应用它以产生相当大的个性化性能收益。ABP允许搜索提供商为个人提供个性化体验，而不是针对同一设备的所有用户。

{"title":"Personalizing Search on Shared Devices","authors":"Ryen W. White, Ahmed Hassan Awadallah","doi":"10.1145/2766462.2767736","DOIUrl":"https://doi.org/10.1145/2766462.2767736","url":null,"abstract":"Search personalization tailors the search experience to individual searchers. To do this, search engines construct interest models comprising signals from observed behavior associated with ma-chines, often via Web browser cookies or other user identifiers. However, shared device usage is common, meaning that the activities of multiple searchers may be interwoven in the interest models generated. Recent research on activity attribution has led to methods to automatically disentangle the histories of multiple searchers and correctly ascribe newly-observed search activity to the correct per-son. Building on this, we introduce attribution-based personalization (ABP), a procedure that extends traditional personalization to target individual searchers on shared devices. Activity attribution may improve personalization, but its benefits are not yet fully understood. We present an oracle study (with perfect knowledge of which searchers perform each action on each machine) to under-stand the effectiveness of ABP in predicting searchers' future interests. We utilize a large Web search log dataset containing both per-son identifiers and machine identifiers to quantify the gain in personalization performance from ABP, identify the circumstances under which ABP is most effective, and develop a classifier to determine when to apply it that yields sizable gains in personalization performance. ABP allows search providers to personalize experiences for individuals rather than targeting all users of a device collectively.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116415460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Finding Money in the Haystack: Information Retrieval at Bloomberg 在干草堆里找钱:彭博社的信息检索

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2776782

Jonathan J. Dorando, Konstantine Arkoudas, P. Vasa, Gary Kazantsev, Gideon Mann

The financial markets are a rich domain for search, and it is not simple to serving the entire scope of financial professionals, who make their living on accurate, timely, and deep information. The data sources are many and disparate. This includes domains with rich structured data such as company and security attributes, textual data like research reports, and time sensitive news stories. Not only is the domain complicated, but some of the techniques that work for web search have to be adapted and reconsidered in an enterprise context with fewer eyeballs but just as complicated questions. At Bloomberg, we have been addressing these problems over the past four years in the search and discoverability group, heavily leveraging the insights from the academic and open-source communities to apply to our problems. We'll discuss about our efforts in Natural Language Question & Answer (NLQA), learning to rank, federated search, crowd sourcing, and how this all comes together to make search effective for our users.

金融市场是一个丰富的搜索领域，为所有金融专业人士提供服务并不简单，他们靠准确、及时和深入的信息为生。数据源很多，而且完全不同。这包括具有丰富结构化数据(如公司和安全属性)、文本数据(如研究报告)和时间敏感的新闻故事的域。不仅这个领域很复杂，而且一些适用于网络搜索的技术必须在企业环境中进行调整和重新考虑，因为企业环境的关注较少，但问题同样复杂。在彭博社，在过去的四年里，我们一直在搜索和可发现性小组中解决这些问题，大量利用来自学术和开源社区的见解来解决我们的问题。我们将讨论我们在自然语言问答(Natural Language Question & Answer, NLQA)、学习排序、联合搜索、众包方面所做的努力，以及这一切是如何结合在一起使搜索对我们的用户有效的。

引用次数: 0

Different Users, Different Opinions: Predicting Search Satisfaction with Mouse Movement Information 不同的用户，不同的意见:用鼠标移动信息预测搜索满意度

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767721

Yiqun Liu, Ye Chen, Jinhui Tang, Jiashen Sun, Min Zhang, Shaoping Ma, Xuan Zhu

Satisfaction prediction is one of the prime concerns in search performance evaluation. It is a non-trivial task for two major reasons: (1) The definition of satisfaction is rather subjective and different users may have different opinions in satisfaction judgement. (2) Most existing studies on satisfaction prediction mainly rely on users' click-through or query reformulation behaviors but there are many sessions without such kind of interactions. To shed light on these research questions, we construct an experimental search engine that could collect users' satisfaction feedback as well as mouse click-through/movement data. Different from existing studies, we compare for the first time search users' and external assessors' opinions on satisfaction. We find that search users pay more attention to the utility of results while external assessors emphasize on the efforts spent in search sessions. Inspired by recent studies in predicting result relevance based on mouse movement patterns (namely motifs), we propose to estimate the utilities of search results and the efforts in search sessions with motifs extracted from mouse movement data on search result pages (SERPs). Besides the existing frequency-based motif selection method, two novel selection strategies (distance-based and distribution-based) are also adopted to extract high quality motifs for satisfaction prediction. Experimental results on over 1,000 user sessions show that the proposed strategies outperform existing methods and also have promising generalization capability for different users and queries.

满意度预测是搜索性能评估中主要关注的问题之一。这是一项不平凡的任务，主要有两个原因:(1)满意度的定义是相当主观的，不同的用户在满意度判断上可能有不同的意见。(2)大多数现有的满意度预测研究主要依赖于用户的点击或查询重构行为，但有很多会话没有这种交互。为了阐明这些研究问题，我们构建了一个实验搜索引擎，可以收集用户的满意度反馈以及鼠标点击/移动数据。与已有的研究不同，我们首次比较了搜索用户和外部评估者对满意度的看法。我们发现，搜索用户更关注结果的效用，而外部评估者则强调在搜索会话中所花费的努力。受最近基于鼠标移动模式(即基序)预测结果相关性研究的启发，我们提出使用从搜索结果页面(serp)上的鼠标移动数据提取的基序来估计搜索结果的效用和搜索会话的努力。除了现有的基于频率的基序选择方法外，还采用了基于距离和基于分布的两种新的基序选择策略来提取高质量的基序进行满意度预测。超过1000个用户会话的实验结果表明，该策略优于现有的方法，并且对不同的用户和查询具有良好的泛化能力。

{"title":"Different Users, Different Opinions: Predicting Search Satisfaction with Mouse Movement Information","authors":"Yiqun Liu, Ye Chen, Jinhui Tang, Jiashen Sun, Min Zhang, Shaoping Ma, Xuan Zhu","doi":"10.1145/2766462.2767721","DOIUrl":"https://doi.org/10.1145/2766462.2767721","url":null,"abstract":"Satisfaction prediction is one of the prime concerns in search performance evaluation. It is a non-trivial task for two major reasons: (1) The definition of satisfaction is rather subjective and different users may have different opinions in satisfaction judgement. (2) Most existing studies on satisfaction prediction mainly rely on users' click-through or query reformulation behaviors but there are many sessions without such kind of interactions. To shed light on these research questions, we construct an experimental search engine that could collect users' satisfaction feedback as well as mouse click-through/movement data. Different from existing studies, we compare for the first time search users' and external assessors' opinions on satisfaction. We find that search users pay more attention to the utility of results while external assessors emphasize on the efforts spent in search sessions. Inspired by recent studies in predicting result relevance based on mouse movement patterns (namely motifs), we propose to estimate the utilities of search results and the efforts in search sessions with motifs extracted from mouse movement data on search result pages (SERPs). Besides the existing frequency-based motif selection method, two novel selection strategies (distance-based and distribution-based) are also adopted to extract high quality motifs for satisfaction prediction. Experimental results on over 1,000 user sessions show that the proposed strategies outperform existing methods and also have promising generalization capability for different users and queries.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125901368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 71

Reachability based Ranking in Interactive Image Retrieval 基于可达性排序的交互式图像检索

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767777

Jiyi Li

In some interactive image retrieval systems, users can select images from image search results and click to view their similar or related images until they reach the targets. Existing image ranking options are based on relevance, update time, interestingness and so on. Because the inexact description of user targets or unsatisfying performance of image retrieval methods, it is possible that users cannot reach their targets in single-round interaction. When we consider multi-round interactions, how to assist users to select the images that are easier to reach the targets in fewer rounds is a useful issue. In this paper, we propose a new kind of ranking option to users by ranking the images according to their difficulties of reaching potential targets. We model the interactive image search behavior as navigation on information network constructed by an image collection and an image retrieval method. We use the properties of this information network for reachability based ranking. Experiments based on a social image collection show the efficiency of our approach.

在一些交互式图像检索系统中，用户可以从图像搜索结果中选择图像，点击查看与自己相似或相关的图像，直到到达目标。现有的图片排名选项是基于相关性，更新时间，兴趣等。由于对用户目标的描述不准确或图像检索方法的性能不理想，有可能导致用户在单轮交互中无法到达目标。当我们考虑多轮交互时，如何帮助用户在更少的回合中选择更容易到达目标的图像是一个有用的问题。在本文中，我们提出了一种新的排序选项，根据图像达到潜在目标的困难程度对用户进行排序。我们将交互式图像搜索行为建模为在由图像集合和图像检索方法构建的信息网络上的导航行为。我们使用该信息网络的属性进行基于可达性的排名。基于社会图像采集的实验表明了该方法的有效性。

引用次数: 1

Using Term Location Information to Enhance Probabilistic Information Retrieval 利用词位信息增强概率信息检索

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pub Date : 2015-08-09 DOI: 10.1145/2766462.2767827

Baiyan Liu, X. An, Xiangji Huang

Nouns are more important than other parts of speech in information retrieval and are more often found near the beginning or the end of sentences. In this paper, we investigate the effects of rewarding terms based on their location in sentences on information retrieval. Particularly, we propose a novel Term Location (TEL) retrieval model based on BM25 to enhance probabilistic information retrieval, where a kernel-based method is used to capture term placement patterns. Experiments on five TREC datasets of varied size and content indicate the proposed model significantly outperforms the optimized BM25 and DirichletLM in MAP over all datasets with all kernel functions, and excels the optimized BM25 and DirichletLM over most of the datasets in P@5 and P@20 with different kernel functions.

名词在信息检索中比其他词类更重要，通常出现在句子的开头或结尾。本文研究了基于句子位置的奖励词对信息检索的影响。特别地，我们提出了一种新的基于BM25的术语定位(TEL)检索模型来增强概率信息检索，其中使用基于核的方法来捕获术语放置模式。在5个不同大小和内容的TREC数据集上进行的实验表明，该模型在具有所有核函数的所有数据集上都明显优于MAP中优化后的BM25和DirichletLM，并且在具有不同核函数的P@5和P@20的大多数数据集上都优于优化后的BM25和DirichletLM。

引用次数: 25

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀