首页 > 最新文献

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval最新文献

英文 中文
ViewSer: enabling large-scale remote user studies of web search examination and interaction ViewSer:允许大规模远程用户研究网络搜索、检查和交互
Dmitry Lagun, Eugene Agichtein
Web search behaviour studies, including eye-tracking studies of search result examination, have resulted in numerous insights to improve search result quality and presentation. Yet, eye tracking studies have been restricted in scale, due to the expense and the effort required. Furthermore, as the reach of the Web expands, it becomes increasingly important to understand how searchers around the world see and interact with the search results. To address both challenges, we introduce ViewSer, a novel methodology for performing web search examination studies remotely, at scale, and without requiring eye-tracking equipment. ViewSer operates by automatically modifying the appearance of a search engine result page, to clearly show one search result at a time as if through a "viewport", while partially blurring the rest and allowing the participant to move the viewport naturally with a computer mouse or trackpad. Remarkably, the resulting result viewing and clickthrough patterns agree closely with unrestricted viewing of results, as measured by eye-tracking equipment, validated by a study with over 100 participants. We also explore applications of ViewSer to practical search tasks, such as analyzing the search result summary (snip- pet) attractiveness, result re-ranking, and evaluating snippet quality. These experiments could have only be done previously by tracking the eye movements for a small number of subjects in the lab. In contrast, our study was performed with over 100 participants, allowing us to reproduce and extend previous findings, establishing ViewSer as a valuable tool for large-scale search behavior experiments.
网络搜索行为研究,包括对搜索结果检查的眼球追踪研究,已经产生了许多改进搜索结果质量和呈现的见解。然而,由于费用和所需的努力,眼动追踪研究在规模上受到限制。此外,随着网络范围的扩大,了解世界各地的搜索者如何查看搜索结果并与之交互变得越来越重要。为了解决这两个挑战,我们引入了ViewSer,这是一种新的方法,用于远程、大规模地执行网络搜索检查研究,并且不需要眼球追踪设备。ViewSer通过自动修改搜索引擎结果页面的外观来运行,一次清楚地显示一个搜索结果,就像通过一个“视口”一样,同时部分模糊其余的,并允许参与者使用计算机鼠标或触控板自然地移动视口。值得注意的是,结果的浏览和点击模式与不受限制的浏览结果非常一致,这是由眼球追踪设备测量的,并得到了100多名参与者的验证。我们还探索了ViewSer在实际搜索任务中的应用,如分析搜索结果摘要(片段-宠物)的吸引力、结果重新排序和评估片段质量。这些实验以前只能通过跟踪实验室中少数受试者的眼球运动来完成。相比之下,我们的研究有超过100名参与者,允许我们复制和扩展以前的发现,建立ViewSer作为大规模搜索行为实验的有价值的工具。
{"title":"ViewSer: enabling large-scale remote user studies of web search examination and interaction","authors":"Dmitry Lagun, Eugene Agichtein","doi":"10.1145/2009916.2009967","DOIUrl":"https://doi.org/10.1145/2009916.2009967","url":null,"abstract":"Web search behaviour studies, including eye-tracking studies of search result examination, have resulted in numerous insights to improve search result quality and presentation. Yet, eye tracking studies have been restricted in scale, due to the expense and the effort required. Furthermore, as the reach of the Web expands, it becomes increasingly important to understand how searchers around the world see and interact with the search results. To address both challenges, we introduce ViewSer, a novel methodology for performing web search examination studies remotely, at scale, and without requiring eye-tracking equipment. ViewSer operates by automatically modifying the appearance of a search engine result page, to clearly show one search result at a time as if through a \"viewport\", while partially blurring the rest and allowing the participant to move the viewport naturally with a computer mouse or trackpad. Remarkably, the resulting result viewing and clickthrough patterns agree closely with unrestricted viewing of results, as measured by eye-tracking equipment, validated by a study with over 100 participants. We also explore applications of ViewSer to practical search tasks, such as analyzing the search result summary (snip- pet) attractiveness, result re-ranking, and evaluating snippet quality. These experiments could have only be done previously by tracking the eye movements for a small number of subjects in the lab. In contrast, our study was performed with over 100 participants, allowing us to reproduce and extend previous findings, establishing ViewSer as a valuable tool for large-scale search behavior experiments.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127512211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
What-you-retrieve-is-what-you-see: a preliminary cyber-physical search engine 你所检索的就是你所看到的:一个初步的网络物理搜索引擎
L. Shou, Ke Chen, Gang Chen, Chao Zhang, Yi Ma, X. Zhang
The cyber-physical systems (CPS) are envisioned as a class of real-time systems integrating the computing, communication and storage facilities with monitoring and control of the physical world. One interesting CPS application in the mobile Internet is to provide Web search "on the spot" regarding the physical world that a user sees, or literally WYRIWYS (What-You-Retrieve-Is-What-You-See). The objective of our work is to develop server/browser software for supporting WYRIWYS search in our prototype cyber-physical search engine. A WYRIWYS search retrieves visible Web objects and ranks them by their cyber-physical relevances (term, visual, spatial, temporal etc.). This work is distinguished from previous LWS as it provides quality Web search geared with the physical world. Therefore it suggests a very promising solution to cyber-physical Web search.
网络物理系统(CPS)被设想为将计算、通信和存储设施与物理世界的监视和控制集成在一起的一类实时系统。移动互联网中一个有趣的CPS应用程序是提供关于用户所看到的物理世界的“现场”Web搜索,或字面上的WYRIWYS (What-You-Retrieve-Is-What-You-See)。我们的工作目标是开发服务器/浏览器软件,以支持我们的原型网络物理搜索引擎中的WYRIWYS搜索。WYRIWYS搜索检索可见的Web对象,并根据它们的网络物理相关性(术语、视觉、空间、时间等)对它们进行排序。这项工作与以前的LWS不同,因为它提供了与物理世界相适应的高质量Web搜索。因此,它提出了一种非常有前途的网络物理网络搜索解决方案。
{"title":"What-you-retrieve-is-what-you-see: a preliminary cyber-physical search engine","authors":"L. Shou, Ke Chen, Gang Chen, Chao Zhang, Yi Ma, X. Zhang","doi":"10.1145/2009916.2010156","DOIUrl":"https://doi.org/10.1145/2009916.2010156","url":null,"abstract":"The cyber-physical systems (CPS) are envisioned as a class of real-time systems integrating the computing, communication and storage facilities with monitoring and control of the physical world. One interesting CPS application in the mobile Internet is to provide Web search \"on the spot\" regarding the physical world that a user sees, or literally WYRIWYS (What-You-Retrieve-Is-What-You-See). The objective of our work is to develop server/browser software for supporting WYRIWYS search in our prototype cyber-physical search engine. A WYRIWYS search retrieves visible Web objects and ranks them by their cyber-physical relevances (term, visual, spatial, temporal etc.). This work is distinguished from previous LWS as it provides quality Web search geared with the physical world. Therefore it suggests a very promising solution to cyber-physical Web search.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"217 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123254757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Competition-based user expertise score estimation 基于竞争的用户经验评分估计
Jing Liu, Young-In Song, Chin-Yew Lin
In this paper, we consider the problem of estimating the relative expertise score of users in community question and answering services (CQA). Previous approaches typically only utilize the explicit question answering relationship between askers and an-swerers and apply link analysis to address this problem. The im-plicit pairwise comparison between two users that is implied in the best answer selection is ignored. Given a question and answering thread, it's likely that the expertise score of the best answerer is higher than the asker's and all other non-best answerers'. The goal of this paper is to explore such pairwise comparisons inferred from best answer selections to estimate the relative expertise scores of users. Formally, we treat each pairwise comparison between two users as a two-player competition with one winner and one loser. Two competition models are proposed to estimate user expertise from pairwise comparisons. Using the NTCIR-8 CQA task data with 3 million questions and introducing answer quality prediction based evaluation metrics, the experimental results show that the pairwise comparison based competition model significantly outperforms link analysis based approaches (PageRank and HITS) and pointwise approaches (number of best answers and best answer ratio) for estimating the expertise of active users. Furthermore, it's shown that pairwise comparison based competi-tion models have better discriminative power than other methods. It's also found that answer quality (best answer) is an important factor to estimate user expertise.
在本文中,我们考虑了社区问答服务(CQA)中用户相对专业知识评分的估计问题。以前的方法通常只利用提问者和回答者之间的明确问答关系,并应用链接分析来解决这个问题。在最佳答案选择中隐含的两个用户之间的隐式两两比较被忽略。给定一个问题和回答线程,最佳答案的专业知识得分可能高于提问者和所有其他非最佳答案的专业知识得分。本文的目的是探索从最佳答案选择推断的两两比较,以估计用户的相对专业知识分数。在形式上,我们将两个用户之间的每一次两两比较视为一个赢家和一个输家的双人竞争。提出了两种竞争模型,通过两两比较来估计用户的专业知识。利用含有300万个问题的ntcirr -8 CQA任务数据,并引入基于答案质量预测的评价指标,实验结果表明,基于两两比较的竞争模型在估计活跃用户专业程度方面显著优于基于链接分析的方法(PageRank和HITS)和基于点的方法(最佳答案数量和最佳答案比率)。此外,基于两两比较的竞争模型比其他方法具有更好的判别能力。研究还发现,答案质量(最佳答案)是评估用户专业程度的重要因素。
{"title":"Competition-based user expertise score estimation","authors":"Jing Liu, Young-In Song, Chin-Yew Lin","doi":"10.1145/2009916.2009975","DOIUrl":"https://doi.org/10.1145/2009916.2009975","url":null,"abstract":"In this paper, we consider the problem of estimating the relative expertise score of users in community question and answering services (CQA). Previous approaches typically only utilize the explicit question answering relationship between askers and an-swerers and apply link analysis to address this problem. The im-plicit pairwise comparison between two users that is implied in the best answer selection is ignored. Given a question and answering thread, it's likely that the expertise score of the best answerer is higher than the asker's and all other non-best answerers'. The goal of this paper is to explore such pairwise comparisons inferred from best answer selections to estimate the relative expertise scores of users. Formally, we treat each pairwise comparison between two users as a two-player competition with one winner and one loser. Two competition models are proposed to estimate user expertise from pairwise comparisons. Using the NTCIR-8 CQA task data with 3 million questions and introducing answer quality prediction based evaluation metrics, the experimental results show that the pairwise comparison based competition model significantly outperforms link analysis based approaches (PageRank and HITS) and pointwise approaches (number of best answers and best answer ratio) for estimating the expertise of active users. Furthermore, it's shown that pairwise comparison based competi-tion models have better discriminative power than other methods. It's also found that answer quality (best answer) is an important factor to estimate user expertise.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125532952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
Active learning to maximize accuracy vs. effort in interactive information retrieval 主动学习,最大限度地提高准确性与努力在交互式信息检索
Aibo Tian, Matthew Lease
We consider an interactive information retrieval task in which the user is interested in finding several to many relevant documents with minimal effort. Given an initial document ranking, user interaction with the system produces relevance feedback (RF) which the system then uses to revise the ranking. This interactive process repeats until the user terminates the search. To maximize accuracy relative to user effort, we propose an active learning strategy. At each iteration, the document whose relevance is maximally uncertain to the system is slotted high into the ranking in order to obtain user feedback for it. Simulated feedback on the Robust04 TREC collection shows our active learning approach dominates several standard RF baselines relative to the amount of feedback provided by the user. Evaluation on Robust04 under noisy feedback and on LETOR collections further demonstrate the effectiveness of active learning, as well as value of negative feedback in this task scenario.
我们考虑一个交互式信息检索任务,其中用户有兴趣以最小的努力找到几个或许多相关文档。给定最初的文档排名,用户与系统的交互产生相关反馈(RF),然后系统使用该反馈来修改排名。这个交互过程不断重复,直到用户终止搜索。为了最大限度地提高相对于用户努力的准确性,我们提出了一种主动学习策略。在每次迭代中,为了获得用户的反馈,与系统相关性最大的不确定文档被排在排名的前面。对Robust04 TREC集合的模拟反馈表明,我们的主动学习方法相对于用户提供的反馈量在几个标准RF基线中占主导地位。在噪声反馈和LETOR集合下对Robust04的评估进一步证明了主动学习的有效性,以及负反馈在该任务场景中的价值。
{"title":"Active learning to maximize accuracy vs. effort in interactive information retrieval","authors":"Aibo Tian, Matthew Lease","doi":"10.1145/2009916.2009939","DOIUrl":"https://doi.org/10.1145/2009916.2009939","url":null,"abstract":"We consider an interactive information retrieval task in which the user is interested in finding several to many relevant documents with minimal effort. Given an initial document ranking, user interaction with the system produces relevance feedback (RF) which the system then uses to revise the ranking. This interactive process repeats until the user terminates the search. To maximize accuracy relative to user effort, we propose an active learning strategy. At each iteration, the document whose relevance is maximally uncertain to the system is slotted high into the ranking in order to obtain user feedback for it. Simulated feedback on the Robust04 TREC collection shows our active learning approach dominates several standard RF baselines relative to the amount of feedback provided by the user. Evaluation on Robust04 under noisy feedback and on LETOR collections further demonstrate the effectiveness of active learning, as well as value of negative feedback in this task scenario.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116847582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Intent-oriented diversity in recommender systems 推荐系统中意向导向的多样性
S. Vargas, P. Castells, D. Vallet
Diversity as a relevant dimension of retrieval quality is receiving increasing attention in the Information Retrieval and Recommender Systems (RS) fields. The problem has nonetheless been approached under different views and formulations in IR and RS respectively, giving rise to different models, methodologies, and metrics, with little convergence between both fields. In this poster we explore the adaptation of diversity metrics, techniques, and principles from ad-hoc IR to the recommendation task, by introducing the notion of user profile aspect as an analogue of query intent. As a particular approach, user aspects are automatically extracted from latent item features. Empirical results support the proposed approach and provide further insights.
多样性作为检索质量的一个相关维度在信息检索和推荐系统(RS)领域受到越来越多的关注。然而,在IR和RS中,这个问题分别在不同的观点和公式下进行了处理,产生了不同的模型、方法和指标,两个领域之间几乎没有收敛。在这张海报中,我们通过引入用户资料方面的概念作为查询意图的类比,探索了多样性指标、技术和原则从ad-hoc IR到推荐任务的适应性。作为一种特殊的方法,从潜在的项目特征中自动提取用户方面。实证结果支持所提出的方法,并提供进一步的见解。
{"title":"Intent-oriented diversity in recommender systems","authors":"S. Vargas, P. Castells, D. Vallet","doi":"10.1145/2009916.2010124","DOIUrl":"https://doi.org/10.1145/2009916.2010124","url":null,"abstract":"Diversity as a relevant dimension of retrieval quality is receiving increasing attention in the Information Retrieval and Recommender Systems (RS) fields. The problem has nonetheless been approached under different views and formulations in IR and RS respectively, giving rise to different models, methodologies, and metrics, with little convergence between both fields. In this poster we explore the adaptation of diversity metrics, techniques, and principles from ad-hoc IR to the recommendation task, by introducing the notion of user profile aspect as an analogue of query intent. As a particular approach, user aspects are automatically extracted from latent item features. Empirical results support the proposed approach and provide further insights.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128968612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 73
Enhancing mobile search using web search log data 使用web搜索日志数据增强移动搜索
Yoshiyuki Inagaki, Jiang Bian, Yi Chang, Motoko Maki
Mobile search is still in infancy compared with general purpose web search. With limited training data and weak relevance features, the ranking performance in mobile search is far from satisfactory. To address this problem, we propose to leverage the knowledge of Web search to enhance the ranking of mobile search. In this paper, we first develop an equivalent page conversion between web search and mobile search, then we design a few novel ranking features, generated from the click-through data in web search, for estimating the relevance of mobile search. Large scale evaluations demonstrate that the knowledge from web search is quite effective for boosting the relevance of ranking on mobile search.
与通用网络搜索相比,移动搜索仍处于起步阶段。由于训练数据有限,相关性特征较弱,移动搜索中的排名性能远远不能令人满意。为了解决这个问题,我们建议利用网络搜索的知识来提高移动搜索的排名。在本文中,我们首先建立了网页搜索和移动搜索之间的等效页面转换,然后我们设计了一些新的排名特征,这些特征是由网页搜索的点击率数据产生的,用于估计移动搜索的相关性。大规模的评估表明,来自网络搜索的知识对于提高移动搜索排名的相关性是非常有效的。
{"title":"Enhancing mobile search using web search log data","authors":"Yoshiyuki Inagaki, Jiang Bian, Yi Chang, Motoko Maki","doi":"10.1145/2009916.2010119","DOIUrl":"https://doi.org/10.1145/2009916.2010119","url":null,"abstract":"Mobile search is still in infancy compared with general purpose web search. With limited training data and weak relevance features, the ranking performance in mobile search is far from satisfactory. To address this problem, we propose to leverage the knowledge of Web search to enhance the ranking of mobile search. In this paper, we first develop an equivalent page conversion between web search and mobile search, then we design a few novel ranking features, generated from the click-through data in web search, for estimating the relevance of mobile search. Large scale evaluations demonstrate that the knowledge from web search is quite effective for boosting the relevance of ranking on mobile search.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124677859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Detecting success in mobile search from interaction 从交互中检测移动搜索的成功
Qi Guo, Shuai Yuan, Eugene Agichtein
Predicting searcher success and satisfaction is a key problem in Web search, which is essential for automatic evaluating and improving search engine performance. This problem has been studied actively in the desktop search setting, but not specifically for mobile search, despite many known differences between the two modalities. As mobile devices become increasingly popular for searching the Web, improving the searcher experience on such devices is becoming crucially important. In this paper, we explore the possibility of predicting searcher success and satisfaction in mobile search with a smart phone. Specifically, we investigate client-side interaction signals, including the number of browsed pages, and touch screen-specific actions such as zooming and sliding. Exploiting this information with machine learning techniques results in nearly 80% accuracy for predicting searcher success -- significantly outperforming the previous models.
预测搜索成功和满意度是网络搜索中的一个关键问题,是自动评估和改进搜索引擎性能的关键。这个问题已经在桌面搜索设置中进行了积极的研究,但没有专门针对移动搜索,尽管两种模式之间存在许多已知的差异。随着移动设备在网络搜索方面变得越来越流行,改善这些设备上的搜索体验变得至关重要。在这篇文章中,我们探讨了用智能手机预测搜索成功和满意度的可能性。具体来说,我们研究客户端交互信号,包括浏览页面的数量,以及特定于触摸屏的操作,如缩放和滑动。利用机器学习技术利用这些信息,预测搜索成功的准确率接近80%,显著优于以前的模型。
{"title":"Detecting success in mobile search from interaction","authors":"Qi Guo, Shuai Yuan, Eugene Agichtein","doi":"10.1145/2009916.2010133","DOIUrl":"https://doi.org/10.1145/2009916.2010133","url":null,"abstract":"Predicting searcher success and satisfaction is a key problem in Web search, which is essential for automatic evaluating and improving search engine performance. This problem has been studied actively in the desktop search setting, but not specifically for mobile search, despite many known differences between the two modalities. As mobile devices become increasingly popular for searching the Web, improving the searcher experience on such devices is becoming crucially important. In this paper, we explore the possibility of predicting searcher success and satisfaction in mobile search with a smart phone. Specifically, we investigate client-side interaction signals, including the number of browsed pages, and touch screen-specific actions such as zooming and sliding. Exploiting this information with machine learning techniques results in nearly 80% accuracy for predicting searcher success -- significantly outperforming the previous models.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126674080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Region-based landmark discovery by crowdsourcing geo-referenced photos 通过众包地理参考照片发现基于区域的地标
Yen-Ta Huang, A. Cheng, Liang-Chi Hsieh, Winston H. Hsu, Kuo-Wei Chang
We propose a novel model for landmark discovery that locates region-based landmarks on map in contrast to the traditional point-based landmarks. The proposed method preserves more information and automatically identifies candidate regions on map by crowdsourcing geo-referenced photos. Gaussian kernel convolution is applied to remove noises and generate detected region. We adopt F1 measure to evaluate discovered landmarks and manually check the association between tags and regions. The experiment results show that more than 90% of attractions in the selected city can be correctly located by this method.
我们提出了一种新的地标发现模型,该模型将基于区域的地标定位在地图上,而不是传统的基于点的地标。该方法保留了更多的信息,并通过众包地理参考照片在地图上自动识别候选区域。采用高斯核卷积去除噪声,生成检测区域。我们采用F1度量来评估发现的地标,并手动检查标签和区域之间的关联。实验结果表明,该方法可以对所选城市90%以上的景点进行正确定位。
{"title":"Region-based landmark discovery by crowdsourcing geo-referenced photos","authors":"Yen-Ta Huang, A. Cheng, Liang-Chi Hsieh, Winston H. Hsu, Kuo-Wei Chang","doi":"10.1145/2009916.2010089","DOIUrl":"https://doi.org/10.1145/2009916.2010089","url":null,"abstract":"We propose a novel model for landmark discovery that locates region-based landmarks on map in contrast to the traditional point-based landmarks. The proposed method preserves more information and automatically identifies candidate regions on map by crowdsourcing geo-referenced photos. Gaussian kernel convolution is applied to remove noises and generate detected region. We adopt F1 measure to evaluate discovered landmarks and manually check the association between tags and regions. The experiment results show that more than 90% of attractions in the selected city can be correctly located by this method.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116363294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Parameterized concept weighting in verbose queries 详细查询中的参数化概念权重
Michael Bendersky, Donald Metzler, W. Bruce Croft
The majority of the current information retrieval models weight the query concepts (e.g., terms or phrases) in an unsupervised manner, based solely on the collection statistics. In this paper, we go beyond the unsupervised estimation of concept weights, and propose a parameterized concept weighting model. In our model, the weight of each query concept is determined using a parameterized combination of diverse importance features. Unlike the existing supervised ranking methods, our model learns importance weights not only for the explicit query concepts, but also for the latent concepts that are associated with the query through pseudo-relevance feedback. The experimental results on both newswire and web TREC corpora show that our model consistently and significantly outperforms a wide range of state-of-the-art retrieval models. In addition, our model significantly reduces the number of latent concepts used for query expansion compared to the non-parameterized pseudo-relevance feedback based models.
当前的大多数信息检索模型仅基于集合统计数据,以一种无监督的方式对查询概念(例如,术语或短语)进行加权。本文超越了概念权值的无监督估计,提出了一种参数化的概念权值模型。在我们的模型中,每个查询概念的权重是使用不同重要性特征的参数化组合来确定的。与现有的监督排序方法不同,我们的模型不仅可以学习显式查询概念的重要性权重,还可以通过伪相关反馈学习与查询相关的潜在概念的重要性权重。在新闻通讯社和web TREC语料库上的实验结果表明,我们的模型一致且显著优于许多最先进的检索模型。此外,与非参数化的基于伪相关反馈的模型相比,我们的模型显著减少了用于查询扩展的潜在概念的数量。
{"title":"Parameterized concept weighting in verbose queries","authors":"Michael Bendersky, Donald Metzler, W. Bruce Croft","doi":"10.1145/2009916.2009998","DOIUrl":"https://doi.org/10.1145/2009916.2009998","url":null,"abstract":"The majority of the current information retrieval models weight the query concepts (e.g., terms or phrases) in an unsupervised manner, based solely on the collection statistics. In this paper, we go beyond the unsupervised estimation of concept weights, and propose a parameterized concept weighting model. In our model, the weight of each query concept is determined using a parameterized combination of diverse importance features. Unlike the existing supervised ranking methods, our model learns importance weights not only for the explicit query concepts, but also for the latent concepts that are associated with the query through pseudo-relevance feedback. The experimental results on both newswire and web TREC corpora show that our model consistently and significantly outperforms a wide range of state-of-the-art retrieval models. In addition, our model significantly reduces the number of latent concepts used for query expansion compared to the non-parameterized pseudo-relevance feedback based models.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121551360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 117
A comparison of time-aware ranking methods 时间感知排序方法的比较
Nattiya Kanhabua, K. Nørvåg
When searching a temporal document collection, e.g., news archives or blogs, the time dimension must be explicitly incorporated into a retrieval model in order to improve relevance ranking. Previous work has followed one of two main approaches: 1) a mixture model linearly combining textual similarity and temporal similarity, or 2) a probabilistic model generating a query from the textual and temporal part of a document independently. In this paper, we compare the effectiveness of different time-aware ranking methods by using a mixture model applied to all methods. Extensive evaluation is conducted using the New York Times Annotated Corpus, queries and relevance judgments obtained using the Amazon Mechanical Turk.
当搜索一个时间文档集合时,例如,新闻档案或博客,时间维度必须明确地合并到检索模型中,以提高相关性排名。以前的工作遵循两种主要方法之一:1)线性组合文本相似性和时间相似性的混合模型,或2)从文档的文本和时间部分独立生成查询的概率模型。在本文中,我们通过一个适用于所有方法的混合模型,比较了不同时间感知排序方法的有效性。使用《纽约时报》注释语料库进行广泛的评估,使用亚马逊机械土耳其人获得查询和相关性判断。
{"title":"A comparison of time-aware ranking methods","authors":"Nattiya Kanhabua, K. Nørvåg","doi":"10.1145/2009916.2010147","DOIUrl":"https://doi.org/10.1145/2009916.2010147","url":null,"abstract":"When searching a temporal document collection, e.g., news archives or blogs, the time dimension must be explicitly incorporated into a retrieval model in order to improve relevance ranking. Previous work has followed one of two main approaches: 1) a mixture model linearly combining textual similarity and temporal similarity, or 2) a probabilistic model generating a query from the textual and temporal part of a document independently. In this paper, we compare the effectiveness of different time-aware ranking methods by using a mixture model applied to all methods. Extensive evaluation is conducted using the New York Times Annotated Corpus, queries and relevance judgments obtained using the Amazon Mechanical Turk.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114876380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
期刊
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1