首页 > 最新文献

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval最新文献

英文 中文
Session details: Salton Award 会议详情:索尔顿奖
C. Clarke
{"title":"Session details: Salton Award","authors":"C. Clarke","doi":"10.1145/3255914","DOIUrl":"https://doi.org/10.1145/3255914","url":null,"abstract":"","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115492268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topic-centric Classification of Twitter User's Political Orientation 以话题为中心的Twitter用户政治倾向分类
Anjie Fang, I. Ounis, P. Habel, C. Macdonald, Nut Limsopatham
In the recent Scottish Independence Referendum (hereafter, IndyRef), Twitter offered a broad platform for people to express their opinions, with millions of IndyRef tweets posted over the campaign period. In this paper, we aim to classify people's voting intentions by the content of their tweets---their short messages communicated on Twitter. By observing tweets related to the IndyRef, we find that people not only discussed the vote, but raised topics related to an independent Scotland including oil reserves, currency, nuclear weapons, and national debt. We show that the views communicated on these topics can inform us of the individuals' voting intentions ("Yes"--in favour of Independence vs. "No"--Opposed). In particular, we argue that an accurate classifier can be designed by leveraging the differences in the features' usage across different topics related to voting intentions. We demonstrate improvements upon a Naive Bayesian classifier using the topics enrichment method. Our new classifier identifies the closest topic for each unseen tweet, based on those topics identified in the training data. Our experiments show that our Topics-Based Naive Bayesian classifier improves accuracy by 7.8% over the classical Naive Bayesian baseline.
在最近的苏格兰独立公投(以下简称IndyRef)中,Twitter为人们提供了一个广泛的表达意见的平台,在竞选期间发布了数百万条IndyRef推文。在本文中,我们的目标是通过他们的推文内容——他们在Twitter上传播的短消息——来分类人们的投票意图。通过观察与IndyRef相关的推文,我们发现人们不仅讨论投票,还提出了与独立苏格兰相关的话题,包括石油储备、货币、核武器和国债。我们表明,在这些主题上传达的观点可以告诉我们个人的投票意图(“是”-支持独立vs.独立)。“不”——反对)。特别是,我们认为可以通过利用与投票意图相关的不同主题之间特征使用的差异来设计准确的分类器。我们演示了使用主题富集方法对朴素贝叶斯分类器的改进。我们的新分类器根据训练数据中确定的主题,为每条未见过的推文识别最接近的主题。我们的实验表明,我们的基于主题的朴素贝叶斯分类器比经典朴素贝叶斯基线提高了7.8%的准确率。
{"title":"Topic-centric Classification of Twitter User's Political Orientation","authors":"Anjie Fang, I. Ounis, P. Habel, C. Macdonald, Nut Limsopatham","doi":"10.1145/2766462.2767833","DOIUrl":"https://doi.org/10.1145/2766462.2767833","url":null,"abstract":"In the recent Scottish Independence Referendum (hereafter, IndyRef), Twitter offered a broad platform for people to express their opinions, with millions of IndyRef tweets posted over the campaign period. In this paper, we aim to classify people's voting intentions by the content of their tweets---their short messages communicated on Twitter. By observing tweets related to the IndyRef, we find that people not only discussed the vote, but raised topics related to an independent Scotland including oil reserves, currency, nuclear weapons, and national debt. We show that the views communicated on these topics can inform us of the individuals' voting intentions (\"Yes\"--in favour of Independence vs. \"No\"--Opposed). In particular, we argue that an accurate classifier can be designed by leveraging the differences in the features' usage across different topics related to voting intentions. We demonstrate improvements upon a Naive Bayesian classifier using the topics enrichment method. Our new classifier identifies the closest topic for each unseen tweet, based on those topics identified in the training data. Our experiments show that our Topics-Based Naive Bayesian classifier improves accuracy by 7.8% over the classical Naive Bayesian baseline.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124396325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
An Aspect-driven Social Media Explorer 方面驱动的社交媒体浏览器
Nedim Lipka, W. Bruce Croft
We demonstrate an exploration tool that organizes social media content under diverse aspects enabling comprehensive explorations. Unlike existing approaches that group content by trending topics, we present a holistic view of diverse and relevant content with respect to a given query.
我们展示了一个探索工具,它可以从多个方面组织社交媒体内容,从而实现全面的探索。与现有的根据趋势主题对内容进行分组的方法不同,我们针对给定的查询提供了多样化和相关内容的整体视图。
{"title":"An Aspect-driven Social Media Explorer","authors":"Nedim Lipka, W. Bruce Croft","doi":"10.1145/2766462.2767864","DOIUrl":"https://doi.org/10.1145/2766462.2767864","url":null,"abstract":"We demonstrate an exploration tool that organizes social media content under diverse aspects enabling comprehensive explorations. Unlike existing approaches that group content by trending topics, we present a holistic view of diverse and relevant content with respect to a given query.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127090063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring Searcher Attention by Jointly Modeling User Interactions and Content Salience 通过用户交互和内容显著性联合建模来推断搜索者的注意力
Dmitry Lagun, Eugene Agichtein
Modeling and predicting user attention is crucial for interpreting search behavior. The numerous applications include quantifying web search satisfaction, estimating search quality, and measuring and predicting online user engagement. While prior research has demonstrated the value of mouse cursor data and other interactions as a rough proxy of user attention, precisely predicting where a user is looking on a page remains a challenge, exacerbated in Web pages beyond the traditional search results. To improve attention prediction on a wider variety of Web pages, we propose a new way of modeling searcher behavior data by connecting the user interactions to the underlying Web page content. Specifically, we propose a principled model for predicting a searcher's gaze position on a page, that we call Mixture of Interactions and Content Salience (MICS). To our knowledge, our model is the first to effectively combine user interaction data, such as mouse cursor and scrolling positions, with the visual prominence, or salience, of the page content elements. Extensive experiments on multiple popular types of Web content demonstrate that the proposed MICS model significantly outperforms previous approaches to searcher gaze prediction that use only the interaction information. Grounding the observed interactions to the underlying page content provides a general and robust approach to user attention modeling, enabling more powerful tool for search behavior interpretation and ultimately search quality improvements.
建模和预测用户注意力对于解释搜索行为至关重要。众多的应用包括量化网络搜索满意度,估计搜索质量,以及测量和预测在线用户参与度。虽然先前的研究已经证明了鼠标光标数据和其他交互作为用户注意力的粗略代理的价值,但准确预测用户在页面上查看的位置仍然是一个挑战,在传统搜索结果之外的Web页面中更是如此。为了在更广泛的网页上改进注意力预测,我们提出了一种新的方法,通过将用户交互与底层网页内容联系起来,对搜索者行为数据进行建模。具体来说,我们提出了一个原则性模型来预测搜索者在页面上的凝视位置,我们称之为交互和内容显著性的混合(MICS)。据我们所知,我们的模型是第一个有效地将用户交互数据(如鼠标光标和滚动位置)与页面内容元素的视觉突出性或显著性结合起来的模型。在多种流行的Web内容类型上进行的大量实验表明,所提出的MICS模型显著优于先前仅使用交互信息的搜索者凝视预测方法。将观察到的交互与底层页面内容结合起来,为用户注意力建模提供了一种通用的、健壮的方法,为搜索行为解释提供了更强大的工具,并最终提高了搜索质量。
{"title":"Inferring Searcher Attention by Jointly Modeling User Interactions and Content Salience","authors":"Dmitry Lagun, Eugene Agichtein","doi":"10.1145/2766462.2767745","DOIUrl":"https://doi.org/10.1145/2766462.2767745","url":null,"abstract":"Modeling and predicting user attention is crucial for interpreting search behavior. The numerous applications include quantifying web search satisfaction, estimating search quality, and measuring and predicting online user engagement. While prior research has demonstrated the value of mouse cursor data and other interactions as a rough proxy of user attention, precisely predicting where a user is looking on a page remains a challenge, exacerbated in Web pages beyond the traditional search results. To improve attention prediction on a wider variety of Web pages, we propose a new way of modeling searcher behavior data by connecting the user interactions to the underlying Web page content. Specifically, we propose a principled model for predicting a searcher's gaze position on a page, that we call Mixture of Interactions and Content Salience (MICS). To our knowledge, our model is the first to effectively combine user interaction data, such as mouse cursor and scrolling positions, with the visual prominence, or salience, of the page content elements. Extensive experiments on multiple popular types of Web content demonstrate that the proposed MICS model significantly outperforms previous approaches to searcher gaze prediction that use only the interaction information. Grounding the observed interactions to the underlying page content provides a general and robust approach to user attention modeling, enabling more powerful tool for search behavior interpretation and ultimately search quality improvements.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"184 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125836644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Location in Search 搜索位置
Vanessa Murdock
As users turn increasingly to handheld devices to find information, the research community has focused on real-time location signals (GPS signals) to improve search engine effectiveness. Location signals have been investigated for predicting businesses the user will frequent[3], assigning geographic coordinates to media files[1], and to improve mobile search ranking[2]. While the increased focus on real-time user location has produced excellent research, there remains a gap between the capabilities being developed in the research community, and the capabilities being developed by commercial search engines. The core of this discrepancy between the advances in research and advances in industry is understanding the user's location. The vast majority of research on user location assumes that the user's location is known, because the user has provided a GPS signal. For many systems, there is no GPS signal available. The user may choose not enable it, or the system chooses not to prompt the user for the location because doing so degrades the user experience. For these interactions, the system relies on the user's IP address for location information. Further, much of the current research uses public geocoded data such as Foursquare (http://www.foursquare.com visited June 2015), and Twitter (http://www.twitter.com visited June 2015). These data are an incomplete picture of places a user may visit, and are potentially biased in their representation of actual users. The information contained in these data is not the same type of information typically available to a commercial search engine. In this talk we discuss gaps between current research on location, and industry advances in using location signals to improve search results. We focus on user location as one example of a gap between research and development.
随着用户越来越多地使用手持设备来查找信息,研究团体将重点放在实时定位信号(GPS信号)上,以提高搜索引擎的效率。位置信号已被用于预测用户将经常光顾的业务[3],为媒体文件[1]分配地理坐标,以及提高移动搜索排名[2]。虽然对实时用户位置的日益关注产生了优秀的研究成果,但研究社区正在开发的功能与商业搜索引擎正在开发的功能之间仍然存在差距。研究进展与工业进展之间差异的核心在于对用户位置的理解。绝大多数关于用户定位的研究都假设用户的位置是已知的,因为用户已经提供了GPS信号。对于许多系统来说,没有可用的GPS信号。用户可以选择不启用它,或者系统选择不提示用户输入位置,因为这样做会降低用户体验。对于这些交互,系统依赖于用户的IP地址来获取位置信息。此外,目前的许多研究使用公共地理编码数据,如Foursquare (http://www.foursquare.com访问2015年6月)和Twitter (http://www.twitter.com访问2015年6月)。这些数据是用户可能访问的地方的不完整图片,并且在代表实际用户时可能存在偏见。这些数据中包含的信息与商业搜索引擎通常提供的信息不同。在这次演讲中,我们将讨论当前位置研究与使用位置信号改善搜索结果的行业进展之间的差距。我们将用户定位作为研究与开发之间差距的一个例子。
{"title":"Location in Search","authors":"Vanessa Murdock","doi":"10.1145/2766462.2776783","DOIUrl":"https://doi.org/10.1145/2766462.2776783","url":null,"abstract":"As users turn increasingly to handheld devices to find information, the research community has focused on real-time location signals (GPS signals) to improve search engine effectiveness. Location signals have been investigated for predicting businesses the user will frequent[3], assigning geographic coordinates to media files[1], and to improve mobile search ranking[2]. While the increased focus on real-time user location has produced excellent research, there remains a gap between the capabilities being developed in the research community, and the capabilities being developed by commercial search engines. The core of this discrepancy between the advances in research and advances in industry is understanding the user's location. The vast majority of research on user location assumes that the user's location is known, because the user has provided a GPS signal. For many systems, there is no GPS signal available. The user may choose not enable it, or the system chooses not to prompt the user for the location because doing so degrades the user experience. For these interactions, the system relies on the user's IP address for location information. Further, much of the current research uses public geocoded data such as Foursquare (http://www.foursquare.com visited June 2015), and Twitter (http://www.twitter.com visited June 2015). These data are an incomplete picture of places a user may visit, and are potentially biased in their representation of actual users. The information contained in these data is not the same type of information typically available to a commercial search engine. In this talk we discuss gaps between current research on location, and industry advances in using location signals to improve search results. We focus on user location as one example of a gap between research and development.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125972686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-Preserving IR 2015: When Information Retrieval Meets Privacy and Security 隐私保护IR 2015:当信息检索满足隐私和安全
G. Yang, I. Soboroff
Information retrieval (IR) and information privacy/security are two fast-growing computer science disciplines. There are many synergies and connections between these two disciplines. However, there have been very limited efforts to connect the two important disciplines. On the other hand, due to lack of mature techniques in privacy-preserving IR, concerns about information privacy and security have become serious obstacles that prevent valuable user data to be used in IR research such as studies on query logs, social media, tweets, and medical record retrieval. We propose this privacy-preserving IR workshop to connect the two disciplines of information retrieval and information privacy and security. We look forward to spurring research that aims to bring together the research fields of IR and privacy/security. Last year, the first privacy-preserving IR workshop focused on mitigating privacy threats in information retrieval by novel algorithms and tools that enable web users to better understand associated privacy risks.
信息检索(IR)和信息隐私/安全是两个快速发展的计算机科学学科。这两个学科之间有许多协同作用和联系。然而,将这两个重要学科联系起来的努力非常有限。另一方面,由于缺乏成熟的隐私保护IR技术,对信息隐私和安全的担忧已经成为阻碍有价值的用户数据用于IR研究的严重障碍,例如查询日志、社交媒体、tweet和病历检索的研究。为了将信息检索和信息隐私与安全这两个学科联系起来,我们提出了这个隐私保护IR研讨会。我们期待着促进旨在将IR和隐私/安全研究领域结合在一起的研究。去年,首届隐私保护IR研讨会的重点是通过新颖的算法和工具减轻信息检索中的隐私威胁,使网络用户更好地了解相关的隐私风险。
{"title":"Privacy-Preserving IR 2015: When Information Retrieval Meets Privacy and Security","authors":"G. Yang, I. Soboroff","doi":"10.1145/2766462.2767857","DOIUrl":"https://doi.org/10.1145/2766462.2767857","url":null,"abstract":"Information retrieval (IR) and information privacy/security are two fast-growing computer science disciplines. There are many synergies and connections between these two disciplines. However, there have been very limited efforts to connect the two important disciplines. On the other hand, due to lack of mature techniques in privacy-preserving IR, concerns about information privacy and security have become serious obstacles that prevent valuable user data to be used in IR research such as studies on query logs, social media, tweets, and medical record retrieval. We propose this privacy-preserving IR workshop to connect the two disciplines of information retrieval and information privacy and security. We look forward to spurring research that aims to bring together the research fields of IR and privacy/security. Last year, the first privacy-preserving IR workshop focused on mitigating privacy threats in information retrieval by novel algorithms and tools that enable web users to better understand associated privacy risks.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125445526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An Entity Class-Dependent Discriminative Mixture Model for Cumulative Citation Recommendation 基于实体分类的累积引文推荐判别混合模型
Jingang Wang, Dandan Song, Qifan Wang, Zhiwei Zhang, Luo Si, L. Liao, Chin-Yew Lin
This paper studies Cumulative Citation Recommendation (CCR) for Knowledge Base Acceleration (KBA). The CCR task aims to detect potential citations of a set of target entities with priorities from a volume of temporally-ordered stream corpus. Previous approaches for CCR that build an individual relevance model for each entity fail to handle unseen entities without annotation. A baseline solution is to build a global entity-unspecific model for all entities regardless of the relationship information among entities, which cannot guarantee to achieve satisfactory result for each entity. In this paper, we propose a novel entity class-dependent discriminative mixture model by introducing a latent entity class layer to model the correlations between entities and latent entity classes. The model can better adjust to different types of entities and achieve better performance when dealing with a broad range of entities. An extensive set of experiments has been conducted on TREC-KBA-2013 dataset, and the experimental results demonstrate that the proposed model can achieve the state-of-the-art performance.
本文研究了知识库加速(KBA)中的累积引文推荐(CCR)。CCR任务旨在从大量临时有序的流语料库中检测一组具有优先级的目标实体的潜在引用。以前的CCR方法为每个实体构建单独的关联模型,但在没有注释的情况下无法处理看不见的实体。基线解决方案是不考虑实体之间的关系信息,为所有实体构建一个全局实体非特定模型,不能保证每个实体都能得到满意的结果。本文通过引入潜在实体类层来建模实体与潜在实体类之间的相关性,提出了一种新的实体类依赖的判别混合模型。该模型可以更好地适应不同类型的实体,并在处理广泛的实体时获得更好的性能。在TREC-KBA-2013数据集上进行了大量的实验,实验结果表明该模型可以达到最先进的性能。
{"title":"An Entity Class-Dependent Discriminative Mixture Model for Cumulative Citation Recommendation","authors":"Jingang Wang, Dandan Song, Qifan Wang, Zhiwei Zhang, Luo Si, L. Liao, Chin-Yew Lin","doi":"10.1145/2766462.2767698","DOIUrl":"https://doi.org/10.1145/2766462.2767698","url":null,"abstract":"This paper studies Cumulative Citation Recommendation (CCR) for Knowledge Base Acceleration (KBA). The CCR task aims to detect potential citations of a set of target entities with priorities from a volume of temporally-ordered stream corpus. Previous approaches for CCR that build an individual relevance model for each entity fail to handle unseen entities without annotation. A baseline solution is to build a global entity-unspecific model for all entities regardless of the relationship information among entities, which cannot guarantee to achieve satisfactory result for each entity. In this paper, we propose a novel entity class-dependent discriminative mixture model by introducing a latent entity class layer to model the correlations between entities and latent entity classes. The model can better adjust to different types of entities and achieve better performance when dealing with a broad range of entities. An extensive set of experiments has been conducted on TREC-KBA-2013 dataset, and the experimental results demonstrate that the proposed model can achieve the state-of-the-art performance.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116089695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Adaptive User Engagement Evaluation via Multi-task Learning 基于多任务学习的自适应用户粘性评估
Hamed Zamani, Pooya Moradi, A. Shakery
User engagement evaluation task in social networks has recently attracted considerable attention due to its applications in recommender systems. In this task, the posts containing users' opinions about items, e.g., the tweets containing the users' ratings about movies in the IMDb website, are studied. In this paper, we try to make use of tweets from different web applications to improve the user engagement evaluation performance. To this aim, we propose an adaptive method based on multi-task learning. Since in this paper we study the problem of detecting tweets with positive engagement which is a highly imbalanced classification problem, we modify the loss function of multi-task learning algorithms to cope with the imbalanced data. Our evaluations over a dataset including the tweets of four diverse and popular data sources, i.e., IMDb, YouTube, Goodreads, and Pandora, demonstrate the effectiveness of the proposed method. Our findings suggest that transferring knowledge between data sources can improve the user engagement evaluation performance.
社交网络中的用户参与评价任务由于在推荐系统中的应用,近年来引起了人们的广泛关注。在这个任务中,我们研究了包含用户对项目的意见的帖子,例如,在IMDb网站上包含用户对电影评分的tweets。在本文中,我们尝试利用来自不同web应用程序的tweet来提高用户参与度评估性能。为此,我们提出了一种基于多任务学习的自适应方法。由于本文研究的是一个高度不平衡的分类问题——积极参与推文检测问题,因此我们修改了多任务学习算法的损失函数来处理不平衡数据。我们对一个数据集进行了评估,其中包括四个不同的流行数据源的推文,即IMDb, YouTube, Goodreads和Pandora,证明了所提出方法的有效性。我们的研究结果表明,在数据源之间转移知识可以提高用户参与评估的绩效。
{"title":"Adaptive User Engagement Evaluation via Multi-task Learning","authors":"Hamed Zamani, Pooya Moradi, A. Shakery","doi":"10.1145/2766462.2767785","DOIUrl":"https://doi.org/10.1145/2766462.2767785","url":null,"abstract":"User engagement evaluation task in social networks has recently attracted considerable attention due to its applications in recommender systems. In this task, the posts containing users' opinions about items, e.g., the tweets containing the users' ratings about movies in the IMDb website, are studied. In this paper, we try to make use of tweets from different web applications to improve the user engagement evaluation performance. To this aim, we propose an adaptive method based on multi-task learning. Since in this paper we study the problem of detecting tweets with positive engagement which is a highly imbalanced classification problem, we modify the loss function of multi-task learning algorithms to cope with the imbalanced data. Our evaluations over a dataset including the tweets of four diverse and popular data sources, i.e., IMDb, YouTube, Goodreads, and Pandora, demonstrate the effectiveness of the proposed method. Our findings suggest that transferring knowledge between data sources can improve the user engagement evaluation performance.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"2528 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128654111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Cognitive Activity during Web Search 网络搜索过程中的认知活动
Md. Hedayetul Islam Shovon, N. Nandagopal, J. Du, Vijayalakshmi Ramasamy, Bernadine Cocks
Searching on the Web or Net-surfing is a part of everyday life for many people, but little is known about the brain activity during Web searching. Such knowledge is essential for better understanding of the cognitive demands imposed by the search system and search tasks. The current study contributes to this understanding by constructing brain networks from EEG data using normalized transfer entropy (NTE) during three Web search task stages: query formulation, viewing of a search result list and reading each individual content page. This study further contributes to the connectivity analysis of the constructed brain networks, since it is an advanced quantitative technique which enables the exploration of brain function by distinct and varied brain areas. By using this approach, we identified that the cognitive activities during the three stages of Web searching are different, with various brain areas becoming more active during the three Web search task stages. Of note, query formulation generated higher interaction between cortical regions than viewing a result list or reading a content page. These findings will have implications for the improvement of Web search engines and search interfaces.
上网搜索或上网冲浪是许多人日常生活的一部分,但人们对上网搜索时大脑的活动知之甚少。这些知识对于更好地理解搜索系统和搜索任务所施加的认知需求是必不可少的。目前的研究通过在三个Web搜索任务阶段(查询制定、查看搜索结果列表和阅读每个单独的内容页面)中使用归一化传递熵(NTE)从EEG数据构建脑网络来促进这一理解。这项研究进一步有助于对构建的脑网络的连通性分析,因为它是一种先进的定量技术,可以通过不同的脑区域来探索脑功能。通过这一方法,我们发现在网络搜索任务的三个阶段,认知活动是不同的,在网络搜索任务的三个阶段,不同的大脑区域变得更加活跃。值得注意的是,与查看结果列表或阅读内容页相比,查询公式在皮质区域之间产生了更高的交互。这些发现将对Web搜索引擎和搜索界面的改进产生影响。
{"title":"Cognitive Activity during Web Search","authors":"Md. Hedayetul Islam Shovon, N. Nandagopal, J. Du, Vijayalakshmi Ramasamy, Bernadine Cocks","doi":"10.1145/2766462.2767784","DOIUrl":"https://doi.org/10.1145/2766462.2767784","url":null,"abstract":"Searching on the Web or Net-surfing is a part of everyday life for many people, but little is known about the brain activity during Web searching. Such knowledge is essential for better understanding of the cognitive demands imposed by the search system and search tasks. The current study contributes to this understanding by constructing brain networks from EEG data using normalized transfer entropy (NTE) during three Web search task stages: query formulation, viewing of a search result list and reading each individual content page. This study further contributes to the connectivity analysis of the constructed brain networks, since it is an advanced quantitative technique which enables the exploration of brain function by distinct and varied brain areas. By using this approach, we identified that the cognitive activities during the three stages of Web searching are different, with various brain areas becoming more active during the three Web search task stages. Of note, query formulation generated higher interaction between cortical regions than viewing a result list or reading a content page. These findings will have implications for the improvement of Web search engines and search interfaces.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130576445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using Sensor Metadata Streams to Identify Topics of Local Events in the City 使用传感器元数据流来识别城市本地事件的主题
M. Albakour, C. Macdonald, I. Ounis
In this paper, we study the emerging Information Retrieval (IR) task of local event retrieval using sensor metadata streams. Sensor metadata streams include information such as the crowd density from video processing, audio classifications, and social media activity. We propose to use these metadata streams to identify the topics of local events within a city, where each event topic corresponds to a set of terms representing a type of events such as a concert or a protest. We develop a supervised approach that is capable of mapping sensor metadata observations to an event topic. In addition to using a variety of sensor metadata observations about the current status of the environment as learning features, our approach incorporates additional background features to model cyclic event patterns. Through experimentation with data collected from two locations in a major Spanish city, we show that our approach markedly outperforms an alternative baseline. We also show that modelling background information improves event topic identification.
本文研究了基于传感器元数据流的信息检索(Information Retrieval, IR)任务。传感器元数据流包括来自视频处理、音频分类和社交媒体活动的人群密度等信息。我们建议使用这些元数据流来识别城市中本地事件的主题,其中每个事件主题对应于一组表示音乐会或抗议等事件类型的术语。我们开发了一种监督方法,能够将传感器元数据观测映射到事件主题。除了使用关于环境当前状态的各种传感器元数据观测作为学习特征外,我们的方法还结合了额外的背景特征来模拟循环事件模式。通过对从西班牙一个主要城市的两个地点收集的数据进行实验,我们表明我们的方法明显优于替代基线。我们还表明,建模背景信息提高了事件主题识别。
{"title":"Using Sensor Metadata Streams to Identify Topics of Local Events in the City","authors":"M. Albakour, C. Macdonald, I. Ounis","doi":"10.1145/2766462.2767837","DOIUrl":"https://doi.org/10.1145/2766462.2767837","url":null,"abstract":"In this paper, we study the emerging Information Retrieval (IR) task of local event retrieval using sensor metadata streams. Sensor metadata streams include information such as the crowd density from video processing, audio classifications, and social media activity. We propose to use these metadata streams to identify the topics of local events within a city, where each event topic corresponds to a set of terms representing a type of events such as a concert or a protest. We develop a supervised approach that is capable of mapping sensor metadata observations to an event topic. In addition to using a variety of sensor metadata observations about the current status of the environment as learning features, our approach incorporates additional background features to model cyclic event patterns. Through experimentation with data collected from two locations in a major Spanish city, we show that our approach markedly outperforms an alternative baseline. We also show that modelling background information improves event topic identification.","PeriodicalId":297035,"journal":{"name":"Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121311970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1