首页 > 最新文献

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval最新文献

英文 中文
Browse with a social web directory 浏览与社会网络目录
H. Huang, Yunjun Gao, Lu Chen, Rui Li, K. Chiew, Qinming He
Browse with either web directories or social bookmarks is an important complementation to search by keywords in web information retrieval. To improve users' browse experiences and facilitate the web directory construction, in this paper, we propose a novel browse system called Social Web Directory (SWD for short) by integrating web directories and social bookmarks. In SWD, (1) web pages are automatically categorized to a hierarchical structure to be retrieved efficiently, and (2) the popular web pages, hottest tags, and expert users in each category are ranked to help users find information more conveniently. Extensive experimental results demonstrate the effectiveness of our SWD system.
在网络信息检索中,利用网络目录或社交书签进行浏览是对关键词搜索的重要补充。为了提高用户的浏览体验,方便网络目录的构建,本文提出了一种将网络目录与社交书签相结合的新型浏览系统Social web directory(简称SWD)。在SWD中,(1)自动将网页分类为一个层次结构,以便高效地检索;(2)对每个类别中的热门网页、最热标签和专家用户进行排名,以帮助用户更方便地查找信息。大量的实验结果证明了我们的SWD系统的有效性。
{"title":"Browse with a social web directory","authors":"H. Huang, Yunjun Gao, Lu Chen, Rui Li, K. Chiew, Qinming He","doi":"10.1145/2484028.2484141","DOIUrl":"https://doi.org/10.1145/2484028.2484141","url":null,"abstract":"Browse with either web directories or social bookmarks is an important complementation to search by keywords in web information retrieval. To improve users' browse experiences and facilitate the web directory construction, in this paper, we propose a novel browse system called Social Web Directory (SWD for short) by integrating web directories and social bookmarks. In SWD, (1) web pages are automatically categorized to a hierarchical structure to be retrieved efficiently, and (2) the popular web pages, hottest tags, and expert users in each category are ranked to help users find information more conveniently. Extensive experimental results demonstrate the effectiveness of our SWD system.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122223130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Informational friend recommendation in social media 社交媒体中的信息好友推荐
Shengxian Wan, Yanyan Lan, J. Guo, Chaosheng Fan, Xueqi Cheng
It is well recognized that users rely on social media (e.g. Twitter or Digg) to fulfill two common needs (i.e. social need and informational need) that is to keep in touch with their friends in the real world and to have access to information they are interested in. Traditional friend recommendation methods in social media mainly focus on a user's social need, but seldom address their informational need (i.e. suggesting friends that can provide information one may be interested in but have not been able to obtain so far). In this paper, we propose to recommend friends according to the informational utility, which stands for the degree to which a friend satisfies the target user's unfulfilled informational need, called informational friend recommendation. In order to capture users' informational need, we view a post in social media as an item and utilize collaborative filtering techniques to predict the rating for each post. The candidate friends are then ranked according to their informational utility for recommendation. In addition, we also show how to further consider diversity in such recommendations. Experiments on benchmark datasets demonstrate that our approach can significantly outperform the traditional friend recommendation methods under informational evaluation measures.
众所周知,用户依赖社交媒体(如Twitter或Digg)来满足两种共同的需求(即社交需求和信息需求),即与现实世界中的朋友保持联系,并获得他们感兴趣的信息。传统的社交媒体好友推荐方法主要关注用户的社交需求,而很少关注用户的信息需求(即推荐可以提供自己可能感兴趣但目前还无法获得的信息的朋友)。在本文中,我们提出根据信息效用推荐朋友,即朋友满足目标用户未满足的信息需求的程度,称为信息推荐。为了捕捉用户的信息需求,我们将社交媒体中的帖子视为一个项目,并利用协同过滤技术来预测每个帖子的评级。然后根据推荐的信息效用对候选朋友进行排名。此外,我们还展示了如何在这些建议中进一步考虑多样性。在基准数据集上的实验表明,在信息评价度量下,我们的方法可以显著优于传统的朋友推荐方法。
{"title":"Informational friend recommendation in social media","authors":"Shengxian Wan, Yanyan Lan, J. Guo, Chaosheng Fan, Xueqi Cheng","doi":"10.1145/2484028.2484179","DOIUrl":"https://doi.org/10.1145/2484028.2484179","url":null,"abstract":"It is well recognized that users rely on social media (e.g. Twitter or Digg) to fulfill two common needs (i.e. social need and informational need) that is to keep in touch with their friends in the real world and to have access to information they are interested in. Traditional friend recommendation methods in social media mainly focus on a user's social need, but seldom address their informational need (i.e. suggesting friends that can provide information one may be interested in but have not been able to obtain so far). In this paper, we propose to recommend friends according to the informational utility, which stands for the degree to which a friend satisfies the target user's unfulfilled informational need, called informational friend recommendation. In order to capture users' informational need, we view a post in social media as an item and utilize collaborative filtering techniques to predict the rating for each post. The candidate friends are then ranked according to their informational utility for recommendation. In addition, we also show how to further consider diversity in such recommendations. Experiments on benchmark datasets demonstrate that our approach can significantly outperform the traditional friend recommendation methods under informational evaluation measures.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131202271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Explicit feedback in local search tasks 本地搜索任务的显式反馈
Dmitry Lagun, Avneesh Sud, Ryen W. White, P. Bailey, Georg Buscher
Modern search engines make extensive use of people's contextual information to finesse result rankings. Using a searcher's location provides an especially strong signal for adjusting results for certain classes of queries where people may have clear preference for local results, without explicitly specifying the location in the query direct-ly. However, if the location estimate is inaccurate or searchers want to obtain many results from a particular location, they have limited control on the location focus in the search results returned. In this paper we describe a user study that examines the effect of offering searchers more control over how local preferences are gathered and used. We studied providing users with functionality to offer explicit relevance feedback (ERF) adjacent to results automatically identi-fied as location-dependent (i.e., more from this location). They can use this functionality to indicate whether they are interested in a particular search result and desire more results from that result's location. We compared the ERF system against a baseline (NoERF) that used the same underlying mechanisms to retrieve and rank results, but did not offer ERF support. User performance was as-sessed across 12 experimental participants over 12 location-sensitive topics, in a fully counter-balanced design. We found that participants interacted with ERF frequently, and there were signs that ERF has the potential to improve success rates and lead to more efficient searching for location-sensitive search tasks than NoERF.
现代搜索引擎广泛利用人们的上下文信息来优化结果排名。使用搜索者的位置为调整某些查询类别的结果提供了一个特别强的信号,在这些查询类别中,人们可能对本地结果有明确的偏好,而无需直接在查询中显式指定位置。但是,如果位置估计不准确,或者搜索者希望从特定位置获得许多结果,则他们对返回的搜索结果中的位置焦点的控制有限。在本文中,我们描述了一项用户研究,该研究检验了为搜索者提供更多控制如何收集和使用本地偏好的效果。我们研究了为用户提供提供显式相关反馈(ERF)的功能,该功能与自动识别为位置相关的结果相邻(即,来自该位置的更多内容)。他们可以使用这个功能来表明他们是否对特定的搜索结果感兴趣,并希望从该结果的位置获得更多结果。我们将ERF系统与基线(NoERF)进行了比较,NoERF使用相同的底层机制来检索和排序结果,但不提供ERF支持。在一个完全平衡的设计中,对12个位置敏感主题的12名实验参与者的用户表现进行了评估。我们发现参与者经常与ERF进行交互,并且有迹象表明,ERF有可能提高成功率,并且比NoERF更有效地搜索位置敏感的搜索任务。
{"title":"Explicit feedback in local search tasks","authors":"Dmitry Lagun, Avneesh Sud, Ryen W. White, P. Bailey, Georg Buscher","doi":"10.1145/2484028.2484123","DOIUrl":"https://doi.org/10.1145/2484028.2484123","url":null,"abstract":"Modern search engines make extensive use of people's contextual information to finesse result rankings. Using a searcher's location provides an especially strong signal for adjusting results for certain classes of queries where people may have clear preference for local results, without explicitly specifying the location in the query direct-ly. However, if the location estimate is inaccurate or searchers want to obtain many results from a particular location, they have limited control on the location focus in the search results returned. In this paper we describe a user study that examines the effect of offering searchers more control over how local preferences are gathered and used. We studied providing users with functionality to offer explicit relevance feedback (ERF) adjacent to results automatically identi-fied as location-dependent (i.e., more from this location). They can use this functionality to indicate whether they are interested in a particular search result and desire more results from that result's location. We compared the ERF system against a baseline (NoERF) that used the same underlying mechanisms to retrieve and rank results, but did not offer ERF support. User performance was as-sessed across 12 experimental participants over 12 location-sensitive topics, in a fully counter-balanced design. We found that participants interacted with ERF frequently, and there were signs that ERF has the potential to improve success rates and lead to more efficient searching for location-sensitive search tasks than NoERF.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132015235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Learning to name faces: a multimodal learning scheme for search-based face annotation 学习命名人脸:基于搜索的人脸标注的多模态学习方案
Dayong Wang, S. Hoi, Pengcheng Wu, Jianke Zhu, Ying He, C. Miao
Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) -- a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the "label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task.
自动人脸标注旨在从照片中自动检测人脸,并进一步使用相应的人名对人脸进行命名。在本文中,我们通过研究一种基于搜索的面部注释(SBFA)范式来解决这个开放的问题,该范式用于挖掘WWW上免费提供的大量网络面部图像。给定要标注的查询面部图像,SBFA的思想是首先从web面部图像数据库中搜索top-n个相似的面部图像,然后利用这些排名靠前的相似面部图像及其弱标签为查询面部图像命名。为了充分挖掘这些信息,本文提出了一种新的面孔命名学习框架(L2NF)——一种基于搜索的人脸标注的统一多模态学习方法,它由以下主要组成部分组成:(i)利用“标签平滑”假设增强排名靠前的相似图像的弱标签;(ii)我们通过提取不同类型的特征来构建面部图像的多模态表示;(iii)我们使用距离度量学习技术对每种特征的距离度量进行优化;最后(iv)我们通过学习排序方案来学习标注的多种模式的最优组合。我们在两个真实世界的人脸图像数据库上进行了大量的实证研究,结果令人鼓舞,表明所提出的算法显著提高了基于搜索的人脸标注任务的命名准确率。
{"title":"Learning to name faces: a multimodal learning scheme for search-based face annotation","authors":"Dayong Wang, S. Hoi, Pengcheng Wu, Jianke Zhu, Ying He, C. Miao","doi":"10.1145/2484028.2484040","DOIUrl":"https://doi.org/10.1145/2484028.2484040","url":null,"abstract":"Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) -- a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the \"label smoothness\" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134090909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
ThemeStreams: visualizing the stream of themes discussed in politics 主题流:可视化政治中讨论的主题流
O. D. Rooij, Daan Odijk, M. de Rijke
The political landscape is fluid. Discussions are always ongoing and new "hot topics" continue to appear in the headlines. But what made people start talking about that topic? And who started it? Because of the speed at which discussions sometimes take place this can be difficult to track down. We describe ThemeStreams: a demonstrator that maps political discussions to themes and influencers and illustrate how this mapping is used in an interactive visualization that shows us which themes are being discussed, and that helps us answer the question "Who put this issue on the map?" in streams of political data.
政治形势多变。讨论总是在进行,新的“热门话题”不断出现在头条新闻中。但是是什么让人们开始谈论这个话题呢?是谁挑起的?由于讨论有时发生的速度很快,因此很难追踪。我们描述了ThemeStreams:一个将政治讨论映射到主题和影响者的演示器,并说明如何在交互式可视化中使用这种映射,向我们展示正在讨论的主题,这有助于我们在政治数据流中回答“谁把这个问题放在地图上?”
{"title":"ThemeStreams: visualizing the stream of themes discussed in politics","authors":"O. D. Rooij, Daan Odijk, M. de Rijke","doi":"10.1145/2484028.2484215","DOIUrl":"https://doi.org/10.1145/2484028.2484215","url":null,"abstract":"The political landscape is fluid. Discussions are always ongoing and new \"hot topics\" continue to appear in the headlines. But what made people start talking about that topic? And who started it? Because of the speed at which discussions sometimes take place this can be difficult to track down. We describe ThemeStreams: a demonstrator that maps political discussions to themes and influencers and illustrate how this mapping is used in an interactive visualization that shows us which themes are being discussed, and that helps us answer the question \"Who put this issue on the map?\" in streams of political data.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133881364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Designing search usability 设计搜索可用性
Tony Russell-Rose
{"title":"Designing search usability","authors":"Tony Russell-Rose","doi":"10.1007/978-3-319-12979-2_10","DOIUrl":"https://doi.org/10.1007/978-3-319-12979-2_10","url":null,"abstract":"","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133906244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective approaches to retrieving and using expertise in social media 检索和使用社交媒体专业知识的有效方法
Reyyan Yeniterzi
Expert retrieval has been widely studied especially after the introduction of Expert Finding task in the TREC's Enterprise Track in 2005 [3]. This track provided two different test collections crawled from two organizations' public-facing websites and internal emails which led to the development of many state-of-the-art algorithms on expert retrieval [1]. Until recently, these datasets were considered good representatives of the information resources available within enterprise. However, the recent growth of social media also influenced the work environment, and social media became a common communication and collaboration tool within organizations. According to a recent survey by McKinsey Global Institute [2], 29% of the companies use at least one social media tool for matching their employees to tasks, and 26% of them assess their employees' performance by using social media. This shows that intra-organizational social media became an important resource to identify expertise within organizations. In recent years, in addition to the intra-organizational social media, public social media tools like Twitter, Facebook, LinkedIn also became common environments for searching expertise. These tools provide an opportunity for their users to show their specific skills to the world which motivates recruiters to look for talented job candidates on social media, or writers and reporters to find experts for consulting on specific topics they are working on. With these motivations in mind, in this work we propose to develop expert retrieval algorithms for intra-organizational and public social media tools. Social media datasets have both challenges and advantages. In terms of challenges, they do not always contain context on one specific domain, instead one social media tool may contain discussions on technical stuff, hobbies or news concurrently. They may also contain spam posts or advertisements. Compared to well-edited enterprise documents, they are much more informal in language. Furthermore, depending on the social media platform, they may have limits on the number of characters used in posts. Even though they include the challenges stated above, they also bring some unique authority signals, such as votes, comments, follower/following information, which can be useful in estimating expertise. Furthermore, compared to previously used enterprise documents, social media provides clear associations between documents and candidates in the context of authorship information. In this work, we propose to develop expert retrieval approaches which will handle these challenges while making use of the advantages. Expert retrieval is a very useful application by itself; furthermore, it can be a step towards improving other social media applications. Social media is different than other web based tools mainly because it is dependent on its users. In social media, users are not just content consumers, but they are also the primary and sometimes the only content creators
自2005年在TREC的企业轨道中引入专家查找任务以来,专家检索得到了广泛的研究。这条赛道提供了从两个组织的面向公众的网站和内部电子邮件中抓取的两个不同的测试集合,这导致了专家检索[1]上许多最先进算法的发展。直到最近,这些数据集还被认为是企业内可用信息资源的良好代表。然而,最近社交媒体的发展也影响了工作环境,社交媒体成为组织内部常见的沟通和协作工具。根据麦肯锡全球研究院(McKinsey Global Institute)最近的一项调查,29%的公司至少使用一种社交媒体工具来匹配员工的任务,26%的公司通过使用社交媒体来评估员工的表现。这表明组织内社交媒体成为组织内识别专业知识的重要资源。近年来,除了组织内部的社交媒体外,Twitter、Facebook、LinkedIn等公共社交媒体工具也成为搜索专业知识的常见环境。这些工具为他们的用户提供了一个向世界展示他们的特定技能的机会,这促使招聘人员在社交媒体上寻找有才华的求职者,或者作家和记者找到专家来咨询他们正在研究的特定主题。考虑到这些动机,在这项工作中,我们建议为组织内部和公共社交媒体工具开发专家检索算法。社交媒体数据集既有挑战,也有优势。就挑战而言,它们并不总是包含特定领域的上下文,相反,一个社交媒体工具可能同时包含有关技术内容、爱好或新闻的讨论。它们也可能包含垃圾邮件或广告。与精心编辑的企业文档相比,它们在语言上要随意得多。此外,根据社交媒体平台的不同,他们可能会限制帖子中使用的字符数量。尽管它们包括上述挑战,但它们也带来了一些独特的权威信号,如投票、评论、追随者/跟踪信息,这些信息在评估专业知识时很有用。此外,与以前使用的企业文档相比,社交媒体在作者信息上下文中提供了文档和候选人之间的明确关联。在这项工作中,我们建议开发专家检索方法来处理这些挑战,同时利用优势。专家检索本身就是一个非常有用的应用;此外,它可以成为改进其他社交媒体应用程序的一步。社交媒体不同于其他基于网络的工具,主要是因为它依赖于它的用户。在社交媒体中,用户不仅仅是内容的消费者,他们也是主要的,有时甚至是唯一的内容创造者。因此,社交媒体中任何用户生成内容的质量取决于其创建者。在本文中,我们建议使用用户的专业知识来改进现有的应用程序,以便他们不仅可以基于内容,还可以基于内容创建者的专业知识来估计内容的相关性。通过使用内容生成器的专业知识,我们也希望增加更可靠的内容。我们建议利用这些用户的专业知识信息来改进社交媒体中的特别搜索和问答应用程序。在这项工作中,以前的TREC企业数据集,可用的组织内部社交媒体和公共社交媒体数据集将用于测试所提出的算法。
{"title":"Effective approaches to retrieving and using expertise in social media","authors":"Reyyan Yeniterzi","doi":"10.1145/2484028.2484230","DOIUrl":"https://doi.org/10.1145/2484028.2484230","url":null,"abstract":"Expert retrieval has been widely studied especially after the introduction of Expert Finding task in the TREC's Enterprise Track in 2005 [3]. This track provided two different test collections crawled from two organizations' public-facing websites and internal emails which led to the development of many state-of-the-art algorithms on expert retrieval [1]. Until recently, these datasets were considered good representatives of the information resources available within enterprise. However, the recent growth of social media also influenced the work environment, and social media became a common communication and collaboration tool within organizations. According to a recent survey by McKinsey Global Institute [2], 29% of the companies use at least one social media tool for matching their employees to tasks, and 26% of them assess their employees' performance by using social media. This shows that intra-organizational social media became an important resource to identify expertise within organizations. In recent years, in addition to the intra-organizational social media, public social media tools like Twitter, Facebook, LinkedIn also became common environments for searching expertise. These tools provide an opportunity for their users to show their specific skills to the world which motivates recruiters to look for talented job candidates on social media, or writers and reporters to find experts for consulting on specific topics they are working on. With these motivations in mind, in this work we propose to develop expert retrieval algorithms for intra-organizational and public social media tools. Social media datasets have both challenges and advantages. In terms of challenges, they do not always contain context on one specific domain, instead one social media tool may contain discussions on technical stuff, hobbies or news concurrently. They may also contain spam posts or advertisements. Compared to well-edited enterprise documents, they are much more informal in language. Furthermore, depending on the social media platform, they may have limits on the number of characters used in posts. Even though they include the challenges stated above, they also bring some unique authority signals, such as votes, comments, follower/following information, which can be useful in estimating expertise. Furthermore, compared to previously used enterprise documents, social media provides clear associations between documents and candidates in the context of authorship information. In this work, we propose to develop expert retrieval approaches which will handle these challenges while making use of the advantages. Expert retrieval is a very useful application by itself; furthermore, it can be a step towards improving other social media applications. Social media is different than other web based tools mainly because it is dependent on its users. In social media, users are not just content consumers, but they are also the primary and sometimes the only content creators","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127693238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Answering natural language queries over linked data graphs: a distributional semantics approach 回答链接数据图上的自然语言查询:一种分布式语义方法
A. Freitas, Fabrício F. de Faria, Seán O'Riain, E. Curry
This paper demonstrates Treo, a natural language query mechanism for Linked Data graphs. The approach uses a distributional semantic vector space model to semantically match user query terms with data, supporting vocabulary-independent (or schema-agnostic) queries over structured data.
本文演示了Treo,一种关联数据图的自然语言查询机制。该方法使用分布式语义向量空间模型在语义上匹配用户查询术语和数据,支持对结构化数据进行与词汇表无关(或与模式无关)的查询。
{"title":"Answering natural language queries over linked data graphs: a distributional semantics approach","authors":"A. Freitas, Fabrício F. de Faria, Seán O'Riain, E. Curry","doi":"10.1145/2484028.2484209","DOIUrl":"https://doi.org/10.1145/2484028.2484209","url":null,"abstract":"This paper demonstrates Treo, a natural language query mechanism for Linked Data graphs. The approach uses a distributional semantic vector space model to semantically match user query terms with data, supporting vocabulary-independent (or schema-agnostic) queries over structured data.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127934250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A query and patient understanding framework for medical records search 用于医疗记录搜索的查询和患者理解框架
Nut Limsopatham
Electronic medical records (EMRs) are being increasingly used worldwide to facilitate improved healthcare services [2,3]. They describe the clinical decision process relating to a patient, detailing the observed symptoms, the conducted diagnostic tests, the identified diagnoses and the prescribed treatments. However, medical records search is challenging, due to the implicit knowledge inherent within the medical records - such knowledge may be known by medical practitioners, but hidden to an information retrieval (IR) system [3]. For instance, the mention of a treatment such as a drug may indicate to a practitioner that a particular diagnosis has been made even if this was not explicitly mentioned in the patient's EMRs. Moreover, the fact that a symptom has not been observed by a clinician may rule out some specific diagnoses. Our work focuses on searching EMRs to identify patients with medical histories relevant to the medical condition(s) stated in a query. The resulting system can be beneficial to healthcare providers, administrators, and researchers who may wish to analyse the effectiveness of a particular medical procedure to combat a specific disease [2,4]. During retrieval, a healthcare provider may indicate a number of inclusion criteria to describe the type of patients of interest. For example, the used criteria may include personal profiles (e.g. age and gender) or some specific medical symptoms and tests, allowing to identify patients that have EMRs matching the criteria. To attain effective retrieval performance, we hypothesise that, in such a medical IR system, both the information needs and patients should be modelled based on how the medical process is developed. Specifically, our thesis states that since the medical decision process typically encompasses four aspects (symptom, diagnostic test, diagnosis, and treatment), a medical search system should take into account these aspects and apply inferences to recover possible implicit knowledge. We postulate that considering these aspects and their derived implicit knowledge at different levels of the retrieval process (namely, sentence, record, and inter-record level) enhances the retrieval performance. Indeed, we propose to build a query and patient understanding framework that can gain insights from EMRs and queries, by modelling and reasoning during retrieval in terms of the four aforementioned aspects (symptom, diagnostic test, diagnosis, and treatment) at three different levels of the retrieval process.
电子医疗记录(emr)在世界范围内越来越多地用于促进改善医疗保健服务[2,3]。它们描述了与患者有关的临床决策过程,详细说明了观察到的症状、进行的诊断测试、确定的诊断和规定的治疗。然而,由于病历中固有的隐性知识——这些知识可能为医生所知,但却隐藏在信息检索(IR)系统中[3],因此,病历搜索具有挑战性。例如,提及治疗(如药物)可能会向医生表明已经做出了特定的诊断,即使在患者的电子病历中没有明确提到这一点。此外,临床医生未观察到症状的事实可能会排除某些特定的诊断。我们的工作重点是搜索电子病历,以识别与查询中所述医疗状况相关的病史的患者。由此产生的系统可以有利于医疗保健提供者、管理人员和研究人员,他们可能希望分析特定医疗程序对抗特定疾病的有效性[2,4]。在检索过程中,医疗保健提供者可能指示许多包含标准来描述感兴趣的患者类型。例如,使用的标准可能包括个人概况(例如年龄和性别)或一些特定的医学症状和测试,从而可以识别电子病历符合标准的患者。为了获得有效的检索性能,我们假设,在这样一个医疗IR系统中,信息需求和患者都应该基于医疗过程的发展方式进行建模。具体来说,我们的论文指出,由于医疗决策过程通常包括四个方面(症状,诊断测试,诊断和治疗),医疗搜索系统应该考虑到这些方面,并应用推断来恢复可能的隐性知识。我们假设在检索过程的不同层次(即句子、记录和记录间层次)考虑这些方面及其衍生的隐性知识可以提高检索性能。事实上,我们建议建立一个查询和患者理解框架,通过在检索过程的三个不同层次上对上述四个方面(症状、诊断测试、诊断和治疗)进行建模和推理,可以从emr和查询中获得见解。
{"title":"A query and patient understanding framework for medical records search","authors":"Nut Limsopatham","doi":"10.1145/2484028.2484228","DOIUrl":"https://doi.org/10.1145/2484028.2484228","url":null,"abstract":"Electronic medical records (EMRs) are being increasingly used worldwide to facilitate improved healthcare services [2,3]. They describe the clinical decision process relating to a patient, detailing the observed symptoms, the conducted diagnostic tests, the identified diagnoses and the prescribed treatments. However, medical records search is challenging, due to the implicit knowledge inherent within the medical records - such knowledge may be known by medical practitioners, but hidden to an information retrieval (IR) system [3]. For instance, the mention of a treatment such as a drug may indicate to a practitioner that a particular diagnosis has been made even if this was not explicitly mentioned in the patient's EMRs. Moreover, the fact that a symptom has not been observed by a clinician may rule out some specific diagnoses. Our work focuses on searching EMRs to identify patients with medical histories relevant to the medical condition(s) stated in a query. The resulting system can be beneficial to healthcare providers, administrators, and researchers who may wish to analyse the effectiveness of a particular medical procedure to combat a specific disease [2,4]. During retrieval, a healthcare provider may indicate a number of inclusion criteria to describe the type of patients of interest. For example, the used criteria may include personal profiles (e.g. age and gender) or some specific medical symptoms and tests, allowing to identify patients that have EMRs matching the criteria. To attain effective retrieval performance, we hypothesise that, in such a medical IR system, both the information needs and patients should be modelled based on how the medical process is developed. Specifically, our thesis states that since the medical decision process typically encompasses four aspects (symptom, diagnostic test, diagnosis, and treatment), a medical search system should take into account these aspects and apply inferences to recover possible implicit knowledge. We postulate that considering these aspects and their derived implicit knowledge at different levels of the retrieval process (namely, sentence, record, and inter-record level) enhances the retrieval performance. Indeed, we propose to build a query and patient understanding framework that can gain insights from EMRs and queries, by modelling and reasoning during retrieval in terms of the four aforementioned aspects (symptom, diagnostic test, diagnosis, and treatment) at three different levels of the retrieval process.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129079654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
TweetMogaz: a news portal of tweets TweetMogaz: tweets的新闻门户
Walid Magdy
Twitter is currently one of the largest social hubs for users to spread and discuss news. For most of the top news stories happening, there are corresponding discussions on social media. In this demonstration TweetMogaz is presented, which is a platform for microblog search and filtering. It creates a real-time comprehensive report about what people discuss and share around news happening in certain regions. TweetMogaz reports the most popular tweets, jokes, videos, images, and news articles that people share about top news stories. Moreover, it allows users to search for specific topics. A scalable automatic technique for microblog filtering is used to obtain relevant tweets to a certain news category in a region. TweetMogaz.com demonstrates the effectiveness of our filtering technique for reporting public response toward news in different Arabic regions including Egypt and Syria in real-time.
推特目前是用户传播和讨论新闻的最大社交中心之一。对于大多数正在发生的头条新闻,社交媒体上都有相应的讨论。在这个演示中,TweetMogaz是一个微博搜索和过滤平台。它创建了一个实时的综合报告,关于人们讨论和分享在某些地区发生的新闻。TweetMogaz报告最受欢迎的推文、笑话、视频、图片和人们分享的新闻报道。此外,它还允许用户搜索特定的主题。采用一种可扩展的微博自动过滤技术,获取某一地区某一新闻类别的相关推文。TweetMogaz.com展示了我们的过滤技术在不同阿拉伯地区(包括埃及和叙利亚)实时报道公众对新闻反应的有效性。
{"title":"TweetMogaz: a news portal of tweets","authors":"Walid Magdy","doi":"10.1145/2484028.2484212","DOIUrl":"https://doi.org/10.1145/2484028.2484212","url":null,"abstract":"Twitter is currently one of the largest social hubs for users to spread and discuss news. For most of the top news stories happening, there are corresponding discussions on social media. In this demonstration TweetMogaz is presented, which is a platform for microblog search and filtering. It creates a real-time comprehensive report about what people discuss and share around news happening in certain regions. TweetMogaz reports the most popular tweets, jokes, videos, images, and news articles that people share about top news stories. Moreover, it allows users to search for specific topics. A scalable automatic technique for microblog filtering is used to obtain relevant tweets to a certain news category in a region. TweetMogaz.com demonstrates the effectiveness of our filtering technique for reporting public response toward news in different Arabic regions including Egypt and Syria in real-time.","PeriodicalId":178818,"journal":{"name":"Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129305104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1