首页 > 最新文献

2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)最新文献

英文 中文
Predicting time-sensitive user locations from social media 从社交媒体预测时间敏感的用户位置
A. Jaiswal, Wei Peng, Tong Sun
Access to massive real-time user generated personal information from micro blogging services, such as Twitter and Facebook, has the potential to enable new location-based recommendation and advertising services. However, sparse user profile information and low adoption of per-message geo-coordinate information necessitates development of location detection techniques that exposes a user's location from message content. We propose and evaluate content-based machine learning techniques to a) identify tweets containing a user's location, and, b) categorize a user location into the author's present or future location. Such an approach is advantageous because it a) relies purely on message content, b) can be used to predict a user's future presence at a location, c) relates user locations to some context (activities, trip plans, etc.), and, d) can be used to profile users constantly evolving location. Our experimental evaluation shows that the proposed techniques can identify and categorize user locations from message content with high accuracy. We also extract the time entities associated with a user's future location to show when the user would be at that location. Finally we illustrate the location-based data analytics potential of these techniques on two real-world datasets.
从Twitter和Facebook等微博服务中获取大量实时用户生成的个人信息,有可能催生新的基于位置的推荐和广告服务。然而,稀疏的用户概要信息和对每条消息地理坐标信息的低采用率需要开发从消息内容中暴露用户位置的位置检测技术。我们提出并评估了基于内容的机器学习技术,以a)识别包含用户位置的推文,以及b)将用户位置分类为作者现在或未来的位置。这种方法是有利的,因为它a)完全依赖于消息内容,b)可用于预测用户未来在某个位置的存在,c)将用户位置与某些上下文(活动、旅行计划等)联系起来,并且d)可用于描述用户不断变化的位置。实验结果表明,该方法能够较准确地从消息内容中识别和分类用户位置。我们还提取与用户未来位置相关的时间实体,以显示用户将在该位置的时间。最后,我们说明了这些技术在两个现实世界数据集上基于位置的数据分析的潜力。
{"title":"Predicting time-sensitive user locations from social media","authors":"A. Jaiswal, Wei Peng, Tong Sun","doi":"10.1145/2492517.2500229","DOIUrl":"https://doi.org/10.1145/2492517.2500229","url":null,"abstract":"Access to massive real-time user generated personal information from micro blogging services, such as Twitter and Facebook, has the potential to enable new location-based recommendation and advertising services. However, sparse user profile information and low adoption of per-message geo-coordinate information necessitates development of location detection techniques that exposes a user's location from message content. We propose and evaluate content-based machine learning techniques to a) identify tweets containing a user's location, and, b) categorize a user location into the author's present or future location. Such an approach is advantageous because it a) relies purely on message content, b) can be used to predict a user's future presence at a location, c) relates user locations to some context (activities, trip plans, etc.), and, d) can be used to profile users constantly evolving location. Our experimental evaluation shows that the proposed techniques can identify and categorize user locations from message content with high accuracy. We also extract the time entities associated with a user's future location to show when the user would be at that location. Finally we illustrate the location-based data analytics potential of these techniques on two real-world datasets.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127041489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
OverCite: Finding overlapping communities in citation network OverCite:在引文网络中寻找重叠的社区
Tanmoy Chakraborty, Abhijnan Chakraborty
Citation analysis is a popular area of research, which has been usually used to rank the authors and the publication venues of research papers. With huge number of publications every year, it has become difficult for the users to find relevant publication materials. One simple solution to this problem is to detect communities from the citation network and recommend papers based on the common membership in communities. But, in today's research scenario, many researchers' fields of interest spread into multiple research directions resulting in an increasing number of interdisciplinary publications. Therefore, it is necessary to detect overlapping communities for relevant recommendation. In this paper, we represent publication information as a tripartite `Publication Hypergraph' consisting of authors, papers and publication venues (conferences/journals) in three partitions. We then propose an algorithm called `OverCite', which can detect overlapping communities of authors, papers and venues simultaneously using the publication hypergraph and the citation network information. We compare OverCite with two existing overlapping community detection algorithms, Clique Percolation Method (CPM) and iLCD, applied on citation network. The experiments on a large real-world citation dataset show that OverCite outperforms other two algorithms. We also present a simple paper search and recommendation system. Based on the relevance judgements of the users, we further prove the effectiveness of OverCite over other two algorithms.
引文分析是一个热门的研究领域,通常用于对研究论文的作者和发表地点进行排名。由于每年出版的出版物数量巨大,用户很难找到相关的出版物资料。解决这一问题的一个简单方法是从引文网络中检测社区,并根据社区的共同成员资格推荐论文。但是,在当今的研究情况下,许多研究人员感兴趣的领域向多个研究方向扩展,导致跨学科出版物越来越多。因此,有必要检测重叠的社区,以便进行相关的推荐。在本文中,我们将出版信息表示为一个由作者、论文和出版场所(会议/期刊)组成的三方“出版超图”。然后,我们提出了一种名为“OverCite”的算法,该算法可以利用出版超图和引文网络信息同时检测作者、论文和场地的重叠社区。我们将OverCite与现有的两种用于引文网络的重叠社区检测算法Clique per渗法(CPM)和iLCD进行了比较。在大型真实引文数据集上的实验表明,OverCite优于其他两种算法。我们还提出了一个简单的论文搜索和推荐系统。基于用户的相关性判断,我们进一步证明了OverCite优于其他两种算法的有效性。
{"title":"OverCite: Finding overlapping communities in citation network","authors":"Tanmoy Chakraborty, Abhijnan Chakraborty","doi":"10.1145/2492517.2500255","DOIUrl":"https://doi.org/10.1145/2492517.2500255","url":null,"abstract":"Citation analysis is a popular area of research, which has been usually used to rank the authors and the publication venues of research papers. With huge number of publications every year, it has become difficult for the users to find relevant publication materials. One simple solution to this problem is to detect communities from the citation network and recommend papers based on the common membership in communities. But, in today's research scenario, many researchers' fields of interest spread into multiple research directions resulting in an increasing number of interdisciplinary publications. Therefore, it is necessary to detect overlapping communities for relevant recommendation. In this paper, we represent publication information as a tripartite `Publication Hypergraph' consisting of authors, papers and publication venues (conferences/journals) in three partitions. We then propose an algorithm called `OverCite', which can detect overlapping communities of authors, papers and venues simultaneously using the publication hypergraph and the citation network information. We compare OverCite with two existing overlapping community detection algorithms, Clique Percolation Method (CPM) and iLCD, applied on citation network. The experiments on a large real-world citation dataset show that OverCite outperforms other two algorithms. We also present a simple paper search and recommendation system. Based on the relevance judgements of the users, we further prove the effectiveness of OverCite over other two algorithms.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114196272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
An interactive visualization interface for studying egocentric, categorical, contact diary datasets 一个交互式的可视化界面,用于研究自我中心,分类,接触日记数据集
Chris Bryan, K. Ma, Yang-chih Fu
Contact diaries are interpersonal communication logs which are obtained in sociological and epidemiological studies. These logs can be used to study the social patterns of communities over a period of time. A dataset composed of diaries maps well to a set of one-tiered, categorical, independent and egocentric networks. This paper presents an interface for visualization and analysis of contact diaries datasets using an interactive radial mapping scheme, with case studies illustrating a standard workflow using the application. We facilitate individual diary analysis, multi-dataset comparison, and an overlay interface for investigating a set of many diaries in a singular space. With this interface, network researchers can utilize visualization to enhance their analysis of contact diaries.
接触日记是在社会学和流行病学研究中获得的人际交往日志。这些日志可以用来研究一段时间内社区的社会模式。由日记组成的数据集可以很好地映射到一组单层的、分类的、独立的、以自我为中心的网络。本文提出了一个使用交互式径向映射方案可视化和分析接触日记数据集的界面,并通过案例研究说明了使用该应用程序的标准工作流。我们促进了个人日记分析,多数据集比较,以及一个覆盖界面,用于在单一空间中调查一组许多日记。有了这个界面,网络研究人员可以利用可视化来增强他们对接触日记的分析。
{"title":"An interactive visualization interface for studying egocentric, categorical, contact diary datasets","authors":"Chris Bryan, K. Ma, Yang-chih Fu","doi":"10.1145/2492517.2492636","DOIUrl":"https://doi.org/10.1145/2492517.2492636","url":null,"abstract":"Contact diaries are interpersonal communication logs which are obtained in sociological and epidemiological studies. These logs can be used to study the social patterns of communities over a period of time. A dataset composed of diaries maps well to a set of one-tiered, categorical, independent and egocentric networks. This paper presents an interface for visualization and analysis of contact diaries datasets using an interactive radial mapping scheme, with case studies illustrating a standard workflow using the application. We facilitate individual diary analysis, multi-dataset comparison, and an overlay interface for investigating a set of many diaries in a singular space. With this interface, network researchers can utilize visualization to enhance their analysis of contact diaries.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122477186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The forecast for electronic health record access: Partly cloudy 电子健康记录访问预报:部分多云
Brian Coats, Subrata Acharya
The mounting pressure to enable widespread access to electronic health record systems is being felt by healthcare providers. The federal government's Meaningful Use incentives are reason alone for providers to address this significant usability issue. As the healthcare industry considers solutions, attention should be given to the Cloud and the considerable investment that has been made related to the establishment of digital identities and making them interoperable across heterogeneous systems. This research considered how the Cloud could be leveraged by healthcare providers to not only provide patients with a familiar way of accessing electronic resources but also creating a significant cost savings for providers. An examination was performed of similar work being done in other industries as well as the standards laid out by the federal government for EHRs and digital identities. This research lays out a comprehensive framework for healthcare providers to easily follow to integrate with the Cloud for identity validation, while meeting compliance guidelines for security and privacy. To demonstrate the viability of this research, a number of pilots and proof of concept projects have already been implemented at a large regional hospital and have produced immediate and tangible improvements.
医疗保健提供者已经感受到广泛使用电子健康记录系统的压力越来越大。联邦政府的“有意义的使用”激励措施是供应商解决这一重大可用性问题的唯一理由。在医疗保健行业考虑解决方案时,应该关注云和与建立数字身份并使其在异构系统之间可互操作相关的大量投资。这项研究考虑了医疗保健提供商如何利用云,不仅为患者提供一种熟悉的访问电子资源的方式,而且还为提供商节省了大量成本。对其他行业正在进行的类似工作以及联邦政府为电子病历和数字身份制定的标准进行了检查。这项研究为医疗保健提供商提供了一个全面的框架,可以轻松地与云集成以进行身份验证,同时满足安全性和隐私性的合规性指导方针。为了证明这项研究的可行性,已经在一家大型区域医院实施了一些试点和概念验证项目,并产生了直接和切实的改进。
{"title":"The forecast for electronic health record access: Partly cloudy","authors":"Brian Coats, Subrata Acharya","doi":"10.1145/2492517.2500329","DOIUrl":"https://doi.org/10.1145/2492517.2500329","url":null,"abstract":"The mounting pressure to enable widespread access to electronic health record systems is being felt by healthcare providers. The federal government's Meaningful Use incentives are reason alone for providers to address this significant usability issue. As the healthcare industry considers solutions, attention should be given to the Cloud and the considerable investment that has been made related to the establishment of digital identities and making them interoperable across heterogeneous systems. This research considered how the Cloud could be leveraged by healthcare providers to not only provide patients with a familiar way of accessing electronic resources but also creating a significant cost savings for providers. An examination was performed of similar work being done in other industries as well as the standards laid out by the federal government for EHRs and digital identities. This research lays out a comprehensive framework for healthcare providers to easily follow to integrate with the Cloud for identity validation, while meeting compliance guidelines for security and privacy. To demonstrate the viability of this research, a number of pilots and proof of concept projects have already been implemented at a large regional hospital and have produced immediate and tangible improvements.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125724716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Combining information extraction and text mining for cancer biomarker detection 结合信息提取和文本挖掘的癌症生物标志物检测
Khaled Dawoud, Shang Gao, Ala Qabaja, P. Karampelas, R. Alhajj
Information technology is advancing faster than anticipated. The amount of data captured and stored in electronic form by far exceeds the capabilities available for comprehensive analysis and effective knowledge discovery. There is always a need for new sophisticated techniques that could extract more of the knowledge hidden in the raw data collected continuously in huge repositories. Biomedicine and computational biology is one of the domains overwhelmed with huge amounts of data that should be carefully analyzed for valuable knowledge that may help uncovering many of the still unknown information related to various diseases threatening the human body. Biomarker detection is one of the areas which have received considerable attention in the research community. There are two sources of data that could be analyzed for biomarker detection, namely gene expression data and the rich literature related to the domain. Our research group has reported achievements analyzing both domains. In this paper, we concentrate on the latter domain by describing a powerful tool which is capable of extracting from the content of a repository (like PubMed) the parts related to a given specific domain like cancer, analyze the retrieved text to extract the key terms with high frequency, present the extracted terms to domain experts for selecting those most relevant to the investigated domain, retrieve from the analyzed text molecules related to the domain by considering the relevant terms, derive the network which will be analyzed to identify potential biomarkers. For the work described in this paper, we considered PubMed and extracted abstracts related to prostate and breast cancer. The reported results are promising; they demonstrate the effectiveness and applicability of the proposed approach.
信息技术的发展比预期的要快。以电子形式捕获和存储的数据量远远超过了全面分析和有效发现知识的能力。总是需要新的复杂技术来提取隐藏在庞大存储库中不断收集的原始数据中的更多知识。生物医学和计算生物学是一个被大量数据淹没的领域,应该仔细分析有价值的知识,这些知识可能有助于揭示与威胁人体的各种疾病有关的许多未知信息。生物标志物检测是近年来备受关注的研究领域之一。生物标志物检测可以分析的数据有两个来源,即基因表达数据和丰富的与该域相关的文献。我们的研究小组已经报告了分析这两个领域的成果。在本文中,我们通过描述一个强大的工具来关注后一个领域,该工具能够从存储库(如PubMed)的内容中提取与特定领域(如癌症)相关的部分,分析检索到的文本以提取高频率的关键术语,将提取的术语呈现给领域专家以选择与所研究领域最相关的术语,并通过考虑相关术语从分析的文本中检索与该领域相关的分子。导出将被分析以识别潜在生物标记物的网络。对于本文中描述的工作,我们参考了PubMed和与前列腺癌和乳腺癌相关的提取摘要。报告的结果是有希望的;它们证明了所提出方法的有效性和适用性。
{"title":"Combining information extraction and text mining for cancer biomarker detection","authors":"Khaled Dawoud, Shang Gao, Ala Qabaja, P. Karampelas, R. Alhajj","doi":"10.1145/2492517.2500281","DOIUrl":"https://doi.org/10.1145/2492517.2500281","url":null,"abstract":"Information technology is advancing faster than anticipated. The amount of data captured and stored in electronic form by far exceeds the capabilities available for comprehensive analysis and effective knowledge discovery. There is always a need for new sophisticated techniques that could extract more of the knowledge hidden in the raw data collected continuously in huge repositories. Biomedicine and computational biology is one of the domains overwhelmed with huge amounts of data that should be carefully analyzed for valuable knowledge that may help uncovering many of the still unknown information related to various diseases threatening the human body. Biomarker detection is one of the areas which have received considerable attention in the research community. There are two sources of data that could be analyzed for biomarker detection, namely gene expression data and the rich literature related to the domain. Our research group has reported achievements analyzing both domains. In this paper, we concentrate on the latter domain by describing a powerful tool which is capable of extracting from the content of a repository (like PubMed) the parts related to a given specific domain like cancer, analyze the retrieved text to extract the key terms with high frequency, present the extracted terms to domain experts for selecting those most relevant to the investigated domain, retrieve from the analyzed text molecules related to the domain by considering the relevant terms, derive the network which will be analyzed to identify potential biomarkers. For the work described in this paper, we considered PubMed and extracted abstracts related to prostate and breast cancer. The reported results are promising; they demonstrate the effectiveness and applicability of the proposed approach.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126552787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The influence of feedback with different opinions on continued user participation in online newsgroups 不同意见的反馈对用户持续参与在线新闻组的影响
Teng Wang, Keith C. Wang, Fredrik Erlandsson, S. F. Wu, Robert W. Faris
With the popularity of social media in recent years, it has been a critical topic for social network designer to understand the factors that influence continued user participation in online newsgroups. Our study examines how feedback with different opinions is associated with participants' lifetime in online newsgroups. Firstly, we propose a new method of classifying different opinions among user interaction contents. Generally, we leverage user behavior information in online newsgroups to estimate their opinions and evaluate our classification results based on linguistic features. In addition, we also implement this opinion classification method into our SINCERE system as a real-time service. Based on this opinion classification tool, we use survival analysis to examine how others' feedback with different opinions influence continued participation. In our experiment, we analyze more than 88,770 interactions on the official Occupy LA Facebook page. Our final result shows that not only the feedback with the same opinions as the user, but also the feedback with different opinions can motivate continued user participation in online newsgroup. Furthermore, an interaction of feedback with both the same and different opinions can boost user continued participation to the greatest extent. This finding forms the basis of understanding how to improve online service in social media.
随着近年来社交媒体的普及,了解影响用户持续参与在线新闻组的因素已成为社交网络设计者的一个重要课题。我们的研究考察了不同意见的反馈与在线新闻组参与者的寿命之间的关系。首先,我们提出了一种对用户交互内容中不同意见进行分类的新方法。通常,我们利用在线新闻组中的用户行为信息来估计他们的意见,并基于语言特征评估我们的分类结果。此外,我们还将这种意见分类方法作为实时服务,落实到我们的诚信系统中。基于这个意见分类工具,我们使用生存分析来检验其他人的不同意见反馈如何影响持续参与。在我们的实验中,我们分析了占领洛杉矶官方Facebook页面上超过88,770个互动。我们的最终结果表明,无论是与用户意见一致的反馈,还是与用户意见不同的反馈,都能激发用户在网络新闻组中的持续参与。此外,相同意见和不同意见的反馈互动可以最大程度地促进用户的持续参与。这一发现为理解如何改善社交媒体的在线服务奠定了基础。
{"title":"The influence of feedback with different opinions on continued user participation in online newsgroups","authors":"Teng Wang, Keith C. Wang, Fredrik Erlandsson, S. F. Wu, Robert W. Faris","doi":"10.1145/2492517.2492555","DOIUrl":"https://doi.org/10.1145/2492517.2492555","url":null,"abstract":"With the popularity of social media in recent years, it has been a critical topic for social network designer to understand the factors that influence continued user participation in online newsgroups. Our study examines how feedback with different opinions is associated with participants' lifetime in online newsgroups. Firstly, we propose a new method of classifying different opinions among user interaction contents. Generally, we leverage user behavior information in online newsgroups to estimate their opinions and evaluate our classification results based on linguistic features. In addition, we also implement this opinion classification method into our SINCERE system as a real-time service. Based on this opinion classification tool, we use survival analysis to examine how others' feedback with different opinions influence continued participation. In our experiment, we analyze more than 88,770 interactions on the official Occupy LA Facebook page. Our final result shows that not only the feedback with the same opinions as the user, but also the feedback with different opinions can motivate continued user participation in online newsgroup. Furthermore, an interaction of feedback with both the same and different opinions can boost user continued participation to the greatest extent. This finding forms the basis of understanding how to improve online service in social media.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131239375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Recommender system by grasping individual preference and influence from other users 推荐系统通过掌握个人偏好和其他用户的影响
Tae Sato, Masanori Fujita, Minoru Kobayashi, Koji Ito
We propose a recommendation method that considers the user's individual preference and influence from other users in social media. This method predicts the user's individual preference and influence from other users by applying the probability of divergence from random-selection based on a statistical hypothesis test as a form of modified content-based filtering. We evaluated the proposed method by focusing on the rate at which items that have recommended tags are contained among all items. The proposed method is shown to have higher accuracy than traditional content-based filtering. It is especially effective when some percentage of the items have recommendation tags.
我们提出了一种考虑用户个人偏好和社交媒体中其他用户影响的推荐方法。该方法将基于统计假设检验的随机选择的偏离概率作为一种改进的基于内容的过滤形式,预测用户的个人偏好和其他用户的影响。我们通过关注具有推荐标签的项目在所有项目中的包含率来评估所提出的方法。结果表明,该方法比传统的基于内容的过滤方法具有更高的准确率。当一定比例的项目有推荐标签时,这种方法尤其有效。
{"title":"Recommender system by grasping individual preference and influence from other users","authors":"Tae Sato, Masanori Fujita, Minoru Kobayashi, Koji Ito","doi":"10.1145/2492517.2500283","DOIUrl":"https://doi.org/10.1145/2492517.2500283","url":null,"abstract":"We propose a recommendation method that considers the user's individual preference and influence from other users in social media. This method predicts the user's individual preference and influence from other users by applying the probability of divergence from random-selection based on a statistical hypothesis test as a form of modified content-based filtering. We evaluated the proposed method by focusing on the rate at which items that have recommended tags are contained among all items. The proposed method is shown to have higher accuracy than traditional content-based filtering. It is especially effective when some percentage of the items have recommendation tags.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124192025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
How do people link? Analysis of contact structures in human face-to-face proximity networks 人们是如何联系起来的?人类面对面接近网络中的接触结构分析
Christoph Scholz, M. Atzmüller, Mark Kibanov, Gerd Stumme
Understanding the process of link creation is rather important for link prediction in social networks. Therefore, this paper analyzes contact structures in networks of face-to-face spatial proximity, and presents new insights on the dynamic and static contact behavior in such real world networks. We focus on face-to-face contact networks collected at different conferences using the social conference guidance system Conferator. Specifically, we investigate the strength of ties and its connection to triadic closures in face-to-face proximity networks. Furthermore, we analyze the predictability of all, new and recurring links at different points of time during the conference. In addition, we consider network dynamics for the prediction of new links.
了解链接创建的过程对于社交网络中的链接预测非常重要。因此,本文分析了面对面空间接近网络中的接触结构,并对这种现实世界网络中的动态和静态接触行为提出了新的见解。我们专注于使用社交会议指导系统Conferator在不同会议上收集的面对面联系网络。具体来说,我们研究了面对面接近网络中联系的强度及其与三合一闭包的联系。此外,我们分析了会议期间不同时间点的所有新链接和重复链接的可预测性。此外,我们考虑网络动力学预测新的链接。
{"title":"How do people link? Analysis of contact structures in human face-to-face proximity networks","authors":"Christoph Scholz, M. Atzmüller, Mark Kibanov, Gerd Stumme","doi":"10.1145/2492517.2492521","DOIUrl":"https://doi.org/10.1145/2492517.2492521","url":null,"abstract":"Understanding the process of link creation is rather important for link prediction in social networks. Therefore, this paper analyzes contact structures in networks of face-to-face spatial proximity, and presents new insights on the dynamic and static contact behavior in such real world networks. We focus on face-to-face contact networks collected at different conferences using the social conference guidance system Conferator. Specifically, we investigate the strength of ties and its connection to triadic closures in face-to-face proximity networks. Furthermore, we analyze the predictability of all, new and recurring links at different points of time during the conference. In addition, we consider network dynamics for the prediction of new links.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124512754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Exploiting hashtags for adaptive microblog crawling 利用标签进行自适应微博爬行
Xinyue Wang, L. Tokarchuk, F. Cuadrado, S. Poslad
Researchers have capitalized on microblogging services, such as Twitter, for detecting and monitoring real world events. Existing approaches have based their conclusions on data collected by monitoring a set of pre-defined keywords. In this paper, we show that this manner of data collection risks losing a significant amount of relevant information. We then propose an adaptive crawling model that detects emerging popular hashtags, and monitors them to retrieve greater amounts of highly associated data for events of interest. The proposed model analyzes the traffic patterns of the hashtags collected from the live stream to update subsequent collection queries. To evaluate this adaptive crawling model, we apply it to a dataset collected during the 2012 London Olympic Games. Our analysis shows that adaptive crawling based on the proposed Refined Keyword Adaptation algorithm collects a more comprehensive dataset than pre-defined keyword crawling, while only introducing a minimum amount of noise.
研究人员利用微博服务,如Twitter,来检测和监控现实世界的事件。现有的方法是根据监测一组预定义关键字收集的数据得出结论的。在本文中,我们表明这种数据收集方式有丢失大量相关信息的风险。然后,我们提出了一种自适应爬行模型,可以检测新兴的流行标签,并对它们进行监控,以检索更多的高度相关的数据,以获取感兴趣的事件。该模型分析从直播流中收集的标签的流量模式,以更新后续的收集查询。为了评估这种自适应爬行模型,我们将其应用于2012年伦敦奥运会期间收集的数据集。我们的分析表明,基于改进关键字自适应算法的自适应爬行比预定义关键字爬行收集了更全面的数据集,同时只引入了最少的噪声。
{"title":"Exploiting hashtags for adaptive microblog crawling","authors":"Xinyue Wang, L. Tokarchuk, F. Cuadrado, S. Poslad","doi":"10.1145/2492517.2492624","DOIUrl":"https://doi.org/10.1145/2492517.2492624","url":null,"abstract":"Researchers have capitalized on microblogging services, such as Twitter, for detecting and monitoring real world events. Existing approaches have based their conclusions on data collected by monitoring a set of pre-defined keywords. In this paper, we show that this manner of data collection risks losing a significant amount of relevant information. We then propose an adaptive crawling model that detects emerging popular hashtags, and monitors them to retrieve greater amounts of highly associated data for events of interest. The proposed model analyzes the traffic patterns of the hashtags collected from the live stream to update subsequent collection queries. To evaluate this adaptive crawling model, we apply it to a dataset collected during the 2012 London Olympic Games. Our analysis shows that adaptive crawling based on the proposed Refined Keyword Adaptation algorithm collects a more comprehensive dataset than pre-defined keyword crawling, while only introducing a minimum amount of noise.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116810319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Enhancing tag-based collaborative filtering via integrated social networking information 通过整合社交网络信息增强基于标签的协同过滤
Sogol Naseri, Arash Bahrehmand, Chen Ding, Chi-Hung Chi
Recently, researchers have taken tremendous strides in attempting to synthesize conventional social judgments and automated filtering within recommender systems. In this study, we aim to enhance recommendation efficiency via integrating social networking information with traditional recommendation algorithms. To achieve this objective, we first propose a new user similarity metric that not only considers tagging activities of users, but also incorporates their social relationships, such as friendship and membership, in measuring the closeness of two users. Subsequently, we define a new item prediction method which makes use of both user-to-user similarity and item-to-item similarity. Experimental outcomes on Last.fm show some positive results that attest the efficiency of our proposed approach.
最近,研究人员在尝试综合传统的社会判断和推荐系统中的自动过滤方面取得了巨大进展。在本研究中,我们的目标是通过将社交网络信息与传统推荐算法相结合来提高推荐效率。为了实现这一目标,我们首先提出了一种新的用户相似度度量,该度量不仅考虑了用户的标记活动,还结合了他们的社会关系,如友谊和成员关系,来衡量两个用户的亲密度。随后,我们定义了一种同时利用用户与用户相似度和物品与物品相似度的物品预测方法。最后的实验结果。FM显示了一些积极的结果,证明了我们所提出的方法的有效性。
{"title":"Enhancing tag-based collaborative filtering via integrated social networking information","authors":"Sogol Naseri, Arash Bahrehmand, Chen Ding, Chi-Hung Chi","doi":"10.1145/2492517.2492658","DOIUrl":"https://doi.org/10.1145/2492517.2492658","url":null,"abstract":"Recently, researchers have taken tremendous strides in attempting to synthesize conventional social judgments and automated filtering within recommender systems. In this study, we aim to enhance recommendation efficiency via integrating social networking information with traditional recommendation algorithms. To achieve this objective, we first propose a new user similarity metric that not only considers tagging activities of users, but also incorporates their social relationships, such as friendship and membership, in measuring the closeness of two users. Subsequently, we define a new item prediction method which makes use of both user-to-user similarity and item-to-item similarity. Experimental outcomes on Last.fm show some positive results that attest the efficiency of our proposed approach.","PeriodicalId":442230,"journal":{"name":"2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)","volume":"12 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116824350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1