首页 > 最新文献

Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining最新文献

英文 中文
Mixing bandits: a recipe for improved cold-start recommendations in a social network 混合强盗:在社交网络中改进冷启动推荐的配方
S. Caron, Smriti Bhagat
Recommending items to new or "cold-start" users is a challenging problem for recommender systems. Collaborative filtering approaches fail when the preference history of users is not available. A promising direction that has been explored recently [12] is to utilize the information in the social networks of users to improve the quality of cold-start recommendations. That is, given that users are part of a social network, a new user shows up in the network with no preference history and limited social links, the recommender system tries to learn the user's tastes as fast as possible. In this work, we model the learning of preferences of cold-start users using multi-armed bandits [5] embedded in a social network. We propose two novel strategies leveraging neighborhood estimates to improve the learning rate of bandits for cold-start users. Our first strategy, MixPair, combines estimates from pairs of neighboring bandits. It extends the well-known UCB1 algorithm [5] and inherits its asymptotically optimal guarantees. Although our second strategy, MixNeigh, is a heuristic based on consensus in the neighborhood of a user, it performed the best among the evaluated strategies. Our experiments on a dataset from Last.fm show that our strategies yield significant improvements, learning 2 to 5 times faster than our baseline, UCB1.
对推荐系统来说,向新用户或“新手”用户推荐产品是一个具有挑战性的问题。当用户的偏好历史不可用时,协同过滤方法将失败。利用用户社交网络中的信息来提高冷启动推荐的质量是最近探索的一个很有前景的方向[12]。也就是说,假设用户是社交网络的一部分,一个没有偏好历史和有限社交链接的新用户出现在网络中,推荐系统试图尽可能快地了解用户的口味。在这项工作中,我们使用嵌入在社交网络中的多臂强盗[5]来模拟冷启动用户的偏好学习。我们提出了两种新的策略,利用邻域估计来提高冷启动用户的强盗学习率。我们的第一个策略MixPair结合了对相邻土匪的估计。它扩展了著名的UCB1算法[5],并继承了其渐近最优保证。虽然我们的第二个策略MixNeigh是一个基于用户附近共识的启发式策略,但它在评估的策略中表现最好。我们在Last的数据集上做的实验。fm显示,我们的策略产生了显著的改善,学习速度比基线(UCB1)快2到5倍。
{"title":"Mixing bandits: a recipe for improved cold-start recommendations in a social network","authors":"S. Caron, Smriti Bhagat","doi":"10.1145/2501025.2501029","DOIUrl":"https://doi.org/10.1145/2501025.2501029","url":null,"abstract":"Recommending items to new or \"cold-start\" users is a challenging problem for recommender systems. Collaborative filtering approaches fail when the preference history of users is not available. A promising direction that has been explored recently [12] is to utilize the information in the social networks of users to improve the quality of cold-start recommendations. That is, given that users are part of a social network, a new user shows up in the network with no preference history and limited social links, the recommender system tries to learn the user's tastes as fast as possible.\u0000 In this work, we model the learning of preferences of cold-start users using multi-armed bandits [5] embedded in a social network. We propose two novel strategies leveraging neighborhood estimates to improve the learning rate of bandits for cold-start users. Our first strategy, MixPair, combines estimates from pairs of neighboring bandits. It extends the well-known UCB1 algorithm [5] and inherits its asymptotically optimal guarantees. Although our second strategy, MixNeigh, is a heuristic based on consensus in the neighborhood of a user, it performed the best among the evaluated strategies. Our experiments on a dataset from Last.fm show that our strategies yield significant improvements, learning 2 to 5 times faster than our baseline, UCB1.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"7 1","pages":"11:1-11:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84463365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Epidemiological modeling of news and rumors on Twitter Twitter上新闻和谣言的流行病学建模
Fang Jin, Edward R. Dougherty, Parang Saraf, Yang Cao, Naren Ramakrishnan
Characterizing information diffusion on social platforms like Twitter enables us to understand the properties of underlying media and model communication patterns. As Twitter gains in popularity, it has also become a venue to broadcast rumors and misinformation. We use epidemiological models to characterize information cascades in twitter resulting from both news and rumors. Specifically, we use the SEIZ enhanced epidemic model that explicitly recognizes skeptics to characterize eight events across the world and spanning a range of event types. We demonstrate that our approach is accurate at capturing diffusion in these events. Our approach can be fruitfully combined with other strategies that use content modeling and graph theoretic features to detect (and possibly disrupt) rumors.
对Twitter等社交平台上的信息扩散进行表征,使我们能够理解底层媒体的属性,并为传播模式建模。随着Twitter越来越受欢迎,它也成为了传播谣言和错误信息的场所。我们使用流行病学模型来描述twitter中由新闻和谣言引起的信息级联。具体而言,我们使用明确识别怀疑论者的SEIZ增强流行病模型来描述世界各地的八个事件,并跨越一系列事件类型。我们证明了我们的方法在捕获这些事件中的扩散方面是准确的。我们的方法可以有效地与其他使用内容建模和图论特征来检测(并可能破坏)谣言的策略相结合。
{"title":"Epidemiological modeling of news and rumors on Twitter","authors":"Fang Jin, Edward R. Dougherty, Parang Saraf, Yang Cao, Naren Ramakrishnan","doi":"10.1145/2501025.2501027","DOIUrl":"https://doi.org/10.1145/2501025.2501027","url":null,"abstract":"Characterizing information diffusion on social platforms like Twitter enables us to understand the properties of underlying media and model communication patterns. As Twitter gains in popularity, it has also become a venue to broadcast rumors and misinformation. We use epidemiological models to characterize information cascades in twitter resulting from both news and rumors. Specifically, we use the SEIZ enhanced epidemic model that explicitly recognizes skeptics to characterize eight events across the world and spanning a range of event types. We demonstrate that our approach is accurate at capturing diffusion in these events. Our approach can be fruitfully combined with other strategies that use content modeling and graph theoretic features to detect (and possibly disrupt) rumors.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"10 1","pages":"8:1-8:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88752093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 322
Community finding within the community set space 在社区集合空间内寻找社区
J. Scripps, C. Trefftz
Community finding algorithms strive to find communities that have a higher connectivity within the communities than between them. Recently a framework called the community set space was introduced which provided a way to measure the quality of community sets. We present a new community finding algorithm, CHI, designed to minimize the violations defined by this framework. It will be shown that the CHI algorithm has similarities to kmeans. It is flexible and fast and can also be tuned to find certain types of communities. It is optimized for the community set framework and results so that it performs better than other algorithms within that framework.
社区查找算法努力寻找社区内部比社区之间具有更高连通性的社区。最近引入了一种称为社区集空间的框架,它提供了一种衡量社区集质量的方法。我们提出了一种新的社区查找算法CHI,旨在最大限度地减少该框架定义的违规行为。本文将证明CHI算法与kmeans算法有相似之处。它是灵活和快速的,也可以调整,以找到某些类型的社区。它针对社区集框架和结果进行了优化,因此它比该框架中的其他算法执行得更好。
{"title":"Community finding within the community set space","authors":"J. Scripps, C. Trefftz","doi":"10.1145/2501025.2501032","DOIUrl":"https://doi.org/10.1145/2501025.2501032","url":null,"abstract":"Community finding algorithms strive to find communities that have a higher connectivity within the communities than between them. Recently a framework called the community set space was introduced which provided a way to measure the quality of community sets. We present a new community finding algorithm, CHI, designed to minimize the violations defined by this framework. It will be shown that the CHI algorithm has similarities to kmeans. It is flexible and fast and can also be tuned to find certain types of communities. It is optimized for the community set framework and results so that it performs better than other algorithms within that framework.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"518 1","pages":"13:1-13:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77161785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Customized reviews for small user-databases using iterative SVD and content based filtering 使用迭代SVD和基于内容的过滤为小型用户数据库定制评论
Jonathan Gregg, Nitin Jain
Recommender systems have proven to be a valuable tool for web companies like Amazon and Netflix for attracting and maintaining a large user base. However, in situations when user data is more scarce (e.g., for mid-sized companies, or an online retailer testing a new ratings system) algorithms tailored to smaller datasets can be used to further increase accuracy. This paper explores the potential of combining collaborative and content-based (using user comments) filtering algorithms using Yelp.com data from a single city. We present the method to combine two approaches, and find that the MSE for predicting a user's new rating can be reduced from a baseline MSE of 1.744 to 0.934 given just 2500 rated items in our real-world dataset.
对于像亚马逊和Netflix这样的网络公司来说,推荐系统已经被证明是一个很有价值的工具,可以吸引和维持庞大的用户群。然而,在用户数据更稀缺的情况下(例如,对于中型公司或在线零售商测试新的评级系统),可以使用针对较小数据集的算法来进一步提高准确性。本文利用单个城市的Yelp.com数据,探索了将协作和基于内容(使用用户评论)的过滤算法相结合的潜力。我们提出了结合两种方法的方法,并发现预测用户新评级的MSE可以从1.744的基线MSE降低到0.934,给定我们的真实数据集中只有2500个评级项目。
{"title":"Customized reviews for small user-databases using iterative SVD and content based filtering","authors":"Jonathan Gregg, Nitin Jain","doi":"10.1145/2501025.2501036","DOIUrl":"https://doi.org/10.1145/2501025.2501036","url":null,"abstract":"Recommender systems have proven to be a valuable tool for web companies like Amazon and Netflix for attracting and maintaining a large user base. However, in situations when user data is more scarce (e.g., for mid-sized companies, or an online retailer testing a new ratings system) algorithms tailored to smaller datasets can be used to further increase accuracy. This paper explores the potential of combining collaborative and content-based (using user comments) filtering algorithms using Yelp.com data from a single city. We present the method to combine two approaches, and find that the MSE for predicting a user's new rating can be reduced from a baseline MSE of 1.744 to 0.934 given just 2500 rated items in our real-world dataset.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"12 1","pages":"14:1-14:5"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84277804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network flows and the link prediction problem 网络流与链路预测问题
Kanika Narang, Kristina Lerman, P. Kumaraguru
Link prediction is used by many applications to recommend new products or social connections to people. Link prediction leverages information in network structure to identify missing links or predict which new one will form in the future. Recent research has provided a theoretical justification for the success of some popular link prediction heuristics, such as the number of common neighbors and the Adamic-Adar score, by showing that they estimate the distance between nodes in some latent feature space. In this paper we examine the link prediction task from the novel perspective of network flows. We show that how easily two nodes can interact with or influence each other depends not only on their position in the network, but also on the nature of the flow that mediates interactions between them. We show that different types of flows lead to different notions of network proximity, some of which are mathematically equivalent to existing link prediction heuristics. We measure the performance of different heuristics on the missing link prediction task in a variety of real-world social, technological and biological networks. We show that heuristics based on a random walk-type processes outperform the popular Adamic-Adar and the number of common neighbors heuristics in many networks.
链接预测被许多应用程序用来向人们推荐新产品或社会关系。链路预测利用网络结构中的信息来识别缺失的链路或预测未来将形成哪些新的链路。最近的研究为一些流行的链接预测启发式方法的成功提供了理论依据,例如共同邻居数和adam - adar分数,表明它们估计了一些潜在特征空间中节点之间的距离。本文从网络流的新视角来研究链路预测任务。我们表明,两个节点相互交互或相互影响的容易程度不仅取决于它们在网络中的位置,还取决于它们之间交互的流的性质。我们表明,不同类型的流导致不同的网络接近概念,其中一些在数学上等同于现有的链接预测启发式。我们在各种现实世界的社会、技术和生物网络中测量了不同启发式在缺失链接预测任务上的表现。我们证明了基于随机漫步型过程的启发式算法在许多网络中优于流行的adam - adar和共同邻居数启发式算法。
{"title":"Network flows and the link prediction problem","authors":"Kanika Narang, Kristina Lerman, P. Kumaraguru","doi":"10.1145/2501025.2501031","DOIUrl":"https://doi.org/10.1145/2501025.2501031","url":null,"abstract":"Link prediction is used by many applications to recommend new products or social connections to people. Link prediction leverages information in network structure to identify missing links or predict which new one will form in the future. Recent research has provided a theoretical justification for the success of some popular link prediction heuristics, such as the number of common neighbors and the Adamic-Adar score, by showing that they estimate the distance between nodes in some latent feature space. In this paper we examine the link prediction task from the novel perspective of network flows. We show that how easily two nodes can interact with or influence each other depends not only on their position in the network, but also on the nature of the flow that mediates interactions between them. We show that different types of flows lead to different notions of network proximity, some of which are mathematically equivalent to existing link prediction heuristics. We measure the performance of different heuristics on the missing link prediction task in a variety of real-world social, technological and biological networks. We show that heuristics based on a random walk-type processes outperform the popular Adamic-Adar and the number of common neighbors heuristics in many networks.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"30 1","pages":"3:1-3:8"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74563444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Analysis and identification of spamming behaviors in Sina Weibo microblog 新浪微博垃圾信息行为分析与识别
Chengfeng Lin, Jianhua He, Yi Zhou, Xiaokang Yang, Kai Chen, Li Song
Spamming has been a widespread problem for social networks. In recent years there is an increasing interest in the analysis of anti-spamming for microblogs, such as Twitter. In this paper we present a systematic research on the analysis of spamming in Sina Weibo platform, which is currently a dominant microblogging service provider in China. Our research objectives are to understand the specific spamming behaviors in Sina Weibo and find approaches to identify and block spammers in Sina Weibo based on spamming behavior classifiers. To start with the analysis of spamming behaviors we devise several effective methods to collect a large set of spammer samples, including uses of proactive honeypots and crawlers, keywords based searching and buying spammer samples directly from online merchants. We processed the database associated with these spammer samples and interestingly we found three representative spamming behaviors: aggressive advertising, repeated duplicate reposting and aggressive following. We extract various features and compare the behaviors of spammers and legitimate users with regard to these features. It is found that spamming behaviors and normal behaviors have distinct characteristics. Based on these findings we design an automatic online spammer identification system. Through tests with real data it is demonstrated that the system can effectively detect the spamming behaviors and identify spammers in Sina Weibo.
垃圾邮件一直是社交网络普遍存在的问题。近年来,人们对微博(如Twitter)的反垃圾邮件分析越来越感兴趣。在本文中,我们对新浪微博平台上的垃圾邮件进行了系统的研究分析,新浪微博是目前中国主要的微博服务提供商。我们的研究目标是了解新浪微博中的具体垃圾邮件行为,并找到基于垃圾邮件行为分类器识别和阻止新浪微博中垃圾邮件发送者的方法。为了分析垃圾邮件行为,我们设计了几种有效的方法来收集大量的垃圾邮件发送者样本,包括使用主动蜜罐和爬虫,基于关键字的搜索和直接从在线商家购买垃圾邮件发送者样本。我们处理了与这些垃圾邮件发送者样本相关的数据库,有趣的是,我们发现了三种典型的垃圾邮件行为:积极的广告、重复的转发和积极的关注。我们提取各种特征,并比较垃圾邮件发送者和合法用户在这些特征方面的行为。研究发现,垃圾邮件行为与正常行为具有明显的特点。基于这些发现,我们设计了一个在线垃圾邮件自动识别系统。通过真实数据的测试表明,该系统能够有效地检测新浪微博中的垃圾邮件行为,识别出垃圾邮件发送者。
{"title":"Analysis and identification of spamming behaviors in Sina Weibo microblog","authors":"Chengfeng Lin, Jianhua He, Yi Zhou, Xiaokang Yang, Kai Chen, Li Song","doi":"10.1145/2501025.2501035","DOIUrl":"https://doi.org/10.1145/2501025.2501035","url":null,"abstract":"Spamming has been a widespread problem for social networks. In recent years there is an increasing interest in the analysis of anti-spamming for microblogs, such as Twitter. In this paper we present a systematic research on the analysis of spamming in Sina Weibo platform, which is currently a dominant microblogging service provider in China. Our research objectives are to understand the specific spamming behaviors in Sina Weibo and find approaches to identify and block spammers in Sina Weibo based on spamming behavior classifiers. To start with the analysis of spamming behaviors we devise several effective methods to collect a large set of spammer samples, including uses of proactive honeypots and crawlers, keywords based searching and buying spammer samples directly from online merchants. We processed the database associated with these spammer samples and interestingly we found three representative spamming behaviors: aggressive advertising, repeated duplicate reposting and aggressive following. We extract various features and compare the behaviors of spammers and legitimate users with regard to these features. It is found that spamming behaviors and normal behaviors have distinct characteristics. Based on these findings we design an automatic online spammer identification system. Through tests with real data it is demonstrated that the system can effectively detect the spamming behaviors and identify spammers in Sina Weibo.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"1 1","pages":"5:1-5:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76182607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Twitter volume spikes: analysis and application in stock trading Twitter成交量峰值:股票交易中的分析与应用
Yuexin Mao, Wei Wei, B. Wang
Stock is a popular topic in Twitter. The number of tweets concerning a stock varies over days, and sometimes exhibits a significant spike. In this paper, we investigate Twitter volume spikes related to S&P 500 stocks, and whether they are useful for stock trading. Through correlation analysis, we provide insight on when Twitter volume spikes occur and possible causes of these spikes. We further explore whether these spikes are surprises to market participants by comparing the implied volatility of a stock before and after a Twitter volume spike. Moreover, we develop a Bayesian classifier that uses Twitter volume spikes to assist stock trading, and show that it can provide substantial profit. We further develop an enhanced strategy that combines the Bayesian classifier and a stock bottom picking method, and demonstrate that it can achieve significant gain in a short amount of time. Simulation over a half year's stock market data indicates that it achieves on average 8.6% gain in 27 trading days and 15.0% gain in 55 trading days. Statistical tests show that the gain is statistically significant, and the enhanced strategy significantly outperforms the strategy that only uses the Bayesian classifier as well as a bottom picking method that uses trading volume spikes.
股票是推特上的热门话题。与某只股票相关的推文数量会在几天内发生变化,有时会出现显著的峰值。在本文中,我们研究了推特成交量峰值与标准普尔500指数股票的关系,以及它们是否对股票交易有用。通过相关性分析,我们可以洞察Twitter数量峰值何时发生以及这些峰值的可能原因。我们通过比较推特交易量峰值前后股票的隐含波动率,进一步探讨这些峰值是否令市场参与者感到意外。此外,我们开发了一个贝叶斯分类器,该分类器使用Twitter成交量峰值来辅助股票交易,并表明它可以提供可观的利润。我们进一步开发了一种增强的策略,将贝叶斯分类器和股票底部选择方法相结合,并证明它可以在短时间内获得显着的收益。模拟半年多的股市数据,27个交易日平均涨幅为8.6%,55个交易日平均涨幅为15.0%。统计测试表明,增益在统计上是显著的,并且增强的策略明显优于仅使用贝叶斯分类器的策略以及使用交易量峰值的底部选择方法。
{"title":"Twitter volume spikes: analysis and application in stock trading","authors":"Yuexin Mao, Wei Wei, B. Wang","doi":"10.1145/2501025.2501039","DOIUrl":"https://doi.org/10.1145/2501025.2501039","url":null,"abstract":"Stock is a popular topic in Twitter. The number of tweets concerning a stock varies over days, and sometimes exhibits a significant spike. In this paper, we investigate Twitter volume spikes related to S&P 500 stocks, and whether they are useful for stock trading. Through correlation analysis, we provide insight on when Twitter volume spikes occur and possible causes of these spikes. We further explore whether these spikes are surprises to market participants by comparing the implied volatility of a stock before and after a Twitter volume spike. Moreover, we develop a Bayesian classifier that uses Twitter volume spikes to assist stock trading, and show that it can provide substantial profit. We further develop an enhanced strategy that combines the Bayesian classifier and a stock bottom picking method, and demonstrate that it can achieve significant gain in a short amount of time. Simulation over a half year's stock market data indicates that it achieves on average 8.6% gain in 27 trading days and 15.0% gain in 55 trading days. Statistical tests show that the gain is statistically significant, and the enhanced strategy significantly outperforms the strategy that only uses the Bayesian classifier as well as a bottom picking method that uses trading volume spikes.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"8 1","pages":"4:1-4:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80783528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
ProfileRank: finding relevant content and influential users based on information diffusion ProfileRank:根据信息扩散发现相关内容和有影响力的用户
A. Silva, Sara Guimarães, Wagner Meira Jr, Mohammed J. Zaki
Understanding information diffusion processes that take place on the Web, specially in social media, is a fundamental step towards the design of effective information diffusion mechanisms, recommendation systems, and viral marketing/advertising campaigns. Two key concepts in information diffusion are influence and relevance. Influence is the ability to popularize content in an online community. To this end, influentials introduce and propagate relevant content, in the sense that such content satisfies the information needs of a significant portion of this community. In this paper, we study the problem of identifying influential users and relevant content in information diffusion data. We propose ProfileRank, a new information diffusion model based on random walks over a user-content graph. ProfileRank is a PageRank inspired model that exploits the principle that relevant content is created and propagated by influential users and influential users create relevant content. A convenient property of ProfileRank is that it can be adapted to provide personalized recommendations. Experimental results demonstrate that ProfileRank makes accurate recommendations, outperforming baseline techniques. We also illustrate relevant content and influential users discovered using ProfileRank. Our analysis shows that ProfileRank scores are more correlated with content diffusion than with the network structure. We also show that our new modeling is more efficient than PageRank to perform these calculations.
了解发生在网络上的信息传播过程,特别是在社交媒体上,是设计有效的信息传播机制、推荐系统和病毒式营销/广告活动的基本步骤。信息传播中的两个关键概念是影响力和相关性。影响力是指在网络社区中普及内容的能力。为此,有影响力的人引入和传播相关内容,因为这些内容满足了该社区相当一部分人的信息需求。本文研究了信息传播数据中有影响力的用户和相关内容的识别问题。我们提出了ProfileRank,一个新的基于用户内容图随机游走的信息扩散模型。ProfileRank是一个受PageRank启发的模型,它利用了相关内容由有影响力的用户创建和传播,有影响力的用户创建相关内容的原则。ProfileRank的一个方便属性是,它可以适应提供个性化的推荐。实验结果表明,ProfileRank可以做出准确的推荐,优于基线技术。我们还说明了使用ProfileRank发现的相关内容和有影响力的用户。我们的分析表明,ProfileRank分数与内容扩散的相关性大于与网络结构的相关性。我们还表明,我们的新模型在执行这些计算时比PageRank更有效。
{"title":"ProfileRank: finding relevant content and influential users based on information diffusion","authors":"A. Silva, Sara Guimarães, Wagner Meira Jr, Mohammed J. Zaki","doi":"10.1145/2501025.2501033","DOIUrl":"https://doi.org/10.1145/2501025.2501033","url":null,"abstract":"Understanding information diffusion processes that take place on the Web, specially in social media, is a fundamental step towards the design of effective information diffusion mechanisms, recommendation systems, and viral marketing/advertising campaigns. Two key concepts in information diffusion are influence and relevance. Influence is the ability to popularize content in an online community. To this end, influentials introduce and propagate relevant content, in the sense that such content satisfies the information needs of a significant portion of this community.\u0000 In this paper, we study the problem of identifying influential users and relevant content in information diffusion data. We propose ProfileRank, a new information diffusion model based on random walks over a user-content graph. ProfileRank is a PageRank inspired model that exploits the principle that relevant content is created and propagated by influential users and influential users create relevant content. A convenient property of ProfileRank is that it can be adapted to provide personalized recommendations.\u0000 Experimental results demonstrate that ProfileRank makes accurate recommendations, outperforming baseline techniques. We also illustrate relevant content and influential users discovered using ProfileRank. Our analysis shows that ProfileRank scores are more correlated with content diffusion than with the network structure. We also show that our new modeling is more efficient than PageRank to perform these calculations.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"28 1","pages":"2:1-2:9"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79345648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
The user's communication patterns on a mobile social network site 用户在移动社交网站上的通信模式
Youngsoo Kim
Given that users are simultaneously connected in multiple communication channels in a social networking service site (e.g., chat, message, and group message), we explore user's collective networking behavior. We collected the data from a mobile social networking site with 4.8 million registered users. The empirical estimation shows interesting results: (1) there are cross-effects across the communication channels: substitute effects for "chat and message" and complementary effects for "message and group message" and "chat and group message" (2) there is significant local network effect but global network effect is not observed, (3) users utilize communication channels for different purposes according to their networking activity level (conveying simple information vs. building sophisticated inter-relationship), and (4) we identify the distinct evolutionary trajectories of an individual user's networking behavior by channel: negative slopes for chat and message vs. upward trend for a group message. Our experimental study shows that we can better predict the word of mouth (WOM) effects by understanding users' collective networking behavior across diverse channels.
考虑到用户在社交网络服务站点中同时通过多种通信渠道(如聊天、消息和群消息)进行连接,我们探讨了用户的集体网络行为。我们从一个拥有480万注册用户的移动社交网站收集数据。实证估计得出了有趣的结果:(1)跨传播渠道存在交叉效应;“聊天+消息”的替代效应和“消息+群消息”、“聊天+群消息”的互补效应(2)局部网络效应显著,全局网络效应不明显;(3)用户根据其网络活动水平(传递简单的信息与建立复杂的相互关系)不同,使用不同目的的沟通渠道;(4)我们通过渠道确定了个人用户网络行为的不同进化轨迹:聊天和消息的负斜率与群组消息的上升趋势。我们的实验研究表明,通过了解用户在不同渠道的集体网络行为,我们可以更好地预测口碑效应。
{"title":"The user's communication patterns on a mobile social network site","authors":"Youngsoo Kim","doi":"10.1145/2501025.2501037","DOIUrl":"https://doi.org/10.1145/2501025.2501037","url":null,"abstract":"Given that users are simultaneously connected in multiple communication channels in a social networking service site (e.g., chat, message, and group message), we explore user's collective networking behavior. We collected the data from a mobile social networking site with 4.8 million registered users. The empirical estimation shows interesting results: (1) there are cross-effects across the communication channels: substitute effects for \"chat and message\" and complementary effects for \"message and group message\" and \"chat and group message\" (2) there is significant local network effect but global network effect is not observed, (3) users utilize communication channels for different purposes according to their networking activity level (conveying simple information vs. building sophisticated inter-relationship), and (4) we identify the distinct evolutionary trajectories of an individual user's networking behavior by channel: negative slopes for chat and message vs. upward trend for a group message. Our experimental study shows that we can better predict the word of mouth (WOM) effects by understanding users' collective networking behavior across diverse channels.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"67 1","pages":"12:1-12:6"},"PeriodicalIF":0.0,"publicationDate":"2013-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85791903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Advances in Social Network Mining and Analysis, Second International Workshop, SNAKDD 2008, Las Vegas, NV, USA, August 24-27, 2008, Revised Selected Papers 社会网络挖掘与分析的进展,第二届国际研讨会,SNAKDD 2008,拉斯维加斯,内华达州,美国,2008年8月24-27日,修订论文选集
Snakdd, C. Lee Giles
{"title":"Advances in Social Network Mining and Analysis, Second International Workshop, SNAKDD 2008, Las Vegas, NV, USA, August 24-27, 2008, Revised Selected Papers","authors":"Snakdd, C. Lee Giles","doi":"10.1007/978-3-642-14929-0","DOIUrl":"https://doi.org/10.1007/978-3-642-14929-0","url":null,"abstract":"","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83731609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1