{"title":"基于话题、情感和个性的转发预测","authors":"Syeda Nadia Firdaus , Chen Ding , Alireza Sadeghian","doi":"10.1016/j.osnem.2021.100165","DOIUrl":null,"url":null,"abstract":"<div><p><span><span><span>Social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. To understand how information is diffused in these social networks, it is important to examine users’ online activities and behaviors. In this work, we focus on Twitter and study the impact of users’ behaviors on their retweet activities (the major way of information diffusion on Twitter). We consider the topic preference, emotion and personality of a user as part of the user profile to represent their online behavior. The user profile can be built based on all their past tweets, retweets, or both. We propose two types of retweet prediction models, one is using </span>classification algorithms<span>, and the other is using matrix factorization algorithms. In the matrix factorization approach, we include behavior features into the basic factorization model through newly defined </span></span>regularization<span> terms. The experimental results show that in terms of the F1-score, our classification models based on user behavior related features provided 5%-9% improvement over </span></span>baseline models and the matrix factorization model showed 4%-6% improvement over the baseline. We also find that by only considering the retweets, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweets and tweets are included.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.osnem.2021.100165","citationCount":"10","resultStr":"{\"title\":\"Retweet Prediction based on Topic, Emotion and Personality\",\"authors\":\"Syeda Nadia Firdaus , Chen Ding , Alireza Sadeghian\",\"doi\":\"10.1016/j.osnem.2021.100165\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span><span><span>Social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. To understand how information is diffused in these social networks, it is important to examine users’ online activities and behaviors. In this work, we focus on Twitter and study the impact of users’ behaviors on their retweet activities (the major way of information diffusion on Twitter). We consider the topic preference, emotion and personality of a user as part of the user profile to represent their online behavior. The user profile can be built based on all their past tweets, retweets, or both. We propose two types of retweet prediction models, one is using </span>classification algorithms<span>, and the other is using matrix factorization algorithms. In the matrix factorization approach, we include behavior features into the basic factorization model through newly defined </span></span>regularization<span> terms. The experimental results show that in terms of the F1-score, our classification models based on user behavior related features provided 5%-9% improvement over </span></span>baseline models and the matrix factorization model showed 4%-6% improvement over the baseline. We also find that by only considering the retweets, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweets and tweets are included.</p></div>\",\"PeriodicalId\":52228,\"journal\":{\"name\":\"Online Social Networks and Media\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.osnem.2021.100165\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Online Social Networks and Media\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468696421000471\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696421000471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
Retweet Prediction based on Topic, Emotion and Personality
Social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. To understand how information is diffused in these social networks, it is important to examine users’ online activities and behaviors. In this work, we focus on Twitter and study the impact of users’ behaviors on their retweet activities (the major way of information diffusion on Twitter). We consider the topic preference, emotion and personality of a user as part of the user profile to represent their online behavior. The user profile can be built based on all their past tweets, retweets, or both. We propose two types of retweet prediction models, one is using classification algorithms, and the other is using matrix factorization algorithms. In the matrix factorization approach, we include behavior features into the basic factorization model through newly defined regularization terms. The experimental results show that in terms of the F1-score, our classification models based on user behavior related features provided 5%-9% improvement over baseline models and the matrix factorization model showed 4%-6% improvement over the baseline. We also find that by only considering the retweets, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweets and tweets are included.