Sam Jackson, Feifei Zhang, O. Boichak, Lauren Bryant, Yingya Li, Jeff J. Hemsley, Jennifer Stromer-Galley, Bryan C. Semaan, Nancy J. McCracken
{"title":"Identifying Political Topics in Social Media Messages: A Lexicon-Based Approach","authors":"Sam Jackson, Feifei Zhang, O. Boichak, Lauren Bryant, Yingya Li, Jeff J. Hemsley, Jennifer Stromer-Galley, Bryan C. Semaan, Nancy J. McCracken","doi":"10.1145/3097286.3097298","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce a lexicon-based method for identifying political topics in social media messages. After discussing several critical shortcomings of unsupervised topic identification for this task, we describe the lexicon-based approach. We test our lexicon on candidate-generated campaign messages on Facebook and Twitter in the 2016 U.S. presidential election. The results show that this approach provides reliable results for eight of nine political topic categories. In closing, we describe steps to improve our approach and how it can be used for future research on political topics in social media messages.","PeriodicalId":130378,"journal":{"name":"Proceedings of the 8th International Conference on Social Media & Society","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th International Conference on Social Media & Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3097286.3097298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
In this paper, we introduce a lexicon-based method for identifying political topics in social media messages. After discussing several critical shortcomings of unsupervised topic identification for this task, we describe the lexicon-based approach. We test our lexicon on candidate-generated campaign messages on Facebook and Twitter in the 2016 U.S. presidential election. The results show that this approach provides reliable results for eight of nine political topic categories. In closing, we describe steps to improve our approach and how it can be used for future research on political topics in social media messages.