Ian Macedo Maiwald Santos, Luciana de Oliveira Rech, Ricardo Moraes
{"title":"A Topic Modeling Method for Analyzes of Short-Text Data in Social Media Networks","authors":"Ian Macedo Maiwald Santos, Luciana de Oliveira Rech, Ricardo Moraes","doi":"10.29007/kr1z","DOIUrl":null,"url":null,"abstract":"Currently, many short texts are published online, especially on social media platforms. High impact events, for example, are highly commented on by users. Understanding the subjects and patterns hidden in online discussions is a very important task for contexts such as elections, natural disasters or major sporting events. However, many works of this nature use techniques that, despite showing satisfactory results, are not the most suitable when it comes to the short texts on social media and may suffer a loss in their results. Therefore, this paper presents a text mining method for messages published on social media, with a data pre-processing step and topic modeling for short texts. For this paper, we created a data set from real world tweets related to COVID-19 that is openly available1 for research purposes.","PeriodicalId":93549,"journal":{"name":"EPiC series in computing","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPiC series in computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/kr1z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, many short texts are published online, especially on social media platforms. High impact events, for example, are highly commented on by users. Understanding the subjects and patterns hidden in online discussions is a very important task for contexts such as elections, natural disasters or major sporting events. However, many works of this nature use techniques that, despite showing satisfactory results, are not the most suitable when it comes to the short texts on social media and may suffer a loss in their results. Therefore, this paper presents a text mining method for messages published on social media, with a data pre-processing step and topic modeling for short texts. For this paper, we created a data set from real world tweets related to COVID-19 that is openly available1 for research purposes.