{"title":"Does ensemble really work when facing the twitter semantic classification?","authors":"Wenqiang Luo, Sheng Yi, Jiaxin Chen, Shuqing Weng, Zengwen Dong","doi":"10.1109/ICCIA49625.2020.00015","DOIUrl":null,"url":null,"abstract":"With the rapid development of Internet social media, twitter has gradually become the most mainstream information release and information sharing platform. A large number of twitter users use the platform to express their views, emotions and opinions. However, it is still a challenge on twitter semantic classification based on the observation that Twitters are short, noisy, arbitrary, etc. Thus, we seek in the mainstream NLP algorithms to find out which algorithm performs best in this problem. After that, we analysis the ensemble methods on the former encode expand to get a better result. However, we find that it dosen’t work well as we expected. we analysis the reason and give the potential explain. The extensive experiments have shown that the LCF-BERT based model performs best over the mainstream algorithms and the ensemble model on the Twitter dataset.\\","PeriodicalId":237536,"journal":{"name":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIA49625.2020.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
With the rapid development of Internet social media, twitter has gradually become the most mainstream information release and information sharing platform. A large number of twitter users use the platform to express their views, emotions and opinions. However, it is still a challenge on twitter semantic classification based on the observation that Twitters are short, noisy, arbitrary, etc. Thus, we seek in the mainstream NLP algorithms to find out which algorithm performs best in this problem. After that, we analysis the ensemble methods on the former encode expand to get a better result. However, we find that it dosen’t work well as we expected. we analysis the reason and give the potential explain. The extensive experiments have shown that the LCF-BERT based model performs best over the mainstream algorithms and the ensemble model on the Twitter dataset.\