{"title":"Keyword Extraction for Social Media Short Text","authors":"Dexin Zhao, Nana Du, Zhi Chang, Yukun Li","doi":"10.1109/WISA.2017.12","DOIUrl":null,"url":null,"abstract":"With the booming development of social media in recent years, researchers have begun to pay more attention to extracting personal profiles from information. Keyword extraction plays an important role in extracting personal profiles. However, most of the previous studies are only valid for ordinary text, but not ideal for social media short text. In this paper, we propose an improved method for keyword extraction based on Word2vec and Textrank to solve the unique problem of social media short text. Our approach uses the Word2vec to capture the semantic features between words in selected text, and meanwhile naturally fuses the word frequency, semantic relation and directional relation into Textrank to extract keywords. We conduct the experiments on the three datasets. The experimental results show the superior performance of our method in keyword extraction.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th Web Information Systems and Applications Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2017.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
With the booming development of social media in recent years, researchers have begun to pay more attention to extracting personal profiles from information. Keyword extraction plays an important role in extracting personal profiles. However, most of the previous studies are only valid for ordinary text, but not ideal for social media short text. In this paper, we propose an improved method for keyword extraction based on Word2vec and Textrank to solve the unique problem of social media short text. Our approach uses the Word2vec to capture the semantic features between words in selected text, and meanwhile naturally fuses the word frequency, semantic relation and directional relation into Textrank to extract keywords. We conduct the experiments on the three datasets. The experimental results show the superior performance of our method in keyword extraction.