{"title":"表情符号情感词典的设计与开发","authors":"Fabian Haak","doi":"10.5283/epub.44960","DOIUrl":null,"url":null,"abstract":"Emojis represent an essential means of expressing sentiments such as opinions and attitudes in computer - mediated communication, especially in chats and social media. To effectively capture these sentiments, the sentiments associated with the emojis used must be known. Previous approaches to determining the sentiments expressed with emojis require a large amount of manual annotation. For many emojis, especially less frequently used platform - specific emojis, studies on expressed sentiments do not yet exist. Therefore, these emojis cannot be considered in sentiment analyses so far. In this work, a method for effective and efficient determination of emojis’ sentiments and their compilation in a sentiment lexicon was developed. The determined sentiments are compiled as a sentiment lexicon. For this purpose, software was created in Python to process collections of texts into a corpus. The software derives the emojis’ sentiments as valence values based on the sentiments of the texts in which the emojis appear. The lexicons produced by the method can be used in lexicon - based sentiment analysis approaches. The method also derives other information on the emojis and their usage that can be used to assess the sentiment lexicon produced and the usage of the emojis. Using the developed method, two analyses were conducted with corpora of different text sources. The results and subsequent comparisons with existing sentiment lexicons have shown that the developed method is able to efficiently produce similar results as sentiment lexicons produced with manual annotation.","PeriodicalId":90875,"journal":{"name":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","volume":"35 1","pages":"432-438"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design and Development of an Emoji Sentiment Lexicon\",\"authors\":\"Fabian Haak\",\"doi\":\"10.5283/epub.44960\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Emojis represent an essential means of expressing sentiments such as opinions and attitudes in computer - mediated communication, especially in chats and social media. To effectively capture these sentiments, the sentiments associated with the emojis used must be known. Previous approaches to determining the sentiments expressed with emojis require a large amount of manual annotation. For many emojis, especially less frequently used platform - specific emojis, studies on expressed sentiments do not yet exist. Therefore, these emojis cannot be considered in sentiment analyses so far. In this work, a method for effective and efficient determination of emojis’ sentiments and their compilation in a sentiment lexicon was developed. The determined sentiments are compiled as a sentiment lexicon. For this purpose, software was created in Python to process collections of texts into a corpus. The software derives the emojis’ sentiments as valence values based on the sentiments of the texts in which the emojis appear. The lexicons produced by the method can be used in lexicon - based sentiment analysis approaches. The method also derives other information on the emojis and their usage that can be used to assess the sentiment lexicon produced and the usage of the emojis. Using the developed method, two analyses were conducted with corpora of different text sources. The results and subsequent comparisons with existing sentiment lexicons have shown that the developed method is able to efficiently produce similar results as sentiment lexicons produced with manual annotation.\",\"PeriodicalId\":90875,\"journal\":{\"name\":\"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics\",\"volume\":\"35 1\",\"pages\":\"432-438\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5283/epub.44960\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5283/epub.44960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design and Development of an Emoji Sentiment Lexicon
Emojis represent an essential means of expressing sentiments such as opinions and attitudes in computer - mediated communication, especially in chats and social media. To effectively capture these sentiments, the sentiments associated with the emojis used must be known. Previous approaches to determining the sentiments expressed with emojis require a large amount of manual annotation. For many emojis, especially less frequently used platform - specific emojis, studies on expressed sentiments do not yet exist. Therefore, these emojis cannot be considered in sentiment analyses so far. In this work, a method for effective and efficient determination of emojis’ sentiments and their compilation in a sentiment lexicon was developed. The determined sentiments are compiled as a sentiment lexicon. For this purpose, software was created in Python to process collections of texts into a corpus. The software derives the emojis’ sentiments as valence values based on the sentiments of the texts in which the emojis appear. The lexicons produced by the method can be used in lexicon - based sentiment analysis approaches. The method also derives other information on the emojis and their usage that can be used to assess the sentiment lexicon produced and the usage of the emojis. Using the developed method, two analyses were conducted with corpora of different text sources. The results and subsequent comparisons with existing sentiment lexicons have shown that the developed method is able to efficiently produce similar results as sentiment lexicons produced with manual annotation.