B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi
{"title":"社交媒体文本的数据标注与多情感分类","authors":"B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi","doi":"10.1109/ICCSP48568.2020.9182362","DOIUrl":null,"url":null,"abstract":"In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.","PeriodicalId":321133,"journal":{"name":"2020 International Conference on Communication and Signal Processing (ICCSP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Data Annotation and Multi-Emotion Classification for Social Media Text\",\"authors\":\"B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi\",\"doi\":\"10.1109/ICCSP48568.2020.9182362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.\",\"PeriodicalId\":321133,\"journal\":{\"name\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSP48568.2020.9182362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communication and Signal Processing (ICCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSP48568.2020.9182362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data Annotation and Multi-Emotion Classification for Social Media Text
In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.