社交媒体文本的数据标注与多情感分类

B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi
{"title":"社交媒体文本的数据标注与多情感分类","authors":"B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi","doi":"10.1109/ICCSP48568.2020.9182362","DOIUrl":null,"url":null,"abstract":"In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.","PeriodicalId":321133,"journal":{"name":"2020 International Conference on Communication and Signal Processing (ICCSP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Data Annotation and Multi-Emotion Classification for Social Media Text\",\"authors\":\"B. V. Namrutha Sridhar, K. Mrinalini, P. Vijayalakshmi\",\"doi\":\"10.1109/ICCSP48568.2020.9182362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.\",\"PeriodicalId\":321133,\"journal\":{\"name\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Communication and Signal Processing (ICCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSP48568.2020.9182362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Communication and Signal Processing (ICCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSP48568.2020.9182362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

近年来,情绪或情绪分析已成为一个关键的研究领域,因为它在从社交媒体评论、市场营销、政治学、心理学、人机交互和人工智能中获得见解方面具有巨大的潜在应用。情绪分析处理识别任何给定数据(如文本、语音或图像)中的情绪。目前的工作建议将社交媒体文本与不同程度的多种情绪进行识别和关联。所提出的工作中采用的数据收集和注释过程是人工和半监督注释方法的结合,其中每个tweet被映射到六维情感向量。总共六种人类情感,如快乐、悲伤、愤怒、厌恶、惊讶和恐惧,被认为是情感标签。本文提出了基于twitter词嵌入(word2vec)的词移动器距离(WMD)来开发标记数据集。在标记数据集上开发了一组分类器,用于在任何给定的文本数据中识别推特级别的情绪。在目前的工作中,KNN分类器、基于树的分类器和神经网络分类器得到了发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Data Annotation and Multi-Emotion Classification for Social Media Text
In recent years, sentiment or emotion analysis has become a key research area due to its vast potential applications in getting insights from social media comments, marketing, political science, psychology, human-computer interaction, and artificial intelligence. Emotion analysis deals with identifying the emotions in any given data such as text, speech, or image. The current work proposes to identify and associate social media text to multiple emotions with varying degrees. The data collection and annotation process employed in the proposed work is a combination of manual and semi-supervised annotation method where each tweet is mapped to a six dimensional emotion vector. Totally six human emotions such as happy, sad, anger, disgust, surprise, and fear are considered for emotion-tagging. Word mover‘s distance (WMD) based on twitter word embeddings (word2vec) is proposed to develop a labelled dataset in the current work. A set of classifiers is developed on the labelled dataset to identify emotions at the tweet-level in any given text data. In the current work, KNN, tree-based, and neural network classifiers are developed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Acoustic Scene Classification in Hearing aid using Deep Learning Plant Disease Detection and Recognition using K means Clustering THD Reduction in Execution of A Nine Level Single Phase Inverter Analysis of Heel Fissure Therapy using Thermal Imaging and Image Processing Malicious Application Detection in Android using Machine Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1