基于距离测量和深度学习技术的社交媒体评论意向精细分析

Akila R, R. S
{"title":"基于距离测量和深度学习技术的社交媒体评论意向精细分析","authors":"Akila R, R. S","doi":"10.58346/jisis.2023.i2.003","DOIUrl":null,"url":null,"abstract":"Intent analysis and classification are performed to identify the expressions of intent in the given text. In this paper, the dataset is classified into emotion classifications by utilizing machine learning model SVM, Bipolar classification, Fine Grained Analysis, and Sarcasm detection, with Naïve Bayes and Random Forest techniques of deep learning, including Long Short-Term Memory to perform intention analysis on social media data. Then Fine-grained or Multi-Class Sentiment analysis is used for further classification of the five classes, viz. negative, strong negative, neutral, positive, and strong positive, which detects the sarcastic reviews in the movie dataset. The emotional intention behind the review comments is classified as happiness, rage, sadness, joy, anger, and disgust by using SVM. The reviews are analyzed and calculated based on their subjectivity and context level similarity using Related Relaxed Word Mover Distance (RRWMD) semantic similarity measure. With the advantage of the RRWMD algorithm, the reviews from the context containing deviated or irrelevant contents were removed before being applied to the classification algorithms, thereby reducing the execution time, which obtains a 3% improvement in accuracy. The disadvantage of the RRWMD algorithm is only one deep learning algorithm is compared. From the observed accuracy scores and classification reports, the LSTM has provided higher accuracy, despite the long execution time. The Naïve Bayes model has produced lower accuracy than the neural network model but was efficient, taking less time to fit and classify. The results from various experiments have proven that the semantic similarity measure provides more accurate results than the state-of-the-art model.","PeriodicalId":36718,"journal":{"name":"Journal of Internet Services and Information Security","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fine Grained Analysis of Intention for Social Media Reviews Using Distance Measure and Deep Learning Technique\",\"authors\":\"Akila R, R. S\",\"doi\":\"10.58346/jisis.2023.i2.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intent analysis and classification are performed to identify the expressions of intent in the given text. In this paper, the dataset is classified into emotion classifications by utilizing machine learning model SVM, Bipolar classification, Fine Grained Analysis, and Sarcasm detection, with Naïve Bayes and Random Forest techniques of deep learning, including Long Short-Term Memory to perform intention analysis on social media data. Then Fine-grained or Multi-Class Sentiment analysis is used for further classification of the five classes, viz. negative, strong negative, neutral, positive, and strong positive, which detects the sarcastic reviews in the movie dataset. The emotional intention behind the review comments is classified as happiness, rage, sadness, joy, anger, and disgust by using SVM. The reviews are analyzed and calculated based on their subjectivity and context level similarity using Related Relaxed Word Mover Distance (RRWMD) semantic similarity measure. With the advantage of the RRWMD algorithm, the reviews from the context containing deviated or irrelevant contents were removed before being applied to the classification algorithms, thereby reducing the execution time, which obtains a 3% improvement in accuracy. The disadvantage of the RRWMD algorithm is only one deep learning algorithm is compared. From the observed accuracy scores and classification reports, the LSTM has provided higher accuracy, despite the long execution time. The Naïve Bayes model has produced lower accuracy than the neural network model but was efficient, taking less time to fit and classify. The results from various experiments have proven that the semantic similarity measure provides more accurate results than the state-of-the-art model.\",\"PeriodicalId\":36718,\"journal\":{\"name\":\"Journal of Internet Services and Information Security\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Internet Services and Information Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.58346/jisis.2023.i2.003\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Internet Services and Information Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.58346/jisis.2023.i2.003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

意图分析和分类是为了识别给定文本中的意图表达。在本文中,通过利用机器学习模型SVM、双极分类、细粒度分析和Sarcasm检测,以及深度学习的Naïve Bayes和随机森林技术(包括长短期记忆),将数据集分类为情绪分类,以对社交媒体数据进行意向分析。然后使用细粒度或多类情感分析对五类进行进一步分类,即负面、强负面、中性、正面和强正面,检测电影数据集中的讽刺评论。使用支持向量机将评论背后的情感意图分为快乐、愤怒、悲伤、喜悦、愤怒和厌恶。基于评论的主观性和上下文水平的相似性,使用相关放松词移动距离(RRWMD)语义相似性度量对评论进行分析和计算。利用RRWMD算法的优势,在将包含偏离或无关内容的上下文中的评论应用于分类算法之前,将其删除,从而减少了执行时间,准确率提高了3%。RRWMD算法的缺点是只比较了一种深度学习算法。从观察到的准确性得分和分类报告来看,尽管执行时间很长,但LSTM提供了更高的准确性。Naïve Bayes模型的精度低于神经网络模型,但效率很高,拟合和分类时间更短。各种实验的结果已经证明,语义相似性度量比现有技术的模型提供了更准确的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fine Grained Analysis of Intention for Social Media Reviews Using Distance Measure and Deep Learning Technique
Intent analysis and classification are performed to identify the expressions of intent in the given text. In this paper, the dataset is classified into emotion classifications by utilizing machine learning model SVM, Bipolar classification, Fine Grained Analysis, and Sarcasm detection, with Naïve Bayes and Random Forest techniques of deep learning, including Long Short-Term Memory to perform intention analysis on social media data. Then Fine-grained or Multi-Class Sentiment analysis is used for further classification of the five classes, viz. negative, strong negative, neutral, positive, and strong positive, which detects the sarcastic reviews in the movie dataset. The emotional intention behind the review comments is classified as happiness, rage, sadness, joy, anger, and disgust by using SVM. The reviews are analyzed and calculated based on their subjectivity and context level similarity using Related Relaxed Word Mover Distance (RRWMD) semantic similarity measure. With the advantage of the RRWMD algorithm, the reviews from the context containing deviated or irrelevant contents were removed before being applied to the classification algorithms, thereby reducing the execution time, which obtains a 3% improvement in accuracy. The disadvantage of the RRWMD algorithm is only one deep learning algorithm is compared. From the observed accuracy scores and classification reports, the LSTM has provided higher accuracy, despite the long execution time. The Naïve Bayes model has produced lower accuracy than the neural network model but was efficient, taking less time to fit and classify. The results from various experiments have proven that the semantic similarity measure provides more accurate results than the state-of-the-art model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Internet Services and Information Security
Journal of Internet Services and Information Security Computer Science-Computer Science (miscellaneous)
CiteScore
3.90
自引率
0.00%
发文量
0
审稿时长
8 weeks
期刊最新文献
Evaluating the Effectiveness of a Gan Fingerprint Removal Approach in Fooling Deepfake Face Detection CSA-Forecaster: Stacked Model for Forecasting Child Sexual Abuse A Nonredundant SVD-based Precoding Matrix for Blind Channel Estimation in CP-OFDM Systems Over Channels with Memory An Intelligent Health Surveillance System: Predictive Modeling of Cardiovascular Parameters through Machine Learning Algorithms Using LoRa Communication and Internet of Medical Things (IoMT) Identifying Large Young Hacker Concentration in Indonesia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1