Deep Aspect-Sentinet:使用混合注意力深度学习辅助 BILSTM 进行基于方面的情感分析

IF 1 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE International Journal of Uncertainty Fuzziness and Knowledge-Based Systems Pub Date : 2024-02-20 DOI:10.1142/s0218488524500028
S. J. R. K. Padminivalli V., M. V. P. Chandra Sekhara Rao
{"title":"Deep Aspect-Sentinet:使用混合注意力深度学习辅助 BILSTM 进行基于方面的情感分析","authors":"S. J. R. K. Padminivalli V., M. V. P. Chandra Sekhara Rao","doi":"10.1142/s0218488524500028","DOIUrl":null,"url":null,"abstract":"<p>Data mining and natural language processing researchers have been working on sentiment analysis for the past decade. Using deep neural networks (DNNs) for sentiment analysis has recently shown promising results. A technique of studying people’s attitudes through emotional sentiment analysis of data generated from various sources such as Twitter, social media reviews, etc. and classifying emotions based on the given data is related to text data generation. Therefore, the proposed study proposes a well-known deep learning technique for facet-based emotional mood classification using text data that can handle a large amount of content. Text data pre-processing uses stemming, segmentation, tokenization, case folding, and removal of stop words, nulls, and special characters. After data pre-processing, three word embedding approaches such as Assimilated N-gram Approach (ANA), Boosted Term Frequency Inverse Document Frequency (BT-IDF) and Enhanced Two-Way Encoder Representation from Transformers (E-BERT) are used to extract relevant features. The extracted features from the three different approaches are concatenated using the Feature Fusion Approach (FFA). The optimal features are selected using the Intensified Hunger Games Search Optimization (I-HGSO) algorithm. Finally, aspect-based sentiment analysis is performed using the Senti-BILSTM (Deep Aspect-EMO SentiNet) autoencoder based on the Hybrid Emotional Aspect Capsule autoencoder. The experiment was built on the yelp reviews dataset, IDMB movie review dataset, Amazon reviews dataset and the Twitter sentiment dataset. A statistical evaluation and comparison of the experimental results are conducted with respect to the accuracy, precision, specificity, the f1-score, recall, and sensitivity. There is a 99.26% accuracy value in the Yelp reviews dataset, a 99.46% accuracy value in the IMDB movie reviews dataset, a 99.26% accuracy value in the Amazon reviews dataset and a 99.93% accuracy value in the Twitter sentiment dataset.</p>","PeriodicalId":50283,"journal":{"name":"International Journal of Uncertainty Fuzziness and Knowledge-Based Systems","volume":"4 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Aspect-Sentinet: Aspect Based Emotional Sentiment Analysis Using Hybrid Attention Deep Learning Assisted BILSTM\",\"authors\":\"S. J. R. K. Padminivalli V., M. V. P. Chandra Sekhara Rao\",\"doi\":\"10.1142/s0218488524500028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Data mining and natural language processing researchers have been working on sentiment analysis for the past decade. Using deep neural networks (DNNs) for sentiment analysis has recently shown promising results. A technique of studying people’s attitudes through emotional sentiment analysis of data generated from various sources such as Twitter, social media reviews, etc. and classifying emotions based on the given data is related to text data generation. Therefore, the proposed study proposes a well-known deep learning technique for facet-based emotional mood classification using text data that can handle a large amount of content. Text data pre-processing uses stemming, segmentation, tokenization, case folding, and removal of stop words, nulls, and special characters. After data pre-processing, three word embedding approaches such as Assimilated N-gram Approach (ANA), Boosted Term Frequency Inverse Document Frequency (BT-IDF) and Enhanced Two-Way Encoder Representation from Transformers (E-BERT) are used to extract relevant features. The extracted features from the three different approaches are concatenated using the Feature Fusion Approach (FFA). The optimal features are selected using the Intensified Hunger Games Search Optimization (I-HGSO) algorithm. Finally, aspect-based sentiment analysis is performed using the Senti-BILSTM (Deep Aspect-EMO SentiNet) autoencoder based on the Hybrid Emotional Aspect Capsule autoencoder. The experiment was built on the yelp reviews dataset, IDMB movie review dataset, Amazon reviews dataset and the Twitter sentiment dataset. A statistical evaluation and comparison of the experimental results are conducted with respect to the accuracy, precision, specificity, the f1-score, recall, and sensitivity. There is a 99.26% accuracy value in the Yelp reviews dataset, a 99.46% accuracy value in the IMDB movie reviews dataset, a 99.26% accuracy value in the Amazon reviews dataset and a 99.93% accuracy value in the Twitter sentiment dataset.</p>\",\"PeriodicalId\":50283,\"journal\":{\"name\":\"International Journal of Uncertainty Fuzziness and Knowledge-Based Systems\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-02-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Uncertainty Fuzziness and Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1142/s0218488524500028\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Uncertainty Fuzziness and Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1142/s0218488524500028","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

过去十年来,数据挖掘和自然语言处理研究人员一直致力于情感分析。利用深度神经网络(DNN)进行情感分析最近取得了可喜的成果。通过对推特、社交媒体评论等各种来源生成的数据进行情感分析来研究人们的态度,并根据给定数据进行情感分类的技术与文本数据生成有关。因此,本研究提出了一种著名的深度学习技术,用于使用文本数据进行基于面的情感情绪分类,该技术可以处理大量内容。文本数据预处理包括词干处理、分段、标记化、大小写折叠以及删除停顿词、空格和特殊字符。数据预处理后,使用三种词嵌入方法(如同化 N-gram 方法 (ANA)、提升词频反向文档频率 (BT-IDF) 和来自变换器的增强型双向编码器表示法 (E-BERT))来提取相关特征。使用特征融合方法 (FFA) 将从三种不同方法中提取的特征串联起来。使用强化饥饿游戏搜索优化(I-HGSO)算法选择最佳特征。最后,使用基于混合情感方面胶囊自动编码器的 Senti-BILSTM (Deep Aspect-EMO SentiNet)自动编码器进行基于方面的情感分析。实验基于 yelp 评论数据集、IDMB 电影评论数据集、亚马逊评论数据集和 Twitter 情感数据集进行。实验结果在准确率、精确度、特异性、f1-分数、召回率和灵敏度方面进行了统计评估和比较。Yelp 评论数据集的准确率为 99.26%,IMDB 电影评论数据集的准确率为 99.46%,亚马逊评论数据集的准确率为 99.26%,Twitter 情感数据集的准确率为 99.93%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Deep Aspect-Sentinet: Aspect Based Emotional Sentiment Analysis Using Hybrid Attention Deep Learning Assisted BILSTM

Data mining and natural language processing researchers have been working on sentiment analysis for the past decade. Using deep neural networks (DNNs) for sentiment analysis has recently shown promising results. A technique of studying people’s attitudes through emotional sentiment analysis of data generated from various sources such as Twitter, social media reviews, etc. and classifying emotions based on the given data is related to text data generation. Therefore, the proposed study proposes a well-known deep learning technique for facet-based emotional mood classification using text data that can handle a large amount of content. Text data pre-processing uses stemming, segmentation, tokenization, case folding, and removal of stop words, nulls, and special characters. After data pre-processing, three word embedding approaches such as Assimilated N-gram Approach (ANA), Boosted Term Frequency Inverse Document Frequency (BT-IDF) and Enhanced Two-Way Encoder Representation from Transformers (E-BERT) are used to extract relevant features. The extracted features from the three different approaches are concatenated using the Feature Fusion Approach (FFA). The optimal features are selected using the Intensified Hunger Games Search Optimization (I-HGSO) algorithm. Finally, aspect-based sentiment analysis is performed using the Senti-BILSTM (Deep Aspect-EMO SentiNet) autoencoder based on the Hybrid Emotional Aspect Capsule autoencoder. The experiment was built on the yelp reviews dataset, IDMB movie review dataset, Amazon reviews dataset and the Twitter sentiment dataset. A statistical evaluation and comparison of the experimental results are conducted with respect to the accuracy, precision, specificity, the f1-score, recall, and sensitivity. There is a 99.26% accuracy value in the Yelp reviews dataset, a 99.46% accuracy value in the IMDB movie reviews dataset, a 99.26% accuracy value in the Amazon reviews dataset and a 99.93% accuracy value in the Twitter sentiment dataset.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.70
自引率
0.00%
发文量
48
审稿时长
13.5 months
期刊介绍: The International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems is a forum for research on various methodologies for the management of imprecise, vague, uncertain or incomplete information. The aim of the journal is to promote theoretical or methodological works dealing with all kinds of methods to represent and manipulate imperfectly described pieces of knowledge, excluding results on pure mathematics or simple applications of existing theoretical results. It is published bimonthly, with worldwide distribution to researchers, engineers, decision-makers, and educators.
期刊最新文献
A Structure-Enhanced Heterogeneous Graph Representation Learning with Attention-Supplemented Embedding Fusion Homogenous Ensembles of Neuro-Fuzzy Classifiers using Hyperparameter Tuning for Medical Data PSO Based Constraint Optimization of Intuitionistic Fuzzy Shortest Path Problem in an Undirected Network Model Predictive Control for Interval Type-2 Fuzzy Systems with Unknown Time-Varying Delay in States and Input Vector An OWA Based MCDM Framework for Analyzing Multidimensional Twitter Data: A Case Study on the Citizen-Government Engagement During COVID-19
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1