Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis

IF 0.6 Q4 COMPUTER SCIENCE, THEORY & METHODS International Journal of Grid and High Performance Computing Pub Date : 2020-07-01 DOI:10.4018/ijghpc.2020070103

Sukhnandan Kaur Johal, R. Mohana

{"title":"Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis","authors":"Sukhnandan Kaur Johal, R. Mohana","doi":"10.4018/ijghpc.2020070103","DOIUrl":null,"url":null,"abstract":"Various natural language processing tasks are carried out to feed into computerized decision support systems. Among these, sentiment analysis is gaining more attention. The majority of sentiment analysis relies on the social media content. This web content is highly un-normalized in nature. This hinders the performance of decision support system. To enhance the performance, it is required to process data efficiently. This article proposes a novel method of normalization of web data during the pre-processing phase. It is aimed to get better results for different natural language processing tasks. This research applies this technique on data for sentiment analysis. Performance of different learning models is analysed using precision, recall, f-measure, fallout for normalize and un-normalize sentiment analysis. Results shows after normalization, some documents shift their polarity i.e. negative to positive. Experimental results show normalized data processing outperforms un-normalized data processing with better accuracy.","PeriodicalId":43565,"journal":{"name":"International Journal of Grid and High Performance Computing","volume":"24 1","pages":"43-56"},"PeriodicalIF":0.6000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Grid and High Performance Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijghpc.2020070103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 2

Abstract

Various natural language processing tasks are carried out to feed into computerized decision support systems. Among these, sentiment analysis is gaining more attention. The majority of sentiment analysis relies on the social media content. This web content is highly un-normalized in nature. This hinders the performance of decision support system. To enhance the performance, it is required to process data efficiently. This article proposes a novel method of normalization of web data during the pre-processing phase. It is aimed to get better results for different natural language processing tasks. This research applies this technique on data for sentiment analysis. Performance of different learning models is analysed using precision, recall, f-measure, fallout for normalize and un-normalize sentiment analysis. Results shows after normalization, some documents shift their polarity i.e. negative to positive. Experimental results show normalized data processing outperforms un-normalized data processing with better accuracy.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于混合情感分析的文本数据归一化处理的有效性

各种自然语言处理任务被执行，以提供给计算机化的决策支持系统。其中，情绪分析备受关注。大多数情感分析依赖于社交媒体内容。这个网页内容在本质上是高度非规范化的。这影响了决策支持系统的性能。为了提高性能，需要有效地处理数据。本文提出了一种新的web数据预处理规范化方法。它旨在为不同的自然语言处理任务获得更好的结果。本研究将此技术应用于情感分析数据。使用精度、召回率、f-measure、影响效应对规范化和非规范化情感分析进行了不同学习模型的性能分析。结果表明，归一化后，一些文件的极性发生了转变，即负极性转变为正极性。实验结果表明，数据归一化处理优于非归一化处理，精度更高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊