A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews

Harnani Mat Zin, N. Mustapha, M. A. Azmi Murad, Nurfadhlina Mohd Sharef
{"title":"A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews","authors":"Harnani Mat Zin, N. Mustapha, M. A. Azmi Murad, Nurfadhlina Mohd Sharef","doi":"10.32890/jict2022.21.4.5","DOIUrl":null,"url":null,"abstract":"Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal features can be challenging given the inherent complication in dealing with over-fitting issues. Most of the current studies used a heuristic method to perform the classification process that will result in selecting and examining only a single feature subset, while ignoring the other subsets that might give better results. This study explored the effect of using the meta-heuristic method together with the ensemble classification method in the sentiment classification of online reviews. Adding to that point, the extraction and selection of relevant features used feature ranking, hyper-parameter optimization, crossover, and mutation, while the classification process utilized the ensemble classifier. The proposed method was tested on the polarity movie review dataset v2.0 and product review dataset (books, electronics, kitchen, and music). The test results indicated that the proposed method significantly improved the classification results by 94%, which far exceeded the existing method. Therefore, the proposed feature extraction and selection method can help in improving the performance of SA in online reviews and, at the same time, reduce thenumber of extracted features. ","PeriodicalId":39396,"journal":{"name":"International Journal of Information and Communication Technology","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32890/jict2022.21.4.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal features can be challenging given the inherent complication in dealing with over-fitting issues. Most of the current studies used a heuristic method to perform the classification process that will result in selecting and examining only a single feature subset, while ignoring the other subsets that might give better results. This study explored the effect of using the meta-heuristic method together with the ensemble classification method in the sentiment classification of online reviews. Adding to that point, the extraction and selection of relevant features used feature ranking, hyper-parameter optimization, crossover, and mutation, while the classification process utilized the ensemble classifier. The proposed method was tested on the polarity movie review dataset v2.0 and product review dataset (books, electronics, kitchen, and music). The test results indicated that the proposed method significantly improved the classification results by 94%, which far exceeded the existing method. Therefore, the proposed feature extraction and selection method can help in improving the performance of SA in online reviews and, at the same time, reduce thenumber of extracted features. 
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于元启发式算法的在线评论最小质量特征提取
特征提取和选择在情感分析中至关重要,通过去除冗余的特征来提取和选择合适的特征。因此,该过程的成功实现可以提高分类的准确性。考虑到处理过拟合问题的固有复杂性,选择高质量的最小特征是不可避免的挑战。目前的大多数研究使用启发式方法来执行分类过程,这将导致只选择和检查单个特征子集,而忽略了可能给出更好结果的其他子集。本研究探讨了元启发式方法与集成分类方法在网络评论情感分类中的应用效果。此外,相关特征的提取和选择使用了特征排序、超参数优化、交叉和突变,分类过程使用了集成分类器。在极性电影评论数据集v2.0和产品评论数据集(书籍、电子、厨房和音乐)上对所提出的方法进行了测试。测试结果表明,本文提出的方法将分类结果显著提高了94%,远远超过了现有的方法。因此,本文提出的特征提取和选择方法有助于提高自动识别在在线评论中的性能,同时减少提取特征的数量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.70
自引率
0.00%
发文量
95
期刊介绍: IJICT is a refereed journal in the field of information and communication technology (ICT), providing an international forum for professionals, engineers and researchers. IJICT reports the new paradigms in this emerging field of technology and envisions the future developments in the frontier areas. The journal addresses issues for the vertical and horizontal applications in this area. Topics covered include: -Information theory/coding- Information/IT/network security, standards, applications- Internet/web based systems/products- Data mining/warehousing- Network planning, design, administration- Sensor/ad hoc networks- Human-computer intelligent interaction, AI- Computational linguistics, digital speech- Distributed/cooperative media- Interactive communication media/content- Social interaction, mobile communications- Signal representation/processing, image processing- Virtual reality, cyber law, e-governance- Microprocessor interfacing, hardware design- Control of industrial processes, ERP/CRM/SCM
期刊最新文献
A Huffman based short message service compression technique using adjacent distance array Machine Learning Models for Behavioural Diversity of Asian Elephants Prediction Using Satellite Collar Data Visually Impaired Usability Requirements for Accessible Mobile Applications: A Checklist for Mobile E-book Applications Dengue Outbreak Detection Model Using Artificial Immune System: A Malaysian Case Study Modelling and Forecasting the Trend in Cryptocurrency Prices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1