An Optimal Binary Particle Swarm Optimization Based Feature Selection Model for Big Data Analysis of Product Assessment

R. Sathya, L. Babu
{"title":"An Optimal Binary Particle Swarm Optimization Based Feature Selection Model for Big Data Analysis of Product Assessment","authors":"R. Sathya, L. Babu","doi":"10.1166/JCTN.2021.9384","DOIUrl":null,"url":null,"abstract":"Big data defines the state where the size, speed and kind of data go beyond a memory or executing capabilities for precise and timely decision-making. Big data analytics is integrated with ML and statistical methods for processing big data and recognizes the important data. At present\n times, the generation of online product reviews has exponentially increased at each and every second. These applications have resulted in developing the volumes of data which can be used for prediction and classification for decision making process. Compared with other models, various techniques\n are applied in solving the big data problem, feature selection (FS) is known to be an efficient method. FS operations could be exploring with the application of a subset of features which is related to the topic of précised definition of the existing datasets. Deplorably, search using\n this type of sub-sets results in the problems of combinatorial as well as maximum time consuming. The meta-heuristic approaches are typically employed to facilitate the choice of features. This paper presents an optimal extreme learning machine (ELM) based binary particle swarm optimization\n to precede the FS process. The proposed method develops a Fitness Function (FF) by applying ELM. And the best solution of the FF has been explored under the application of BPSO technique. For instance, the dataset of product review which are derived from Amazon including synthetic data, which\n is comprised with total of 235,000 positive and 147,000 negative review records is used. The experimental result implied that the ELM-BPSO technique is comparably best","PeriodicalId":15416,"journal":{"name":"Journal of Computational and Theoretical Nanoscience","volume":"18 1","pages":"1233-1238"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational and Theoretical Nanoscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1166/JCTN.2021.9384","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Chemistry","Score":null,"Total":0}
引用次数: 0

Abstract

Big data defines the state where the size, speed and kind of data go beyond a memory or executing capabilities for precise and timely decision-making. Big data analytics is integrated with ML and statistical methods for processing big data and recognizes the important data. At present times, the generation of online product reviews has exponentially increased at each and every second. These applications have resulted in developing the volumes of data which can be used for prediction and classification for decision making process. Compared with other models, various techniques are applied in solving the big data problem, feature selection (FS) is known to be an efficient method. FS operations could be exploring with the application of a subset of features which is related to the topic of précised definition of the existing datasets. Deplorably, search using this type of sub-sets results in the problems of combinatorial as well as maximum time consuming. The meta-heuristic approaches are typically employed to facilitate the choice of features. This paper presents an optimal extreme learning machine (ELM) based binary particle swarm optimization to precede the FS process. The proposed method develops a Fitness Function (FF) by applying ELM. And the best solution of the FF has been explored under the application of BPSO technique. For instance, the dataset of product review which are derived from Amazon including synthetic data, which is comprised with total of 235,000 positive and 147,000 negative review records is used. The experimental result implied that the ELM-BPSO technique is comparably best
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于二值粒子群优化的产品评估大数据特征选择模型
大数据定义了数据的大小、速度和种类超出内存或执行能力的状态,以实现精确及时的决策。大数据分析与ML和统计方法相结合,用于处理大数据并识别重要数据。目前,在线产品评论的生成量每秒钟都呈指数级增长。这些应用程序开发了大量数据,可用于决策过程的预测和分类。与其他模型相比,各种技术被应用于解决大数据问题,特征选择是一种有效的方法。FS操作可以通过应用与现有数据集的精确定义主题相关的特征子集来进行探索。令人沮丧的是,使用这种类型的子集进行搜索会导致组合问题以及最大时间消耗问题。元启发式方法通常用于促进特征的选择。本文提出了一种基于最优极限学习机(ELM)的二进制粒子群优化方法,以先于FS过程。所提出的方法通过应用ELM来开发适应度函数(FF)。并在BPSO技术的应用下,探讨了FF的最佳解决方案。例如,使用了来自亚马逊的产品审查数据集,包括合成数据,该数据集共有23.5万条正面审查记录和14.7万条负面审查记录。实验结果表明,ELM-BPSO技术是比较好的
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Computational and Theoretical Nanoscience
Journal of Computational and Theoretical Nanoscience 工程技术-材料科学:综合
自引率
0.00%
发文量
0
审稿时长
3.9 months
期刊介绍: Information not localized
期刊最新文献
The 'Insertion/Deletion' Polymorphism, rs4340 and Diabetes Risk: A Pilot Study from a Hospital Cohort. Reincluding: Providing Support to Reengage Youth who Truant in Secondary Schools. Eosinophil cationic protein (ECP) correlates with eosinophil cell counts in the induced sputum of elite swimmers. Synergic action of an inserted carbohydrate-binding module in a glycoside hydrolase family 5 endoglucanase. [Prognostic impact of prior cardiopathy in patients hospitalized with COVID-19 pneumonia].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1