Mobile Phone Price Class Prediction Using Different Classification Algorithms with Feature Selection and Parameter Optimization

Mustafa Çetın, Yunus Koç
{"title":"Mobile Phone Price Class Prediction Using Different Classification Algorithms with Feature Selection and Parameter Optimization","authors":"Mustafa Çetın, Yunus Koç","doi":"10.1109/ISMSIT52890.2021.9604550","DOIUrl":null,"url":null,"abstract":"Machine Learning (ML) algorithms are used in many fields such as finance, education, industry, medicine and e-commerce. ML algorithms show performance differences depending on the dataset and processing steps. Choosing the right algorithm, preprocessing and postprocessing methods has great importance to achieve good results. In this paper, Random Forest Classifier, Logistic Regression Classifier, Decision Tree Classifier, Linear Discriminant Analysis, K-Nearest Neighbor Classifier and SVC methods are compared to predict mobile phone price class. The \"Mobile Price Classification\" dataset which is taken from Kaggle.com is used to evaluate methods. Firstly, all values at the dataset are checked to verify that there are no missing values. After that, scaling is applied to dataset in order to obtain more relevant data for ML algorithms. Then, feature selection methods which reduce the computational cost by reducing the number of inputs are performed to get meaningful features. Finally, the parameters of classification algorithms are tuned to improve the system accuracy. According to obtained results, it is seen that ANOVA f-test feature selection method is more convenient for this dataset. It gives satisfying accuracy with a minimum number of features. It is also seen that the SVC classifier has the highest test accuracy compared to other models.","PeriodicalId":120997,"journal":{"name":"2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMSIT52890.2021.9604550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Machine Learning (ML) algorithms are used in many fields such as finance, education, industry, medicine and e-commerce. ML algorithms show performance differences depending on the dataset and processing steps. Choosing the right algorithm, preprocessing and postprocessing methods has great importance to achieve good results. In this paper, Random Forest Classifier, Logistic Regression Classifier, Decision Tree Classifier, Linear Discriminant Analysis, K-Nearest Neighbor Classifier and SVC methods are compared to predict mobile phone price class. The "Mobile Price Classification" dataset which is taken from Kaggle.com is used to evaluate methods. Firstly, all values at the dataset are checked to verify that there are no missing values. After that, scaling is applied to dataset in order to obtain more relevant data for ML algorithms. Then, feature selection methods which reduce the computational cost by reducing the number of inputs are performed to get meaningful features. Finally, the parameters of classification algorithms are tuned to improve the system accuracy. According to obtained results, it is seen that ANOVA f-test feature selection method is more convenient for this dataset. It gives satisfying accuracy with a minimum number of features. It is also seen that the SVC classifier has the highest test accuracy compared to other models.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于特征选择和参数优化的不同分类算法的手机价格分类预测
机器学习(ML)算法应用于金融、教育、工业、医药和电子商务等诸多领域。机器学习算法根据数据集和处理步骤显示性能差异。选择合适的算法、预处理和后处理方法对获得良好的效果至关重要。本文比较了随机森林分类器、逻辑回归分类器、决策树分类器、线性判别分析、k近邻分类器和SVC方法对手机价格分类的预测效果。从Kaggle.com获取的“移动价格分类”数据集用于评估方法。首先,检查数据集上的所有值,以验证没有缺失值。之后,对数据集进行缩放,以获得更多与ML算法相关的数据。然后,采用特征选择方法,通过减少输入数量来降低计算成本,从而获得有意义的特征。最后,对分类算法的参数进行了调整,以提高系统的准确率。从得到的结果可以看出,方差分析f检验特征选择方法对于该数据集更为方便。它用最少的特征给出了令人满意的精度。还可以看出,与其他模型相比,SVC分类器具有最高的测试精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design Improvement of a Spoke-Type PMSG with Ferrite Magnets to Reduce Space Harmonics Fuzzy AHP-TOPSIS Hybrid Method for Indoor Positioning Technology Selection for Shipyards Assessment of Slime Mould Algorithm Based Real PID Plus Second-order Derivative Controller for Magnetic Levitation System ROS Validation for Fuzzy Logic Contro Implemented under Differential Drive Mobile Robot Physical and Digital Accessibility in Museums in the New Reality
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1