使用Relief-F和预算树随机森林(RFBTRF)方法分别添加新特征对准确率的性能进行分析

K. Deepika, M. S. Reddy, N. Pandian, R. D. Kumar
{"title":"使用Relief-F和预算树随机森林(RFBTRF)方法分别添加新特征对准确率的性能进行分析","authors":"K. Deepika, M. S. Reddy, N. Pandian, R. D. Kumar","doi":"10.22075/IJNAA.2021.24724.2808","DOIUrl":null,"url":null,"abstract":"The education is very important for improving the values of students in the society. Different types of features like school related features, student related features, parent related features and teacher related features are influencing the success rate of students in their education. Identification of best features from the huge set of features for analyzing the success or failure of a student is one important challenge to the research community and academicians. The set of features information is collected for preparing the student dataset also one difficult task in the prediction of student academic performance. We collected a student dataset of different schools that contains 4965 student’s information. The dataset contains information of 45 features of different categories such as school related features, student related features, parent related features and teacher related features. All features are not useful for predicting the academic performance of a student. The Data mining methods are applied in various research domains including education to extract hidden information from datasets. The feature selection algorithms are used to determine the best informative features by eliminating the irrelevant and redundant features. In this work, Relief-F Budget Tree Random Forest feature selection algorithm is used to identify the relevant features in the collected school dataset. Five different machine learning models are used to predict the efficiency of feature selection algorithm. The decision tree model shows best accuracy for student academic performance prediction compared with other models. The experimental results display that the RFBTRF algorithm identifies the best informative features for enhancing the accuracy of student academic performance prediction and also reduces the over-fitting issues. The experiment started with individual features and then continued with combination of different categories of features. It was observed that the accuracy of student academic performance prediction is decreased when some categories of features are added to other categories of features.","PeriodicalId":14240,"journal":{"name":"International Journal of Nonlinear Analysis and Applications","volume":"13 1","pages":"1239-1252"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of performance of accuracy by adding new features individually using Relief-F and Budget Tree Random Forest (RFBTRF) method\",\"authors\":\"K. Deepika, M. S. Reddy, N. Pandian, R. D. Kumar\",\"doi\":\"10.22075/IJNAA.2021.24724.2808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The education is very important for improving the values of students in the society. Different types of features like school related features, student related features, parent related features and teacher related features are influencing the success rate of students in their education. Identification of best features from the huge set of features for analyzing the success or failure of a student is one important challenge to the research community and academicians. The set of features information is collected for preparing the student dataset also one difficult task in the prediction of student academic performance. We collected a student dataset of different schools that contains 4965 student’s information. The dataset contains information of 45 features of different categories such as school related features, student related features, parent related features and teacher related features. All features are not useful for predicting the academic performance of a student. The Data mining methods are applied in various research domains including education to extract hidden information from datasets. The feature selection algorithms are used to determine the best informative features by eliminating the irrelevant and redundant features. In this work, Relief-F Budget Tree Random Forest feature selection algorithm is used to identify the relevant features in the collected school dataset. Five different machine learning models are used to predict the efficiency of feature selection algorithm. The decision tree model shows best accuracy for student academic performance prediction compared with other models. The experimental results display that the RFBTRF algorithm identifies the best informative features for enhancing the accuracy of student academic performance prediction and also reduces the over-fitting issues. The experiment started with individual features and then continued with combination of different categories of features. It was observed that the accuracy of student academic performance prediction is decreased when some categories of features are added to other categories of features.\",\"PeriodicalId\":14240,\"journal\":{\"name\":\"International Journal of Nonlinear Analysis and Applications\",\"volume\":\"13 1\",\"pages\":\"1239-1252\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Nonlinear Analysis and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22075/IJNAA.2021.24724.2808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Nonlinear Analysis and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22075/IJNAA.2021.24724.2808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

摘要

教育对于提高学生在社会中的价值观是非常重要的。学校相关特征、学生相关特征、家长相关特征、教师相关特征等不同类型的特征影响着学生的教育成功率。从大量的特征中识别出最佳特征来分析学生的成功或失败是研究团体和学者面临的一个重要挑战。收集特征信息集是为了准备学生数据集,这也是预测学生学习成绩的一项困难任务。我们收集了不同学校的学生数据集,其中包含4965名学生的信息。该数据集包含学校相关特征、学生相关特征、家长相关特征和教师相关特征等45个不同类别的特征信息。所有的特征都不能用来预测学生的学习成绩。数据挖掘方法被应用于包括教育在内的各个研究领域,从数据集中提取隐藏信息。特征选择算法通过剔除不相关和冗余的特征来确定最佳信息特征。在这项工作中,使用Relief-F预算树随机森林特征选择算法来识别收集到的学校数据集中的相关特征。使用五种不同的机器学习模型来预测特征选择算法的效率。与其他模型相比,决策树模型对学生学习成绩的预测精度最高。实验结果表明,RFBTRF算法识别出最佳的信息特征,提高了学生学业成绩预测的准确性,并减少了过拟合问题。实验从单个特征开始,然后进行不同类别特征的组合。研究发现,当某些类别的特征添加到其他类别的特征中时,学生学业成绩预测的准确性会降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Analysis of performance of accuracy by adding new features individually using Relief-F and Budget Tree Random Forest (RFBTRF) method
The education is very important for improving the values of students in the society. Different types of features like school related features, student related features, parent related features and teacher related features are influencing the success rate of students in their education. Identification of best features from the huge set of features for analyzing the success or failure of a student is one important challenge to the research community and academicians. The set of features information is collected for preparing the student dataset also one difficult task in the prediction of student academic performance. We collected a student dataset of different schools that contains 4965 student’s information. The dataset contains information of 45 features of different categories such as school related features, student related features, parent related features and teacher related features. All features are not useful for predicting the academic performance of a student. The Data mining methods are applied in various research domains including education to extract hidden information from datasets. The feature selection algorithms are used to determine the best informative features by eliminating the irrelevant and redundant features. In this work, Relief-F Budget Tree Random Forest feature selection algorithm is used to identify the relevant features in the collected school dataset. Five different machine learning models are used to predict the efficiency of feature selection algorithm. The decision tree model shows best accuracy for student academic performance prediction compared with other models. The experimental results display that the RFBTRF algorithm identifies the best informative features for enhancing the accuracy of student academic performance prediction and also reduces the over-fitting issues. The experiment started with individual features and then continued with combination of different categories of features. It was observed that the accuracy of student academic performance prediction is decreased when some categories of features are added to other categories of features.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
160
期刊最新文献
Classes of certain analytic functions defining by subordinations Some common fixed point theorems for strict contractions Some common fixed point results of multivalued mappings on fuzzy metric space Some coupled coincidence and coupled common fixed point result in dislocated quasi b-metric spaces for rational type contraction mappings Fixed Point Results in Complex Valued Metric Spaces and an Application
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1