Y. Yamasari, A. Qoiriah, H. P. Tjahyaningtijas, R. E. Putra, A. Prihanto, Asmunin
{"title":"Improving the Quality of the Clustering Process on Students’ Performance using Feature Selection","authors":"Y. Yamasari, A. Qoiriah, H. P. Tjahyaningtijas, R. E. Putra, A. Prihanto, Asmunin","doi":"10.1109/iSemantic50169.2020.9234249","DOIUrl":null,"url":null,"abstract":"the quality of students' performance clusters relates to the accuracy of students being in groups based on their performance. However, the resulting quality sometimes needs to be improved because the clustering process involves features that are not dominant. Furthermore, in the previous works, measurement of the quality of the clusters in unsupervised evaluation often only uses one measure. Therefore, this paper focuses to enhance the quality of clusters by eliminating features that are irrelevant by applying the feature selection method called the Gini Index. Meanwhile, in this paper, the clustering method applied is K-means for the mining process. Then, we propose the evaluation process measured by three metrics, namely: silhouette coefficient, ANOVA, and t-test. The experimental results show that the Gini Index can improve the quality of clusters based on the three proposed metrics.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
the quality of students' performance clusters relates to the accuracy of students being in groups based on their performance. However, the resulting quality sometimes needs to be improved because the clustering process involves features that are not dominant. Furthermore, in the previous works, measurement of the quality of the clusters in unsupervised evaluation often only uses one measure. Therefore, this paper focuses to enhance the quality of clusters by eliminating features that are irrelevant by applying the feature selection method called the Gini Index. Meanwhile, in this paper, the clustering method applied is K-means for the mining process. Then, we propose the evaluation process measured by three metrics, namely: silhouette coefficient, ANOVA, and t-test. The experimental results show that the Gini Index can improve the quality of clusters based on the three proposed metrics.