Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh
{"title":"利用k -均值聚类集成提高推荐系统的性能","authors":"Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh","doi":"10.1109/IDSTA55301.2022.9923070","DOIUrl":null,"url":null,"abstract":"Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.","PeriodicalId":268343,"journal":{"name":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems\",\"authors\":\"Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh\",\"doi\":\"10.1109/IDSTA55301.2022.9923070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.\",\"PeriodicalId\":268343,\"journal\":{\"name\":\"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)\",\"volume\":\"174 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IDSTA55301.2022.9923070\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDSTA55301.2022.9923070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems
Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.