Ali Arshad, Muhammad Hassam, Saman Riaz, Shahaboddin Shamshirband
{"title":"Empirical Study of Semi-Supervised Deep Fuzzy C-Mean Clustering Algorithm","authors":"Ali Arshad, Muhammad Hassam, Saman Riaz, Shahaboddin Shamshirband","doi":"10.1109/EUROCON52738.2021.9535541","DOIUrl":null,"url":null,"abstract":"Nowadays machine learning is most widely used in different applications to enhance the quality of system. In our study, we are going to propose an Empirical study of semi supervised Deep Fuzzy C-Mean clustering algorithm on different state-of-the-art pre-processing approach. However, the performance of the models depends on the quality of datasets. In this paper, we initially split our data into labeled and unlabeled data and simultaneously compare the feature between labeled and unlabeled data to extract the knowledge from unlabeled data. In second stage of feature selection and instance reduction, we apply Information Gain (IG) to conduct redundancy control and Random under sampling and Random over sampling to handle the imbalance problem. In our paper, we use the dataset (MNIST) to check the demonstration performance of our given approaches.","PeriodicalId":328338,"journal":{"name":"IEEE EUROCON 2021 - 19th International Conference on Smart Technologies","volume":"291 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE EUROCON 2021 - 19th International Conference on Smart Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUROCON52738.2021.9535541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays machine learning is most widely used in different applications to enhance the quality of system. In our study, we are going to propose an Empirical study of semi supervised Deep Fuzzy C-Mean clustering algorithm on different state-of-the-art pre-processing approach. However, the performance of the models depends on the quality of datasets. In this paper, we initially split our data into labeled and unlabeled data and simultaneously compare the feature between labeled and unlabeled data to extract the knowledge from unlabeled data. In second stage of feature selection and instance reduction, we apply Information Gain (IG) to conduct redundancy control and Random under sampling and Random over sampling to handle the imbalance problem. In our paper, we use the dataset (MNIST) to check the demonstration performance of our given approaches.
目前,机器学习被广泛应用于不同的应用中,以提高系统的质量。在我们的研究中,我们将在不同的最先进的预处理方法上对半监督深度模糊c均值聚类算法进行实证研究。然而,模型的性能取决于数据集的质量。在本文中,我们首先将数据分为有标记和未标记数据,同时比较有标记和未标记数据之间的特征,从未标记数据中提取知识。在第二阶段的特征选择和实例约简中,我们利用信息增益(Information Gain, IG)进行冗余控制,并利用随机下采样和随机上采样来处理不平衡问题。在我们的论文中,我们使用数据集(MNIST)来检查我们给定方法的演示性能。