{"title":"Hybrid Symbiotic Organism Search algorithms for Automatic Data Clustering","authors":"V. Rajah, Ezugwu E. Absalom","doi":"10.1109/ICTAS47918.2020.234001","DOIUrl":null,"url":null,"abstract":"Cluster analysis is an essential tool in data mining. Several clustering algorithms have been proposed and implemented for which most are able to find the good quality or optimal clustering solutions. However, most of these algorithms still depend on the number of a cluster being provided a priori. In dealing with real-life problems, the number of clusters is unknown and determining the optimal number of clusters for a large density and high dimensionality dataset is quite a difficult task to handle. This paper, therefore, proposes five new hybrid symbiotic organism search algorithms to automatically partition datasets without any prior information regarding the number of clusters. Furthermore, the hybrid algorithms will be evaluated in terms of solution quality using the Davies–Bouldin clustering validity index. The simulation results show that the performance of the hybrid symbiotic organisms search particle swarm optimization algorithm is superior to the other proposed hybrid algorithms.","PeriodicalId":431012,"journal":{"name":"2020 Conference on Information Communications Technology and Society (ICTAS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Conference on Information Communications Technology and Society (ICTAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAS47918.2020.234001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Cluster analysis is an essential tool in data mining. Several clustering algorithms have been proposed and implemented for which most are able to find the good quality or optimal clustering solutions. However, most of these algorithms still depend on the number of a cluster being provided a priori. In dealing with real-life problems, the number of clusters is unknown and determining the optimal number of clusters for a large density and high dimensionality dataset is quite a difficult task to handle. This paper, therefore, proposes five new hybrid symbiotic organism search algorithms to automatically partition datasets without any prior information regarding the number of clusters. Furthermore, the hybrid algorithms will be evaluated in terms of solution quality using the Davies–Bouldin clustering validity index. The simulation results show that the performance of the hybrid symbiotic organisms search particle swarm optimization algorithm is superior to the other proposed hybrid algorithms.