{"title":"Visual interactive evolutionary algorithm for high dimensional outlier detection and data clustering problems","authors":"Lydia Boudjeloud-Assala","doi":"10.1504/IJBIC.2012.044931","DOIUrl":null,"url":null,"abstract":"Usual visualisation techniques for multidimensional datasets, such as parallel coordinates and scatterplot matrices, do not scale well to high numbers of dimensions. A common approach to solve this problem is dimensionality selection. Existing dimensionality selection techniques usually select pertinent dimension subsets that are significant to the user without loose of information. We present concrete cooperation between automatic algorithms, interactive algorithms and visualisation tools: the evolutionary algorithm is used to obtain optimal dimension subsets which represent the original dataset without loosing information for unsupervised mode (clustering or outlier detection). The last effective cooperation is a visualisation tool used to present the user interactive evolutionary algorithm results and let him actively participate in evolutionary algorithm searching with more efficiency resulting in a faster evolutionary algorithm convergence. We have implemented our approach and applied it to real dataset to confirm this approach is effective for supporting the user in the exploration of high dimensional datasets.","PeriodicalId":49059,"journal":{"name":"International Journal of Bio-Inspired Computation","volume":"26 1","pages":"6-13"},"PeriodicalIF":1.7000,"publicationDate":"2012-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bio-Inspired Computation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1504/IJBIC.2012.044931","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 10
Abstract
Usual visualisation techniques for multidimensional datasets, such as parallel coordinates and scatterplot matrices, do not scale well to high numbers of dimensions. A common approach to solve this problem is dimensionality selection. Existing dimensionality selection techniques usually select pertinent dimension subsets that are significant to the user without loose of information. We present concrete cooperation between automatic algorithms, interactive algorithms and visualisation tools: the evolutionary algorithm is used to obtain optimal dimension subsets which represent the original dataset without loosing information for unsupervised mode (clustering or outlier detection). The last effective cooperation is a visualisation tool used to present the user interactive evolutionary algorithm results and let him actively participate in evolutionary algorithm searching with more efficiency resulting in a faster evolutionary algorithm convergence. We have implemented our approach and applied it to real dataset to confirm this approach is effective for supporting the user in the exploration of high dimensional datasets.
期刊介绍:
IJBIC discusses the new bio-inspired computation methodologies derived from the animal and plant world, such as new algorithms mimicking the wolf schooling, the plant survival process, etc.
Topics covered include:
-New bio-inspired methodologies coming from
creatures living in nature
artificial society-
physical/chemical phenomena-
New bio-inspired methodology analysis tools, e.g. rough sets, stochastic processes-
Brain-inspired methods: models and algorithms-
Bio-inspired computation with big data: algorithms and structures-
Applications associated with bio-inspired methodologies, e.g. bioinformatics.