{"title":"Effects of Population Initialization on Evolutionary Techniques for Subgroup Discovery in High Dimensional Datasets","authors":"Vitor de Albuquerque Torreao, Renato Vimieiro","doi":"10.1109/BRACIS.2018.00013","DOIUrl":null,"url":null,"abstract":"Many Evolutionary Algorithms have been proposed to solve the Subgroup Discovery task. Some of these, however, have been shown to work poorly in high dimensional problems. The best performing evolutionary algorithm for subgroup discovery in high dimensional datasets has a particular way to initialize its starting population, limiting the size of initial solutions to the lowest possible value. As with most population-based techniques, the outcome of evolutionary algorithms is usually dependent on the initial set of solutions, which are typically randomly generated. The impact of choosing one initialization technique over another in the final presented solution has been the topic of many published works in the broad area of evolutionary computation. However, to the best of our knowledge, it has not been the topic of study in the specific case of the Subgroup Discovery task, especially when considering high dimensional datasets. Therefore, this paper aims at studying whether or not it is possible to improve the performance of evolutionary algorithms in high dimensional subgroup discovery tasks by biasing the initial population to individuals with lower sizes.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":" 23","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Many Evolutionary Algorithms have been proposed to solve the Subgroup Discovery task. Some of these, however, have been shown to work poorly in high dimensional problems. The best performing evolutionary algorithm for subgroup discovery in high dimensional datasets has a particular way to initialize its starting population, limiting the size of initial solutions to the lowest possible value. As with most population-based techniques, the outcome of evolutionary algorithms is usually dependent on the initial set of solutions, which are typically randomly generated. The impact of choosing one initialization technique over another in the final presented solution has been the topic of many published works in the broad area of evolutionary computation. However, to the best of our knowledge, it has not been the topic of study in the specific case of the Subgroup Discovery task, especially when considering high dimensional datasets. Therefore, this paper aims at studying whether or not it is possible to improve the performance of evolutionary algorithms in high dimensional subgroup discovery tasks by biasing the initial population to individuals with lower sizes.