{"title":"Empowering Simultaneous Feature and Instance Selection in Classification Problems through the Adaptation of Two Selection Algorithms","authors":"R. D. Carmo, F. Freitas, J. Souza","doi":"10.1109/ICMLA.2010.121","DOIUrl":null,"url":null,"abstract":"This paper proposes a new approach to data selection, a key issue in classification problems. This approach, which is based on a feature selection algorithm and one instance selection algorithm, reduces the original dataset in two dimensions, selecting relevant features and retaining important instances simultaneously. The search processes for the best feature and instance subsets occur separately yet, due to the influence of features in the importance of instances and vice versa, they bias one another. The experiments validate the proposed approach showing that this existing relation between features and instances can be reproduced when constructing data selection algorithms and that it leads to a quality improval comparing to the sequential execution of both algorithms.","PeriodicalId":74528,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","volume":"222 1","pages":"793-796"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper proposes a new approach to data selection, a key issue in classification problems. This approach, which is based on a feature selection algorithm and one instance selection algorithm, reduces the original dataset in two dimensions, selecting relevant features and retaining important instances simultaneously. The search processes for the best feature and instance subsets occur separately yet, due to the influence of features in the importance of instances and vice versa, they bias one another. The experiments validate the proposed approach showing that this existing relation between features and instances can be reproduced when constructing data selection algorithms and that it leads to a quality improval comparing to the sequential execution of both algorithms.