{"title":"混合内容文档的复合选择降维策略","authors":"S. Raheel","doi":"10.1109/AIMS.2015.28","DOIUrl":null,"url":null,"abstract":"Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential. The advantage of this approach is that it allows for an important number of features that are highly relevant to their classes but statistically insignificant to participate in the learning process of the classifier. Results show that this new approach is promising and as good as the traditional one. Higher accuracy is reached when the number of the infrequent features increases. This approach is useful when we need for the infrequent features to be part of the predictive model since this, in turn, enforces the subjectivity of the decision made by the classifier.","PeriodicalId":121874,"journal":{"name":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content\",\"authors\":\"S. Raheel\",\"doi\":\"10.1109/AIMS.2015.28\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential. The advantage of this approach is that it allows for an important number of features that are highly relevant to their classes but statistically insignificant to participate in the learning process of the classifier. Results show that this new approach is promising and as good as the traditional one. Higher accuracy is reached when the number of the infrequent features increases. This approach is useful when we need for the infrequent features to be part of the predictive model since this, in turn, enforces the subjectivity of the decision made by the classifier.\",\"PeriodicalId\":121874,\"journal\":{\"name\":\"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIMS.2015.28\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIMS.2015.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content
Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential. The advantage of this approach is that it allows for an important number of features that are highly relevant to their classes but statistically insignificant to participate in the learning process of the classifier. Results show that this new approach is promising and as good as the traditional one. Higher accuracy is reached when the number of the infrequent features increases. This approach is useful when we need for the infrequent features to be part of the predictive model since this, in turn, enforces the subjectivity of the decision made by the classifier.