Keyu Liu, Jingjing Song, Wendong Zhang, Xibei Yang
{"title":"Alleviating Over-Fitting in Attribute Reduction: An Early Stopping Strategy","authors":"Keyu Liu, Jingjing Song, Wendong Zhang, Xibei Yang","doi":"10.1109/ICWAPR.2018.8521316","DOIUrl":null,"url":null,"abstract":"In rough set theory, forward heuristic algorithm selects the most important attribute in the process of attribute reduction until the given constraint is satisfied. However, the attributes selected by such strategy may bring us over-fitting. To solve such problem, a new heuristic algorithm is designed: the importance of the attribute is obtained by cross validation and then the Early Stopping is employed to terminate the algorithm if over-fitting occurs. Based on the neighborhood rough set, the heuristic algorithm is compared with the new method over several UCI data sets. The experimental results show that: 1) the proposed algorithm can effectively alleviate over-fitting; 2) the reduct obtained by the new algorithm may offer us better classification performances.","PeriodicalId":385478,"journal":{"name":"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWAPR.2018.8521316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In rough set theory, forward heuristic algorithm selects the most important attribute in the process of attribute reduction until the given constraint is satisfied. However, the attributes selected by such strategy may bring us over-fitting. To solve such problem, a new heuristic algorithm is designed: the importance of the attribute is obtained by cross validation and then the Early Stopping is employed to terminate the algorithm if over-fitting occurs. Based on the neighborhood rough set, the heuristic algorithm is compared with the new method over several UCI data sets. The experimental results show that: 1) the proposed algorithm can effectively alleviate over-fitting; 2) the reduct obtained by the new algorithm may offer us better classification performances.