Jintao Huang, Wenbin Qian, Binglong Wu, Yinglong Wang
{"title":"Cost-Sensitive Feature Selection Based on Label Significance and Positive Region","authors":"Jintao Huang, Wenbin Qian, Binglong Wu, Yinglong Wang","doi":"10.1109/ICMLC48188.2019.8949182","DOIUrl":null,"url":null,"abstract":"Cost-sensitive feature selection is an important research topic in the field of machine learning and data mining. Presently, cost-sensitive feature selection research is mainly oriented to single-label or multi-label data. Since in many fields of application, there is a correlation and significance among the labels for multi-label data. In order to resolve the problems, this paper introduces label significance into cost-sensitive feature selection, and proposes a feature selection algorithm using test cost based on label significance. The algorithm combines the test cost matrix generated by the three distributions with positive region. Finally, the effectiveness and feasibility of the algorithm are further verified by experimental results on the four Mulan data set.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC48188.2019.8949182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Cost-sensitive feature selection is an important research topic in the field of machine learning and data mining. Presently, cost-sensitive feature selection research is mainly oriented to single-label or multi-label data. Since in many fields of application, there is a correlation and significance among the labels for multi-label data. In order to resolve the problems, this paper introduces label significance into cost-sensitive feature selection, and proposes a feature selection algorithm using test cost based on label significance. The algorithm combines the test cost matrix generated by the three distributions with positive region. Finally, the effectiveness and feasibility of the algorithm are further verified by experimental results on the four Mulan data set.