Zhenxi Chen , Gong Chen , Can Gao , Jie Zhou , Jiajun Wen
{"title":"基于加权模糊边际的稳健特征选择与三向决策","authors":"Zhenxi Chen , Gong Chen , Can Gao , Jie Zhou , Jiajun Wen","doi":"10.1016/j.ijar.2024.109253","DOIUrl":null,"url":null,"abstract":"<div><p>Feature selection has shown noticeable benefits to the tasks of machine learning and data mining, and an extensive variety of feature selection methods has been proposed to remove redundant and irrelevant features. However, most of the existing methods aim to find a feature subset to perfectly fit data with the minimum empirical risk, thus causing the problems of overfitting and noise sensitivity. In this study, a robust weighted fuzzy margin-based feature selection is proposed for uncertain data with noise. Concretely, a robust weighted fuzzy margin based on fuzzy rough sets is first introduced to evaluate the significance of different features. Then, a gradient ascent algorithm based on the noise filtering strategy and three-way decision is developed to optimize the sample and feature weights to further enlarge the fuzzy margin. Finally, an adaptive feature selection algorithm based on the robust weighted fuzzy margin is presented to generate an optimal feature subset with a large margin. Extensive experiments on the UCI benchmark datasets show that the proposed method could obtain high-quality feature subsets and outperform other representative methods under different noise rates.</p></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"173 ","pages":"Article 109253"},"PeriodicalIF":3.2000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust weighted fuzzy margin-based feature selection with three-way decision\",\"authors\":\"Zhenxi Chen , Gong Chen , Can Gao , Jie Zhou , Jiajun Wen\",\"doi\":\"10.1016/j.ijar.2024.109253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Feature selection has shown noticeable benefits to the tasks of machine learning and data mining, and an extensive variety of feature selection methods has been proposed to remove redundant and irrelevant features. However, most of the existing methods aim to find a feature subset to perfectly fit data with the minimum empirical risk, thus causing the problems of overfitting and noise sensitivity. In this study, a robust weighted fuzzy margin-based feature selection is proposed for uncertain data with noise. Concretely, a robust weighted fuzzy margin based on fuzzy rough sets is first introduced to evaluate the significance of different features. Then, a gradient ascent algorithm based on the noise filtering strategy and three-way decision is developed to optimize the sample and feature weights to further enlarge the fuzzy margin. Finally, an adaptive feature selection algorithm based on the robust weighted fuzzy margin is presented to generate an optimal feature subset with a large margin. Extensive experiments on the UCI benchmark datasets show that the proposed method could obtain high-quality feature subsets and outperform other representative methods under different noise rates.</p></div>\",\"PeriodicalId\":13842,\"journal\":{\"name\":\"International Journal of Approximate Reasoning\",\"volume\":\"173 \",\"pages\":\"Article 109253\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Approximate Reasoning\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0888613X24001403\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Approximate Reasoning","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888613X24001403","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Robust weighted fuzzy margin-based feature selection with three-way decision
Feature selection has shown noticeable benefits to the tasks of machine learning and data mining, and an extensive variety of feature selection methods has been proposed to remove redundant and irrelevant features. However, most of the existing methods aim to find a feature subset to perfectly fit data with the minimum empirical risk, thus causing the problems of overfitting and noise sensitivity. In this study, a robust weighted fuzzy margin-based feature selection is proposed for uncertain data with noise. Concretely, a robust weighted fuzzy margin based on fuzzy rough sets is first introduced to evaluate the significance of different features. Then, a gradient ascent algorithm based on the noise filtering strategy and three-way decision is developed to optimize the sample and feature weights to further enlarge the fuzzy margin. Finally, an adaptive feature selection algorithm based on the robust weighted fuzzy margin is presented to generate an optimal feature subset with a large margin. Extensive experiments on the UCI benchmark datasets show that the proposed method could obtain high-quality feature subsets and outperform other representative methods under different noise rates.
期刊介绍:
The International Journal of Approximate Reasoning is intended to serve as a forum for the treatment of imprecision and uncertainty in Artificial and Computational Intelligence, covering both the foundations of uncertainty theories, and the design of intelligent systems for scientific and engineering applications. It publishes high-quality research papers describing theoretical developments or innovative applications, as well as review articles on topics of general interest.
Relevant topics include, but are not limited to, probabilistic reasoning and Bayesian networks, imprecise probabilities, random sets, belief functions (Dempster-Shafer theory), possibility theory, fuzzy sets, rough sets, decision theory, non-additive measures and integrals, qualitative reasoning about uncertainty, comparative probability orderings, game-theoretic probability, default reasoning, nonstandard logics, argumentation systems, inconsistency tolerant reasoning, elicitation techniques, philosophical foundations and psychological models of uncertain reasoning.
Domains of application for uncertain reasoning systems include risk analysis and assessment, information retrieval and database design, information fusion, machine learning, data and web mining, computer vision, image and signal processing, intelligent data analysis, statistics, multi-agent systems, etc.