{"title":"Zone Oriented Binary Multi-Objective Charged System Search Based Feature Selection Approach for Multi-Label Classification","authors":"Pradip Dhal, Chandrashekhar Azad","doi":"10.1111/exsy.13803","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Multi-label learning is used in situations when each instance has many labels. Due to the high-dimensional feature space and noise in multi-label datasets, multi-label learning algorithms face substantial problems. Researchers have researched multi-label FS techniques to minimise data dimensionality in multi-label classification (MLC) problems. Global optimization approaches, such as evolutionary algorithm (EA) optimizers, scale well to high-dimensional problems. This paper proposes a hybrid multi-objective FS approach based on the charged system search (CSS) and grey wolf optimization (GWO) methods for the MLC problem. The first objective is to minimise the hamming loss (HLoss) value, and the second objective is to minimise the features from the feature set. A novel concept feature zone based on informative and non-informative features has been added here. Here, we have added the Preference Ranking Organisation METHod for Enrichment of Evaluations (PROMETHEE) approach to the objective function in the FS approach. Here, we have added the new velocity equation for the updated charge particles in the CSS algorithm. The GWO property has been added to the new velocity equation to improve the exploration and exploiting property in the CSS algorithm. For experimental verification, we have utilised six publically accessible multi-label datasets: <i>CAL500</i>, <i>Emotions</i>, <i>Medical</i>, <i>Enron</i>, <i>Scene</i>, and the <i>Yeast</i>. The findings show that the proposed approach gets the best value regarding various performance metrics. The proposed method achieves optimal Jaccard Score (JC) and HLoss values of 0.4408 and 0.0645 for <i>CAL500</i>, 0.8169 and 0.0719 for <i>Emotions</i>, 0.9486 and 0.0019 for <i>Medical</i>, 0.5950 and 0.0205 for <i>Enron</i>, 0.7391 and 0.0495 for <i>Scene</i>, and 0.6452 and 0.0766 for <i>Yeast</i> datasets. In particular, according to empirical data on a popular six-label benchmark multi-label datasets, the proposed method obtains competitive performance when labels are constrained.</p>\n </div>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.13803","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-label learning is used in situations when each instance has many labels. Due to the high-dimensional feature space and noise in multi-label datasets, multi-label learning algorithms face substantial problems. Researchers have researched multi-label FS techniques to minimise data dimensionality in multi-label classification (MLC) problems. Global optimization approaches, such as evolutionary algorithm (EA) optimizers, scale well to high-dimensional problems. This paper proposes a hybrid multi-objective FS approach based on the charged system search (CSS) and grey wolf optimization (GWO) methods for the MLC problem. The first objective is to minimise the hamming loss (HLoss) value, and the second objective is to minimise the features from the feature set. A novel concept feature zone based on informative and non-informative features has been added here. Here, we have added the Preference Ranking Organisation METHod for Enrichment of Evaluations (PROMETHEE) approach to the objective function in the FS approach. Here, we have added the new velocity equation for the updated charge particles in the CSS algorithm. The GWO property has been added to the new velocity equation to improve the exploration and exploiting property in the CSS algorithm. For experimental verification, we have utilised six publically accessible multi-label datasets: CAL500, Emotions, Medical, Enron, Scene, and the Yeast. The findings show that the proposed approach gets the best value regarding various performance metrics. The proposed method achieves optimal Jaccard Score (JC) and HLoss values of 0.4408 and 0.0645 for CAL500, 0.8169 and 0.0719 for Emotions, 0.9486 and 0.0019 for Medical, 0.5950 and 0.0205 for Enron, 0.7391 and 0.0495 for Scene, and 0.6452 and 0.0766 for Yeast datasets. In particular, according to empirical data on a popular six-label benchmark multi-label datasets, the proposed method obtains competitive performance when labels are constrained.
期刊介绍:
Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper.
As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.