{"title":"Improved salp swarm optimization algorithm based on a robust search strategy and a novel local search algorithm for feature selection problems","authors":"Mahdieh Khorashadizade, Elham Abbasi, Seyed Abolfazl Shahzadeh Fazeli","doi":"10.1016/j.chemolab.2025.105343","DOIUrl":null,"url":null,"abstract":"<div><div>The enormous challenge in data science and data mining for knowledge extraction is confronting an expansive high number of data dimensions. Because the process of extracting knowledge from data can become more complex and memory-consuming. Not only the presence of all features doesn't help the learning process, but also it can sometimes decrease the model's efficiency. To enhance the model's efficiency and reduce the problem's complexity, various feature selection algorithms are designed and implemented. In this paper, a novel and highly effective algorithm based on the salp swarm optimization algorithm for solving complex problems is proposed for feature selection. In the proposed method, an unexpected event that causes the chain to break apart (such as hitting an obstacle or the death of the chain leader, etc.) is modeled which is not taken into account in the salp swarm optimization algorithm. Also, the exploration capability is improved by modifying the updating the position of the chain leader. Additionally, an innovative local search algorithm has been embedded into the proposed algorithm to enhance its exploitation. The proposed approach is implemented on 14 datasets, and the results are compared by two terms, classification accuracy and number of selected features. Additionally, the effectiveness of the proposed method is tested on 2 widely used chemical datasets. The modifications that are applied to the standard salp swarm algorithm reduce the probability of getting stuck in the local optimum and simultaneously, increase the diversity of solutions. The results show that the proposed algorithm has performed significantly better than other algorithms in solving the feature selection problem.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"258 ","pages":"Article 105343"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743925000280","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The enormous challenge in data science and data mining for knowledge extraction is confronting an expansive high number of data dimensions. Because the process of extracting knowledge from data can become more complex and memory-consuming. Not only the presence of all features doesn't help the learning process, but also it can sometimes decrease the model's efficiency. To enhance the model's efficiency and reduce the problem's complexity, various feature selection algorithms are designed and implemented. In this paper, a novel and highly effective algorithm based on the salp swarm optimization algorithm for solving complex problems is proposed for feature selection. In the proposed method, an unexpected event that causes the chain to break apart (such as hitting an obstacle or the death of the chain leader, etc.) is modeled which is not taken into account in the salp swarm optimization algorithm. Also, the exploration capability is improved by modifying the updating the position of the chain leader. Additionally, an innovative local search algorithm has been embedded into the proposed algorithm to enhance its exploitation. The proposed approach is implemented on 14 datasets, and the results are compared by two terms, classification accuracy and number of selected features. Additionally, the effectiveness of the proposed method is tested on 2 widely used chemical datasets. The modifications that are applied to the standard salp swarm algorithm reduce the probability of getting stuck in the local optimum and simultaneously, increase the diversity of solutions. The results show that the proposed algorithm has performed significantly better than other algorithms in solving the feature selection problem.
期刊介绍:
Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines.
Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data.
The journal deals with the following topics:
1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.)
2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered.
3) Development of new software that provides novel tools or truly advances the use of chemometrical methods.
4) Well characterized data sets to test performance for the new methods and software.
The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.