K. Janani , S.S. Mohanrasu , Ardak Kashkynbayev , R. Rakkiyappan
{"title":"Ensemble feature selection via CoCoSo method extended to interval-valued intuitionistic fuzzy environment","authors":"K. Janani , S.S. Mohanrasu , Ardak Kashkynbayev , R. Rakkiyappan","doi":"10.1016/j.matcom.2024.09.023","DOIUrl":null,"url":null,"abstract":"<div><div>Feature selection is a crucial step in the process of preparing and refining data. By identifying and retaining only the most informative and discriminative features, one can achieve several benefits, including faster training times, reduced risk of overfitting, improved model generalization, and enhanced interpretability. Ensemble feature selection has demonstrated its efficacy in improving the stability and generalization performance of models and is particularly valuable in high-dimensional datasets and complex machine learning tasks, contributing to the creation of more accurate and robust predictive models. This article presents an innovative ensemble feature selection technique through the development of a unique Multi-criteria decision making (MCDM) model, incorporating both rank aggregation principles and a filter-based algorithm. The proposed MCDM model combines the Combined Compromise Solution (CoCoSo) method and the Archimedean operator within interval-valued intuitionistic fuzzy environments, effectively addressing the challenges of vagueness and imprecision in datasets. A customizable feature selection model is introduced, allowing users to define the number of features, employing a sigmoidal function with a tuning parameter for fuzzification. The assignment of entropy weights in the Interval-valued intuitionistic fuzzy set (IVIFS) environment provides priorities to each column. The method’s effectiveness is assessed on real-world datasets, comparing it with existing approaches and validated through statistical tests such as the Friedman test and post-hoc Conover test, emphasizing its significance in comparison to current methodologies. Based on the results obtained, we inferred that our structured approach to ensemble feature selection, utilizing a specific case of the Archimedean operator, demonstrated superior performance across the datasets. This more generalized methodology enhances the robustness and effectiveness of feature selection by leveraging the strengths of the Archimedean operator, resulting in improved data analysis and model accuracy.</div></div>","PeriodicalId":4,"journal":{"name":"ACS Applied Energy Materials","volume":null,"pages":null},"PeriodicalIF":5.4000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Energy Materials","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378475424003781","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Feature selection is a crucial step in the process of preparing and refining data. By identifying and retaining only the most informative and discriminative features, one can achieve several benefits, including faster training times, reduced risk of overfitting, improved model generalization, and enhanced interpretability. Ensemble feature selection has demonstrated its efficacy in improving the stability and generalization performance of models and is particularly valuable in high-dimensional datasets and complex machine learning tasks, contributing to the creation of more accurate and robust predictive models. This article presents an innovative ensemble feature selection technique through the development of a unique Multi-criteria decision making (MCDM) model, incorporating both rank aggregation principles and a filter-based algorithm. The proposed MCDM model combines the Combined Compromise Solution (CoCoSo) method and the Archimedean operator within interval-valued intuitionistic fuzzy environments, effectively addressing the challenges of vagueness and imprecision in datasets. A customizable feature selection model is introduced, allowing users to define the number of features, employing a sigmoidal function with a tuning parameter for fuzzification. The assignment of entropy weights in the Interval-valued intuitionistic fuzzy set (IVIFS) environment provides priorities to each column. The method’s effectiveness is assessed on real-world datasets, comparing it with existing approaches and validated through statistical tests such as the Friedman test and post-hoc Conover test, emphasizing its significance in comparison to current methodologies. Based on the results obtained, we inferred that our structured approach to ensemble feature selection, utilizing a specific case of the Archimedean operator, demonstrated superior performance across the datasets. This more generalized methodology enhances the robustness and effectiveness of feature selection by leveraging the strengths of the Archimedean operator, resulting in improved data analysis and model accuracy.
期刊介绍:
ACS Applied Energy Materials is an interdisciplinary journal publishing original research covering all aspects of materials, engineering, chemistry, physics and biology relevant to energy conversion and storage. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrate knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important energy applications.