{"title":"SAMME.C2 algorithm for imbalanced multi-class classification","authors":"Banghee So, Emiliano A. Valdez","doi":"10.1007/s00500-024-09847-0","DOIUrl":null,"url":null,"abstract":"<p>Classification predictive modeling involves the accurate assignment of observations in a dataset to target classes or categories. Real-world classification problems with severely imbalanced class distributions have increased substantially in recent years. In such cases, significantly fewer observations are available for minority classes to learn from than for majority classes. Despite this sparsity, the minority class is often considered as the more interesting class, yet the development of a scientific learning algorithm that is suitable for these observations presents numerous challenges. In this study, we further explore the merits of an effective multi-class classification algorithm known as <span>SAMME.C2</span> that is specialized for handling severely imbalanced classes. This innovative method blends the flexible mechanics of the boosting techniques from the <span>SAMME</span> algorithm, which is a multi-class classifier, and the <span>Ada.C2</span> algorithm, which is a cost-sensitive binary classifier that is designed to address highly imbalanced classes. We establish a scientific and statistical formulation of the <span>SAMME.C2</span> algorithm, together with providing and explaining the resulting procedure. We demonstrate the consistently superior performance of this algorithm through numerical experiments as well as empirical studies.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"51 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09847-0","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Classification predictive modeling involves the accurate assignment of observations in a dataset to target classes or categories. Real-world classification problems with severely imbalanced class distributions have increased substantially in recent years. In such cases, significantly fewer observations are available for minority classes to learn from than for majority classes. Despite this sparsity, the minority class is often considered as the more interesting class, yet the development of a scientific learning algorithm that is suitable for these observations presents numerous challenges. In this study, we further explore the merits of an effective multi-class classification algorithm known as SAMME.C2 that is specialized for handling severely imbalanced classes. This innovative method blends the flexible mechanics of the boosting techniques from the SAMME algorithm, which is a multi-class classifier, and the Ada.C2 algorithm, which is a cost-sensitive binary classifier that is designed to address highly imbalanced classes. We establish a scientific and statistical formulation of the SAMME.C2 algorithm, together with providing and explaining the resulting procedure. We demonstrate the consistently superior performance of this algorithm through numerical experiments as well as empirical studies.
期刊介绍:
Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems.
Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.