{"title":"The cross-interval reconstruction and heuristic calculation to deal with the continuous-valued attribute in the learning process","authors":"Wei Zhou , Wenqiang Zhu , Jin Chen , Zeshui Xu","doi":"10.1016/j.asoc.2025.112897","DOIUrl":null,"url":null,"abstract":"<div><div>Discretization is a widely used technique for organizing and simplifying continuous-valued data in classification problems, facilitating subsequent analytical applications. However, the lack of an effective approach for addressing soft boundary classification can complicate the interpretation and accuracy of results. This paper focuses on discretization methods within soft boundary classification environments, aiming to improve classification outcomes and enhance interpretability for specific applications. Building on rule-based discretization, we propose a novel approach to enhance the handling of soft boundary classification. A detailed illustrative example is provided to demonstrate the effectiveness of the proposed Cross-Interval Recursion (CIR) method and Heuristic Cross-Interval Recursion (HCIR) algorithm. Our results show that the CIR-based rule discretization method and its evaluation mechanism effectively mitigate noise interference from class boundary points, improving interpretability and promoting greater generalization in soft boundary classification. The performance of our algorithm outperforms existing methods, including EqQua-CIR, EqVal-CIR, and other rule-based discretization techniques, particularly in terms of classification accuracy when dealing with boundary points at high granularity. When compared to classic classifiers and other rule-based discretization approaches, our method demonstrates that rule-based classifiers are more effective than direct approaches in handling soft boundary issues. Furthermore, the alignment between the classifier and the sample data plays a critical role in determining classification performance. Our approach offers significant potential as a breakthrough in addressing soft boundary classification of continuous-valued attributes, leveraging interval reconstruction, and enhancing classification robustness.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"172 ","pages":"Article 112897"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462500208X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Discretization is a widely used technique for organizing and simplifying continuous-valued data in classification problems, facilitating subsequent analytical applications. However, the lack of an effective approach for addressing soft boundary classification can complicate the interpretation and accuracy of results. This paper focuses on discretization methods within soft boundary classification environments, aiming to improve classification outcomes and enhance interpretability for specific applications. Building on rule-based discretization, we propose a novel approach to enhance the handling of soft boundary classification. A detailed illustrative example is provided to demonstrate the effectiveness of the proposed Cross-Interval Recursion (CIR) method and Heuristic Cross-Interval Recursion (HCIR) algorithm. Our results show that the CIR-based rule discretization method and its evaluation mechanism effectively mitigate noise interference from class boundary points, improving interpretability and promoting greater generalization in soft boundary classification. The performance of our algorithm outperforms existing methods, including EqQua-CIR, EqVal-CIR, and other rule-based discretization techniques, particularly in terms of classification accuracy when dealing with boundary points at high granularity. When compared to classic classifiers and other rule-based discretization approaches, our method demonstrates that rule-based classifiers are more effective than direct approaches in handling soft boundary issues. Furthermore, the alignment between the classifier and the sample data plays a critical role in determining classification performance. Our approach offers significant potential as a breakthrough in addressing soft boundary classification of continuous-valued attributes, leveraging interval reconstruction, and enhancing classification robustness.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.