首页 > 最新文献

Annals of Mathematics and Artificial Intelligence最新文献

英文 中文
35 years of math and AI
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-10 DOI: 10.1007/s10472-025-09969-7
Martin Charles Golumbic
{"title":"35 years of math and AI","authors":"Martin Charles Golumbic","doi":"10.1007/s10472-025-09969-7","DOIUrl":"10.1007/s10472-025-09969-7","url":null,"abstract":"","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"1 - 3"},"PeriodicalIF":1.2,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The future starts now
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-05 DOI: 10.1007/s10472-025-09970-0
Jürgen Dix, Michael Fisher
{"title":"The future starts now","authors":"Jürgen Dix, Michael Fisher","doi":"10.1007/s10472-025-09970-0","DOIUrl":"10.1007/s10472-025-09970-0","url":null,"abstract":"","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"5 - 6"},"PeriodicalIF":1.2,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10472-025-09970-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guest editorial: Revised selected papers from the LION 16 conference
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-06 DOI: 10.1007/s10472-024-09958-2
Ilias S. Kotsireas, Panos M. Pardalos
{"title":"Guest editorial: Revised selected papers from the LION 16 conference","authors":"Ilias S. Kotsireas, Panos M. Pardalos","doi":"10.1007/s10472-024-09958-2","DOIUrl":"10.1007/s10472-024-09958-2","url":null,"abstract":"","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"93 1","pages":"19 - 20"},"PeriodicalIF":1.2,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Common equivalence and size of forgetting from Horn formulae 霍恩公式中常见的等效和遗忘量
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-29 DOI: 10.1007/s10472-024-09955-5
Paolo Liberatore

Forgetting variables from a propositional formula may increase its size. Introducing new variables is a way to shorten it. Both operations can be expressed in terms of common equivalence, a weakened version of equivalence. In turn, common equivalence can be expressed in terms of forgetting. An algorithm for forgetting and checking common equivalence in polynomial space is given for the Horn case; it is polynomial-time for the subclass of single-head formulae. Minimizing after forgetting is polynomial-time if the formula is also acyclic and variables cannot be introduced, NP-hard when they can.

忘记命题公式中的变量可能会增加它的大小。引入新变量是缩短它的一种方法。这两个操作都可以用公共等价来表示,这是等价的弱化版本。反过来,一般的等价也可以用遗忘来表达。给出了Horn情况下多项式空间中公共等价的遗忘和检验算法;单头公式的子类是多项式时间的。如果公式也是无循环的,并且不能引入变量,则遗忘后最小化是多项式时间,如果可以引入变量,则是np困难。
{"title":"Common equivalence and size of forgetting from Horn formulae","authors":"Paolo Liberatore","doi":"10.1007/s10472-024-09955-5","DOIUrl":"10.1007/s10472-024-09955-5","url":null,"abstract":"<div><p>Forgetting variables from a propositional formula may increase its size. Introducing new variables is a way to shorten it. Both operations can be expressed in terms of common equivalence, a weakened version of equivalence. In turn, common equivalence can be expressed in terms of forgetting. An algorithm for forgetting and checking common equivalence in polynomial space is given for the Horn case; it is polynomial-time for the subclass of single-head formulae. Minimizing after forgetting is polynomial-time if the formula is also acyclic and variables cannot be introduced, NP-hard when they can.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 6","pages":"1545 - 1584"},"PeriodicalIF":1.2,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10472-024-09955-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142870292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-penalised trees (TpT): introducing a new tree-based data mining algorithm for time-varying covariates 时变树(TpT):为时变协变量引入一种新的基于树的数据挖掘算法
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-22 DOI: 10.1007/s10472-024-09950-w
Mathias Valla

This article introduces a new decision tree algorithm that accounts for time-varying covariates in the decision-making process. Traditional decision tree algorithms assume that the covariates are static and do not change over time, which can lead to inaccurate predictions in dynamic environments. Other existing methods suggest workaround solutions such as the pseudo-subject approach, discussed in the article. The proposed algorithm utilises a different structure and a time-penalised splitting criterion that allows a recursive partitioning of both the covariates space and time. Relevant historical trends are then inherently involved in the construction of a tree, and are visible and interpretable once it is fit. This approach allows for innovative and highly interpretable analysis in settings where the covariates are subject to change over time. The effectiveness of the algorithm is demonstrated through a real-world data application in life insurance. The results presented in this article can be seen as an introduction or proof-of-concept of our time-penalised approach, and the algorithm’s theoretical properties and comparison against existing approaches on datasets from various fields, including healthcare, finance, insurance, environmental monitoring, and data mining in general, will be explored in forthcoming work.

本文介绍了一种新的决策树算法,该算法在决策过程中考虑了随时间变化的协变量。传统的决策树算法假定协变量是静态的,不会随时间变化,这可能导致在动态环境中预测不准确。其他现有方法提出了变通的解决方案,如文章中讨论的伪主体方法。所提出的算法采用了不同的结构和时间分隔分割标准,允许对协变因素的空间和时间进行递归分割。这样,相关的历史趋势就会内在地参与到树的构建中,一旦树被拟合,这些趋势就会显现出来并可进行解释。在协变量随时间变化的情况下,这种方法可以进行创新的、可解释性强的分析。该算法的有效性通过人寿保险领域的实际数据应用得到了验证。本文介绍的结果可以看作是我们时间分隔方法的介绍或概念验证,而算法的理论特性以及与现有方法在医疗保健、金融、保险、环境监测和数据挖掘等不同领域数据集上的比较,将在接下来的工作中进行探讨。
{"title":"Time-penalised trees (TpT): introducing a new tree-based data mining algorithm for time-varying covariates","authors":"Mathias Valla","doi":"10.1007/s10472-024-09950-w","DOIUrl":"10.1007/s10472-024-09950-w","url":null,"abstract":"<div><p>This article introduces a new decision tree algorithm that accounts for time-varying covariates in the decision-making process. Traditional decision tree algorithms assume that the covariates are static and do not change over time, which can lead to inaccurate predictions in dynamic environments. Other existing methods suggest workaround solutions such as the pseudo-subject approach, discussed in the article. The proposed algorithm utilises a different structure and a time-penalised splitting criterion that allows a recursive partitioning of both the covariates space and time. Relevant historical trends are then inherently involved in the construction of a tree, and are visible and interpretable once it is fit. This approach allows for innovative and highly interpretable analysis in settings where the covariates are subject to change over time. The effectiveness of the algorithm is demonstrated through a real-world data application in life insurance. The results presented in this article can be seen as an introduction or proof-of-concept of our time-penalised approach, and the algorithm’s theoretical properties and comparison against existing approaches on datasets from various fields, including healthcare, finance, insurance, environmental monitoring, and data mining in general, will be explored in forthcoming work.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 6","pages":"1609 - 1661"},"PeriodicalIF":1.2,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conformal test martingales for hypergraphical models 超图模型的共形检验马氏体
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-03 DOI: 10.1007/s10472-024-09951-9
Ilia Nouretdinov

In this work, we study applications of the Conformal Prediction machine learning framework to the questions of statistical data testing. This technique is also known as Conformal Test Martingales. Earlier works on this topic used it to detect deviations from exchangeability assumptions (such as change points). Here we move to test popular hypergraphical models. We adopt and compare two versions of Conformal Testing Martingales. First: testing the data against exchangeability assumption, but using the elements of hypergraphical model for setting its parameters. Second: combining Conformal Testing Martingale with Hypergraphical On-Line Compression Models. The latter is an extension of the Conformal Prediction technique beyond exchangeability.

We show how these approaches help to accelerate the detection of data deviation from i.i.d. by making use of the knowledge about relations between the features embedded into a hypergraphical model.

在这项工作中,我们研究了共形预测机器学习框架在统计数据测试问题上的应用。这种技术也被称为共形测试马丁格尔。关于这一主题的早期研究将其用于检测可交换性假设的偏差(如变化点)。在这里,我们转而测试流行的超图模型。我们采用并比较了两个版本的马氏拟合检验(Conformal Testing Martingales)。第一种:根据可交换性假设测试数据,但使用超图模型的元素来设置参数。第二种:将共形检验马丁格尔与超图在线压缩模型相结合。我们展示了这些方法如何通过利用嵌入超图模型的特征之间关系的知识,帮助加速检测数据偏离 i.i.d.。
{"title":"Conformal test martingales for hypergraphical models","authors":"Ilia Nouretdinov","doi":"10.1007/s10472-024-09951-9","DOIUrl":"https://doi.org/10.1007/s10472-024-09951-9","url":null,"abstract":"<p>In this work, we study applications of the Conformal Prediction machine learning framework to the questions of statistical data testing. This technique is also known as Conformal Test Martingales. Earlier works on this topic used it to detect deviations from exchangeability assumptions (such as change points). Here we move to test popular hypergraphical models. We adopt and compare two versions of Conformal Testing Martingales. First: testing the data against exchangeability assumption, but using the elements of hypergraphical model for setting its parameters. Second: combining Conformal Testing Martingale with Hypergraphical On-Line Compression Models. The latter is an extension of the Conformal Prediction technique beyond exchangeability.</p><p>We show how these approaches help to accelerate the detection of data deviation from i.i.d. by making use of the knowledge about relations between the features embedded into a hypergraphical model.</p>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"15 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in preference handling: foreword 偏好处理的进展:前言
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-27 DOI: 10.1007/s10472-024-09954-6
Khaled Belahcène, Sébastien Destercke, Christophe Labreuche, Meltem Öztürk, Paolo Viappiani
{"title":"Advances in preference handling: foreword","authors":"Khaled Belahcène,&nbsp;Sébastien Destercke,&nbsp;Christophe Labreuche,&nbsp;Meltem Öztürk,&nbsp;Paolo Viappiani","doi":"10.1007/s10472-024-09954-6","DOIUrl":"10.1007/s10472-024-09954-6","url":null,"abstract":"","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 6","pages":"1377 - 1379"},"PeriodicalIF":1.2,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141797335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Costly information providing in binary contests 在二进制竞赛中提供昂贵的信息
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-27 DOI: 10.1007/s10472-024-09953-7
Noam Simon, Priel Levy, David Sarne

Contests are commonly used as a mechanism for eliciting effort and participation in multi-agent settings. Naturally, and much like with various other mechanisms, the information provided to the agents prior to and throughout the contest fundamentally influences its outcomes. In this paper we study the problem of information providing whenever the contest organizer does not initially hold the information and obtaining it is potentially costly. As the underlying contest mechanism for our model we use the binary contest, where contestants’ strategy is captured by their decision whether or not to participate in the contest in the first place. Here, it is often the case that the contest organizer can proactively obtain and provide contestants information related to their expected performance in the contest. We provide a comprehensive equilibrium analysis of the model, showing that even when such information is costless, it is not necessarily the case that the contest organizer will prefer to obtain and provide it to all agents, let alone when the information is costly.

在多代理环境中,竞赛通常被用作一种激发努力和参与的机制。自然,与其他各种机制一样,在竞赛之前和整个竞赛过程中向代理提供的信息会从根本上影响竞赛结果。在本文中,我们研究的是当竞赛组织者最初并不掌握信息,而获取信息又可能代价高昂时的信息提供问题。作为模型的基础竞赛机制,我们使用二元竞赛,参赛者的策略由他们是否参加竞赛的决定决定。在这种情况下,竞赛组织者往往可以主动获取并向参赛者提供与他们在竞赛中的预期表现相关的信息。我们对模型进行了全面的均衡分析,结果表明,即使这些信息是无成本的,比赛组织者也不一定会倾向于获取并向所有参赛者提供这些信息,更不用说这些信息是有成本的了。
{"title":"Costly information providing in binary contests","authors":"Noam Simon,&nbsp;Priel Levy,&nbsp;David Sarne","doi":"10.1007/s10472-024-09953-7","DOIUrl":"10.1007/s10472-024-09953-7","url":null,"abstract":"<div><p>Contests are commonly used as a mechanism for eliciting effort and participation in multi-agent settings. Naturally, and much like with various other mechanisms, the information provided to the agents prior to and throughout the contest fundamentally influences its outcomes. In this paper we study the problem of information providing whenever the contest organizer does not initially hold the information and obtaining it is potentially costly. As the underlying contest mechanism for our model we use the binary contest, where contestants’ strategy is captured by their decision whether or not to participate in the contest in the first place. Here, it is often the case that the contest organizer can proactively obtain and provide contestants information related to their expected performance in the contest. We provide a comprehensive equilibrium analysis of the model, showing that even when such information is costless, it is not necessarily the case that the contest organizer will prefer to obtain and provide it to all agents, let alone when the information is costly.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 5","pages":"1353 - 1375"},"PeriodicalIF":1.2,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10472-024-09953-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tumato 2.0 - a constraint-based planning approach for safe and robust robot behavior Tumato 2.0--一种基于约束的规划方法,可实现安全、稳健的机器人行为
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-26 DOI: 10.1007/s10472-024-09949-3
Jan Vermaelen, Tom Holvoet

Ensuring the safe and effective operation of autonomous systems is a complex undertaking that inherently relies on underlying decision-making processes. To rigorously analyze these processes, formal verification methods, such as model checking, offer a valuable means. However, the non-deterministic nature of realistic environments makes these approaches challenging and often impractical. This work explores the capabilities of a constraint-based planning approach, Tumato, in generating policies that guide the system to predefined goals while adhering to safety constraints. Constraint-based planning approaches are inherently able to provide guarantees of soundness and completeness. Our primary contribution lies in extending Tumato’s capabilities to accommodate non-deterministic outcomes of actions, enhancing the robustness of the behavior. Originally designed to accommodate only deterministic outcomes, actions can now be modeled to include alternative outcomes to address contingencies explicitly. The adapted solver generates policies that enable reaching the goals in a safe manner, even when such alternative outcomes of actions occur. Additionally, we introduce a purely declarative manner for specifying safety in Tumato to further enhance its expressiveness as well as to reduce the susceptibility to errors during specification. The incorporation of cost or duration values to actions enables the solver to restore safety in the most preferred manner when necessary. Finally, we highlight the overlap of Tumato’s safety-related capabilities with a systems-theoretic approach, STPA (Systems-Theoretic Process Analysis). The aim is to emphasize the ability to avoid unsafe control actions without their explicit identification, contributing to a more comprehensive and holistic understanding of safety.

确保自主系统安全有效地运行是一项复杂的工作,本质上依赖于潜在的决策过程。为了严格分析这些过程,模型检查等形式验证方法提供了宝贵的手段。然而,现实环境的非确定性使得这些方法具有挑战性,而且往往不切实际。这项工作探索了基于约束的规划方法 Tumato 在生成策略方面的能力,该策略可在遵守安全约束的同时引导系统实现预定目标。基于约束的规划方法本质上能够提供合理性和完整性保证。我们的主要贡献在于扩展了 Tumato 的功能,使其能够适应行动的非确定性结果,从而增强了行为的稳健性。图马图最初的设计只考虑确定性结果,现在可以对行动进行建模,使其包括替代性结果,以明确解决突发事件。调整后的求解器生成的策略,即使在行动出现这种替代结果时,也能以安全的方式实现目标。此外,我们还在 Tumato 中引入了一种纯粹的声明式安全指定方式,以进一步增强其表达能力,并降低指定过程中出错的可能性。在行动中加入成本或持续时间值,可使求解器在必要时以最理想的方式恢复安全性。最后,我们强调了 Tumato 的安全相关功能与系统理论方法 STPA(系统理论过程分析)的重叠之处。这样做的目的是强调在没有明确识别不安全控制行为的情况下避免这些行为的能力,从而促进对安全的更全面、更整体的理解。
{"title":"Tumato 2.0 - a constraint-based planning approach for safe and robust robot behavior","authors":"Jan Vermaelen, Tom Holvoet","doi":"10.1007/s10472-024-09949-3","DOIUrl":"https://doi.org/10.1007/s10472-024-09949-3","url":null,"abstract":"<p>Ensuring the safe and effective operation of autonomous systems is a complex undertaking that inherently relies on underlying decision-making processes. To rigorously analyze these processes, formal verification methods, such as model checking, offer a valuable means. However, the non-deterministic nature of realistic environments makes these approaches challenging and often impractical. This work explores the capabilities of a constraint-based planning approach, Tumato, in generating policies that guide the system to predefined goals while adhering to safety constraints. Constraint-based planning approaches are inherently able to provide guarantees of soundness and completeness. Our primary contribution lies in extending Tumato’s capabilities to accommodate non-deterministic outcomes of actions, enhancing the robustness of the behavior. Originally designed to accommodate only deterministic outcomes, actions can now be modeled to include alternative outcomes to address contingencies explicitly. The adapted solver generates policies that enable reaching the goals in a safe manner, even when such alternative outcomes of actions occur. Additionally, we introduce a purely declarative manner for specifying safety in Tumato to further enhance its expressiveness as well as to reduce the susceptibility to errors during specification. The incorporation of cost or duration values to actions enables the solver to restore safety in the most preferred manner when necessary. Finally, we highlight the overlap of Tumato’s safety-related capabilities with a systems-theoretic approach, STPA (Systems-Theoretic Process Analysis). The aim is to emphasize the ability to avoid unsafe control actions without their explicit identification, contributing to a more comprehensive and holistic understanding of safety.</p>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"44 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calibration methods in imbalanced binary classification 不平衡二元分类中的校准方法
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-19 DOI: 10.1007/s10472-024-09952-8
Théo Guilbert, Olivier Caelen, Andrei Chirita, Marco Saerens

The calibration problem in machine learning classification tasks arises when a model’s output score does not align with the ground truth observed probability of the target class. There exist several parametric and non-parametric post-processing methods that can help to calibrate an existing classifier. In this work, we focus on binary classification cases where the dataset is imbalanced, meaning that the negative target class significantly outnumbers the positive one. We propose new parametric calibration methods designed to this specific case and a new calibration measure focusing on the primary objective in imbalanced problems: detecting infrequent positive cases. Experiments on several datasets show that, for imbalanced problems, our approaches outperform state-of-the-art methods in many cases.

在机器学习分类任务中,当模型的输出得分与观察到的目标类别的基本真实概率不一致时,就会出现校准问题。有几种参数和非参数后处理方法可以帮助校准现有分类器。在这项工作中,我们将重点放在数据集不平衡的二元分类情况上,这意味着负目标类明显多于正目标类。我们针对这种特殊情况提出了新的参数校准方法,并针对不平衡问题的主要目标提出了新的校准方法:检测不常见的正向案例。在多个数据集上的实验表明,对于不平衡问题,我们的方法在很多情况下都优于最先进的方法。
{"title":"Calibration methods in imbalanced binary classification","authors":"Théo Guilbert,&nbsp;Olivier Caelen,&nbsp;Andrei Chirita,&nbsp;Marco Saerens","doi":"10.1007/s10472-024-09952-8","DOIUrl":"10.1007/s10472-024-09952-8","url":null,"abstract":"<div><p>The calibration problem in machine learning classification tasks arises when a model’s output score does not align with the ground truth observed probability of the target class. There exist several parametric and non-parametric post-processing methods that can help to calibrate an existing classifier. In this work, we focus on binary classification cases where the dataset is imbalanced, meaning that the negative target class significantly outnumbers the positive one. We propose new parametric calibration methods designed to this specific case and a new calibration measure focusing on the primary objective in imbalanced problems: detecting infrequent positive cases. Experiments on several datasets show that, for imbalanced problems, our approaches outperform state-of-the-art methods in many cases.</p></div>","PeriodicalId":7971,"journal":{"name":"Annals of Mathematics and Artificial Intelligence","volume":"92 5","pages":"1319 - 1352"},"PeriodicalIF":1.2,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of Mathematics and Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1