首页 > 最新文献

International Journal of Approximate Reasoning最新文献

英文 中文
Neural approaches to SAT solving: Design choices and interpretability 解决SAT的神经方法:设计选择和可解释性
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-05 DOI: 10.1016/j.ijar.2025.109609
David Mojžíšek , Jan Hůla , Ziwei Li , Ziyu Zhou , Mikoláš Janota
In this contribution, we provide a comprehensive evaluation of graph neural networks applied to Boolean satisfiability problems, accompanied by an intuitive explanation of the mechanisms enabling the model to generalize to different instances. We introduce several training improvements, particularly a novel closest assignment supervision method that dynamically adapts to the model’s current state, significantly enhancing performance on problems with larger solution spaces. Our experiments demonstrate the suitability of variable-clause graph representations with recurrent neural network updates, which achieve good accuracy on SAT assignment prediction while reducing computational demands. We extend the base graph neural network into a diffusion model that facilitates incremental sampling and can be effectively combined with classical techniques like unit propagation. Through analysis of embedding space patterns and optimization trajectories, we show how these networks implicitly perform a process very similar to continuous relaxations of MaxSAT, offering an interpretable view of their reasoning process. This understanding guides our design choices and explains the ability of recurrent architectures to scale effectively at inference time beyond their training distribution, which we demonstrate with test-time scaling experiments.
在这篇文章中,我们提供了应用于布尔可满足性问题的图神经网络的综合评估,并伴随着对使模型能够推广到不同实例的机制的直观解释。我们引入了一些训练改进,特别是一种新的最接近分配监督方法,它可以动态地适应模型的当前状态,显著提高了在具有较大解空间的问题上的性能。我们的实验证明了变量子句图表示与递归神经网络更新的适用性,在减少计算需求的同时,在SAT分配预测上取得了良好的准确性。我们将基图神经网络扩展为一个扩散模型,该模型便于增量采样,并且可以有效地与单元传播等经典技术相结合。通过对嵌入空间模式和优化轨迹的分析,我们展示了这些网络如何隐式地执行与MaxSAT连续松弛非常相似的过程,并提供了其推理过程的可解释视图。这种理解指导了我们的设计选择,并解释了循环架构在超出其训练分布的推理时间有效扩展的能力,我们通过测试时间扩展实验证明了这一点。
{"title":"Neural approaches to SAT solving: Design choices and interpretability","authors":"David Mojžíšek ,&nbsp;Jan Hůla ,&nbsp;Ziwei Li ,&nbsp;Ziyu Zhou ,&nbsp;Mikoláš Janota","doi":"10.1016/j.ijar.2025.109609","DOIUrl":"10.1016/j.ijar.2025.109609","url":null,"abstract":"<div><div>In this contribution, we provide a comprehensive evaluation of graph neural networks applied to Boolean satisfiability problems, accompanied by an intuitive explanation of the mechanisms enabling the model to generalize to different instances. We introduce several training improvements, particularly a novel closest assignment supervision method that dynamically adapts to the model’s current state, significantly enhancing performance on problems with larger solution spaces. Our experiments demonstrate the suitability of variable-clause graph representations with recurrent neural network updates, which achieve good accuracy on SAT assignment prediction while reducing computational demands. We extend the base graph neural network into a diffusion model that facilitates incremental sampling and can be effectively combined with classical techniques like unit propagation. Through analysis of embedding space patterns and optimization trajectories, we show how these networks implicitly perform a process very similar to continuous relaxations of MaxSAT, offering an interpretable view of their reasoning process. This understanding guides our design choices and explains the ability of recurrent architectures to scale effectively at inference time beyond their training distribution, which we demonstrate with test-time scaling experiments.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"190 ","pages":"Article 109609"},"PeriodicalIF":3.0,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145735354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sweep transform on imaginary matrices and its application to parameter estimation with Gaussian belief functions 虚矩阵的扫描变换及其在高斯信念函数参数估计中的应用
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-29 DOI: 10.1016/j.ijar.2025.109607
Liping Liu
Gaussian belief functions admit matrix representations, with their combination reduced to matrix summation under Dempster’ s rule. However, the incorporation of deterministic and logic knowledge under ignorance introduces division-by-zero issues, often requiring symbolic calculus—a method that becomes intractable for large matrices. This paper proposes the sweep transform on imaginary matricesas a solution to this challenge and demonstrates its utility for parameter estimation by combining data represented as Gaussian belief functions. It advances the theory of sweep transforms by showing that, when extended to imaginary numbers, a sweep transform with a symmetric matrix as the pivot can be decomposed into a sequence of sweeps using the matrix’ s leading diagonal elements as pivots. Notably, such reducibility does not hold when restricted to real numbers. The result gives rises to a novel approach to inverting structured symmetric matrices, which may not be positive definite, without requiring permutations even in the presence of zero pivots.
高斯信念函数允许矩阵表示,它们的组合在Dempster规则下简化为矩阵和。然而,在无知的情况下,确定性和逻辑知识的结合引入了除以零的问题,通常需要符号演算——一种对于大型矩阵变得难以处理的方法。本文提出了对虚矩阵的扫描变换作为解决这一挑战的方法,并通过将表示为高斯信念函数的数据组合在一起,证明了它在参数估计中的实用性。通过证明当扩展到虚数时,以对称矩阵为枢轴的扫描变换可以分解为以矩阵的前导对角元素为枢轴的扫描序列,提出了扫描变换的理论。值得注意的是,这种可约性在实数上并不成立。结果提出了一种新的方法来反演结构对称矩阵,它可能不是正定的,即使在零轴存在的情况下也不需要置换。
{"title":"Sweep transform on imaginary matrices and its application to parameter estimation with Gaussian belief functions","authors":"Liping Liu","doi":"10.1016/j.ijar.2025.109607","DOIUrl":"10.1016/j.ijar.2025.109607","url":null,"abstract":"<div><div>Gaussian belief functions admit matrix representations, with their combination reduced to matrix summation under Dempster’ s rule. However, the incorporation of deterministic and logic knowledge under ignorance introduces division-by-zero issues, often requiring symbolic calculus—a method that becomes intractable for large matrices. This paper proposes <em>the sweep transform on imaginary matrices</em>as a solution to this challenge and demonstrates its utility for parameter estimation by combining data represented as Gaussian belief functions. It advances the theory of sweep transforms by showing that, when extended to imaginary numbers, a sweep transform with a symmetric matrix as the pivot can be decomposed into a sequence of sweeps using the matrix’ s leading diagonal elements as pivots. Notably, such reducibility does not hold when restricted to real numbers. The result gives rises to a novel approach to inverting structured symmetric matrices, which may not be positive definite, without requiring permutations even in the presence of zero pivots.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"190 ","pages":"Article 109607"},"PeriodicalIF":3.0,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145683831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy rules with quantifiers as weights 以量词作为权重的模糊规则
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-28 DOI: 10.1016/j.ijar.2025.109600
Martina Daňková , Dana Hliněná
In this paper, we explore the use of General Unary Hypotheses Automaton quantifiers and provide representations for their specific subclasses. Furthermore, we focus explicitly on implicational quantifiers for analyzing specific relational dependencies. We discuss their suitability in fuzzy modeling and demonstrate their integration with appropriate fuzzy rules to create a new class of weighted fuzzy rules. This study contributes to the advancement of fuzzy modeling and offers a framework for further research and practical applications.
在本文中,我们探讨了一般一元假设自动机量词的使用,并为它们的特定子类提供了表示。此外,我们明确地关注用于分析特定关系依赖的隐含量词。讨论了它们在模糊建模中的适用性,并证明了它们与适当的模糊规则的集成,从而创建了一类新的加权模糊规则。本研究有助于模糊建模的发展,并为进一步的研究和实际应用提供框架。
{"title":"Fuzzy rules with quantifiers as weights","authors":"Martina Daňková ,&nbsp;Dana Hliněná","doi":"10.1016/j.ijar.2025.109600","DOIUrl":"10.1016/j.ijar.2025.109600","url":null,"abstract":"<div><div>In this paper, we explore the use of General Unary Hypotheses Automaton quantifiers and provide representations for their specific subclasses. Furthermore, we focus explicitly on implicational quantifiers for analyzing specific relational dependencies. We discuss their suitability in fuzzy modeling and demonstrate their integration with appropriate fuzzy rules to create a new class of weighted fuzzy rules. This study contributes to the advancement of fuzzy modeling and offers a framework for further research and practical applications.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"190 ","pages":"Article 109600"},"PeriodicalIF":3.0,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145665626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Connecting classical finite exchangeability to quantum theory and indistinguishability 将经典的有限互换性与量子理论和不可区分性联系起来
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1016/j.ijar.2025.109604
Alessio Benavoli , Alessandro Facchini , Marco Zaffalon
Exchangeability is a fundamental concept in probability theory and statistics. It allows to model situations where the order of observations does not matter. The classical de Finetti’s theorem provides a representation of infinitely exchangeable sequences of random variables as mixtures of independent and identically distributed variables. The quantum de Finetti theorem extends this result to symmetric quantum states on tensor product Hilbert spaces. It is well known that both theorems do not hold for finitely exchangeable sequences. The aim of this work is to investigate two lesser-known representation theorems, which were developed in classical probability theory to extend de Finetti’s theorem to finitely exchangeable sequences by using quasi-probabilities and quasi-expectations. With the aid of these theorems, we illustrate how a de Finetti-like representation theorem for finitely exchangeable sequences can be formulated through a mathematical representation which is formally equivalent to quantum theory (with boson-symmetric density matrices). We then show a promising application of this connection to the challenge of defining entanglement for indistinguishable bosons.
互换性是概率论和统计学中的一个基本概念。它允许对观察顺序无关紧要的情况进行建模。经典的de Finetti定理将随机变量的无限交换序列表示为独立的和同分布的变量的混合物。量子德菲内蒂定理将这一结果推广到张量积希尔伯特空间上的对称量子态。众所周知,对于有限可交换序列,这两个定理并不成立。本文的目的是研究两个不太为人所知的表示定理,它们是在经典概率论中发展起来的,通过准概率和准期望将de Finetti定理扩展到有限交换序列。借助这些定理,我们说明了有限可交换序列的类德菲内蒂表示定理是如何通过形式上等价于量子理论(具有玻色子对称密度矩阵)的数学表示来表述的。然后,我们展示了这种联系在定义不可区分玻色子纠缠的挑战中的一个有希望的应用。
{"title":"Connecting classical finite exchangeability to quantum theory and indistinguishability","authors":"Alessio Benavoli ,&nbsp;Alessandro Facchini ,&nbsp;Marco Zaffalon","doi":"10.1016/j.ijar.2025.109604","DOIUrl":"10.1016/j.ijar.2025.109604","url":null,"abstract":"<div><div>Exchangeability is a fundamental concept in probability theory and statistics. It allows to model situations where the order of observations does not matter. The classical de Finetti’s theorem provides a representation of infinitely exchangeable sequences of random variables as mixtures of independent and identically distributed variables. The quantum de Finetti theorem extends this result to symmetric quantum states on tensor product Hilbert spaces. It is well known that both theorems do not hold for finitely exchangeable sequences. The aim of this work is to investigate two lesser-known representation theorems, which were developed in classical probability theory to extend de Finetti’s theorem to finitely exchangeable sequences by using quasi-probabilities and quasi-expectations. With the aid of these theorems, we illustrate how a de Finetti-like representation theorem for finitely exchangeable sequences can be formulated through a mathematical representation which is formally equivalent to quantum theory (with boson-symmetric density matrices). We then show a promising application of this connection to the challenge of defining entanglement for indistinguishable bosons.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109604"},"PeriodicalIF":3.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature enhancement-based network for few-shot image classification 基于特征增强的少拍图像分类网络
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1016/j.ijar.2025.109605
Junbo Zhao , Lixin Han , Hong Yan
Few-shot image classification tackles recognizing new classes with limited training data. Despite the progress made by researchers in recent years, there are still challenges in dealing with data scarcity and improving model robustness. This paper presents FEICNet, a framework combining feature enhancement and adaptive weight control. Key innovations include: (1) MiniUNet (MUNet) — a compact network that enhances spatial and channel features through layered processing, and (2)Distribution of Feature Weights (DFW) – an adaptive weighting system that boosts critical features while filtering irrelevant patterns. Compatible with existing models without structural changes, the framework achieves strong performance across five benchmarks (MiniImageNet, tieredImageNet, StandfordDogs, CUB, StanfordCars). On StanfordCars, FEICNet outperforms Conv-64F methods by 13.4 % (1-shot) and 13.5 % (5-shot), surpassing ResNet-12 models by 9.7 % in 5-shot tests. Notably, when integrated as an embedding module, FEICNet elevates ProtoNets and Relation Networks by 19.6 %–28.8 % on StanfordDogs, demonstrating its effective plug-and-play capability. The framework exhibits consistent convergence across episodic training tasks, further evidencing its robustness. These advancements establish FEICNet as a significant contribution to few-shot image classification, particularly in resource-constrained environments and coarse- and fine-grained recognition scenarios. Overall, FEICNet provides a unified framework that bridges feature enhancement with adaptive weighting, offering new insights into few-shot image representation learning.
少拍图像分类解决了用有限的训练数据识别新类别的问题。尽管近年来研究人员取得了一些进展,但在处理数据稀缺性和提高模型鲁棒性方面仍然存在挑战。本文提出了一种结合特征增强和自适应权值控制的FEICNet框架。关键的创新包括:(1)MiniUNet (MUNet)——一个紧凑的网络,通过分层处理增强空间和通道特征;(2)特征权重分布(DFW)——一个自适应加权系统,在过滤无关模式的同时增强关键特征。该框架在不改变结构的情况下与现有模型兼容,在五个基准测试(MiniImageNet, tieredImageNet, StandfordDogs, CUB, StanfordCars)中实现了强大的性能。在StanfordCars上,FEICNet的性能比con - 64f高出13.4%(1次射击)和13.5%(5次射击),在5次射击测试中比ResNet-12模型高出9.7%。值得注意的是,当作为嵌入模块集成时,FEICNet将ProtoNets和Relation Networks在StanfordDogs上提升了19.6% - 28.8%,证明了其有效的即插即用能力。该框架在情景训练任务中表现出一致的收敛性,进一步证明了其鲁棒性。这些进步使FEICNet成为对少量图像分类的重要贡献,特别是在资源受限环境和粗粒度和细粒度识别场景中。总的来说,FEICNet提供了一个统一的框架,将特征增强与自适应加权连接起来,为少量图像表示学习提供了新的见解。
{"title":"Feature enhancement-based network for few-shot image classification","authors":"Junbo Zhao ,&nbsp;Lixin Han ,&nbsp;Hong Yan","doi":"10.1016/j.ijar.2025.109605","DOIUrl":"10.1016/j.ijar.2025.109605","url":null,"abstract":"<div><div>Few-shot image classification tackles recognizing new classes with limited training data. Despite the progress made by researchers in recent years, there are still challenges in dealing with data scarcity and improving model robustness. This paper presents FEICNet, a framework combining feature enhancement and adaptive weight control. Key innovations include: (1) MiniUNet (MUNet) — a compact network that enhances spatial and channel features through layered processing, and (2)Distribution of Feature Weights (DFW) – an adaptive weighting system that boosts critical features while filtering irrelevant patterns. Compatible with existing models without structural changes, the framework achieves strong performance across five benchmarks (MiniImageNet, tieredImageNet, StandfordDogs, CUB, StanfordCars). On StanfordCars, FEICNet outperforms Conv-64F methods by 13.4 % (1-shot) and 13.5 % (5-shot), surpassing ResNet-12 models by 9.7 % in 5-shot tests. Notably, when integrated as an embedding module, FEICNet elevates ProtoNets and Relation Networks by 19.6 %–28.8 % on StanfordDogs, demonstrating its effective plug-and-play capability. The framework exhibits consistent convergence across episodic training tasks, further evidencing its robustness. These advancements establish FEICNet as a significant contribution to few-shot image classification, particularly in resource-constrained environments and coarse- and fine-grained recognition scenarios. Overall, FEICNet provides a unified framework that bridges feature enhancement with adaptive weighting, offering new insights into few-shot image representation learning.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109605"},"PeriodicalIF":3.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145621174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How artificial intelligence leads to knowledge why: An inquiry inspired by Aristotle’s Posterior Analytics 人工智能如何导致知识为什么:受亚里士多德后验分析启发的探究
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1016/j.ijar.2025.109603
Guus Eelink , Kilian Rückschloß , Felix Weitkämper
Bayesian networks and causal models provide frameworks for reasoning about external interventions, enabling tasks that go beyond what probability distributions alone can support. Although these formalisms are often informally described as encoding causal knowledge, there is a lack of a formal theory that characterizes the kind of knowledge required to predict the effects of such interventions. This work introduces the theoretical framework of causal systems to implement Aristotle’s distinction between knowledge-that and knowledge-why within the setting of artificial intelligence. By interpreting existing AI technologies as causal systems, it examines the corresponding forms of knowledge they embody. Finally, it argues that predicting the effects of external interventions is possible only with knowledge-why, offering a more precise account of the assumptions underlying this capacity.
贝叶斯网络和因果模型为外部干预提供了推理框架,使任务超越了概率分布所能支持的范围。虽然这些形式主义经常被非正式地描述为编码因果知识,但缺乏一种正式的理论来描述预测此类干预措施的效果所需的知识。这项工作介绍了因果系统的理论框架,以实现亚里士多德在人工智能背景下对知识-那和知识-为什么的区分。通过将现有的人工智能技术解释为因果系统,它检查了它们所体现的相应形式的知识。最后,它认为预测外部干预的影响只有在“为什么”的知识下才有可能,并对这种能力背后的假设提供了更精确的解释。
{"title":"How artificial intelligence leads to knowledge why: An inquiry inspired by Aristotle’s Posterior Analytics","authors":"Guus Eelink ,&nbsp;Kilian Rückschloß ,&nbsp;Felix Weitkämper","doi":"10.1016/j.ijar.2025.109603","DOIUrl":"10.1016/j.ijar.2025.109603","url":null,"abstract":"<div><div>Bayesian networks and causal models provide frameworks for reasoning about external interventions, enabling tasks that go beyond what probability distributions alone can support. Although these formalisms are often informally described as encoding causal knowledge, there is a lack of a formal theory that characterizes the kind of knowledge required to predict the effects of such interventions. This work introduces the theoretical framework of <em>causal systems</em> to implement Aristotle’s distinction between knowledge-<em>that</em> and knowledge-<em>why</em> within the setting of artificial intelligence. By interpreting existing AI technologies as causal systems, it examines the corresponding forms of knowledge they embody. Finally, it argues that predicting the effects of external interventions is possible only with knowledge-<em>why</em>, offering a more precise account of the assumptions underlying this capacity.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"190 ","pages":"Article 109603"},"PeriodicalIF":3.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145788263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangled representations for continuous treatment effect estimation 连续处理效果估计的解纠缠表示
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-16 DOI: 10.1016/j.ijar.2025.109601
Ruijing Cui, Jianbin Sun, Bingyu He, Kewei Yang, Bingfeng Ge
We focus on estimating the causal effects of continuous treatments, also known as the dose-response function. Current methods typically learn a treatment-agnostic representation for all covariates, without distinguishing between instrumental, confounding, and adjustment variables among the covariates. Although some researchers disentangle covariates to estimate treatment effects, these methods are limited to the binary treatment setting and fail to obtain independent disentangled factors. So, learning the underlying disentangled factors precisely remains an open problem. In this paper, we incorporate the disentangled representation into the setting of continuous treatment and propose a novel model for dose-response curve estimation. Mutual information estimators and Integral Probability Metric distances effectively ensure the independence for disentangled factors. Extensive results on synthetic and semi-synthetic datasets demonstrate that our model outperforms current state-of-the-art methods.
我们的重点是估计连续治疗的因果效应,也被称为剂量-反应函数。目前的方法通常学习所有协变量的治疗不可知论表示,而不区分协变量中的工具变量、混杂变量和调整变量。尽管一些研究者通过解纠缠协变量来估计治疗效果,但这些方法仅限于二元治疗设置,无法获得独立的解纠缠因子。因此,准确地了解潜在的解开因素仍然是一个悬而未决的问题。在本文中,我们将解纠缠表示纳入到连续处理的设定中,并提出了一个新的剂量-响应曲线估计模型。互信息估计和积分概率度量距离有效地保证了解纠缠因子的独立性。在合成和半合成数据集上的广泛结果表明,我们的模型优于当前最先进的方法。
{"title":"Disentangled representations for continuous treatment effect estimation","authors":"Ruijing Cui,&nbsp;Jianbin Sun,&nbsp;Bingyu He,&nbsp;Kewei Yang,&nbsp;Bingfeng Ge","doi":"10.1016/j.ijar.2025.109601","DOIUrl":"10.1016/j.ijar.2025.109601","url":null,"abstract":"<div><div>We focus on estimating the causal effects of continuous treatments, also known as the dose-response function. Current methods typically learn a treatment-agnostic representation for all covariates, without distinguishing between instrumental, confounding, and adjustment variables among the covariates. Although some researchers disentangle covariates to estimate treatment effects, these methods are limited to the binary treatment setting and fail to obtain independent disentangled factors. So, learning the underlying disentangled factors precisely remains an open problem. In this paper, we incorporate the disentangled representation into the setting of continuous treatment and propose a novel model for dose-response curve estimation. Mutual information estimators and Integral Probability Metric distances effectively ensure the independence for disentangled factors. Extensive results on synthetic and semi-synthetic datasets demonstrate that our model outperforms current state-of-the-art methods.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109601"},"PeriodicalIF":3.0,"publicationDate":"2025-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145621226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning for quantifier selection in cvc5 cvc5中量词选择的机器学习
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-15 DOI: 10.1016/j.ijar.2025.109602
Jan Jakubův , Mikoláš Janota , Jelle Piepenbrock , Josef Urban
In this work we considerably improve the real-time performance of state-of-the-art SMT solving on first-order quantified problems by efficient machine learning guidance of quantifier selection. Quantifiers represent a significant challenge for SMT and are technically a source of undecidability. In our approach, we train an efficient machine learning model that informs the solver which quantifiers should be instantiated and which not. Each quantifier may be instantiated multiple times and the set of the currently active quantifiers changes as the solving progresses. Therefore, we invoke the ML predictor many times, during the whole run of the solver. To make this efficient, we use fast ML models based on gradient boosted decision trees. We integrate our approach into the state-of-the-art cvc5 SMT solver and show a considerable increase of the system’s holdout-set performance after training it on large sets of first-order problems. The method is tested in several ways, using both single-strategy and portfolio approaches. The evaluation is done on two large formal verification corpora: first-order problems created from the Mizar Mathematical Library, and first-order problems created from the HOL4 standard library.
在这项工作中,我们通过有效的机器学习指导量词选择,大大提高了最先进的SMT解决一阶量化问题的实时性。量词代表了SMT的一个重大挑战,并且在技术上是不可确定的来源。在我们的方法中,我们训练了一个有效的机器学习模型,该模型通知求解器哪些量词应该实例化,哪些不应该实例化。每个量词可以被实例化多次,并且当前活动量词的集合随着求解的进行而变化。因此,我们在求解器的整个运行过程中多次调用ML预测器。为了提高效率,我们使用基于梯度增强决策树的快速ML模型。我们将我们的方法集成到最先进的cvc5 SMT求解器中,并在对大的一阶问题集进行训练后,显示出系统的hold - out-set性能的显著提高。该方法经过了几种方法的测试,包括单一策略和投资组合方法。评估是在两个大型形式化验证语料库上完成的:从Mizar数学库创建的一阶问题,以及从HOL4标准库创建的一阶问题。
{"title":"Machine learning for quantifier selection in cvc5","authors":"Jan Jakubův ,&nbsp;Mikoláš Janota ,&nbsp;Jelle Piepenbrock ,&nbsp;Josef Urban","doi":"10.1016/j.ijar.2025.109602","DOIUrl":"10.1016/j.ijar.2025.109602","url":null,"abstract":"<div><div>In this work we considerably improve the real-time performance of state-of-the-art SMT solving on first-order quantified problems by efficient machine learning guidance of quantifier selection. Quantifiers represent a significant challenge for SMT and are technically a source of undecidability. In our approach, we train an efficient machine learning model that informs the solver which quantifiers should be instantiated and which not. Each quantifier may be instantiated multiple times and the set of the currently active quantifiers changes as the solving progresses. Therefore, we invoke the ML predictor many times, during the whole run of the solver. To make this efficient, we use fast ML models based on gradient boosted decision trees. We integrate our approach into the state-of-the-art cvc5 SMT solver and show a considerable increase of the system’s holdout-set performance after training it on large sets of first-order problems. The method is tested in several ways, using both single-strategy and portfolio approaches. The evaluation is done on two large formal verification corpora: first-order problems created from the Mizar Mathematical Library, and first-order problems created from the HOL4 standard library.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109602"},"PeriodicalIF":3.0,"publicationDate":"2025-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145621175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-granularity Knowledge Fusion for Feature Selection Using Granular-ball Entropy Uncertainty Measures 基于颗粒球熵不确定性测度的特征选择多粒度知识融合
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1016/j.ijar.2025.109590
Kehua Yuan , Yuji Bai , Duoqian Miao , Weiping Ding , Yiyu Yao , Hongyun Zhang , Witold Pedrycz
Multi-granularity computing for knowledge discovery has emerged as a remarkable paradigm in data mining and machine learning. As a representative method, granular-ball computing has attracted considerable attention due to its efficiency and adaptability in handling complex data distributions. However, most existing granularity-based approaches focus on intra-granular mutual information while neglecting the heterogeneity and overlapping phenomena across granularities. This limitation often leads to imprecise knowledge space construction and inaccurate uncertainty estimation in feature evaluation. To overcome this problem, this study proposes a novel and high-efficiency multi-granularity knowledge fusion framework for feature selection, incorporating an enhanced granular-ball generation mechanism and a newly designed granular-ball entropy (GB-E) uncertainty measure. Specifically, we first develop an enhanced granular-ball generation mechanism to construct multi-granularity knowledge space by incorporating class distribution information, thus achieving more accurate and flexible data partitioning. Subsequently, by jointly analyzing the separation and aggregation among granular balls, a novel granular-ball entropy is proposed to quantify uncertainty in the multi-granularity knowledge space. Compared with existing uncertainty measure methods, it provides a dual-perspective uncertainty characterization and effectively improves the accuracy of granularity information fusion. Furthermore, two feature significance measures based on the proposed GB-E measure are introduced for feature evaluation, and then a corresponding feature selection method is developed. Extensive experiments on multiple public datasets demonstrate the proposed method’s superior classification performance compared with several state-of-the-art approaches.
面向知识发现的多粒度计算已经成为数据挖掘和机器学习领域的一个重要范例。作为一种代表性的方法,颗粒球计算因其处理复杂数据分布的效率和适应性而受到广泛关注。然而,现有的基于粒度的方法大多侧重于粒度内的互信息,而忽略了粒度间的异质性和重叠现象。这种局限性往往导致特征评价中知识空间的构建不精确,不确定性估计不准确。为了克服这一问题,本研究提出了一种新的、高效的多粒度知识融合框架用于特征选择,该框架结合了增强的颗粒球生成机制和新设计的颗粒球熵(GB-E)不确定性测度。具体而言,我们首先开发了一种增强的颗粒球生成机制,通过结合类分布信息构建多粒度知识空间,从而实现更准确、更灵活的数据分区。随后,通过对颗粒球之间的分离和聚集进行分析,提出了一种新的颗粒球熵来量化多粒度知识空间中的不确定性。与现有的不确定性度量方法相比,该方法提供了双视角的不确定性表征,有效提高了粒度信息融合的精度。在此基础上,引入了基于GB-E测度的两种特征显著性测度进行特征评价,并提出了相应的特征选择方法。在多个公共数据集上进行的大量实验表明,与几种最先进的方法相比,该方法具有优越的分类性能。
{"title":"Multi-granularity Knowledge Fusion for Feature Selection Using Granular-ball Entropy Uncertainty Measures","authors":"Kehua Yuan ,&nbsp;Yuji Bai ,&nbsp;Duoqian Miao ,&nbsp;Weiping Ding ,&nbsp;Yiyu Yao ,&nbsp;Hongyun Zhang ,&nbsp;Witold Pedrycz","doi":"10.1016/j.ijar.2025.109590","DOIUrl":"10.1016/j.ijar.2025.109590","url":null,"abstract":"<div><div>Multi-granularity computing for knowledge discovery has emerged as a remarkable paradigm in data mining and machine learning. As a representative method, granular-ball computing has attracted considerable attention due to its efficiency and adaptability in handling complex data distributions. However, most existing granularity-based approaches focus on intra-granular mutual information while neglecting the heterogeneity and overlapping phenomena across granularities. This limitation often leads to imprecise knowledge space construction and inaccurate uncertainty estimation in feature evaluation. To overcome this problem, this study proposes a novel and high-efficiency multi-granularity knowledge fusion framework for feature selection, incorporating an enhanced granular-ball generation mechanism and a newly designed granular-ball entropy (GB-E) uncertainty measure. Specifically, we first develop an enhanced granular-ball generation mechanism to construct multi-granularity knowledge space by incorporating class distribution information, thus achieving more accurate and flexible data partitioning. Subsequently, by jointly analyzing the separation and aggregation among granular balls, a novel granular-ball entropy is proposed to quantify uncertainty in the multi-granularity knowledge space. Compared with existing uncertainty measure methods, it provides a dual-perspective uncertainty characterization and effectively improves the accuracy of granularity information fusion. Furthermore, two feature significance measures based on the proposed GB-E measure are introduced for feature evaluation, and then a corresponding feature selection method is developed. Extensive experiments on multiple public datasets demonstrate the proposed method’s superior classification performance compared with several state-of-the-art approaches.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109590"},"PeriodicalIF":3.0,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145621172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Matrix-based efficient methods to update three-way regions in neighborhood systems under varying attributes 基于矩阵的邻域系统三向区域更新方法
IF 3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-10 DOI: 10.1016/j.ijar.2025.109598
Yingqi Qi , Chengxiang Hu , Xiaoling Huang
Traditional set-based methods for computing three-way regions in neighborhood systems primarily rely on the inclusion relationships between target concepts and neighborhood classes to process continuous numerical data. However, these methods exhibit significant limitations when applied to time-varying neighborhood information systems, as they inherently lack the capability to accommodate dynamically evolving data, effectively. To overcome this challenge, our research presents novel matrix-based incremental methods that leverage previously computed results to enable more efficient updating and maintenance of three-way regions in neighborhood rough sets. Through comprehensive integration and analysis of neighborhood information systems with a focus on varying attributes, we develop matrix-based incremental mechanisms. Building on these mechanisms, we propose two incremental algorithms to effectively handle dynamic numerical data. Experimental results demonstrate the effectiveness and superior efficiency of the proposed methods compared to existing approaches. Specifically, the proposed algorithms exhibit lower computational time and higher speed-up ratio, highlighting their efficiency for updating neighborhood three-way regions.
传统的基于集的邻域系统三向区域计算方法主要依靠目标概念和邻域类之间的包含关系来处理连续数值数据。然而,当应用于时变邻域信息系统时,这些方法表现出明显的局限性,因为它们固有地缺乏有效适应动态发展数据的能力。为了克服这一挑战,我们的研究提出了新的基于矩阵的增量方法,利用先前计算的结果来更有效地更新和维护邻域粗糙集中的三向区域。通过对不同属性邻域信息系统的综合集成和分析,提出了基于矩阵的增量机制。在这些机制的基础上,我们提出了两种增量算法来有效地处理动态数值数据。实验结果表明,与现有方法相比,所提方法具有较高的有效性和效率。具体而言,该算法具有较低的计算时间和较高的加速比,突出了其更新邻域三向区域的效率。
{"title":"Matrix-based efficient methods to update three-way regions in neighborhood systems under varying attributes","authors":"Yingqi Qi ,&nbsp;Chengxiang Hu ,&nbsp;Xiaoling Huang","doi":"10.1016/j.ijar.2025.109598","DOIUrl":"10.1016/j.ijar.2025.109598","url":null,"abstract":"<div><div>Traditional set-based methods for computing three-way regions in neighborhood systems primarily rely on the inclusion relationships between target concepts and neighborhood classes to process continuous numerical data. However, these methods exhibit significant limitations when applied to time-varying neighborhood information systems, as they inherently lack the capability to accommodate dynamically evolving data, effectively. To overcome this challenge, our research presents novel matrix-based incremental methods that leverage previously computed results to enable more efficient updating and maintenance of three-way regions in neighborhood rough sets. Through comprehensive integration and analysis of neighborhood information systems with a focus on varying attributes, we develop matrix-based incremental mechanisms. Building on these mechanisms, we propose two incremental algorithms to effectively handle dynamic numerical data. Experimental results demonstrate the effectiveness and superior efficiency of the proposed methods compared to existing approaches. Specifically, the proposed algorithms exhibit lower computational time and higher speed-up ratio, highlighting their efficiency for updating neighborhood three-way regions.</div></div>","PeriodicalId":13842,"journal":{"name":"International Journal of Approximate Reasoning","volume":"189 ","pages":"Article 109598"},"PeriodicalIF":3.0,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145621173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Approximate Reasoning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1