首页 > 最新文献

Artificial Intelligence最新文献

英文 中文
Probabilistically robust counterfactual explanations under model changes 模型变化下的概率稳健反事实解释
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-08 DOI: 10.1016/j.artint.2025.104459
Luca Marzari , Francesco Leofante , Ferdinando Cicalese , Alessandro Farinelli
We study the problem of generating robust counterfactual explanations for deep learning models subject to model changes. We focus on plausible model changes altering model parameters and propose a novel framework to reason about the robustness property in this setting. To motivate our solution, we begin by showing for the first time that computing the robustness of counterfactuals with respect to model changes is NP-hard. As this (practically) rules out the existence of scalable algorithms for exactly computing robustness, we propose a novel probabilistic approach which is able to provide tight estimates of robustness with strong guarantees while preserving scalability. Remarkably, and differently from existing solutions targeting plausible model changes, our approach does not impose requirements on the network to be analysed, thus enabling robustness analysis on a wider range of architectures, including state-of-the-art tabular transformers. A thorough experimental analysis on four binary classification datasets reveals that our method improves the state of the art in generating robust explanations, outperforming existing methods.
我们研究了为受模型变化影响的深度学习模型生成鲁棒反事实解释的问题。我们将重点放在改变模型参数的合理模型变化上,并提出了一个新的框架来解释这种情况下的鲁棒性。为了激励我们的解决方案,我们首先展示了计算关于模型变化的反事实的鲁棒性是np困难的。由于这(实际上)排除了精确计算鲁棒性的可扩展算法的存在,我们提出了一种新的概率方法,该方法能够在保持可扩展性的同时,提供具有强保证的严格鲁棒性估计。值得注意的是,与现有的针对合理模型变化的解决方案不同,我们的方法不会对要分析的网络施加要求,因此可以在更广泛的体系结构上进行鲁棒性分析,包括最先进的表格变压器。对四个二元分类数据集的彻底实验分析表明,我们的方法在生成鲁棒性解释方面提高了技术水平,优于现有方法。
{"title":"Probabilistically robust counterfactual explanations under model changes","authors":"Luca Marzari ,&nbsp;Francesco Leofante ,&nbsp;Ferdinando Cicalese ,&nbsp;Alessandro Farinelli","doi":"10.1016/j.artint.2025.104459","DOIUrl":"10.1016/j.artint.2025.104459","url":null,"abstract":"<div><div>We study the problem of generating robust counterfactual explanations for deep learning models subject to model changes. We focus on <em>plausible model changes</em> altering model parameters and propose a novel framework to reason about the robustness property in this setting. To motivate our solution, we begin by showing for the first time that computing the robustness of counterfactuals with respect to model changes is NP-hard. As this (practically) rules out the existence of scalable algorithms for exactly computing robustness, we propose a novel probabilistic approach which is able to provide tight estimates of robustness with strong guarantees while preserving scalability. Remarkably, and differently from existing solutions targeting plausible model changes, our approach does not impose requirements on the network to be analysed, thus enabling robustness analysis on a wider range of architectures, including state-of-the-art tabular transformers. A thorough experimental analysis on four binary classification datasets reveals that our method improves the state of the art in generating robust explanations, outperforming existing methods.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104459"},"PeriodicalIF":4.6,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145731229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-objective reinforcement learning for provably incentivising alignment with value systems 多目标强化学习可证明激励与价值系统对齐
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-07 DOI: 10.1016/j.artint.2025.104460
Manel Rodriguez-Soto , Roxana Rădulescu , Filippo Bistaffa , Oriol Ricart , Arnau Mayoral-Macau , Maite Lopez-Sanchez , Juan A. Rodriguez-Aguilar , Ann Nowé
This paper addresses the problem of ensuring that autonomous learning agents align with multiple moral values. Specifically, we present the theoretical principles and algorithmic tools necessary for creating an environment where we ensure that the agent learns a behaviour aligned with multiple moral values while striving to achieve its individual objective. To address this value alignment problem, we adopt the Multi-Objective Reinforcement Learning framework and propose a novel algorithm that combines techniques from Multi-Objective Reinforcement Learning and Linear Programming. In addition, we illustrate our value alignment process with an example involving an autonomous vehicle. Here, we demonstrate that the agent learns to behave in alignment with the ethical values of safety, achievement, and comfort, with achievement representing the agent’s individual objective. Such ethical behaviour differs depending on the ordering between values. We also use a synthetic multi-objective environment to evaluate the computational costs of guaranteeing ethical learning as the number of values increases.
本文解决了确保自主学习代理与多种道德价值观保持一致的问题。具体来说,我们提出了创建一个环境所需的理论原则和算法工具,在这个环境中,我们确保代理在努力实现其个人目标的同时学习与多种道德价值观相一致的行为。为了解决这个值对齐问题,我们采用了多目标强化学习框架,并提出了一种结合多目标强化学习和线性规划技术的新算法。此外,我们用一个涉及自动驾驶汽车的例子来说明我们的价值校准过程。在这里,我们证明了代理学习与安全、成就和舒适的道德价值观一致的行为,而成就代表了代理的个人目标。这种道德行为的不同取决于价值观之间的顺序。我们还使用一个合成的多目标环境来评估随着值的增加而保证伦理学习的计算成本。
{"title":"Multi-objective reinforcement learning for provably incentivising alignment with value systems","authors":"Manel Rodriguez-Soto ,&nbsp;Roxana Rădulescu ,&nbsp;Filippo Bistaffa ,&nbsp;Oriol Ricart ,&nbsp;Arnau Mayoral-Macau ,&nbsp;Maite Lopez-Sanchez ,&nbsp;Juan A. Rodriguez-Aguilar ,&nbsp;Ann Nowé","doi":"10.1016/j.artint.2025.104460","DOIUrl":"10.1016/j.artint.2025.104460","url":null,"abstract":"<div><div>This paper addresses the problem of ensuring that autonomous learning agents align with multiple moral values. Specifically, we present the theoretical principles and algorithmic tools necessary for creating an environment where we ensure that the agent learns a behaviour aligned with multiple moral values while striving to achieve its individual objective. To address this value alignment problem, we adopt the Multi-Objective Reinforcement Learning framework and propose a novel algorithm that combines techniques from Multi-Objective Reinforcement Learning and Linear Programming. In addition, we illustrate our value alignment process with an example involving an autonomous vehicle. Here, we demonstrate that the agent learns to behave in alignment with the ethical values of safety, achievement, and comfort, with achievement representing the agent’s individual objective. Such ethical behaviour differs depending on the ordering between values. We also use a synthetic multi-objective environment to evaluate the computational costs of guaranteeing ethical learning as the number of values increases.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104460"},"PeriodicalIF":4.6,"publicationDate":"2025-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145689753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Defining Defense and Defeat in Abstract Argumentation From Scratch – A Generalizing Approach 在抽象论证中从零开始定义辩护和失败——一种一般化的方法
IF 14.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-05 DOI: 10.1016/j.artint.2025.104456
Lydia Blümel, Markus Ulbricht
{"title":"Defining Defense and Defeat in Abstract Argumentation From Scratch – A Generalizing Approach","authors":"Lydia Blümel, Markus Ulbricht","doi":"10.1016/j.artint.2025.104456","DOIUrl":"https://doi.org/10.1016/j.artint.2025.104456","url":null,"abstract":"","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"26 1","pages":""},"PeriodicalIF":14.4,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145689754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OBDDs, SDDs, and circuits of bounded width: Completeness matters obdd、sdd和有界宽度电路:完整性问题
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-30 DOI: 10.1016/j.artint.2025.104458
Alexis De Colnet , Sebastian Ordyniak , Stefan Szeider
Ordered Binary Decision Diagrams (OBDDs) are dynamic data structures with many application areas. The literature suggested that OBDDs of bounded width equate to Boolean circuits of bounded pathwidth. In this paper, we show that this relationship holds only for complete OBDDs. Additionally, we demonstrate that similar limitations affect the claimed equivalence between Sentential Decision Diagrams (SDDs) of bounded width and Boolean circuits of bounded treewidth.
有序二元决策图(obdd)是一种动态数据结构,具有广泛的应用领域。文献认为有界宽度的obdd等于有界径宽的布尔电路。在本文中,我们证明了这种关系只适用于完整的obdd。此外,我们证明了类似的限制影响了有界宽度的句子决策图(sdd)和有界树宽度的布尔电路之间的等价性。
{"title":"OBDDs, SDDs, and circuits of bounded width: Completeness matters","authors":"Alexis De Colnet ,&nbsp;Sebastian Ordyniak ,&nbsp;Stefan Szeider","doi":"10.1016/j.artint.2025.104458","DOIUrl":"10.1016/j.artint.2025.104458","url":null,"abstract":"<div><div>Ordered Binary Decision Diagrams (OBDDs) are dynamic data structures with many application areas. The literature suggested that OBDDs of bounded width equate to Boolean circuits of bounded pathwidth. In this paper, we show that this relationship holds only for complete OBDDs. Additionally, we demonstrate that similar limitations affect the claimed equivalence between Sentential Decision Diagrams (SDDs) of bounded width and Boolean circuits of bounded treewidth.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104458"},"PeriodicalIF":4.6,"publicationDate":"2025-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145619717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated neural nonparametric point processes 联邦神经非参数点过程
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-22 DOI: 10.1016/j.artint.2025.104454
Hui Chen , Xuhui Fan , Hengyu Liu , Yaqiong Li , Zhilin Zhao , Feng Zhou , Christopher John Quinn , Longbing Cao
Temporal point processes (TPPs) are effective for modeling event occurrences over time but struggle with sparse and uncertain events in federated systems, where privacy is a major concern. To address this, we propose FedPP, a federated neural nonparametric point process model. FedPP integrates neural embeddings into sigmoidal Gaussian Cox processes (SGCPs) on the client side. SGCPs is a flexible and expressive class of TPPs, allowing FedPP to generate highly flexible intensity functions that capture client-specific event dynamics and uncertainties while efficiently summarizing historical records. For global aggregation, FedPP introduces a divergence-based mechanism to communicate the distributions of kernel hyperparameters in SGCPs between the server and clients, while keeping client-specific parameters local to ensure privacy and personalization. FedPP effectively captures event uncertainty and sparsity. Extensive experiments demonstrate its superior performance in federated settings, showing global aggregation with the KL divergence and the Wasserstein distance.
时间点处理(TPPs)对于随着时间的推移而发生的事件建模是有效的,但是在联邦系统中难以处理稀疏和不确定的事件,其中隐私是一个主要问题。为了解决这个问题,我们提出了联邦神经非参数点过程模型FedPP。FedPP在客户端将神经嵌入集成到s型高斯Cox过程(SGCPs)中。sgcp是一种灵活且富有表现力的tpp类,允许FedPP生成高度灵活的强度函数,以捕获客户特定的事件动态和不确定性,同时有效地总结历史记录。对于全局聚合,FedPP引入了一种基于散度的机制,在服务器和客户端之间传递sgcp中内核超参数的分布,同时保持特定于客户端的参数在本地,以确保隐私和个性化。FedPP有效地捕获了事件的不确定性和稀疏性。大量的实验证明了它在联邦设置下的优越性能,显示出具有KL散度和Wasserstein距离的全局聚合。
{"title":"Federated neural nonparametric point processes","authors":"Hui Chen ,&nbsp;Xuhui Fan ,&nbsp;Hengyu Liu ,&nbsp;Yaqiong Li ,&nbsp;Zhilin Zhao ,&nbsp;Feng Zhou ,&nbsp;Christopher John Quinn ,&nbsp;Longbing Cao","doi":"10.1016/j.artint.2025.104454","DOIUrl":"10.1016/j.artint.2025.104454","url":null,"abstract":"<div><div>Temporal point processes (TPPs) are effective for modeling event occurrences over time but struggle with sparse and uncertain events in federated systems, where privacy is a major concern. To address this, we propose <em>FedPP</em>, a federated neural nonparametric point process model. FedPP integrates neural embeddings into sigmoidal Gaussian Cox processes (SGCPs) on the client side. SGCPs is a flexible and expressive class of TPPs, allowing FedPP to generate highly flexible intensity functions that capture client-specific event dynamics and uncertainties while efficiently summarizing historical records. For global aggregation, FedPP introduces a divergence-based mechanism to communicate the distributions of kernel hyperparameters in SGCPs between the server and clients, while keeping client-specific parameters local to ensure privacy and personalization. FedPP effectively captures event uncertainty and sparsity. Extensive experiments demonstrate its superior performance in federated settings, showing global aggregation with the KL divergence and the Wasserstein distance.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104454"},"PeriodicalIF":4.6,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145575241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangling data distribution for optimal and communication-efficient federated learning 面向最优通信高效联邦学习的解纠缠数据分布
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-21 DOI: 10.1016/j.artint.2025.104455
Xinyuan Zhao , Hanlin Gu , Lixin Fan , Yuxing Han , Qiang Yang
Federated Learning (FL) facilitates collaborative training of a global model whose performance is boosted by private data owned by distributed clients, without compromising data privacy. Yet the wide applicability of FL is hindered by the entanglement of data distributions across different clients. This paper demonstrates for the first time that by disentangling data distributions, FL can in principle achieve efficiencies comparable to those of distributed systems, requiring only one round of communication. To this end, we propose a novel FedDistr algorithm, which employs diffusion models to decouple and recover data distributions. Empirical results on the CIFAR100, DomainNet, OfficeHome, and ISIC2020 datasets show that FedDistr significantly enhances model utility and efficiency in both disentangled and near-disentangled scenarios while ensuring privacy, outperforming traditional federated learning methods.
联邦学习(FL)促进了全局模型的协作训练,该模型的性能由分布式客户端拥有的私有数据提高,而不会损害数据隐私。然而,不同客户端之间数据分布的纠缠阻碍了FL的广泛适用性。本文首次证明,通过解开数据分布的纠缠,FL在原则上可以达到与分布式系统相当的效率,只需要一轮通信。为此,我们提出了一种新的FedDistr算法,该算法采用扩散模型来解耦和恢复数据分布。在CIFAR100、DomainNet、OfficeHome和ISIC2020数据集上的实证结果表明,FedDistr在确保隐私的同时,显著提高了模型在解纠缠和近解纠缠场景下的效用和效率,优于传统的联邦学习方法。
{"title":"Disentangling data distribution for optimal and communication-efficient federated learning","authors":"Xinyuan Zhao ,&nbsp;Hanlin Gu ,&nbsp;Lixin Fan ,&nbsp;Yuxing Han ,&nbsp;Qiang Yang","doi":"10.1016/j.artint.2025.104455","DOIUrl":"10.1016/j.artint.2025.104455","url":null,"abstract":"<div><div>Federated Learning (FL) facilitates collaborative training of a global model whose performance is boosted by private data owned by distributed clients, without compromising data privacy. Yet the wide applicability of FL is hindered by the entanglement of data distributions across different clients. This paper demonstrates for the first time that by disentangling data distributions, FL can in principle achieve efficiencies comparable to those of distributed systems, requiring only one round of communication. To this end, we propose a novel FedDistr algorithm, which employs diffusion models to decouple and recover data distributions. Empirical results on the CIFAR100, DomainNet, OfficeHome, and ISIC2020 datasets show that FedDistr significantly enhances model utility and efficiency in both disentangled and near-disentangled scenarios while ensuring privacy, outperforming traditional federated learning methods.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104455"},"PeriodicalIF":4.6,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145567483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human compliance with computational argumentation principles 人类对计算论证原则的遵从
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1016/j.artint.2025.104457
Predrag Teovanović , Srdjan Vesic , Bruno Yun
This paper presents a comprehensive examination of human compliance with normative principles of argumentation across two experimental studies. The first study investigated whether fundamental argumentation principles such as anonymity, independence, void precedence, and maximality align with human reasoning. Additionally, it explored whether graph-based representations of arguments facilitate better understanding and adherence to these principles compared to textual representations of arguments alone and examined the role of individual cognitive differences in compliance with these principles. Our experiments revealed that graph-based representations significantly improved compliance with argumentation principles, particularly among individuals with higher cognitive reflection. The second study replicated and extended the first study’s findings, introducing new principles such as skeptical precedence and simple reinstatement, and explored the effects of presenting arguments solely in graphical form, as well as the impact of a short tutorial on argumentation theory. The study also assessed participants’ ability to perform graphical tasks and how this influenced their compliance with normative principles. Results partially replicated the first study’s findings, confirming that graphical representations enhance compliance, but also revealed that the effect does not generalize to the new principles. We found evidence that in the absence of a graphical representation, performing graphical tasks can improve compliance with principles; especially drawing the argumentation graph. Moreover, a brief tutorial significantly improved performance on several principles, indicating that even minimal instruction can enhance understanding and compliance. However, the difficulties observed with the simple reinstatement principle hint that the participants’ intuition about the notion of defense diverges significantly from that of the researchers and that more careful thoughts must be put in crafting them. These studies collectively suggest that while argumentation principles can be intuitive to some extent, their comprehension and application are significantly influenced by the instruction given as well as by graphical representations and processes used to obtain them. These findings have important implications for the design of future argumentation-based tools and our understanding of how to bridge human reasoning and formal argumentation.
本文提出了一个全面的检查人类遵守规范性原则的论证跨越两个实验研究。第一项研究调查了基本的论证原则,如匿名性、独立性、无效优先性和最大化是否与人类推理一致。此外,它还探讨了与单独的文本表示相比,基于图形的论点表示是否有助于更好地理解和遵守这些原则,并检查了个人认知差异在遵守这些原则方面的作用。我们的实验表明,基于图形的表征显著提高了对论证原则的遵从性,特别是在认知反射较高的个体中。第二项研究复制并扩展了第一项研究的发现,引入了新的原则,如怀疑优先和简单还原,并探索了仅以图形形式呈现论点的效果,以及论证理论简短教程的影响。该研究还评估了参与者执行图形任务的能力,以及这对他们遵守规范原则的影响。结果部分重复了第一项研究的发现,证实了图形表示增强了依从性,但也揭示了效果并不适用于新原则。我们发现证据表明,在没有图形表示的情况下,执行图形任务可以提高对原则的遵从性;特别是绘制论证图。此外,一个简短的教程可以显著提高几个原则的性能,这表明即使是最小的指导也可以增强理解和遵从性。然而,用简单恢复原则观察到的困难暗示了参与者对防御概念的直觉与研究人员的直觉有很大的分歧,必须更仔细地考虑它们。这些研究共同表明,虽然论证原则在某种程度上是直观的,但它们的理解和应用在很大程度上受到所给予的指导以及用于获取它们的图形表示和过程的影响。这些发现对未来基于论证的工具的设计以及我们对如何将人类推理与正式论证联系起来的理解具有重要意义。
{"title":"Human compliance with computational argumentation principles","authors":"Predrag Teovanović ,&nbsp;Srdjan Vesic ,&nbsp;Bruno Yun","doi":"10.1016/j.artint.2025.104457","DOIUrl":"10.1016/j.artint.2025.104457","url":null,"abstract":"<div><div>This paper presents a comprehensive examination of human compliance with normative principles of argumentation across two experimental studies. The first study investigated whether fundamental argumentation principles such as anonymity, independence, void precedence, and maximality align with human reasoning. Additionally, it explored whether graph-based representations of arguments facilitate better understanding and adherence to these principles compared to textual representations of arguments alone and examined the role of individual cognitive differences in compliance with these principles. Our experiments revealed that graph-based representations significantly improved compliance with argumentation principles, particularly among individuals with higher cognitive reflection. The second study replicated and extended the first study’s findings, introducing new principles such as skeptical precedence and simple reinstatement, and explored the effects of presenting arguments solely in graphical form, as well as the impact of a short tutorial on argumentation theory. The study also assessed participants’ ability to perform graphical tasks and how this influenced their compliance with normative principles. Results partially replicated the first study’s findings, confirming that graphical representations enhance compliance, but also revealed that the effect does not generalize to the new principles. We found evidence that in the absence of a graphical representation, performing graphical tasks can improve compliance with principles; especially drawing the argumentation graph. Moreover, a brief tutorial significantly improved performance on several principles, indicating that even minimal instruction can enhance understanding and compliance. However, the difficulties observed with the simple reinstatement principle hint that the participants’ intuition about the notion of defense diverges significantly from that of the researchers and that more careful thoughts must be put in crafting them. These studies collectively suggest that while argumentation principles can be intuitive to some extent, their comprehension and application are significantly influenced by the instruction given as well as by graphical representations and processes used to obtain them. These findings have important implications for the design of future argumentation-based tools and our understanding of how to bridge human reasoning and formal argumentation.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"351 ","pages":"Article 104457"},"PeriodicalIF":4.6,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Many-objective problems where crossover is provably essential 多目标问题,其中交叉是必要的
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-07 DOI: 10.1016/j.artint.2025.104453
Andre Opris
This article addresses theory in evolutionary many-objective optimization and focuses on the role of crossover operators. The advantages of using crossover are hardly understood and rigorous runtime analyses with crossover are lagging far behind its use in practice, specifically in the case of more than two objectives. We present two many-objective problems RRMO and URRMO, and a theoretical runtime analysis of the GSEMO and the widely used NSGA‑III algorithm, to demonstrate that one point crossover on RRMO, as well as uniform crossover on URRMO, can yield an exponential speedup in the runtime. In particular, when the number of objectives is constant, this algorithms can find the Pareto set of both problems in expected polynomial time when using crossover, while without crossover they require exponential time to even find a single Pareto-optimal point. For either problem, we also demonstrate a significant performance gap in certain superconstant parameter regimes for the number of objectives. To the best of our knowledge, this is the first rigorous runtime analysis in many-objective optimization which demonstrates an exponential performance gap when using crossover for more than two objectives. Additionally, it is the first runtime analysis involving crossover in many-objective optimization where the number of objectives is not necessarily constant.
本文讨论了进化多目标优化中的理论,重点讨论了交叉算子的作用。使用交叉的优势很难被理解,严格的运行时分析与交叉在实践中的使用相差甚远,特别是在两个以上目标的情况下。我们提出了两个多目标问题RRMO 和URRMO,并对GSEMO和广泛使用的NSGA - III算法进行了理论运行时分析,证明了RRMO上的一点交叉以及URRMO上的均匀交叉可以在运行时产生指数级的加速。特别是,当目标数一定时,该算法在使用交叉时可以在预期的多项式时间内找到两个问题的Pareto集,而不使用交叉时甚至需要指数时间才能找到一个Pareto最优点。对于这两个问题,我们也证明了在目标数量的某些超常参数体系中存在显著的性能差距。据我们所知,这是多目标优化中第一个严格的运行时分析,它展示了当对两个以上目标使用交叉时的指数级性能差距。此外,它是第一个在多目标优化中涉及交叉的运行时分析,其中目标数量不一定是恒定的。
{"title":"Many-objective problems where crossover is provably essential","authors":"Andre Opris","doi":"10.1016/j.artint.2025.104453","DOIUrl":"10.1016/j.artint.2025.104453","url":null,"abstract":"<div><div>This article addresses theory in evolutionary many-objective optimization and focuses on the role of crossover operators. The advantages of using crossover are hardly understood and rigorous runtime analyses with crossover are lagging far behind its use in practice, specifically in the case of more than two objectives. We present two many-objective problems <span><math><msub><mtext>RR</mtext><mrow><mi>MO</mi></mrow></msub></math></span> and <span><math><msub><mtext>URR</mtext><mrow><mi>MO</mi></mrow></msub></math></span>, and a theoretical runtime analysis of the GSEMO and the widely used NSGA‑III algorithm, to demonstrate that one point crossover on <span><math><msub><mtext>RR</mtext><mrow><mi>MO</mi></mrow></msub></math></span>, as well as uniform crossover on <span><math><msub><mtext>URR</mtext><mrow><mi>MO</mi></mrow></msub></math></span>, can yield an exponential speedup in the runtime. In particular, when the number of objectives is constant, this algorithms can find the Pareto set of both problems in expected polynomial time when using crossover, while without crossover they require exponential time to even find a single Pareto-optimal point. For either problem, we also demonstrate a significant performance gap in certain superconstant parameter regimes for the number of objectives. To the best of our knowledge, this is the first rigorous runtime analysis in many-objective optimization which demonstrates an exponential performance gap when using crossover for more than two objectives. Additionally, it is the first runtime analysis involving crossover in many-objective optimization where the number of objectives is not necessarily constant.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"350 ","pages":"Article 104453"},"PeriodicalIF":4.6,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145461589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bridging sparse domain semantics via an asymmetric siamese framework with virtual anchor guidance for domain-specific multimodal translation 针对特定领域的多模态翻译,采用非对称Siamese框架和虚拟锚引导架桥稀疏领域语义
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-31 DOI: 10.1016/j.artint.2025.104443
Junjun Guo , Yifan Liu , Zhengtao Yu
Domain-specific Multimodal Neural Machine Translation (DMNMT) aims to translate text in specialized domains by leveraging both linguistic context and associated visual information to resolve domain-specific ambiguities and enhance terminological accuracy. Although accompanying images often provide sparse and fragmented visual cues that could potentially anchor critical domain semantics, the semantic mapping from images to textual domain semantics typically exhibits sparse multi-focal alignment challenges. Existing general-domain multimodal neural machine translation (MNMT) models and large language models (LLMs) struggle to achieve accurate aggregation of domain-salient information, often resulting in near-equivalent yet imprecise terminology translations or outright errors. To bridge this sparse domain semantic correspondence gap, we introduce the Asymmetric Siamese Multimodal Fusion (ASMF) framework, which decouples domain representation learning into two complementary branches that both consume text: a domain-specific virtual visual content generation (DVVG) branch and a terminology-aware textual (TAT) branch. The DVVG branch distills sparse, localized visual features into modality-agnostic semantic anchors through mask-constrained multi-focal distillation, while the TAT branch captures terminology-dense textual context. We introduce a novel Domain-Virtualized Pivot-driven Hierarchical Fusion (DVPH) strategy that progressively injects distilled visual anchors across encoder layers. This asymmetric dual-branch design effectively couples spatially fragmented visual details with terminology-rich text, enabling accurate and domain-consistent translations even for low-frequency terms. Extensive experiments were conducted on four benchmark datasets covering three distinct scenarios: two domain-specific datasets (Fashion-MMT and EMMT), one general-domain dataset (Multi30K), and one multi-domain dataset (WIT). Comprehensive evaluations demonstrate that the proposed approach outperforms existing MNMT, DMNMT and LLMs, achieving state-of-the-art (SOTA) results across all datasets. In-depth analyses validate its robustness and generalization capabilities across diverse scenarios, including visually noisy or image-free conditions.
特定领域的多模态神经机器翻译(DMNMT)旨在通过利用语言上下文和相关的视觉信息来解决特定领域的歧义并提高术语的准确性。虽然伴随图像通常提供稀疏和碎片化的视觉线索,可能潜在地锚定关键领域语义,但从图像到文本领域语义的语义映射通常表现出稀疏的多焦点对齐挑战。现有的通用领域多模态神经机器翻译(MNMT)模型和大型语言模型(llm)难以实现领域显著性信息的准确聚合,经常导致近乎等同但不精确的术语翻译或完全错误。为了弥合这种稀疏的领域语义对应差距,我们引入了非对称暹罗多模态融合(ASMF)框架,该框架将领域表示学习解耦为两个互补的分支,这两个分支都消耗文本:特定于领域的虚拟视觉内容生成(DVVG)分支和术语感知文本(TAT)分支。DVVG分支通过掩模约束的多焦点蒸馏将稀疏的局部视觉特征提炼成模态不确定的语义锚,而TAT分支捕获术语密集的文本上下文。我们引入了一种新的域虚拟化轴驱动分层融合(DVPH)策略,该策略在编码器层之间逐步注入精炼的视觉锚点。这种不对称的双分支设计有效地将空间碎片化的视觉细节与术语丰富的文本结合在一起,即使对于低频术语也能实现准确和领域一致的翻译。在涵盖三种不同场景的四个基准数据集上进行了大量实验:两个特定于领域的数据集(Fashion-MMT和EMMT),一个通用领域数据集(Multi30K)和一个多领域数据集(WIT)。综合评估表明,所提出的方法优于现有的MNMT、DMNMT和llm,在所有数据集上都取得了最先进的(SOTA)结果。深入分析验证了其在不同场景下的鲁棒性和泛化能力,包括视觉噪声或无图像条件。
{"title":"Bridging sparse domain semantics via an asymmetric siamese framework with virtual anchor guidance for domain-specific multimodal translation","authors":"Junjun Guo ,&nbsp;Yifan Liu ,&nbsp;Zhengtao Yu","doi":"10.1016/j.artint.2025.104443","DOIUrl":"10.1016/j.artint.2025.104443","url":null,"abstract":"<div><div>Domain-specific Multimodal Neural Machine Translation (DMNMT) aims to translate text in specialized domains by leveraging both linguistic context and associated visual information to resolve domain-specific ambiguities and enhance terminological accuracy. Although accompanying images often provide sparse and fragmented visual cues that could potentially anchor critical domain semantics, the semantic mapping from images to textual domain semantics typically exhibits sparse multi-focal alignment challenges. Existing general-domain multimodal neural machine translation (MNMT) models and large language models (LLMs) struggle to achieve accurate aggregation of domain-salient information, often resulting in near-equivalent yet imprecise terminology translations or outright errors. To bridge this sparse domain semantic correspondence gap, we introduce the Asymmetric Siamese Multimodal Fusion (ASMF) framework, which decouples domain representation learning into two complementary branches that both consume text: a domain-specific virtual visual content generation (DVVG) branch and a terminology-aware textual (TAT) branch. The DVVG branch distills sparse, localized visual features into modality-agnostic semantic anchors through mask-constrained multi-focal distillation, while the TAT branch captures terminology-dense textual context. We introduce a novel Domain-Virtualized Pivot-driven Hierarchical Fusion (DVPH) strategy that progressively injects distilled visual anchors across encoder layers. This asymmetric dual-branch design effectively couples spatially fragmented visual details with terminology-rich text, enabling accurate and domain-consistent translations even for low-frequency terms. Extensive experiments were conducted on four benchmark datasets covering three distinct scenarios: two domain-specific datasets (Fashion-MMT and EMMT), one general-domain dataset (Multi30K), and one multi-domain dataset (WIT). Comprehensive evaluations demonstrate that the proposed approach outperforms existing MNMT, DMNMT and LLMs, achieving state-of-the-art (SOTA) results across all datasets. In-depth analyses validate its robustness and generalization capabilities across diverse scenarios, including visually noisy or image-free conditions.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"350 ","pages":"Article 104443"},"PeriodicalIF":4.6,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145404964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A General Theoretical Framework for Learning Smallest Interpretable Models 学习最小可解释模型的一般理论框架
IF 4.6 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-26 DOI: 10.1016/j.artint.2025.104441
Sebastian Ordyniak , Giacomo Paesani , Mateusz Rychlicki , Stefan Szeider
We develop a general algorithmic framework that allows us to obtain fixed-parameter tractability for computing smallest symbolic models that represent given data. Our framework applies to all ML model types that admit a certain extension property. By establishing this extension property for decision trees, decision sets, decision lists, and binary decision diagrams, we obtain that minimizing these fundamental model types is fixed-parameter tractable. Our framework even applies to ensembles, which combine individual models by majority decision.
我们开发了一个通用的算法框架,使我们能够获得固定参数的可追溯性,用于计算表示给定数据的最小符号模型。我们的框架适用于所有承认某种扩展属性的ML模型类型。通过建立决策树、决策集、决策列表和二元决策图的可拓性,我们得到最小化这些基本模型类型是固定参数可处理的。我们的框架甚至适用于集合,它通过多数决策来组合单个模型。
{"title":"A General Theoretical Framework for Learning Smallest Interpretable Models","authors":"Sebastian Ordyniak ,&nbsp;Giacomo Paesani ,&nbsp;Mateusz Rychlicki ,&nbsp;Stefan Szeider","doi":"10.1016/j.artint.2025.104441","DOIUrl":"10.1016/j.artint.2025.104441","url":null,"abstract":"<div><div>We develop a general algorithmic framework that allows us to obtain fixed-parameter tractability for computing smallest symbolic models that represent given data. Our framework applies to all ML model types that admit a certain extension property. By establishing this extension property for decision trees, decision sets, decision lists, and binary decision diagrams, we obtain that minimizing these fundamental model types is fixed-parameter tractable. Our framework even applies to ensembles, which combine individual models by majority decision.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"350 ","pages":"Article 104441"},"PeriodicalIF":4.6,"publicationDate":"2025-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145382554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1