Discovering Agents

Artif. Intell. Pub Date : 2022-08-17 DOI:10.48550/arXiv.2208.08345

Z. Kenton, Ramana Kumar, Sebastian Farquhar, Jonathan G. Richens, Matt MacDermott, Tom Everitt

引用次数: 11

Abstract

Causal models of agents have been used to analyse the safety aspects of machine learning systems. But identifying agents is non-trivial -- often the causal model is just assumed by the modeler without much justification -- and modelling failures can lead to mistakes in the safety analysis. This paper proposes the first formal causal definition of agents -- roughly that agents are systems that would adapt their policy if their actions influenced the world in a different way. From this we derive the first causal discovery algorithm for discovering agents from empirical data, and give algorithms for translating between causal models and game-theoretic influence diagrams. We demonstrate our approach by resolving some previous confusions caused by incorrect causal modelling of agents.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

发现代理

代理的因果模型已被用于分析机器学习系统的安全方面。但识别代理人并非易事——通常因果模型只是由建模者在没有太多理由的情况下假设的——而建模失败可能导致安全分析中的错误。本文提出了agent的第一个正式的因果定义——粗略地说，agent是一种系统，如果它们的行为以不同的方式影响世界，它们会调整自己的政策。在此基础上，我们推导了第一个从经验数据中发现代理的因果发现算法，并给出了在因果模型和博弈论影响图之间转换的算法。我们通过解决先前由不正确的代理因果建模引起的一些混淆来演示我们的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Artif. Intell.

自引率

0.00%

发文量

期刊最新文献

Entropy Estimation via Uniformization Task-Guided IRL in POMDPs that Scales Measuring power in coalitional games with friends, enemies and allies Defense coordination in security games: Equilibrium analysis and mechanism design Reasoning about general preference relations