揭示和未揭示偏好的（非）信息理论：通过心理理论出现的欺骗和怀疑论。

Q1 Social Sciences Open Mind Pub Date : 2023-08-20 eCollection Date: 2023-01-01 DOI:10.1162/opmi_a_00097

Nitay Alon, Lion Schulz, Jeffrey S Rosenschein, Peter Dayan

{"title":"揭示和未揭示偏好的（非）信息理论：通过心理理论出现的欺骗和怀疑论。","authors":"Nitay Alon, Lion Schulz, Jeffrey S Rosenschein, Peter Dayan","doi":"10.1162/opmi_a_00097","DOIUrl":null,"url":null,"abstract":"In complex situations involving communication, agents might attempt to mask their intentions, exploiting Shannon's theory of information as a theory of misinformation. Here, we introduce and analyze a simple multiagent reinforcement learning task where a buyer sends signals to a seller via its actions, and in which both agents are endowed with a recursive theory of mind. We show that this theory of mind, coupled with pure reward-maximization, gives rise to agents that selectively distort messages and become skeptical towards one another. Using information theory to analyze these interactions, we show how savvy buyers reduce mutual information between their preferences and actions, and how suspicious sellers learn to reinterpret or discard buyers' signals in a strategic manner.","PeriodicalId":32558,"journal":{"name":"Open Mind","volume":"7 ","pages":"608-624"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575559/pdf/","citationCount":"4","resultStr":"{\"title\":\"A (Dis-)information Theory of Revealed and Unrevealed Preferences: Emerging Deception and Skepticism via Theory of Mind.\",\"authors\":\"Nitay Alon, Lion Schulz, Jeffrey S Rosenschein, Peter Dayan\",\"doi\":\"10.1162/opmi_a_00097\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In complex situations involving communication, agents might attempt to mask their intentions, exploiting Shannon's theory of information as a theory of misinformation. Here, we introduce and analyze a simple multiagent reinforcement learning task where a buyer sends signals to a seller via its actions, and in which both agents are endowed with a recursive theory of mind. We show that this theory of mind, coupled with pure reward-maximization, gives rise to agents that selectively distort messages and become skeptical towards one another. Using information theory to analyze these interactions, we show how savvy buyers reduce mutual information between their preferences and actions, and how suspicious sellers learn to reinterpret or discard buyers' signals in a strategic manner.\",\"PeriodicalId\":32558,\"journal\":{\"name\":\"Open Mind\",\"volume\":\"7 \",\"pages\":\"608-624\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575559/pdf/\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Open Mind\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1162/opmi_a_00097\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Mind","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1162/opmi_a_00097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 4

摘要

在涉及沟通的复杂情况下，特工可能会试图掩盖他们的意图，利用香农的信息理论作为错误信息理论。在这里，我们介绍并分析了一个简单的多智能体强化学习任务，其中买方通过其行为向卖方发送信号，并且两个智能体都被赋予了递归思维理论。我们表明，这种心理理论，再加上纯粹的回报最大化，会产生选择性地扭曲信息并对彼此持怀疑态度的代理人。利用信息理论分析这些互动，我们展示了精明的买家如何减少他们的偏好和行为之间的相互信息，以及可疑的卖家如何学会以战略方式重新解释或丢弃买家的信号。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A (Dis-)information Theory of Revealed and Unrevealed Preferences: Emerging Deception and Skepticism via Theory of Mind.

In complex situations involving communication, agents might attempt to mask their intentions, exploiting Shannon's theory of information as a theory of misinformation. Here, we introduce and analyze a simple multiagent reinforcement learning task where a buyer sends signals to a seller via its actions, and in which both agents are endowed with a recursive theory of mind. We show that this theory of mind, coupled with pure reward-maximization, gives rise to agents that selectively distort messages and become skeptical towards one another. Using information theory to analyze these interactions, we show how savvy buyers reduce mutual information between their preferences and actions, and how suspicious sellers learn to reinterpret or discard buyers' signals in a strategic manner.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊