踪迹强化蚂蚁的过程不会找到最短路径

Journal de l’École polytechnique — Mathématiques Pub Date : 2021-06-19 DOI:10.5802/jep.188

Daniel Kious, Cécile Mailler, Bruno Schapira

{"title":"踪迹强化蚂蚁的过程不会找到最短路径","authors":"Daniel Kious, Cécile Mailler, Bruno Schapira","doi":"10.5802/jep.188","DOIUrl":null,"url":null,"abstract":"In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph. \nOur contribution is three-fold: (1) We prove that, if $\\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.","PeriodicalId":106406,"journal":{"name":"Journal de l’École polytechnique — Mathématiques","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"The trace-reinforced ants process does not find shortest paths\",\"authors\":\"Daniel Kious, Cécile Mailler, Bruno Schapira\",\"doi\":\"10.5802/jep.188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\\\\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\\\\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph. \\nOur contribution is three-fold: (1) We prove that, if $\\\\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\\\\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.\",\"PeriodicalId\":106406,\"journal\":{\"name\":\"Journal de l’École polytechnique — Mathématiques\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal de l’École polytechnique — Mathématiques\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5802/jep.188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal de l’École polytechnique — Mathématiques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5802/jep.188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在本文中，我们研究了蚂蚁寻找巢穴和食物来源之间最短路径的概率强化学习模型。在这个模型中，巢和食物来源是有限图中两个不同的节点$N$和$F$。蚂蚁在这张图上执行一系列随机行走，从巢穴开始，当第一次到达食物来源时停止。在随机行走的每一步中，第n只蚂蚁选择穿过相邻的边，其概率与之前至少一次穿过该边的蚂蚁的数量成正比。我们说，如果蚂蚁找到了最短路径，几乎可以肯定，当蚂蚁的数量增长到无穷大时，几乎所有的蚂蚁都通过一条最短路径从巢穴到食物来源，而不会在图的其他边损失时间。我们的贡献有三方面:(1)我们证明了，如果$\mathcal G$是一棵扎根于$N$的树，它的叶子被合并到$F$节点中，并且在$N$和$F$之间有一条边，那么蚂蚁确实找到了最短路径。(2)相比之下，我们提供了三个蚂蚁不能找到最短路径的图的例子，这表明在这个模型和大多数图中，蚂蚁不能找到最短路径。(3)在所有这些情况下，我们证明了归一化边权序列收敛于{\it确定性}极限，尽管存在线性强化机制，并且我们推测这是一个在所有有限图上有效的一般事实。为了证明这些结果，我们使用了随机逼近方法，特别是ODE方法。一个困难来自于这样一个事实，即这种方法依赖于对非线性、多维ODE解的大时间行为的理解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

The trace-reinforced ants process does not find shortest paths

In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph. Our contribution is three-fold: (1) We prove that, if $\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助