Schrödinger's Control and Estimation Paradigm With Spatio-Temporal Distributions on Graphs

IF 7 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automatic Control Pub Date : 2024-10-23 DOI:10.1109/TAC.2024.3485537

Asmaa Eldesoukey;Tryphon T. Georgiou

{"title":"Schrödinger's Control and Estimation Paradigm With Spatio-Temporal Distributions on Graphs","authors":"Asmaa Eldesoukey;Tryphon T. Georgiou","doi":"10.1109/TAC.2024.3485537","DOIUrl":null,"url":null,"abstract":"The problem of reconciling a prior probability law on paths with data was introduced by Schrödinger in 1931 and 1932. It represents an early formulation of a maximum likelihood problem. This specific formulation can also be seen as the control problem to modify the law of a diffusion process so as to match specifications on marginal distributions at given times. Thereby, in recent years, this so-called <italic>Schrödinger's bridge problem</i> has been at the center of the uncertainty control development. However, an understudied facet of this program has been to address uncertainty in <italic>space</i> (state) and <italic>time</i>, modeling the effect of tasks being completed contingent on meeting a certain condition at some random time instead of imposing specifications at fixed times. The present work is a study to extend Schrödinger's paradigm on such an issue, and herein, it is tackled in the context of random walks on directed graphs. Specifically, we study the case where one marginal is the initial probability distribution on a Markov chain, while others are marginals of stopping (first-arrival) times at absorbing states, signifying completion of tasks. We show when the prior law on paths is Markov, a Markov policy is once again optimal to satisfy those marginal constraints with respect to a likelihood cost following Schrödinger's dictum. Based on this, we present the mathematical formulation involving a <italic>Sinkhorn</i>-type iteration to construct the optimal probability law on paths matching the spatio-temporal marginals.","PeriodicalId":13201,"journal":{"name":"IEEE Transactions on Automatic Control","volume":"70 4","pages":"2466-2478"},"PeriodicalIF":7.0000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automatic Control","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10731575/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The problem of reconciling a prior probability law on paths with data was introduced by Schrödinger in 1931 and 1932. It represents an early formulation of a maximum likelihood problem. This specific formulation can also be seen as the control problem to modify the law of a diffusion process so as to match specifications on marginal distributions at given times. Thereby, in recent years, this so-called Schrödinger's bridge problem has been at the center of the uncertainty control development. However, an understudied facet of this program has been to address uncertainty in space (state) and time, modeling the effect of tasks being completed contingent on meeting a certain condition at some random time instead of imposing specifications at fixed times. The present work is a study to extend Schrödinger's paradigm on such an issue, and herein, it is tackled in the context of random walks on directed graphs. Specifically, we study the case where one marginal is the initial probability distribution on a Markov chain, while others are marginals of stopping (first-arrival) times at absorbing states, signifying completion of tasks. We show when the prior law on paths is Markov, a Markov policy is once again optimal to satisfy those marginal constraints with respect to a likelihood cost following Schrödinger's dictum. Based on this, we present the mathematical formulation involving a Sinkhorn-type iteration to construct the optimal probability law on paths matching the spatio-temporal marginals.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用图上时空分布的薛定谔控制和估计范式

调和路径上的先验概率律和数据的问题是由Schrödinger在1931年和1932年提出的。它代表了极大似然问题的一个早期形式。这个特殊的公式也可以看作是控制问题，用来修改扩散过程的规律，以匹配给定时间的边际分布规范。因此，近年来，这个所谓的Schrödinger桥梁问题一直是不确定性控制发展的中心问题。然而，该程序的一个未被充分研究的方面是解决空间（状态）和时间的不确定性，建模任务完成的影响取决于在某个随机时间满足某个条件，而不是在固定时间强加规范。目前的工作是一项研究，以扩展Schrödinger的范式在这样一个问题上，在这里，它是在有向图上随机行走的背景下处理的。具体来说，我们研究了一种情况，其中一个边缘是马尔可夫链上的初始概率分布，而其他边缘是在吸收状态下停止（首次到达）时间的边缘，表示任务的完成。我们展示了当路径上的先验律是马尔可夫律时，一个马尔可夫策略再一次是最优的，以满足那些边际约束的可能性成本，遵循Schrödinger的格言。在此基础上，我们提出了一个包含sinkhorn型迭代的数学公式来构造匹配时空边缘的路径的最优概率律。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automatic Control 工程技术-工程：电子与电气

CiteScore

11.30

自引率

5.90%

发文量

824

审稿时长

9 months

期刊介绍： In the IEEE Transactions on Automatic Control, the IEEE Control Systems Society publishes high-quality papers on the theory, design, and applications of control engineering. Two types of contributions are regularly considered: 1) Papers: Presentation of significant research, development, or application of control concepts. 2) Technical Notes and Correspondence: Brief technical notes, comments on published areas or established control topics, corrections to papers and notes published in the Transactions. In addition, special papers (tutorials, surveys, and perspectives on the theory and applications of control systems topics) are solicited.