Quentin Jacquet;Wim van Ackooij;Clémence Alasseur;Stéphane Gaubert
{"title":"Ergodic Control of a Heterogeneous Population and Application to Electricity Pricing","authors":"Quentin Jacquet;Wim van Ackooij;Clémence Alasseur;Stéphane Gaubert","doi":"10.1109/TAC.2025.3532178","DOIUrl":null,"url":null,"abstract":"We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a “mean-field” Markov decision process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we prove that the infinite-dimensional ergodic eigenproblem admits a solution and show that the latter is in general nonunique. This allows us to obtain optimal strategies and to quantify the gap between steady-state strategies and optimal ones. In particular, we prove in the 1-D case that there exist cyclic policies—alternating between discount and profit-taking stages—which secure a greater gain than constant-price policies. On numerical aspects, we develop a policy iteration algorithm with “on-the-fly” generated transitions, specifically adapted to decomposable models, leading to substantial memory savings. We finally apply our results to realistic instances coming from an electricity pricing problem encountered in the retail markets and numerically observe the emergence of cyclic promotions for sufficient inertia in customer behavior.","PeriodicalId":13201,"journal":{"name":"IEEE Transactions on Automatic Control","volume":"70 7","pages":"4640-4654"},"PeriodicalIF":7.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automatic Control","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848191/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a “mean-field” Markov decision process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we prove that the infinite-dimensional ergodic eigenproblem admits a solution and show that the latter is in general nonunique. This allows us to obtain optimal strategies and to quantify the gap between steady-state strategies and optimal ones. In particular, we prove in the 1-D case that there exist cyclic policies—alternating between discount and profit-taking stages—which secure a greater gain than constant-price policies. On numerical aspects, we develop a policy iteration algorithm with “on-the-fly” generated transitions, specifically adapted to decomposable models, leading to substantial memory savings. We finally apply our results to realistic instances coming from an electricity pricing problem encountered in the retail markets and numerically observe the emergence of cyclic promotions for sufficient inertia in customer behavior.
期刊介绍:
In the IEEE Transactions on Automatic Control, the IEEE Control Systems Society publishes high-quality papers on the theory, design, and applications of control engineering. Two types of contributions are regularly considered:
1) Papers: Presentation of significant research, development, or application of control concepts.
2) Technical Notes and Correspondence: Brief technical notes, comments on published areas or established control topics, corrections to papers and notes published in the Transactions.
In addition, special papers (tutorials, surveys, and perspectives on the theory and applications of control systems topics) are solicited.