Ergodic Control of a Heterogeneous Population and Application to Electricity Pricing

IF 7 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automatic Control Pub Date : 2025-01-20 DOI:10.1109/TAC.2025.3532178

Quentin Jacquet;Wim van Ackooij;Clémence Alasseur;Stéphane Gaubert

{"title":"Ergodic Control of a Heterogeneous Population and Application to Electricity Pricing","authors":"Quentin Jacquet;Wim van Ackooij;Clémence Alasseur;Stéphane Gaubert","doi":"10.1109/TAC.2025.3532178","DOIUrl":null,"url":null,"abstract":"We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a “mean-field” Markov decision process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we prove that the infinite-dimensional ergodic eigenproblem admits a solution and show that the latter is in general nonunique. This allows us to obtain optimal strategies and to quantify the gap between steady-state strategies and optimal ones. In particular, we prove in the 1-D case that there exist cyclic policies—alternating between discount and profit-taking stages—which secure a greater gain than constant-price policies. On numerical aspects, we develop a policy iteration algorithm with “on-the-fly” generated transitions, specifically adapted to decomposable models, leading to substantial memory savings. We finally apply our results to realistic instances coming from an electricity pricing problem encountered in the retail markets and numerically observe the emergence of cyclic promotions for sufficient inertia in customer behavior.","PeriodicalId":13201,"journal":{"name":"IEEE Transactions on Automatic Control","volume":"70 7","pages":"4640-4654"},"PeriodicalIF":7.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automatic Control","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848191/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a “mean-field” Markov decision process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we prove that the infinite-dimensional ergodic eigenproblem admits a solution and show that the latter is in general nonunique. This allows us to obtain optimal strategies and to quantify the gap between steady-state strategies and optimal ones. In particular, we prove in the 1-D case that there exist cyclic policies—alternating between discount and profit-taking stages—which secure a greater gain than constant-price policies. On numerical aspects, we develop a policy iteration algorithm with “on-the-fly” generated transitions, specifically adapted to decomposable models, leading to substantial memory savings. We finally apply our results to realistic instances coming from an electricity pricing problem encountered in the retail markets and numerically observe the emergence of cyclic promotions for sufficient inertia in customer behavior.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

异质人口的遍历控制及其在电价中的应用

我们考虑一个由能够在任何时间在不同选项之间切换的代理组成的异质群体的控制问题。控制器的目标是最大化每时间单位的平均增益，假设种群的大小是无限的。这导致了一个“平均域”马尔可夫决策过程的遍历控制问题，其中状态空间是简单函数的乘积，种群根据受控线性动力学进化。利用希尔伯特射影度规中动力学的收缩性质，证明了无穷维遍历本征问题存在解，并证明了后者一般是非唯一的。这使我们能够获得最优策略，并量化稳态策略与最优策略之间的差距。特别地，我们证明了在一维情况下存在循环政策-在折扣和获利回盘阶段之间交替-比不变价格政策获得更大的收益。在数值方面，我们开发了一种具有“动态”生成转换的策略迭代算法，特别适用于可分解模型，从而节省了大量内存。最后，我们将我们的结果应用于零售市场中遇到的电价问题的实际实例，并通过数值观察客户行为中足够惯性的循环促销的出现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automatic Control 工程技术-工程：电子与电气

CiteScore

11.30

自引率

5.90%

发文量

824

审稿时长

9 months

期刊介绍： In the IEEE Transactions on Automatic Control, the IEEE Control Systems Society publishes high-quality papers on the theory, design, and applications of control engineering. Two types of contributions are regularly considered: 1) Papers: Presentation of significant research, development, or application of control concepts. 2) Technical Notes and Correspondence: Brief technical notes, comments on published areas or established control topics, corrections to papers and notes published in the Transactions. In addition, special papers (tutorials, surveys, and perspectives on the theory and applications of control systems topics) are solicited.