首页 > 最新文献

Applied Mathematics and Optimization最新文献

英文 中文
On the Time Consistent Solution to Optimal Stopping Problems with Expectation Constraint 带期望约束的最优停车问题的时间一致解
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-19 DOI: 10.1007/s00245-024-10202-w
S. Christensen, M. Klein, B. Schultz

We study the (weak) equilibrium problem arising from the problem of optimally stopping a one-dimensional diffusion subject to an expectation constraint on the time until stopping. The weak equilibrium problem is realized with a set of randomized but purely state dependent stopping times as admissible strategies. We derive a verification theorem and necessary conditions for equilibria, which together basically characterize all equilibria. Furthermore, additional structural properties of equilibria are obtained to feed a possible guess-and-verify approach, which is then illustrated by an example.

我们研究了在期望时间约束下最优停止一维扩散问题所引起的(弱)平衡问题。用一组随机但纯粹依赖于状态的停止时间作为允许策略来实现弱均衡问题。我们导出了均衡的验证定理和必要条件,它们共同基本表征了所有均衡。此外,还获得了平衡态的附加结构性质,以提供一种可能的猜测-验证方法,然后通过一个例子说明了这一点。
{"title":"On the Time Consistent Solution to Optimal Stopping Problems with Expectation Constraint","authors":"S. Christensen,&nbsp;M. Klein,&nbsp;B. Schultz","doi":"10.1007/s00245-024-10202-w","DOIUrl":"10.1007/s00245-024-10202-w","url":null,"abstract":"<div><p>We study the (weak) equilibrium problem arising from the problem of optimally stopping a one-dimensional diffusion subject to an expectation constraint on the time until stopping. The weak equilibrium problem is realized with a set of randomized but purely state dependent stopping times as admissible strategies. We derive a verification theorem and necessary conditions for equilibria, which together basically characterize all equilibria. Furthermore, additional structural properties of equilibria are obtained to feed a possible guess-and-verify approach, which is then illustrated by an example.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10202-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142858602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous Time q-Learning for Mean-Field Control Problems 平均场控制问题的连续时间 q 学习
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-17 DOI: 10.1007/s00245-024-10205-7
Xiaoli Wei, Xiang Yu

This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (J Mach Learn Res 24:1–61, 2023), for continuous time mean-field control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent’s control problem in Jia and Zhou (J Mach Learn Res 24:1–61, 2023), we reveal that two different q-functions naturally arise in mean-field control problems: (i) the integrated q-function (denoted by q) as the first-order approximation of the integrated Q-function introduced in Gu et al. (Oper Res 71(4):1040–1054, 2023), which can be learnt by a weak martingale condition using all test policies; and (ii) the essential q-function (denoted by (q_e)) that is employed in the policy improvement iterations. We show that two q-functions are related via an integral representation. Based on the weak martingale condition and our proposed searching method of test policies, some model-free learning algorithms are devised. In two examples, one in LQ control framework and one beyond LQ control framework, we can obtain the exact parameterization of the optimal value function and q-functions and illustrate our algorithms with simulation experiments.

本文研究了在熵规则化强化学习背景下连续时间均场控制问题的q-learning(最近被Jia和Zhou称为Q-learning的连续时间对应物)(J Mach Learn Res 24:1-61, 2023)。与 Jia 和 Zhou (J Mach Learn Res 24:1-61, 2023) 中的单代理控制问题不同,我们发现均场控制问题中自然会出现两种不同的 q 函数:(i)综合 q 函数(用 q 表示),即 Gu 等人 (Oper Res 71(4:1-61, 2023) 中引入的综合 Q 函数的一阶近似值。(Oper Res 71(4):1040-1054, 2023)中引入的一阶近似综合 Q 函数,它可以通过使用所有测试策略的弱马丁格尔条件来学习;(ii) 在策略改进迭代中使用的本质 q 函数(用 (q_e) 表示)。我们通过积分表示法证明了两个 q 函数的关系。基于弱鞅条件和我们提出的测试策略搜索方法,我们设计了一些无模型学习算法。在两个例子(一个在 LQ 控制框架内,一个在 LQ 控制框架之外)中,我们可以得到最优值函数和 q 函数的精确参数化,并通过仿真实验说明了我们的算法。
{"title":"Continuous Time q-Learning for Mean-Field Control Problems","authors":"Xiaoli Wei,&nbsp;Xiang Yu","doi":"10.1007/s00245-024-10205-7","DOIUrl":"10.1007/s00245-024-10205-7","url":null,"abstract":"<div><p>This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (J Mach Learn Res 24:1–61, 2023), for continuous time mean-field control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent’s control problem in Jia and Zhou (J Mach Learn Res 24:1–61, 2023), we reveal that two different q-functions naturally arise in mean-field control problems: (i) the integrated q-function (denoted by <i>q</i>) as the first-order approximation of the integrated Q-function introduced in Gu et al. (Oper Res 71(4):1040–1054, 2023), which can be learnt by a weak martingale condition using all test policies; and (ii) the essential q-function (denoted by <span>(q_e)</span>) that is employed in the policy improvement iterations. We show that two q-functions are related via an integral representation. Based on the weak martingale condition and our proposed searching method of test policies, some model-free learning algorithms are devised. In two examples, one in LQ control framework and one beyond LQ control framework, we can obtain the exact parameterization of the optimal value function and q-functions and illustrate our algorithms with simulation experiments.\u0000</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142826395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching 政策梯度的控制随机化方法及其在最优切换强化学习中的应用
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-16 DOI: 10.1007/s00245-024-10207-5
Robert Denkert, Huyên Pham, Xavier Warin

We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enabling applications across various classes of Markovian continuous time control problems, beyond diffusion models, including e.g. regular, impulse and optimal stopping/switching problems. By utilizing change of measure in the control randomisation technique, we derive a new policy gradient representation for these randomised problems, featuring parametrised intensity policies. We further develop actor-critic algorithms specifically designed to address general Markovian stochastic control issues. Our framework is demonstrated through its application to optimal switching problems, with two numerical case studies in the energy sector focusing on real options.

我们提出了一个针对连续时间强化学习的政策梯度方法综合框架。该框架基于随机控制问题和随机化问题之间的联系,可应用于扩散模型之外的各类马尔可夫连续时间控制问题,包括常规问题、脉冲问题和最优停止/切换问题。通过利用控制随机化技术中的度量变化,我们为这些随机化问题推导出了一种新的策略梯度表示法,具有参数化强度策略的特点。我们还进一步开发了专门用于解决一般马尔可夫随机控制问题的行为批判算法。通过将我们的框架应用于最优转换问题,并在能源领域进行了两个以实物期权为重点的数值案例研究,我们的框架得到了验证。
{"title":"Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching","authors":"Robert Denkert,&nbsp;Huyên Pham,&nbsp;Xavier Warin","doi":"10.1007/s00245-024-10207-5","DOIUrl":"10.1007/s00245-024-10207-5","url":null,"abstract":"<div><p>We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enabling applications across various classes of Markovian continuous time control problems, beyond diffusion models, including e.g. regular, impulse and optimal stopping/switching problems. By utilizing change of measure in the control randomisation technique, we derive a new policy gradient representation for these randomised problems, featuring parametrised intensity policies. We further develop actor-critic algorithms specifically designed to address general Markovian stochastic control issues. Our framework is demonstrated through its application to optimal switching problems, with two numerical case studies in the energy sector focusing on real options.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142826391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coarse Correlated Equilibria in Linear Quadratic Mean Field Games and Application to an Emission Abatement Game 线性二次均值场博弈中的粗相关均衡及其在减排博弈中的应用
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-14 DOI: 10.1007/s00245-024-10198-3
Luciano Campi, Federico Cannerozzi, Fanny Cartellier

Coarse correlated equilibria (CCE) are a good alternative to Nash equilibria (NE), as they arise more naturally as outcomes of learning algorithms and as they may exhibit higher payoffs than NE. CCEs include a device which allows players’ strategies to be correlated without any cooperation, only through information sent by a mediator. We develop a methodology to concretely compute mean field CCEs in a linear-quadratic mean field game (MFG) framework. We compare their performance to mean field control solutions and mean field NE (usually named MFG solutions). Our approach is implemented in the mean field version of an emission abatement game between greenhouse gas emitters. In particular, we exhibit a simple and tractable class of mean field CCEs which allows to outperform very significantly the mean field NE payoff and abatement levels, bridging the gap between the mean field NE and the social optimum obtained by mean field control.

粗相关均衡(CCE)是纳什均衡(NE)的一个很好的替代方案,因为粗相关均衡作为学习算法的结果出现得更自然,而且可能比纳什均衡显示出更高的收益。CCE 包含一种装置,它允许博弈者的策略在没有任何合作的情况下,仅通过中介发送的信息相互关联。我们开发了一种在线性-二次均值场博弈(MFG)框架内具体计算均值场 CCE 的方法。我们将其性能与均值场控制解决方案和均值场 NE(通常称为 MFG 解决方案)进行了比较。我们的方法是在温室气体排放者之间的均值场减排博弈中实现的。特别是,我们展示了一类简单易行的均值场 CCE,它可以大大优于均值场 NE 的报酬和减排水平,弥补了均值场 NE 与均值场控制所获得的社会最优之间的差距。
{"title":"Coarse Correlated Equilibria in Linear Quadratic Mean Field Games and Application to an Emission Abatement Game","authors":"Luciano Campi,&nbsp;Federico Cannerozzi,&nbsp;Fanny Cartellier","doi":"10.1007/s00245-024-10198-3","DOIUrl":"10.1007/s00245-024-10198-3","url":null,"abstract":"<div><p>Coarse correlated equilibria (CCE) are a good alternative to Nash equilibria (NE), as they arise more naturally as outcomes of learning algorithms and as they may exhibit higher payoffs than NE. CCEs include a device which allows players’ strategies to be correlated without any cooperation, only through information sent by a mediator. We develop a methodology to concretely compute mean field CCEs in a linear-quadratic mean field game (MFG) framework. We compare their performance to mean field control solutions and mean field NE (usually named MFG solutions). Our approach is implemented in the mean field version of an emission abatement game between greenhouse gas emitters. In particular, we exhibit a simple and tractable class of mean field CCEs which allows to outperform very significantly the mean field NE payoff and abatement levels, bridging the gap between the mean field NE and the social optimum obtained by mean field control.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10198-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142821384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Well-Posed Uniform Solvability of Convex Optimization Problems on a Uniform Differentiable Closed Convex Set 一致可微闭凸集上凸优化问题的适定一致可解性
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-12 DOI: 10.1007/s00245-024-10206-6
Shaoqiang Shang

In this paper, we first give the definition of uniformly differentiable set and give the definitions of sets (P(A,eta , r)) and (P_{A,delta }(f)). Secondly, we prove that if the set A is bounded closed convex set, then A is uniformly differentiable if and only if for any (varepsilon , eta , r>0), there exists (delta =delta (varepsilon ,eta ,r )>0) such that (Vert x-yVert <varepsilon ) whenever (fin P(A,eta , r)), (yin P_{A,delta }(f)) and (xin P_{A}(f)). Moreover, we also prove that if A is a bounded closed convex set in a finite-dimensional space X, then A is differentiable if and only if A is uniformly differentiable. Finally, we give some examples of uniformly differentiable set. Therefore, we extend some conclusions (SIAM J. Optim. Vol. 30, No. 1, pp. 490–512).

本文首先给出了一致可微集的定义,并给出了集合(P(A,eta , r))和(P_{A,delta }(f))的定义。其次,证明了如果集合A是有界闭凸集,则A是一致可微的当且仅当对于任意(varepsilon , eta , r>0),存在(delta =delta (varepsilon ,eta ,r )>0)使得(Vert x-yVert <varepsilon )无论(fin P(A,eta , r)), (yin P_{A,delta }(f)), (xin P_{A}(f))。此外,我们还证明了如果A是有限维空间X中的有界闭凸集,则当且仅当A一致可微时,A是可微的。最后给出了一致可微集的一些例子。因此,我们推广了一些结论(SIAM J. Optim。第30卷第1期,490-512页)。
{"title":"Well-Posed Uniform Solvability of Convex Optimization Problems on a Uniform Differentiable Closed Convex Set","authors":"Shaoqiang Shang","doi":"10.1007/s00245-024-10206-6","DOIUrl":"10.1007/s00245-024-10206-6","url":null,"abstract":"<div><p>In this paper, we first give the definition of uniformly differentiable set and give the definitions of sets <span>(P(A,eta , r))</span> and <span>(P_{A,delta }(f))</span>. Secondly, we prove that if the set <i>A</i> is bounded closed convex set, then <i>A</i> is uniformly differentiable if and only if for any <span>(varepsilon , eta , r&gt;0)</span>, there exists <span>(delta =delta (varepsilon ,eta ,r )&gt;0)</span> such that <span>(Vert x-yVert &lt;varepsilon )</span> whenever <span>(fin P(A,eta , r))</span>, <span>(yin P_{A,delta }(f))</span> and <span>(xin P_{A}(f))</span>. Moreover, we also prove that if <i>A</i> is a bounded closed convex set in a finite-dimensional space <i>X</i>, then <i>A</i> is differentiable if and only if <i>A</i> is uniformly differentiable. Finally, we give some examples of uniformly differentiable set. Therefore, we extend some conclusions (SIAM J. Optim. Vol. 30, No. 1, pp. 490–512).</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142811195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Boundedness of Solutions to a Food Chain Model with Nonlinear Taxis Sensitivity 一类具有非线性导向敏感性的食物链模型解的全局有界性
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-11 DOI: 10.1007/s00245-024-10208-4
Enhui Pan, Changchun Liu

In this paper, we investigate the initial boundary value problem of a three-species spatial food chain model with nonlinear taxis sensitivity in a bounded domain (Omega subset {mathbb {R}}^2) with a smooth boundary and homogeneous Neumann boundary conditions. By establishing a new energy functional, we demonstrate the global boundedness of classical solutions under appropriate initial data.

在光滑边界和齐次Neumann边界条件下,研究了一类具有非线性趋向性灵敏度的三物种空间食物链模型在有界区域(Omega subset {mathbb {R}}^2)上的初边值问题。通过建立一个新的能量泛函,我们证明了经典解在适当初始数据下的全局有界性。
{"title":"Global Boundedness of Solutions to a Food Chain Model with Nonlinear Taxis Sensitivity","authors":"Enhui Pan,&nbsp;Changchun Liu","doi":"10.1007/s00245-024-10208-4","DOIUrl":"10.1007/s00245-024-10208-4","url":null,"abstract":"<div><p>In this paper, we investigate the initial boundary value problem of a three-species spatial food chain model with nonlinear taxis sensitivity in a bounded domain <span>(Omega subset {mathbb {R}}^2)</span> with a smooth boundary and homogeneous Neumann boundary conditions. By establishing a new energy functional, we demonstrate the global boundedness of classical solutions under appropriate initial data.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal Consumption–Investment with Constraints in a Regime Switching Market with Random Coefficients 随机系数制度交换市场中有约束的最优消费-投资
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-10 DOI: 10.1007/s00245-024-10203-9
Ying Hu, Xiaomin Shi, Zuo Quan Xu

This paper studies finite-time optimal consumption–investment problems with power, logarithmic and exponential utilities, in a regime switching market with random coefficients and possibly subject to non-convex constraints. Compared to the existing models, one distinguish feature of our model is that the trading constraints put on the consumption and investment strategies are coupled together in the cases of power and logarithmic utilities, leading to new technical challenges. We provide explicit optimal consumption–investment strategies and optimal values for these consumption–investment problems, which are expressed in terms of the solutions to some multi-dimensional diagonally quadratic backward stochastic differential equation (BSDE) and linear BSDE with unbound coefficients. These BSDEs are new in the literature and solving them is one of the main theoretical contributions of this paper. We accomplish the latter by applying the truncation, approximation technique to get some a priori uniformly lower and upper bounds for their solutions.

本文研究了具有随机系数且可能受非凸约束的状态切换市场中具有功率、对数和指数效用的有限时间最优消费-投资问题。与现有模型相比,我们的模型的一个显著特征是,在电力和对数公用事业的情况下,对消费和投资策略的交易约束是耦合在一起的,这导致了新的技术挑战。我们给出了这些消费-投资问题的显式最优消费-投资策略和最优值,这些最优值用一些多维对角二次倒向随机微分方程(BSDE)和无界系数线性BSDE的解来表示。这些bsde在文献中是新的,求解它们是本文的主要理论贡献之一。我们采用截断、近似的方法,得到了它们解的先验一致的下界和上界。
{"title":"Optimal Consumption–Investment with Constraints in a Regime Switching Market with Random Coefficients","authors":"Ying Hu,&nbsp;Xiaomin Shi,&nbsp;Zuo Quan Xu","doi":"10.1007/s00245-024-10203-9","DOIUrl":"10.1007/s00245-024-10203-9","url":null,"abstract":"<div><p>This paper studies finite-time optimal consumption–investment problems with power, logarithmic and exponential utilities, in a regime switching market with random coefficients and possibly subject to non-convex constraints. Compared to the existing models, one distinguish feature of our model is that the trading constraints put on the consumption and investment strategies are coupled together in the cases of power and logarithmic utilities, leading to new technical challenges. We provide explicit optimal consumption–investment strategies and optimal values for these consumption–investment problems, which are expressed in terms of the solutions to some multi-dimensional diagonally quadratic backward stochastic differential equation (BSDE) and linear BSDE with unbound coefficients. These BSDEs are new in the literature and solving them is one of the main theoretical contributions of this paper. We accomplish the latter by applying the truncation, approximation technique to get some a priori uniformly lower and upper bounds for their solutions.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimality Conditions for Parabolic Stochastic Optimal Control Problems with Boundary Controls 具有边界控制的抛物型随机最优控制问题的最优性条件
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-09 DOI: 10.1007/s00245-024-10204-8
Piero Visconti

In this paper, we study optimality conditions for a class of control problems driven by a cylindrical Wiener process, resulting in a stochastic maximum principle in differential form. The control acts on both the drift and volatility, potentially as unbounded operators, allowing for SPDEs with boundary control and/or noise. Through the factorization method, we establish a regularity property for the state equation, which, by duality, extends to the backward costate equation, understood in the transposition sense. Finally, we show that the cost functional is Gâteaux differentiable, with its derivative represented by the costate. The optimality condition is derived using results from set-valued analysis.

本文研究了一类由圆柱形Wiener过程驱动的控制问题的最优性条件,得到了微分形式的随机极大值原理。控制作用于漂移和波动,可能作为无界算子,允许具有边界控制和/或噪声的spde。通过因式分解方法,我们建立了状态方程的正则性,并通过对偶性将其推广到转置意义上的后向状态方程。最后,我们证明了代价泛函是g teaux可微的,其导数由代价状态表示。利用集值分析的结果,导出了最优性条件。
{"title":"Optimality Conditions for Parabolic Stochastic Optimal Control Problems with Boundary Controls","authors":"Piero Visconti","doi":"10.1007/s00245-024-10204-8","DOIUrl":"10.1007/s00245-024-10204-8","url":null,"abstract":"<div><p>In this paper, we study optimality conditions for a class of control problems driven by a cylindrical Wiener process, resulting in a stochastic maximum principle in differential form. The control acts on both the drift and volatility, potentially as unbounded operators, allowing for SPDEs with boundary control and/or noise. Through the factorization method, we establish a regularity property for the state equation, which, by duality, extends to the backward costate equation, understood in the transposition sense. Finally, we show that the cost functional is Gâteaux differentiable, with its derivative represented by the costate. The optimality condition is derived using results from set-valued analysis.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stopper vs. Singular Controller Games With Degenerate Diffusions 具有退化扩散的Stopper vs. Singular Controller Games
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-05 DOI: 10.1007/s00245-024-10199-2
Andrea Bovo, Tiziano De Angelis, Jan Palczewski

We study zero-sum stochastic games between a singular controller and a stopper when the (state-dependent) diffusion matrix of the underlying controlled diffusion process is degenerate. In particular, we show the existence of a value for the game and determine an optimal strategy for the stopper. The degeneracy of the dynamics prevents the use of analytical methods based on solution in Sobolev spaces of suitable variational problems. Therefore we adopt a probabilistic approach based on a perturbation of the underlying diffusion modulated by a parameter (gamma >0). For each (gamma >0) the approximating game is non-degenerate and admits a value (u^gamma ) and an optimal strategy (tau ^gamma _*) for the stopper. Letting (gamma rightarrow 0) we prove convergence of (u^gamma ) to a function v, which identifies the value of the original game. We also construct explicitly optimal stopping times (theta ^gamma _*) for (u^gamma ), related but not equal to (tau ^gamma _*), which converge almost surely to an optimal stopping time (theta _*) for the game with degenerate dynamics.

研究了当受控扩散过程的(状态相关)扩散矩阵退化时,奇异控制器与阻塞器之间的零和随机对策。特别地,我们展示了游戏的存在值,并确定了一个最优的策略。动力学的退化性阻碍了基于Sobolev空间中合适变分问题解的解析方法的使用。因此,我们采用一种基于由参数(gamma >0)调制的底层扩散扰动的概率方法。对于每个(gamma >0),近似对策是非退化的,并承认一个值(u^gamma )和一个最优策略(tau ^gamma _*)。让(gamma rightarrow 0)我们证明(u^gamma )收敛于一个函数v,它识别了原始游戏的值。我们还明确构造了(u^gamma )的最优停止时间(theta ^gamma _*),与(tau ^gamma _*)相关但不等于,对于具有退化动力学的博弈,它几乎肯定收敛于最优停止时间(theta _*)。
{"title":"Stopper vs. Singular Controller Games With Degenerate Diffusions","authors":"Andrea Bovo,&nbsp;Tiziano De Angelis,&nbsp;Jan Palczewski","doi":"10.1007/s00245-024-10199-2","DOIUrl":"10.1007/s00245-024-10199-2","url":null,"abstract":"<div><p>We study zero-sum stochastic games between a singular controller and a stopper when the (state-dependent) diffusion matrix of the underlying controlled diffusion process is degenerate. In particular, we show the existence of a value for the game and determine an optimal strategy for the stopper. The degeneracy of the dynamics prevents the use of analytical methods based on solution in Sobolev spaces of suitable variational problems. Therefore we adopt a probabilistic approach based on a perturbation of the underlying diffusion modulated by a parameter <span>(gamma &gt;0)</span>. For each <span>(gamma &gt;0)</span> the approximating game is non-degenerate and admits a value <span>(u^gamma )</span> and an optimal strategy <span>(tau ^gamma _*)</span> for the stopper. Letting <span>(gamma rightarrow 0)</span> we prove convergence of <span>(u^gamma )</span> to a function <i>v</i>, which identifies the value of the original game. We also construct explicitly optimal stopping times <span>(theta ^gamma _*)</span> for <span>(u^gamma )</span>, related but not equal to <span>(tau ^gamma _*)</span>, which converge almost surely to an optimal stopping time <span>(theta _*)</span> for the game with degenerate dynamics.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10199-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142778199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Almost Sectorial Operators in Fractional Superdiffusion Equations 分数阶超扩散方程中的概扇区算子
IF 1.6 2区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-12-02 DOI: 10.1007/s00245-024-10201-x
Eduardo Cuesta, Rodrigo Ponce

In this paper the resolvent family ({S_{alpha ,beta }(t)}_{tge 0}subset mathcal {L}(X,Y)) generated by an almost sectorial operator A,  where (alpha ,beta >0,) XY are complex Banach spaces and its Laplace transform satisfies (hat{S}_{alpha ,beta }(z)=z^{alpha -beta }(z^alpha -A)^{-1}) is studied. This family of operators allows to write the solution to an abstract initial value problem of time fractional type of order (1<alpha <2) as a variation of constants formula. Estimates of the norm (Vert S_{alpha ,beta }(t)Vert ,) as well as the continuity and compactness of (S_{alpha ,beta }(t)), for (t>0), are shown. Moreover, the Hölder regularity of its solutions is also studied.

本文研究了由近似扇形算子A生成的解族({S_{alpha ,beta }(t)}_{tge 0}subset mathcal {L}(X,Y)),其中(alpha ,beta >0,) X, Y为复巴那赫空间,其拉普拉斯变换满足(hat{S}_{alpha ,beta }(z)=z^{alpha -beta }(z^alpha -A)^{-1})。该算子族允许将阶为(1<alpha <2)的抽象时间分数型初值问题的解写成常数公式的变化形式。给出了对(t>0)的范数(Vert S_{alpha ,beta }(t)Vert ,)以及(S_{alpha ,beta }(t))的连续性和紧致性的估计。此外,还研究了其解的Hölder正则性。
{"title":"Almost Sectorial Operators in Fractional Superdiffusion Equations","authors":"Eduardo Cuesta,&nbsp;Rodrigo Ponce","doi":"10.1007/s00245-024-10201-x","DOIUrl":"10.1007/s00245-024-10201-x","url":null,"abstract":"<div><p>In this paper the resolvent family <span>({S_{alpha ,beta }(t)}_{tge 0}subset mathcal {L}(X,Y))</span> generated by an almost sectorial operator <i>A</i>,  where <span>(alpha ,beta &gt;0,)</span> <i>X</i>, <i>Y</i> are complex Banach spaces and its Laplace transform satisfies <span>(hat{S}_{alpha ,beta }(z)=z^{alpha -beta }(z^alpha -A)^{-1})</span> is studied. This family of operators allows to write the solution to an abstract initial value problem of time fractional type of order <span>(1&lt;alpha &lt;2)</span> as a variation of constants formula. Estimates of the norm <span>(Vert S_{alpha ,beta }(t)Vert ,)</span> as well as the continuity and compactness of <span>(S_{alpha ,beta }(t))</span>, for <span>(t&gt;0)</span>, are shown. Moreover, the Hölder regularity of its solutions is also studied.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142761958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Mathematics and Optimization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1