{"title":"Average Cost Brownian Drift Control with Proportional Changeover Costs","authors":"John H. Vande Vate","doi":"10.1287/stsy.2021.0071","DOIUrl":null,"url":null,"abstract":"This paper considers the problem of optimally controlling the drift of a Brownian motion with a finite set of possible drift rates so as to minimize the long-run average cost, consisting of fixed costs for changing the drift rate, processing costs for maintaining the drift rate, holding costs on the state of the process, and costs for instantaneous controls to keep the process within a prescribed range. We show that, under mild assumptions on the processing costs and the fixed costs for changing the drift rate, there is a strongly ordered optimal policy, that is, an optimal policy that limits the use of each drift rate to a single interval; when the process reaches the upper limit of that interval, the policy either changes to the next lower drift rate deterministically or resorts to instantaneous controls to keep the process within the prescribed range, and when the process reaches the lower limit of the interval, the policy either changes to the next higher drift rate deterministically or again resorts to instantaneous controls to keep the process within the prescribed range. We prove the optimality of such a policy by constructing smooth relative value functions satisfying the associated simplified optimality criteria. This paper shows that, under the proportional changeover cost assumption, each drift rate is active in at most one contiguous range and that the transitions between drift rates are strongly ordered. The results reduce the complexity of proving the optimality of such a policy by proving the existence of optimal relative value functions that constitute a nondecreasing sequence of functions. As a consequence, the constructive arguments lead to a practical procedure for solving the problem that is tens of thousands of times faster than previously reported methods.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/stsy.2021.0071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 4
Abstract
This paper considers the problem of optimally controlling the drift of a Brownian motion with a finite set of possible drift rates so as to minimize the long-run average cost, consisting of fixed costs for changing the drift rate, processing costs for maintaining the drift rate, holding costs on the state of the process, and costs for instantaneous controls to keep the process within a prescribed range. We show that, under mild assumptions on the processing costs and the fixed costs for changing the drift rate, there is a strongly ordered optimal policy, that is, an optimal policy that limits the use of each drift rate to a single interval; when the process reaches the upper limit of that interval, the policy either changes to the next lower drift rate deterministically or resorts to instantaneous controls to keep the process within the prescribed range, and when the process reaches the lower limit of the interval, the policy either changes to the next higher drift rate deterministically or again resorts to instantaneous controls to keep the process within the prescribed range. We prove the optimality of such a policy by constructing smooth relative value functions satisfying the associated simplified optimality criteria. This paper shows that, under the proportional changeover cost assumption, each drift rate is active in at most one contiguous range and that the transitions between drift rates are strongly ordered. The results reduce the complexity of proving the optimality of such a policy by proving the existence of optimal relative value functions that constitute a nondecreasing sequence of functions. As a consequence, the constructive arguments lead to a practical procedure for solving the problem that is tens of thousands of times faster than previously reported methods.