Stochastic Approximation for Multi-period Simulation Optimization with Streaming Input Data

IF 1.9 4区计算机科学 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS ACM Transactions on Modeling and Computer Simulation Pub Date : 2023-08-29 DOI:10.1145/3617595

Linyun He, U. Shanbhag, Eunhye Song

{"title":"Stochastic Approximation for Multi-period Simulation Optimization with Streaming Input Data","authors":"Linyun He, U. Shanbhag, Eunhye Song","doi":"10.1145/3617595","DOIUrl":null,"url":null,"abstract":"We consider a continuous-valued simulation optimization (SO) problem, where a simulator is built to optimize an expected performance measure of a real-world system while parameters of the simulator are estimated from streaming data collected periodically from the system. At each period, a new batch of data is combined with the cumulative data and the parameters are re-estimated with higher precision. The system requires the decision variable to be selected in all periods. Therefore, it is sensible for the decision-maker to update the decision variable at each period by solving a more precise SO problem with the updated parameter estimate to reduce the performance loss with respect to the target system. We define this decision-making process as the multi-period SO problem and introduce a multi-period stochastic approximation (SA) framework that generates a sequence of solutions. Two algorithms are proposed: Re-start SA (ReSA) reinitializes the stepsize sequence in each period, whereas Warm-start SA (WaSA) carefully tunes the stepsizes, taking both fewer and shorter gradient-descent steps in later periods as parameter estimates become increasingly more precise. We show that under suitable strong convexity and regularity conditions, ReSA and WaSA achieve the best possible convergence rate in expected sub-optimality either when an unbiased or a simultaneous perturbation gradient estimator is employed, while WaSA accrues significantly lower computational cost as the number of periods increases. In addition, we present the regularized ReSA which obviates the need to know the strong convexity constant and achieves the same convergence rate at the expense of additional computation.","PeriodicalId":50943,"journal":{"name":"ACM Transactions on Modeling and Computer Simulation","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Modeling and Computer Simulation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3617595","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

We consider a continuous-valued simulation optimization (SO) problem, where a simulator is built to optimize an expected performance measure of a real-world system while parameters of the simulator are estimated from streaming data collected periodically from the system. At each period, a new batch of data is combined with the cumulative data and the parameters are re-estimated with higher precision. The system requires the decision variable to be selected in all periods. Therefore, it is sensible for the decision-maker to update the decision variable at each period by solving a more precise SO problem with the updated parameter estimate to reduce the performance loss with respect to the target system. We define this decision-making process as the multi-period SO problem and introduce a multi-period stochastic approximation (SA) framework that generates a sequence of solutions. Two algorithms are proposed: Re-start SA (ReSA) reinitializes the stepsize sequence in each period, whereas Warm-start SA (WaSA) carefully tunes the stepsizes, taking both fewer and shorter gradient-descent steps in later periods as parameter estimates become increasingly more precise. We show that under suitable strong convexity and regularity conditions, ReSA and WaSA achieve the best possible convergence rate in expected sub-optimality either when an unbiased or a simultaneous perturbation gradient estimator is employed, while WaSA accrues significantly lower computational cost as the number of periods increases. In addition, we present the regularized ReSA which obviates the need to know the strong convexity constant and achieves the same convergence rate at the expense of additional computation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

流输入数据多周期仿真优化的随机逼近

我们考虑一个连续值模拟优化（SO）问题，其中建立模拟器来优化真实世界系统的预期性能度量，同时根据从系统定期收集的流数据来估计模拟器的参数。在每个周期，将新的一批数据与累积数据相结合，并以更高的精度重新估计参数。系统要求在所有时段中选择决策变量。因此，决策者在每个周期更新决策变量是明智的，通过用更新的参数估计来解决更精确的SO问题，以减少相对于目标系统的性能损失。我们将这个决策过程定义为多周期SO问题，并引入了一个多周期随机逼近（SA）框架，该框架生成一系列解。提出了两种算法：重新启动SA（ReSA）在每个周期重新初始化步长序列，而暖启动SA（WaSA）仔细调整步长，随着参数估计变得越来越精确，在以后的周期中采取更少和更短的梯度下降步骤。我们证明，在适当的强凸性和正则性条件下，当使用无偏或同时扰动梯度估计器时，ReSA和WaSA在预期次最优性中实现了尽可能好的收敛速度，而随着周期数的增加，WaSA的计算成本显著降低。此外，我们提出了正则化ReSA，它不需要知道强凸性常数，并以额外的计算为代价实现了相同的收敛速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Modeling and Computer Simulation 工程技术-计算机：跨学科应用

CiteScore

2.50

自引率

22.20%

发文量

审稿时长

>12 weeks

期刊介绍： The ACM Transactions on Modeling and Computer Simulation (TOMACS) provides a single archival source for the publication of high-quality research and developmental results referring to all phases of the modeling and simulation life cycle. The subjects of emphasis are discrete event simulation, combined discrete and continuous simulation, as well as Monte Carlo methods. The use of simulation techniques is pervasive, extending to virtually all the sciences. TOMACS serves to enhance the understanding, improve the practice, and increase the utilization of computer simulation. Submissions should contribute to the realization of these objectives, and papers treating applications should stress their contributions vis-á-vis these objectives.