首页 > 最新文献

INFORMS journal on optimization最新文献

英文 中文
A Stochastic Inexact Sequential Quadratic Optimization Algorithm for Nonlinear Equality-Constrained Optimization 非线性等式约束优化的随机非精确序列二次优化算法
Pub Date : 2024-05-24 DOI: 10.1287/ijoo.2022.0008
Frank E. Curtis, Daniel P. Robinson, Baoyu Zhou
A stochastic algorithm is proposed, analyzed, and tested experimentally for solving continuous optimization problems with nonlinear equality constraints. It is assumed that constraint function and derivative values can be computed but that only stochastic approximations are available for the objective function and its derivatives. The algorithm is of the sequential quadratic optimization variety. Distinguishing features of the algorithm are that it only employs stochastic objective gradient estimates that satisfy a relatively weak set of assumptions (while using neither objective function values nor estimates of them) and that it allows inexact subproblem solutions to be employed, the latter of which is particularly useful in large-scale settings when the matrices defining the subproblems are too large to form and/or factorize. Conditions are imposed on the inexact subproblem solutions that account for the fact that only stochastic objective gradient estimates are employed. Convergence results are established for the method. Numerical experiments show that the proposed method vastly outperforms a stochastic subgradient method and can outperform an alternative sequential quadratic programming algorithm that employs highly accurate subproblem solutions in every iteration. Funding: This material is based upon work supported by the National Science Foundation [Awards CCF-1740796 and CCF-2139735] and the Office of Naval Research [Award N00014-21-1-2532].
本文提出了一种随机算法,并对其进行了分析和实验测试,以解决具有非线性相等约束条件的连续优化问题。假设可以计算约束函数和导数值,但目标函数及其导数只有随机近似值。该算法属于顺序二次优化算法。该算法的显著特点是,它只采用满足一组相对较弱假设的随机目标梯度估计值(同时既不使用目标函数值,也不使用其估计值),而且允许采用不精确的子问题解决方案,后者在大规模环境中特别有用,因为定义子问题的矩阵太大,无法形成和/或因式分解。考虑到只采用随机目标梯度估计,对非精确子问题解施加了条件。确定了该方法的收敛结果。数值实验表明,所提出的方法大大优于随机子梯度方法,并且优于在每次迭代中都采用高精度子问题解的另一种顺序二次编程算法。资助:本资料基于美国国家科学基金会 [CCF-1740796 和 CCF-2139735] 以及海军研究办公室 [N00014-21-1-2532] 的资助。
{"title":"A Stochastic Inexact Sequential Quadratic Optimization Algorithm for Nonlinear Equality-Constrained Optimization","authors":"Frank E. Curtis, Daniel P. Robinson, Baoyu Zhou","doi":"10.1287/ijoo.2022.0008","DOIUrl":"https://doi.org/10.1287/ijoo.2022.0008","url":null,"abstract":"A stochastic algorithm is proposed, analyzed, and tested experimentally for solving continuous optimization problems with nonlinear equality constraints. It is assumed that constraint function and derivative values can be computed but that only stochastic approximations are available for the objective function and its derivatives. The algorithm is of the sequential quadratic optimization variety. Distinguishing features of the algorithm are that it only employs stochastic objective gradient estimates that satisfy a relatively weak set of assumptions (while using neither objective function values nor estimates of them) and that it allows inexact subproblem solutions to be employed, the latter of which is particularly useful in large-scale settings when the matrices defining the subproblems are too large to form and/or factorize. Conditions are imposed on the inexact subproblem solutions that account for the fact that only stochastic objective gradient estimates are employed. Convergence results are established for the method. Numerical experiments show that the proposed method vastly outperforms a stochastic subgradient method and can outperform an alternative sequential quadratic programming algorithm that employs highly accurate subproblem solutions in every iteration. Funding: This material is based upon work supported by the National Science Foundation [Awards CCF-1740796 and CCF-2139735] and the Office of Naval Research [Award N00014-21-1-2532].","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":"5 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141100955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scenario-Based Robust Optimization for Two-Stage Decision Making Under Binary Uncertainty 二元不确定性下基于场景的两阶段决策鲁棒优化
Pub Date : 2023-10-31 DOI: 10.1287/ijoo.2020.0038
Kai Wang, Mehmet Aydemir, Alexandre Jacquillat
This paper addresses problems of two-stage optimization under binary uncertainty. We define a scenario-based robust optimization (ScRO) formulation that combines principles of stochastic optimization (by constructing probabilistic scenarios) and robust optimization (by protecting against adversarial perturbations within discrete uncertainty sets). To solve it, we develop a sparse row generation algorithm that iterates between a master problem (which provides a lower bound based on minimal uncertainty sets) and a history-based subproblem (which generates an upper bound and updates minimal uncertainty sets). We generate scenarios and uncertainty sets from element-wise probabilities using a deviation likelihood method or from historical samples using a sample clustering approach. Using public data sets, results suggest that (i) our ScRO formulation outperforms benchmarks based on deterministic, stochastic, and robust optimization; (ii) our deviation likelihood and sample clustering approaches outperform scenario generation baselines; and (iii) our sparse row generation algorithm outperforms off-the-shelf implementation and state-of-the-art cutting plane benchmarks. An application to a real-world ambulance dispatch case study suggests that the proposed modeling and algorithmic approach can reduce the number of late responses by more than 25%. Funding: K. Wang’s research was supported by the National Natural Science Foundation of China [Grants 72322002, 52221005 and 52220105001]
本文研究了二元不确定性下的两阶段优化问题。我们定义了一个基于场景的鲁棒优化(ScRO)公式,该公式结合了随机优化(通过构建概率场景)和鲁棒优化(通过防止离散不确定性集中的对抗性扰动)的原理。为了解决这个问题,我们开发了一种稀疏行生成算法,该算法在主问题(基于最小不确定性集提供下界)和基于历史的子问题(生成上界并更新最小不确定性集)之间迭代。我们使用偏差似然方法从元素概率或使用样本聚类方法从历史样本中生成场景和不确定性集。使用公共数据集,结果表明:(i)我们的ScRO公式优于基于确定性、随机和鲁棒优化的基准;(ii)我们的偏差似然和样本聚类方法优于情景生成基线;(iii)我们的稀疏行生成算法优于现成的实现和最先进的切割平面基准。应用于现实世界的救护车调度案例研究表明,所提出的建模和算法方法可以减少延迟响应的数量超过25%。基金资助:王k .国家自然科学基金资助项目[no . 72322002,52221005和52220105001]
{"title":"Scenario-Based Robust Optimization for Two-Stage Decision Making Under Binary Uncertainty","authors":"Kai Wang, Mehmet Aydemir, Alexandre Jacquillat","doi":"10.1287/ijoo.2020.0038","DOIUrl":"https://doi.org/10.1287/ijoo.2020.0038","url":null,"abstract":"This paper addresses problems of two-stage optimization under binary uncertainty. We define a scenario-based robust optimization (ScRO) formulation that combines principles of stochastic optimization (by constructing probabilistic scenarios) and robust optimization (by protecting against adversarial perturbations within discrete uncertainty sets). To solve it, we develop a sparse row generation algorithm that iterates between a master problem (which provides a lower bound based on minimal uncertainty sets) and a history-based subproblem (which generates an upper bound and updates minimal uncertainty sets). We generate scenarios and uncertainty sets from element-wise probabilities using a deviation likelihood method or from historical samples using a sample clustering approach. Using public data sets, results suggest that (i) our ScRO formulation outperforms benchmarks based on deterministic, stochastic, and robust optimization; (ii) our deviation likelihood and sample clustering approaches outperform scenario generation baselines; and (iii) our sparse row generation algorithm outperforms off-the-shelf implementation and state-of-the-art cutting plane benchmarks. An application to a real-world ambulance dispatch case study suggests that the proposed modeling and algorithmic approach can reduce the number of late responses by more than 25%. Funding: K. Wang’s research was supported by the National Natural Science Foundation of China [Grants 72322002, 52221005 and 52220105001]","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":"119 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135872794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Hardness of Learning from Censored and Nonstationary Demand 从删减和非平稳需求中学习的困难
Pub Date : 2023-09-28 DOI: 10.1287/ijoo.2022.0017
Gábor Lugosi, Mihalis G. Markakis, Gergely Neu
We consider a repeated newsvendor problem in which the inventory manager has no prior information about the demand and can access only censored/sales data. In analogy to multiarmed bandit problems, the manager needs to simultaneously “explore” and “exploit” with inventory decisions in order to minimize the cumulative cost. Our goal is to understand the hardness of the problem disentangled from any probabilistic assumptions on the demand sequence—importantly, independence or time stationarity—and, correspondingly, to develop policies that perform well with respect to the regret criterion. We design a cost estimator that is tailored to the special structure of the censoring problem, and we show that, if coupled with the classic exponentially weighted forecaster, it achieves optimal scaling of the expected regret (up to logarithmic factors) with respect to both the number of time periods and available actions. This result also leads to two important insights: the benefit from “information stalking” as well as the cost of censoring are both negligible, at least in terms of the regret. We demonstrate the flexibility of our technique by combining it with the fixed share forecaster to provide strong guarantees in terms of tracking regret, a powerful notion of regret that uses a large class of time-varying action sequences as benchmark. Numerical experiments suggest that the resulting policy outperforms existing policies (that are tailored to or facilitated by time stationarity) on nonstationary demand models with time-varying noise, trend, and seasonality components. Finally, we consider the “combinatorial” version of the repeated newsvendor problem, that is, single-warehouse, multiretailer inventory management of a perishable product. We extend the proposed approach so that, again, it achieves near-optimal performance in terms of the regret. Funding: G. Lugosi was supported by the Spanish Ministry of Economy, Industry and Competitiveness [Grant MTM2015-67304-P (AEI/FEDER, UE)]. M. G. Markakis was supported by the Spanish Ministry of Economy and Competitiveness [Grant ECO2016-75905-R (AEI/FEDER, UE)] and a Juan de la Cierva fellowship as well as the Spanish Ministry of Science and Innovation through a Ramón y Cajal fellowship. G. Neu was supported by the UPFellows Fellowship (Marie Curie COFUND program) [Grant 600387]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/ijoo.2022.0017 .
我们考虑一个重复报贩问题,其中库存管理人员没有关于需求的先验信息,只能访问审查/销售数据。与多武装盗匪问题类似,管理者需要同时“探索”和“利用”库存决策,以最小化累积成本。我们的目标是理解从需求序列的任何概率假设(重要的是独立性或时间平稳性)中解脱出来的问题的难度,并相应地制定相对于后悔标准表现良好的策略。我们设计了一个针对审查问题的特殊结构量身定制的成本估计器,并且我们表明,如果与经典的指数加权预测器相结合,它可以根据时间周期和可用操作的数量实现预期后悔(高达对数因子)的最佳缩放。这个结果还带来了两个重要的启示:“信息跟踪”的好处和审查的成本都可以忽略不计,至少在后悔方面是这样。我们通过将其与固定份额预测器相结合来展示我们技术的灵活性,从而在跟踪后悔方面提供强有力的保证,这是一种使用大量时变动作序列作为基准的强大的后悔概念。数值实验表明,在具有时变噪声、趋势和季节性成分的非平稳需求模型上,所得政策优于现有政策(根据时间平稳性量身定制或促进)。最后,我们考虑重复报贩问题的“组合”版本,即易腐产品的单仓库、多零售商库存管理。我们扩展了所提出的方法,因此,再一次,它在遗憾方面达到了接近最佳的性能。资助:G. Lugosi由西班牙经济、工业和竞争力部资助[Grant MTM2015-67304-P (AEI/FEDER, UE)]。m.g. Markakis得到了西班牙经济与竞争力部[Grant ECO2016-75905-R (AEI/FEDER, UE)]、Juan de la Cierva奖学金以及西班牙科学与创新部Ramón y Cajal奖学金的支持。G. Neu得到了UPFellows Fellowship (Marie Curie COFUND program)的资助[Grant 600387]。补充材料:电子伴侣可在https://doi.org/10.1287/ijoo.2022.0017上获得。
{"title":"On the Hardness of Learning from Censored and Nonstationary Demand","authors":"Gábor Lugosi, Mihalis G. Markakis, Gergely Neu","doi":"10.1287/ijoo.2022.0017","DOIUrl":"https://doi.org/10.1287/ijoo.2022.0017","url":null,"abstract":"We consider a repeated newsvendor problem in which the inventory manager has no prior information about the demand and can access only censored/sales data. In analogy to multiarmed bandit problems, the manager needs to simultaneously “explore” and “exploit” with inventory decisions in order to minimize the cumulative cost. Our goal is to understand the hardness of the problem disentangled from any probabilistic assumptions on the demand sequence—importantly, independence or time stationarity—and, correspondingly, to develop policies that perform well with respect to the regret criterion. We design a cost estimator that is tailored to the special structure of the censoring problem, and we show that, if coupled with the classic exponentially weighted forecaster, it achieves optimal scaling of the expected regret (up to logarithmic factors) with respect to both the number of time periods and available actions. This result also leads to two important insights: the benefit from “information stalking” as well as the cost of censoring are both negligible, at least in terms of the regret. We demonstrate the flexibility of our technique by combining it with the fixed share forecaster to provide strong guarantees in terms of tracking regret, a powerful notion of regret that uses a large class of time-varying action sequences as benchmark. Numerical experiments suggest that the resulting policy outperforms existing policies (that are tailored to or facilitated by time stationarity) on nonstationary demand models with time-varying noise, trend, and seasonality components. Finally, we consider the “combinatorial” version of the repeated newsvendor problem, that is, single-warehouse, multiretailer inventory management of a perishable product. We extend the proposed approach so that, again, it achieves near-optimal performance in terms of the regret. Funding: G. Lugosi was supported by the Spanish Ministry of Economy, Industry and Competitiveness [Grant MTM2015-67304-P (AEI/FEDER, UE)]. M. G. Markakis was supported by the Spanish Ministry of Economy and Competitiveness [Grant ECO2016-75905-R (AEI/FEDER, UE)] and a Juan de la Cierva fellowship as well as the Spanish Ministry of Science and Innovation through a Ramón y Cajal fellowship. G. Neu was supported by the UPFellows Fellowship (Marie Curie COFUND program) [Grant 600387]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/ijoo.2022.0017 .","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135386695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Temporal Bin Packing with Half-Capacity Jobs 具有半容量作业的临时装箱
Pub Date : 2023-09-25 DOI: 10.1287/ijoo.2023.0002
Christopher Muir, Luke Marshall, Alejandro Toriello
Motivated by applications in cloud computing, we study a temporal bin packing problem with jobs that occupy half of a bin’s capacity. An instance is given by a set of jobs, each with a start and end time during which it must be processed (i.e., assigned to a bin). A bin can accommodate two jobs simultaneously, and the objective is an assignment that minimizes the time-averaged number of open or active bins over the horizon; this problem is known to be NP hard. We demonstrate that a well-known “static” lower bound may have a significant gap even in relatively simple instances, which motivates us to introduce a novel combinatorial lower bound and an integer programming formulation, both based on an interpretation of the model as a series of connected matching problems. We theoretically compare the static bound, the new matching-based bounds, and various linear programming bounds. We perform a computational study using both synthetic and application-based instances and show that our bounds offer significant improvement over existing methods, particularly for sparse instances. Funding: This work was supported by the National Science Foundation [Grants CMMI-1552479 and NSF GRFP]. Supplemental Material: The online appendix is available at https://doi.org/10.1287/ijoo.2023.0002 .
受云计算应用的启发,我们研究了一个作业占用垃圾箱容量一半的临时装箱问题。实例由一组作业给出,每个作业都有一个开始和结束时间,在此期间必须对其进行处理(即,分配给一个bin)。一个垃圾箱可以同时容纳两个作业,目标是最小化打开或活动垃圾箱的时间平均数量;这个问题被称为NP困难。我们证明,即使在相对简单的情况下,众所周知的“静态”下界也可能有显著的差距,这促使我们引入一个新的组合下界和一个整数规划公式,两者都基于将模型解释为一系列相连的匹配问题。我们从理论上比较了静态界、新的基于匹配的界和各种线性规划界。我们使用合成实例和基于应用程序的实例进行计算研究,并表明我们的边界比现有方法提供了显着改进,特别是对于稀疏实例。资助:本研究由美国国家科学基金会支持[赠款CMMI-1552479和NSF GRFP]。补充材料:在线附录可在https://doi.org/10.1287/ijoo.2023.0002上获得。
{"title":"Temporal Bin Packing with Half-Capacity Jobs","authors":"Christopher Muir, Luke Marshall, Alejandro Toriello","doi":"10.1287/ijoo.2023.0002","DOIUrl":"https://doi.org/10.1287/ijoo.2023.0002","url":null,"abstract":"Motivated by applications in cloud computing, we study a temporal bin packing problem with jobs that occupy half of a bin’s capacity. An instance is given by a set of jobs, each with a start and end time during which it must be processed (i.e., assigned to a bin). A bin can accommodate two jobs simultaneously, and the objective is an assignment that minimizes the time-averaged number of open or active bins over the horizon; this problem is known to be NP hard. We demonstrate that a well-known “static” lower bound may have a significant gap even in relatively simple instances, which motivates us to introduce a novel combinatorial lower bound and an integer programming formulation, both based on an interpretation of the model as a series of connected matching problems. We theoretically compare the static bound, the new matching-based bounds, and various linear programming bounds. We perform a computational study using both synthetic and application-based instances and show that our bounds offer significant improvement over existing methods, particularly for sparse instances. Funding: This work was supported by the National Science Foundation [Grants CMMI-1552479 and NSF GRFP]. Supplemental Material: The online appendix is available at https://doi.org/10.1287/ijoo.2023.0002 .","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135817386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Bisection Protocol for Political Redistricting 一种用于政治选区重新划分的二分协议
Pub Date : 2023-07-01 DOI: 10.1287/ijoo.2022.0084
Ian G. Ludden, Rahul Swamy, D. King, S. Jacobson
The authors conceived of the bisection protocol during a research meeting discussing recent political redistricting literature, in particular, the I-cut-you-freeze protocol preprint. After establishing the theoretical results for the continuous nongeometric setting, they discussed ways to implement both protocols on real-world data, culminating in the Iowa case study and computational experiments with 17 other states.
作者在一次讨论最近政治选区重新划分文献的研究会议上构思了平分协议,特别是I-cut-you-freeze协议预印本。在确定了连续非几何设置的理论结果后,他们讨论了在真实世界数据上实现这两种协议的方法,最终在爱荷华州的案例研究和与其他17个州的计算实验中达到了高潮。
{"title":"A Bisection Protocol for Political Redistricting","authors":"Ian G. Ludden, Rahul Swamy, D. King, S. Jacobson","doi":"10.1287/ijoo.2022.0084","DOIUrl":"https://doi.org/10.1287/ijoo.2022.0084","url":null,"abstract":"The authors conceived of the bisection protocol during a research meeting discussing recent political redistricting literature, in particular, the I-cut-you-freeze protocol preprint. After establishing the theoretical results for the continuous nongeometric setting, they discussed ways to implement both protocols on real-world data, culminating in the Iowa case study and computational experiments with 17 other states.","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47544022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning for Commodity Procurement: Nonlinear Data-Driven Optimization of Hedging Decisions 商品采购的深度学习:非线性数据驱动的对冲决策优化
Pub Date : 2023-07-01 DOI: 10.1287/ijoo.2022.0086
Nicolas Busch, Tobias Crönert, Stefan Minner, Moritz Rettinger, Burakhan Sel
As the number of exchange-traded commodity contracts and their volatility increase, risk management through financial hedging gains importance for commodity-purchasing firms. Existing data-driven optimization approaches for hedging decisions include linear regression-based techniques. As such, they assume linear price–feature relationships and, thus, do not automatically detect nonlinear feature effects. We propose an alternative, nonlinear data-driven approach to commodity procurement based on deep learning. The prescriptive algorithm uses artificial neural networks to allow for universal approximation and requires no a priori knowledge regarding underlying price processes. We reformulate the periodic review procurement problem as a multilabel time series classification problem as the optimal bang-bang type procurement policy allows us to treat the hedging decision for each demand period as an individual subproblem that is independent of the other periods. Thereby, we are differentiating between optimal and suboptimal hedging decisions in each period and introduce a novel opportunity cost–sensitive loss function. We train maximum likelihood classifiers based on different deep learning architectures and test their performance in numerical experiments and case studies for natural gas, crude oil, nickel, and copper procurement. We show comparable performance to the state of the art for linear price–feature relationships and considerable advantages in the nonlinear case. Funding: Financial support received through the DFG as part of the AdONE GRK2201 [Grant 277991500] is gratefully acknowledged.
随着交易所交易的大宗商品合约数量及其波动性的增加,通过金融对冲进行风险管理对大宗商品采购公司来说变得越来越重要。现有的数据驱动的对冲决策优化方法包括基于线性回归的技术。因此,它们假设线性价格-特征关系,因此不能自动检测非线性特征效应。我们提出了一种基于深度学习的非线性数据驱动的商品采购方法。规定性算法使用人工神经网络来允许普遍近似,并且不需要关于基础价格过程的先验知识。我们将定期审查采购问题重新表述为一个多标签时间序列分类问题,因为最优的bang-bang型采购政策允许我们将每个需求期的对冲决策视为独立于其他时期的单个子问题。因此,我们区分了每个时期的最优和次优对冲决策,并引入了一个新的机会成本敏感损失函数。我们基于不同的深度学习架构训练最大似然分类器,并在天然气、原油、镍和铜采购的数值实验和案例研究中测试其性能。对于线性价格特征关系,我们展示了与最先进的性能相当的性能,并且在非线性情况下具有相当大的优势。资金:作为AdONE GRK2201 [Grant 277991500]的一部分,通过DFG获得的资金支持得到了感谢。
{"title":"Deep Learning for Commodity Procurement: Nonlinear Data-Driven Optimization of Hedging Decisions","authors":"Nicolas Busch, Tobias Crönert, Stefan Minner, Moritz Rettinger, Burakhan Sel","doi":"10.1287/ijoo.2022.0086","DOIUrl":"https://doi.org/10.1287/ijoo.2022.0086","url":null,"abstract":"As the number of exchange-traded commodity contracts and their volatility increase, risk management through financial hedging gains importance for commodity-purchasing firms. Existing data-driven optimization approaches for hedging decisions include linear regression-based techniques. As such, they assume linear price–feature relationships and, thus, do not automatically detect nonlinear feature effects. We propose an alternative, nonlinear data-driven approach to commodity procurement based on deep learning. The prescriptive algorithm uses artificial neural networks to allow for universal approximation and requires no a priori knowledge regarding underlying price processes. We reformulate the periodic review procurement problem as a multilabel time series classification problem as the optimal bang-bang type procurement policy allows us to treat the hedging decision for each demand period as an individual subproblem that is independent of the other periods. Thereby, we are differentiating between optimal and suboptimal hedging decisions in each period and introduce a novel opportunity cost–sensitive loss function. We train maximum likelihood classifiers based on different deep learning architectures and test their performance in numerical experiments and case studies for natural gas, crude oil, nickel, and copper procurement. We show comparable performance to the state of the art for linear price–feature relationships and considerable advantages in the nonlinear case. Funding: Financial support received through the DFG as part of the AdONE GRK2201 [Grant 277991500] is gratefully acknowledged.","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136309435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial Board 编辑委员会
Pub Date : 2023-07-01 DOI: 10.1287/ijoo.2023.eb.v5n3
{"title":"Editorial Board","authors":"","doi":"10.1287/ijoo.2023.eb.v5n3","DOIUrl":"https://doi.org/10.1287/ijoo.2023.eb.v5n3","url":null,"abstract":"","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43975707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Routing and Wavelength Assignment with Reinforcement Learning 基于强化学习的动态路由和波长分配
Pub Date : 2023-06-08 DOI: 10.1287/ijoo.2023.0092
Peyman Kafaei, Quentin Cappart, Nicolas Chapados, H. Pouya, Louis-Martin Rousseau
With the rapid developments in communication systems, and considering their dynamic nature, all-optical networks are becoming increasingly complex. This study proposes a novel method based on deep reinforcement learning for the routing and wavelength assignment problem in all-optical wavelength-decision-multiplexing networks. We consider dynamic incoming requests, in which their arrival and holding times are not known in advance. The objective is to devise a strategy that minimizes the number of rejected packages due to the lack of resources in the long term. We use graph neural networks to capture crucial latent information from the graph-structured input to develop the optimal strategy. The proposed deep reinforcement learning algorithm selects a route and a wavelength simultaneously for each incoming traffic connection as they arrive. The results demonstrate that the learned agent outperforms the methods used in practice and can be generalized on network topologies that did not participate in training.
随着通信系统的快速发展,考虑到其动态特性,全光网络变得越来越复杂。提出了一种基于深度强化学习的全光波长决策复用网络路由和波长分配问题的新方法。我们考虑动态传入请求,其中它们的到达和保持时间是未知的。目标是设计一种策略,以尽量减少由于长期缺乏资源而被拒绝的包的数量。我们使用图神经网络从图结构输入中捕获关键的潜在信息,以制定最优策略。提出的深度强化学习算法在每个进入的流量连接到达时同时选择路由和波长。结果表明,学习后的智能体优于实践中使用的方法,可以在未参与训练的网络拓扑上进行泛化。
{"title":"Dynamic Routing and Wavelength Assignment with Reinforcement Learning","authors":"Peyman Kafaei, Quentin Cappart, Nicolas Chapados, H. Pouya, Louis-Martin Rousseau","doi":"10.1287/ijoo.2023.0092","DOIUrl":"https://doi.org/10.1287/ijoo.2023.0092","url":null,"abstract":"With the rapid developments in communication systems, and considering their dynamic nature, all-optical networks are becoming increasingly complex. This study proposes a novel method based on deep reinforcement learning for the routing and wavelength assignment problem in all-optical wavelength-decision-multiplexing networks. We consider dynamic incoming requests, in which their arrival and holding times are not known in advance. The objective is to devise a strategy that minimizes the number of rejected packages due to the lack of resources in the long term. We use graph neural networks to capture crucial latent information from the graph-structured input to develop the optimal strategy. The proposed deep reinforcement learning algorithm selects a route and a wavelength simultaneously for each incoming traffic connection as they arrive. The results demonstrate that the learned agent outperforms the methods used in practice and can be generalized on network topologies that did not participate in training.","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42660672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geometric Analysis of Noisy Low-Rank Matrix Recovery in the Exact Parametrized and the Overparametrized Regimes 精确参数化和过参数化条件下噪声低秩矩阵恢复的几何分析
Pub Date : 2023-04-14 DOI: 10.1287/ijoo.2023.0090
Ziye Ma, Yingjie Bi, J. Lavaei, S. Sojoudi
The matrix sensing problem is an important low-rank optimization problem that has found a wide range of applications, such as matrix completion, phase synchornization/retrieval, robust principal component analysis (PCA), and power system state estimation. In this work, we focus on the general matrix sensing problem with linear measurements that are corrupted by random noise. We investigate the scenario where the search rank r is equal to the true rank [Formula: see text] of the unknown ground truth (the exact parametrized case), as well as the scenario where r is greater than [Formula: see text] (the overparametrized case). We quantify the role of the restricted isometry property (RIP) in shaping the landscape of the nonconvex factorized formulation and assisting with the success of local search algorithms. First, we develop a global guarantee on the maximum distance between an arbitrary local minimizer of the nonconvex problem and the ground truth under the assumption that the RIP constant is smaller than [Formula: see text]. We then present a local guarantee for problems with an arbitrary RIP constant, which states that any local minimizer is either considerably close to the ground truth or far away from it. More importantly, we prove that this noisy, overparametrized problem exhibits the strict saddle property, which leads to the global convergence of perturbed gradient descent algorithm in polynomial time. The results of this work provide a comprehensive understanding of the geometric landscape of the matrix sensing problem in the noisy and overparametrized regime. Funding: This work was supported by grants from the National Science Foundation, Office of Naval Research, Air Force Office of Scientific Research, and Army Research Office.
矩阵感知问题是一个重要的低阶优化问题,在矩阵完备、相位同步/检索、鲁棒主成分分析(PCA)和电力系统状态估计等方面有着广泛的应用。在这项工作中,我们专注于具有被随机噪声破坏的线性测量的一般矩阵传感问题。我们研究了搜索秩r等于未知基本事实的真实秩[公式:见文本]的情况(确切的参数化情况),以及r大于[公式:参见文本]的场景(过度框化情况)。我们量化了限制等距性质(RIP)在塑造非凸因子化公式的景观和帮助局部搜索算法取得成功方面的作用。首先,我们在假设RIP常数小于[公式:见正文]的情况下,对非凸问题的任意局部极小值与基本事实之间的最大距离进行了全局保证。然后,我们为具有任意RIP常数的问题提供了一个局部保证,该保证表明任何局部极小值要么非常接近地面实况,要么远离地面实况。更重要的是,我们证明了这个有噪声的、过框架化的问题表现出严格的鞍性质,这导致了扰动梯度下降算法在多项式时间内的全局收敛性。这项工作的结果提供了对噪声和过度帧化情况下矩阵传感问题的几何景观的全面理解。资助:这项工作得到了国家科学基金会、海军研究办公室、空军科学研究办公室和陆军研究办公室的资助。
{"title":"Geometric Analysis of Noisy Low-Rank Matrix Recovery in the Exact Parametrized and the Overparametrized Regimes","authors":"Ziye Ma, Yingjie Bi, J. Lavaei, S. Sojoudi","doi":"10.1287/ijoo.2023.0090","DOIUrl":"https://doi.org/10.1287/ijoo.2023.0090","url":null,"abstract":"The matrix sensing problem is an important low-rank optimization problem that has found a wide range of applications, such as matrix completion, phase synchornization/retrieval, robust principal component analysis (PCA), and power system state estimation. In this work, we focus on the general matrix sensing problem with linear measurements that are corrupted by random noise. We investigate the scenario where the search rank r is equal to the true rank [Formula: see text] of the unknown ground truth (the exact parametrized case), as well as the scenario where r is greater than [Formula: see text] (the overparametrized case). We quantify the role of the restricted isometry property (RIP) in shaping the landscape of the nonconvex factorized formulation and assisting with the success of local search algorithms. First, we develop a global guarantee on the maximum distance between an arbitrary local minimizer of the nonconvex problem and the ground truth under the assumption that the RIP constant is smaller than [Formula: see text]. We then present a local guarantee for problems with an arbitrary RIP constant, which states that any local minimizer is either considerably close to the ground truth or far away from it. More importantly, we prove that this noisy, overparametrized problem exhibits the strict saddle property, which leads to the global convergence of perturbed gradient descent algorithm in polynomial time. The results of this work provide a comprehensive understanding of the geometric landscape of the matrix sensing problem in the noisy and overparametrized regime. Funding: This work was supported by grants from the National Science Foundation, Office of Naval Research, Air Force Office of Scientific Research, and Army Research Office.","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46590619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Data-Driven Distributionally Robust Optimization over Time 随时间变化的数据驱动分布式鲁棒优化
Pub Date : 2023-04-11 DOI: 10.1287/ijoo.2023.0091
Kevin-Martin Aigner, Andreas Bärmann, Kristin Braun, F. Liers, S. Pokutta, Oskar Schneider, Kartikey Sharma, Sebastian Tschuppik
Stochastic optimization (SO) is a classical approach for optimization under uncertainty that typically requires knowledge about the probability distribution of uncertain parameters. Because the latter is often unknown, distributionally robust optimization (DRO) provides a strong alternative that determines the best guaranteed solution over a set of distributions (ambiguity set). In this work, we present an approach for DRO over time that uses online learning and scenario observations arriving as a data stream to learn more about the uncertainty. Our robust solutions adapt over time and reduce the cost of protection with shrinking ambiguity. For various kinds of ambiguity sets, the robust solutions converge to the SO solution. Our algorithm achieves the optimization and learning goals without solving the DRO problem exactly at any step. We also provide a regret bound for the quality of the online strategy that converges at a rate of [Formula: see text], where T is the number of iterations. Furthermore, we illustrate the effectiveness of our procedure by numerical experiments on mixed-integer optimization instances from popular benchmark libraries and give practical examples stemming from telecommunications and routing. Our algorithm is able to solve the DRO over time problem significantly faster than standard reformulations. Funding: This work was supported by Deutsche Forschungsgemeinschaft (DFG): Projects B06 and B10 in CRC TRR 154 and Project-ID 416229255 - SFB 1411 and Federal Ministry for Economic Affairs and Energy, Germany [Grant 03EI1036A]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/ijoo.2023.0091 .
随机优化(SO)是一种在不确定条件下进行优化的经典方法,通常需要了解不确定参数的概率分布。由于后者通常是未知的,分布式鲁棒优化(DRO)提供了一个强大的替代方案,可以确定一组分布(模糊集)上的最佳保证解。在这项工作中,我们提出了一种随时间变化的DRO方法,该方法使用在线学习和作为数据流到达的场景观察来了解更多关于不确定性的信息。我们强大的解决方案会随着时间的推移进行调整,并在减少模糊性的情况下降低保护成本。对于各种模糊集,鲁棒解收敛到SO解。我们的算法实现了优化和学习目标,而不需要在任何步骤精确地解决DRO问题。我们还为在线策略的质量提供了一个遗憾界,该策略以[公式:见正文]的速率收敛,其中T是迭代次数。此外,我们通过对来自流行基准库的混合整数优化实例的数值实验来说明我们的过程的有效性,并给出了来自电信和路由的实际例子。我们的算法能够比标准公式更快地解决DRO随时间变化的问题。资金:这项工作得到了德国政府(DFG)的支持:CRC TRR 154中的B06和B10项目以及项目ID 416229255-SFB 1411和德国联邦经济事务和能源部【拨款03EI1036A】。补充材料:电子公司可在https://doi.org/10.1287/ijoo.2023.0091。
{"title":"Data-Driven Distributionally Robust Optimization over Time","authors":"Kevin-Martin Aigner, Andreas Bärmann, Kristin Braun, F. Liers, S. Pokutta, Oskar Schneider, Kartikey Sharma, Sebastian Tschuppik","doi":"10.1287/ijoo.2023.0091","DOIUrl":"https://doi.org/10.1287/ijoo.2023.0091","url":null,"abstract":"Stochastic optimization (SO) is a classical approach for optimization under uncertainty that typically requires knowledge about the probability distribution of uncertain parameters. Because the latter is often unknown, distributionally robust optimization (DRO) provides a strong alternative that determines the best guaranteed solution over a set of distributions (ambiguity set). In this work, we present an approach for DRO over time that uses online learning and scenario observations arriving as a data stream to learn more about the uncertainty. Our robust solutions adapt over time and reduce the cost of protection with shrinking ambiguity. For various kinds of ambiguity sets, the robust solutions converge to the SO solution. Our algorithm achieves the optimization and learning goals without solving the DRO problem exactly at any step. We also provide a regret bound for the quality of the online strategy that converges at a rate of [Formula: see text], where T is the number of iterations. Furthermore, we illustrate the effectiveness of our procedure by numerical experiments on mixed-integer optimization instances from popular benchmark libraries and give practical examples stemming from telecommunications and routing. Our algorithm is able to solve the DRO over time problem significantly faster than standard reformulations. Funding: This work was supported by Deutsche Forschungsgemeinschaft (DFG): Projects B06 and B10 in CRC TRR 154 and Project-ID 416229255 - SFB 1411 and Federal Ministry for Economic Affairs and Energy, Germany [Grant 03EI1036A]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/ijoo.2023.0091 .","PeriodicalId":73382,"journal":{"name":"INFORMS journal on optimization","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47231198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
INFORMS journal on optimization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1