首页 > 最新文献

arXiv - MATH - Optimization and Control最新文献

英文 中文
Contextual Stochastic Optimization for Omnichannel Multi-Courier Order Fulfillment Under Delivery Time Uncertainty 交货时间不确定情况下全渠道多快递员订单履行的情境随机优化
Pub Date : 2024-09-11 DOI: arxiv-2409.06918
Tinghan Ye, Sikai Cheng, Amira Hijazi, Pascal Van Hentenryck
The paper studies a large-scale order fulfillment problem for a leadinge-commerce company in the United States. The challenge involves selectingfulfillment centers and shipping carriers with observational data only toefficiently process orders from a vast network of physical stores andwarehouses. The company's current practice relies on heuristic rules thatchoose the cheapest fulfillment and shipping options for each unit, withoutconsidering opportunities for batching items or the reliability of carriers inmeeting expected delivery dates. The paper develops a data-driven ContextualStochastic Optimization (CSO) framework that integrates distributionalforecasts of delivery time deviations with stochastic and robust orderfulfillment optimization models. The framework optimizes the selection offulfillment centers and carriers, accounting for item consolidation anddelivery time uncertainty. Validated on a real-world data set containing tensof thousands of products, each with hundreds of fulfillment options, theproposed CSO framework significantly enhances the accuracy of meetingcustomer-expected delivery dates compared to current practices. It provides aflexible balance between reducing fulfillment costs and managing delivery timedeviation risks, emphasizing the importance of contextual information anddistributional forecasts in order fulfillment. This is the first paper thatstudies the omnichannel multi-courier order fulfillment problem with deliverytime uncertainty through the lens of contextual optimization, fusing machinelearning and optimization.
本文研究了美国一家领先电子商务公司的大规模订单履行问题。该难题涉及仅利用观察数据选择履约中心和运输公司,以有效处理来自庞大实体店和仓库网络的订单。该公司目前的做法依赖于启发式规则,即为每个单位选择最便宜的履行和运输方案,而不考虑分批发货的机会或承运商在满足预期交货日期方面的可靠性。本文开发了一个数据驱动的上下文随机优化(CSO)框架,该框架将交货时间偏差的分布预测与随机和稳健的订单履行优化模型整合在一起。该框架优化了配送中心和承运商的选择,同时考虑了项目合并和交付时间的不确定性。拟议的 CSO 框架在包含数万种产品(每种产品都有数百种履行选项)的真实数据集上进行了验证,与当前的做法相比,该框架显著提高了满足客户预期交货日期的准确性。它在降低执行成本和管理交货时间偏差风险之间实现了灵活的平衡,强调了上下文信息和配送预测在订单执行中的重要性。这是第一篇通过上下文优化的视角研究具有交货时间不确定性的全渠道多快递员订单履行问题的论文,将机器学习和优化融为一体。
{"title":"Contextual Stochastic Optimization for Omnichannel Multi-Courier Order Fulfillment Under Delivery Time Uncertainty","authors":"Tinghan Ye, Sikai Cheng, Amira Hijazi, Pascal Van Hentenryck","doi":"arxiv-2409.06918","DOIUrl":"https://doi.org/arxiv-2409.06918","url":null,"abstract":"The paper studies a large-scale order fulfillment problem for a leading\u0000e-commerce company in the United States. The challenge involves selecting\u0000fulfillment centers and shipping carriers with observational data only to\u0000efficiently process orders from a vast network of physical stores and\u0000warehouses. The company's current practice relies on heuristic rules that\u0000choose the cheapest fulfillment and shipping options for each unit, without\u0000considering opportunities for batching items or the reliability of carriers in\u0000meeting expected delivery dates. The paper develops a data-driven Contextual\u0000Stochastic Optimization (CSO) framework that integrates distributional\u0000forecasts of delivery time deviations with stochastic and robust order\u0000fulfillment optimization models. The framework optimizes the selection of\u0000fulfillment centers and carriers, accounting for item consolidation and\u0000delivery time uncertainty. Validated on a real-world data set containing tens\u0000of thousands of products, each with hundreds of fulfillment options, the\u0000proposed CSO framework significantly enhances the accuracy of meeting\u0000customer-expected delivery dates compared to current practices. It provides a\u0000flexible balance between reducing fulfillment costs and managing delivery time\u0000deviation risks, emphasizing the importance of contextual information and\u0000distributional forecasts in order fulfillment. This is the first paper that\u0000studies the omnichannel multi-courier order fulfillment problem with delivery\u0000time uncertainty through the lens of contextual optimization, fusing machine\u0000learning and optimization.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Indirect Dynamic Negotiation in the Nash Demand Game 纳什需求博弈中的间接动态谈判
Pub Date : 2024-09-10 DOI: arxiv-2409.06566
Tatiana V. Guy, Jitka Homolová, Aleksej Gaj
The paper addresses a problem of sequential bilateral bargaining withincomplete information. We proposed a decision model that helps agents tosuccessfully bargain by performing indirect negotiation and learning theopponent's model. Methodologically the paper casts heuristically-motivatedbargaining of a self-interested independent player into a framework of Bayesianlearning and Markov decision processes. The special form of the rewardimplicitly motivates the players to negotiate indirectly, via closed-loopinteraction. We illustrate the approach by applying our model to the Nashdemand game, which is an abstract model of bargaining. The results indicatethat the established negotiation: i) leads to coordinating players' actions;ii) results in maximising success rate of the game and iii) brings moreindividual profit to the players.
本文探讨了在信息不完全的情况下进行有序双边谈判的问题。我们提出了一个决策模型,通过执行间接谈判和学习对手的模型来帮助代理成功地进行讨价还价。在方法论上,本文将一个自利的独立参与者的启发式讨价还价纳入贝叶斯学习和马尔可夫决策过程的框架中。奖励的特殊形式通过闭环互动间接地激励参与者进行谈判。我们将模型应用于纳什需求博弈,这是一个抽象的讨价还价模型,以此来说明我们的方法。结果表明,既定的谈判:i) 能协调博弈者的行动;ii) 能使博弈的成功率最大化;iii) 能为博弈者带来更多的个人利益。
{"title":"Indirect Dynamic Negotiation in the Nash Demand Game","authors":"Tatiana V. Guy, Jitka Homolová, Aleksej Gaj","doi":"arxiv-2409.06566","DOIUrl":"https://doi.org/arxiv-2409.06566","url":null,"abstract":"The paper addresses a problem of sequential bilateral bargaining with\u0000incomplete information. We proposed a decision model that helps agents to\u0000successfully bargain by performing indirect negotiation and learning the\u0000opponent's model. Methodologically the paper casts heuristically-motivated\u0000bargaining of a self-interested independent player into a framework of Bayesian\u0000learning and Markov decision processes. The special form of the reward\u0000implicitly motivates the players to negotiate indirectly, via closed-loop\u0000interaction. We illustrate the approach by applying our model to the Nash\u0000demand game, which is an abstract model of bargaining. The results indicate\u0000that the established negotiation: i) leads to coordinating players' actions;\u0000ii) results in maximising success rate of the game and iii) brings more\u0000individual profit to the players.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proceedings of the XIII International Workshop on Locational Analysis and Related Problems 第十三届定位分析及相关问题国际研讨会论文集
Pub Date : 2024-09-10 DOI: arxiv-2409.06397
Marta Baldomero-Naranjo, Ricardo Gázquez, Miguel Martínez-Antón, Luisa I. Martínez-Merino, Juan M. Muñoz-Ocaña, Francisco Temprano, Alberto Torrejón, Carlos Valverde, Nicolás Zerega
The topics of interest are location analysis and related problems. Thisincludes location models, networks, transportation, logistics, exact andheuristic solution methods, and computational geometry, among many others.
感兴趣的主题是位置分析和相关问题。其中包括位置模型、网络、运输、物流、精确和启发式求解方法以及计算几何等。
{"title":"Proceedings of the XIII International Workshop on Locational Analysis and Related Problems","authors":"Marta Baldomero-Naranjo, Ricardo Gázquez, Miguel Martínez-Antón, Luisa I. Martínez-Merino, Juan M. Muñoz-Ocaña, Francisco Temprano, Alberto Torrejón, Carlos Valverde, Nicolás Zerega","doi":"arxiv-2409.06397","DOIUrl":"https://doi.org/arxiv-2409.06397","url":null,"abstract":"The topics of interest are location analysis and related problems. This\u0000includes location models, networks, transportation, logistics, exact and\u0000heuristic solution methods, and computational geometry, among many others.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frequency range non-Lipschitz parametric optimization of a noise absorption 噪声吸收的频率范围非 Lipschitz 参数优化
Pub Date : 2024-09-10 DOI: arxiv-2409.06292
Frederic MagoulesMICS, Mathieu MenouxMICS, Anna Rozanova-PierratMICS
In the framework of the optimal wave energy absorption, we solvetheoretically and numerically a parametric shape optimization problem to findthe optimal distribution of absorbing material in the reflexive one defined bya characteristic function in the Robin-type boundary condition associated withthe Helmholtz equation. Robin boundary condition can be given on a part or theall boundary of a bounded ($epsilon$, $infty$)-domain of R n . The geometryof the partially absorbing boundary is fixed, but allowed to be non-Lipschitz,for example, fractal. It is defined as the support of a d-upper regular measurewith d $in$]n -2, n[. Using the well-posedness properties of the model, forany fixed volume fraction of the absorbing material, we establish the existenceof at least one optimal distribution minimizing the acoustical energy on afixed frequency range of the relaxation problem. Thanks to the shape derivativeof the energy functional, also existing for non-Lipschitz boundaries, weimplement (in the two-dimensional case) the gradient descent method and findthe optimal distribution with 50% of the absorbent material on a frequencyrange with better performances than the 100% absorbent boundary. The same typeof performance is also obtained by the genetic method.
在最优波能吸收的框架下,我们从理论和数值上求解了一个参数形状优化问题,以找到吸收材料在与亥姆霍兹方程相关的罗宾型边界条件的特征函数所定义的反射一中的最优分布。罗宾边界条件可以在 R n 的有界($epsilon$, $infty$)域的部分或全部边界上给出。部分吸收边界的几何形状是固定的,但允许是非 Lipschitz 的,例如分形。它被定义为具有 d $in$]n -2, n[ 的 d 上正则量的支持。利用该模型的好求解特性,对于任何固定体积分数的吸声材料,我们都能确定至少存在一种最优分布,能使松弛问题的固定频率范围内的声能最小化。由于能量函数的形状导数也存在于非 Lipschitz 边界,我们(在二维情况下)实施了梯度下降法,并在一个频率范围内找到了 50%吸声材料的最佳分布,其性能优于 100%吸声边界。遗传方法也获得了相同的性能。
{"title":"Frequency range non-Lipschitz parametric optimization of a noise absorption","authors":"Frederic MagoulesMICS, Mathieu MenouxMICS, Anna Rozanova-PierratMICS","doi":"arxiv-2409.06292","DOIUrl":"https://doi.org/arxiv-2409.06292","url":null,"abstract":"In the framework of the optimal wave energy absorption, we solve\u0000theoretically and numerically a parametric shape optimization problem to find\u0000the optimal distribution of absorbing material in the reflexive one defined by\u0000a characteristic function in the Robin-type boundary condition associated with\u0000the Helmholtz equation. Robin boundary condition can be given on a part or the\u0000all boundary of a bounded ($epsilon$, $infty$)-domain of R n . The geometry\u0000of the partially absorbing boundary is fixed, but allowed to be non-Lipschitz,\u0000for example, fractal. It is defined as the support of a d-upper regular measure\u0000with d $in$]n -2, n[. Using the well-posedness properties of the model, for\u0000any fixed volume fraction of the absorbing material, we establish the existence\u0000of at least one optimal distribution minimizing the acoustical energy on a\u0000fixed frequency range of the relaxation problem. Thanks to the shape derivative\u0000of the energy functional, also existing for non-Lipschitz boundaries, we\u0000implement (in the two-dimensional case) the gradient descent method and find\u0000the optimal distribution with 50% of the absorbent material on a frequency\u0000range with better performances than the 100% absorbent boundary. The same type\u0000of performance is also obtained by the genetic method.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KANtrol: A Physics-Informed Kolmogorov-Arnold Network Framework for Solving Multi-Dimensional and Fractional Optimal Control Problems KANtrol:用于解决多维和分数最优控制问题的物理信息型科尔莫戈罗夫-阿诺德网络框架
Pub Date : 2024-09-10 DOI: arxiv-2409.06649
Alireza Afzal Aghaei
In this paper, we introduce the KANtrol framework, which utilizesKolmogorov-Arnold Networks (KANs) to solve optimal control problems involvingcontinuous time variables. We explain how Gaussian quadrature can be employedto approximate the integral parts within the problem, particularly forintegro-differential state equations. We also demonstrate how automaticdifferentiation is utilized to compute exact derivatives for integer-orderdynamics, while for fractional derivatives of non-integer order, we employmatrix-vector product discretization within the KAN framework. We tacklemulti-dimensional problems, including the optimal control of a 2D heat partialdifferential equation. The results of our simulations, which cover both forwardand parameter identification problems, show that the KANtrol frameworkoutperforms classical MLPs in terms of accuracy and efficiency.
本文介绍了 KANtrol 框架,该框架利用 Kolmogorov-Arnold 网络(KAN)来解决涉及连续时间变量的最优控制问题。我们解释了如何利用高斯正交来逼近问题中的积分部分,特别是对于积分微分状态方程。我们还演示了如何利用自动微分来计算整数阶动力学的精确导数,而对于非整数阶的分数导数,我们则在 KAN 框架内采用矩阵向量积离散化。我们解决了多维问题,包括二维热偏微分方程的优化控制。模拟结果表明,KAN 控制框架在精度和效率方面都优于经典 MLP。
{"title":"KANtrol: A Physics-Informed Kolmogorov-Arnold Network Framework for Solving Multi-Dimensional and Fractional Optimal Control Problems","authors":"Alireza Afzal Aghaei","doi":"arxiv-2409.06649","DOIUrl":"https://doi.org/arxiv-2409.06649","url":null,"abstract":"In this paper, we introduce the KANtrol framework, which utilizes\u0000Kolmogorov-Arnold Networks (KANs) to solve optimal control problems involving\u0000continuous time variables. We explain how Gaussian quadrature can be employed\u0000to approximate the integral parts within the problem, particularly for\u0000integro-differential state equations. We also demonstrate how automatic\u0000differentiation is utilized to compute exact derivatives for integer-order\u0000dynamics, while for fractional derivatives of non-integer order, we employ\u0000matrix-vector product discretization within the KAN framework. We tackle\u0000multi-dimensional problems, including the optimal control of a 2D heat partial\u0000differential equation. The results of our simulations, which cover both forward\u0000and parameter identification problems, show that the KANtrol framework\u0000outperforms classical MLPs in terms of accuracy and efficiency.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks Learn2Aggregate:使用图神经网络监督生成 Chvátal-Gomory 剪切
Pub Date : 2024-09-10 DOI: arxiv-2409.06559
Arnaud Deza, Elias B. Khalil, Zhenan Fan, Zirui Zhou, Yong Zhang
We present $textit{Learn2Aggregate}$, a machine learning (ML) framework foroptimizing the generation of Chv'atal-Gomory (CG) cuts in mixed integer linearprogramming (MILP). The framework trains a graph neural network to classifyuseful constraints for aggregation in CG cut generation. The ML-driven CGseparator selectively focuses on a small set of impactful constraints,improving runtimes without compromising the strength of the generated cuts. Keyto our approach is the formulation of a constraint classification task whichfavours sparse aggregation of constraints, consistent with empirical findings.This, in conjunction with a careful constraint labeling scheme and a hybrid ofdeep learning and feature engineering, results in enhanced CG cut generationacross five diverse MILP benchmarks. On the largest test sets, our methodcloses roughly $textit{twice}$ as much of the integrality gap as the standardCG method while running 40$% faster. This performance improvement is due to ourmethod eliminating 75% of the constraints prior to aggregation.
我们提出了 $textit{Learn2Aggregate}$,这是一个用于优化混合整数线性编程(MILP)中 Chv'atal-Gomory (CG) 切分生成的机器学习(ML)框架。该框架训练一个图神经网络,对有用的约束条件进行分类,以便在生成 CG 切分时进行聚合。ML 驱动的 CG 分割器会有选择性地关注一小部分有影响的约束,从而在不影响生成的切割强度的情况下提高运行时间。我们方法的关键在于制定了一个约束分类任务,该任务倾向于稀疏聚集约束,这与实证研究结果是一致的。这与谨慎的约束标记方案以及深度学习和特征工程的混合方法相结合,在五个不同的 MILP 基准中增强了 CG 切分生成。在最大的测试集上,我们的方法缩小的积分差距大约是标准CG方法的两倍,同时运行速度提高了40%。这一性能提升归功于我们的方法在聚合之前消除了 75% 的约束。
{"title":"Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks","authors":"Arnaud Deza, Elias B. Khalil, Zhenan Fan, Zirui Zhou, Yong Zhang","doi":"arxiv-2409.06559","DOIUrl":"https://doi.org/arxiv-2409.06559","url":null,"abstract":"We present $textit{Learn2Aggregate}$, a machine learning (ML) framework for\u0000optimizing the generation of Chv'atal-Gomory (CG) cuts in mixed integer linear\u0000programming (MILP). The framework trains a graph neural network to classify\u0000useful constraints for aggregation in CG cut generation. The ML-driven CG\u0000separator selectively focuses on a small set of impactful constraints,\u0000improving runtimes without compromising the strength of the generated cuts. Key\u0000to our approach is the formulation of a constraint classification task which\u0000favours sparse aggregation of constraints, consistent with empirical findings.\u0000This, in conjunction with a careful constraint labeling scheme and a hybrid of\u0000deep learning and feature engineering, results in enhanced CG cut generation\u0000across five diverse MILP benchmarks. On the largest test sets, our method\u0000closes roughly $textit{twice}$ as much of the integrality gap as the standard\u0000CG method while running 40$% faster. This performance improvement is due to our\u0000method eliminating 75% of the constraints prior to aggregation.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bi-level regularization via iterative mesh refinement for aeroacoustics 通过迭代网格细化实现航空声学的双级正则化
Pub Date : 2024-09-10 DOI: arxiv-2409.06854
Christian Aarset, Tram Thi Ngoc Nguyen
In this work, we illustrate the connection between adaptive mesh refinementfor finite element discretized PDEs and the recently developed emph{bi-levelregularization algorithm}. By adaptive mesh refinement according to data noise,regularization effect and convergence are immediate consequences. We moreoverdemonstrate its numerical advantages to the classical Landweber algorithm interm of time and reconstruction quality for the example of the Helmholtzequation in an aeroacoustic setting.
在这项工作中,我们阐述了自适应网格细化有限元离散 PDE 与最近开发的 emph{bi-levelregularization algithm} 之间的联系。通过根据数据噪声进行自适应网格细化,正则化效果和收敛性是立竿见影的。此外,我们还以航空声学中的 Helmholtzequation 为例,在时间和重构质量方面证明了该算法在数值上优于经典的 Landweber 算法。
{"title":"Bi-level regularization via iterative mesh refinement for aeroacoustics","authors":"Christian Aarset, Tram Thi Ngoc Nguyen","doi":"arxiv-2409.06854","DOIUrl":"https://doi.org/arxiv-2409.06854","url":null,"abstract":"In this work, we illustrate the connection between adaptive mesh refinement\u0000for finite element discretized PDEs and the recently developed emph{bi-level\u0000regularization algorithm}. By adaptive mesh refinement according to data noise,\u0000regularization effect and convergence are immediate consequences. We moreover\u0000demonstrate its numerical advantages to the classical Landweber algorithm in\u0000term of time and reconstruction quality for the example of the Helmholtz\u0000equation in an aeroacoustic setting.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Policy Iteration Method for Inverse Mean Field Games 逆均值场博弈的策略迭代法
Pub Date : 2024-09-10 DOI: arxiv-2409.06184
Kui Ren, Nathan Soedjak, Shanyin Tong
We propose a policy iteration method to solve an inverse problem for amean-field game model, specifically to reconstruct the obstacle function in thegame from the partial observation data of value functions, which represent theoptimal costs for agents. The proposed approach decouples this complex inverseproblem, which is an optimization problem constrained by a coupled nonlinearforward and backward PDE system in the MFG, into several iterations of solvinglinear PDEs and linear inverse problems. This method can also be viewed as afixed-point iteration that simultaneously solves the MFG system and inversion.We further prove its linear rate of convergence. In addition, numericalexamples in 1D and 2D, along with performance comparisons to a directleast-squares method, demonstrate the superior efficiency and accuracy of theproposed method for solving inverse MFGs.
我们提出了一种策略迭代方法来解决均场博弈模型的逆问题,具体来说,就是从价值函数的部分观测数据重建博弈中的障碍函数,而价值函数代表了代理的最优成本。所提出的方法将这个复杂的逆问题(即受 MFG 中耦合非线性前向和后向 PDE 系统约束的优化问题)解耦为求解线性 PDE 和线性逆问题的多次迭代。我们进一步证明了该方法的线性收敛速率。此外,一维和二维的数值示例,以及与直接最小二乘法的性能比较,都证明了所提方法在求解反 MFG 方面的卓越效率和准确性。
{"title":"A Policy Iteration Method for Inverse Mean Field Games","authors":"Kui Ren, Nathan Soedjak, Shanyin Tong","doi":"arxiv-2409.06184","DOIUrl":"https://doi.org/arxiv-2409.06184","url":null,"abstract":"We propose a policy iteration method to solve an inverse problem for a\u0000mean-field game model, specifically to reconstruct the obstacle function in the\u0000game from the partial observation data of value functions, which represent the\u0000optimal costs for agents. The proposed approach decouples this complex inverse\u0000problem, which is an optimization problem constrained by a coupled nonlinear\u0000forward and backward PDE system in the MFG, into several iterations of solving\u0000linear PDEs and linear inverse problems. This method can also be viewed as a\u0000fixed-point iteration that simultaneously solves the MFG system and inversion.\u0000We further prove its linear rate of convergence. In addition, numerical\u0000examples in 1D and 2D, along with performance comparisons to a direct\u0000least-squares method, demonstrate the superior efficiency and accuracy of the\u0000proposed method for solving inverse MFGs.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fourier series-based algorithm for control optimization in pendulum capsule drive: an integrated computational and experimental study 基于傅立叶级数的摆式胶囊驱动控制优化算法:计算与实验综合研究
Pub Date : 2024-09-10 DOI: arxiv-2409.06824
Sandra Zarychta, Marek Balcerzak, Katarzyna Wojdalska, Rafał Dolny, Jerzy Wojewoda
Pendulum-driven systems have emerged as a notable modification ofvibro-impact mechanisms, replacing the conventional mass-on-spring oscillatorwith a pendulum. Such systems exhibit intricate behavior resulting from theinterplay of directional dynamics, pendulum motion, and contact forces betweenthe designed device and the underlying surface. This paper delves into theapplication of a Fourier series-based greedy algorithm for control optimizationin pendulum capsule drives, which hold potential for diverse scenarios,including endoscopy capsule robots, pipeline inspection, and rescue operationsin confined spaces. The emphasis is placed on experimental studies involvingprototype development to validate the system's efficacy with previouscomputational simulations. Empirical findings closely align (<2% loss) withnumerical investigations, showcasing the pendulum capsule drive's ability toachieve average speeds of 2.48 cm/s and 2.58 cm/s for three and six harmonics,respectively. These results are reinforced by high-quality signal-trackingaccuracy, which demonstrates resilience against potential disturbances duringmotion. The authors envision the Fourier series-based control optimizationmethod as a significant step towards ensuring enhanced locomotion performancein discontinuous systems, effectively handling the non-linearities arising fromdry friction.
摆锤驱动系统是对振动撞击装置的一种显著改进,用摆锤取代了传统的质量-弹簧振荡器。由于方向动力学、摆锤运动以及设计装置与底层表面之间的接触力的相互作用,此类系统表现出错综复杂的行为。本文深入探讨了基于傅立叶级数的贪婪算法在摆式胶囊驱动器控制优化中的应用,该算法在内窥镜胶囊机器人、管道检测和密闭空间救援行动等多种应用场景中都具有潜力。重点放在涉及原型开发的实验研究上,以验证系统与之前计算模拟的有效性。实证研究结果与数值研究结果非常吻合(损失小于 2%),表明摆式胶囊驱动装置能够在三次谐波和六次谐波中分别达到 2.48 厘米/秒和 2.58 厘米/秒的平均速度。高质量的信号跟踪精度加强了这些结果,证明了在运动过程中对潜在干扰的适应能力。作者认为基于傅立叶级数的控制优化方法是确保提高非连续系统运动性能的重要一步,它能有效处理干摩擦产生的非线性问题。
{"title":"Fourier series-based algorithm for control optimization in pendulum capsule drive: an integrated computational and experimental study","authors":"Sandra Zarychta, Marek Balcerzak, Katarzyna Wojdalska, Rafał Dolny, Jerzy Wojewoda","doi":"arxiv-2409.06824","DOIUrl":"https://doi.org/arxiv-2409.06824","url":null,"abstract":"Pendulum-driven systems have emerged as a notable modification of\u0000vibro-impact mechanisms, replacing the conventional mass-on-spring oscillator\u0000with a pendulum. Such systems exhibit intricate behavior resulting from the\u0000interplay of directional dynamics, pendulum motion, and contact forces between\u0000the designed device and the underlying surface. This paper delves into the\u0000application of a Fourier series-based greedy algorithm for control optimization\u0000in pendulum capsule drives, which hold potential for diverse scenarios,\u0000including endoscopy capsule robots, pipeline inspection, and rescue operations\u0000in confined spaces. The emphasis is placed on experimental studies involving\u0000prototype development to validate the system's efficacy with previous\u0000computational simulations. Empirical findings closely align (<2% loss) with\u0000numerical investigations, showcasing the pendulum capsule drive's ability to\u0000achieve average speeds of 2.48 cm/s and 2.58 cm/s for three and six harmonics,\u0000respectively. These results are reinforced by high-quality signal-tracking\u0000accuracy, which demonstrates resilience against potential disturbances during\u0000motion. The authors envision the Fourier series-based control optimization\u0000method as a significant step towards ensuring enhanced locomotion performance\u0000in discontinuous systems, effectively handling the non-linearities arising from\u0000dry friction.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constant Payoff Property in Zero-Sum Stochastic Games with a Finite Horizon 有限地平线零和随机博弈中的恒定回报特性
Pub Date : 2024-09-09 DOI: arxiv-2409.05683
Thomas Ragel, Bruno Ziliotto
This paper examines finite zero-sum stochastic games and demonstrates thatwhen the game's duration is sufficiently long, there exists a pair ofapproximately optimal strategies such that the expected average payoff at anypoint in the game remains close to the value. This property, known as thetextit{constant payoff property}, was previously established only forabsorbing games and discounted stochastic games.
本文研究了有限零和随机博弈,并证明当博弈持续时间足够长时,存在一对近似最优的策略,使得博弈中任何一点的期望平均报酬都保持在接近值的水平。这一性质被称为 "不变报酬性质"(the/textit{constant payoff property}),以前只在吸收博弈和贴现随机博弈中被证实。
{"title":"Constant Payoff Property in Zero-Sum Stochastic Games with a Finite Horizon","authors":"Thomas Ragel, Bruno Ziliotto","doi":"arxiv-2409.05683","DOIUrl":"https://doi.org/arxiv-2409.05683","url":null,"abstract":"This paper examines finite zero-sum stochastic games and demonstrates that\u0000when the game's duration is sufficiently long, there exists a pair of\u0000approximately optimal strategies such that the expected average payoff at any\u0000point in the game remains close to the value. This property, known as the\u0000textit{constant payoff property}, was previously established only for\u0000absorbing games and discounted stochastic games.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - MATH - Optimization and Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1