Stochastic approximation for uncapacitated assortment optimization under the multinomial logit model

Naval Research Logistics (NRL) Pub Date : 2022-05-23 DOI:10.1002/nav.22068

Yannik Peeters, Arnoud V. den Boer

引用次数: 1

Abstract

We consider dynamic assortment optimization with incomplete information under the uncapacitated multinomial logit choice model. We propose an anytime stochastic approximation policy and prove that the regret—the cumulative expected revenue loss caused by offering suboptimal assortments—after T$$ T $$ time periods is bounded by T$$ \sqrt{T} $$ times a constant that is independent of the number of products. In addition, we prove a matching lower bound on the regret for any policy that is valid for arbitrary model parameters—slightly generalizing a recent regret lower bound derived for specific revenue parameters. Numerical illustrations suggest that our policy outperforms alternatives by a significant margin when T$$ T $$ and the number of products N$$ N $$ are not too small.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多项logit模型下无能力分类优化的随机逼近

在无能力多项logit选择模型下，研究了具有不完全信息的动态分类优化问题。我们提出了一个随时随机逼近策略，并证明了T $$ T $$时间段后的遗憾-由提供次优分类引起的累积预期收入损失由T $$ \sqrt{T} $$乘以一个与产品数量无关的常数所限制。此外，我们证明了对任意模型参数有效的任何策略的后悔下界的匹配下界-稍微推广了最近为特定收益参数导出的后悔下界。数值实例表明，当T $$ T $$和产品数量N $$ N $$不是太小时，我们的政策明显优于替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Naval Research Logistics (NRL)

自引率

0.00%

发文量