Dynamic Demand Management for Parcel Lockers

Daniela Sailer, Robert Klein, Claudius Steinhardt
{"title":"Dynamic Demand Management for Parcel Lockers","authors":"Daniela Sailer, Robert Klein, Claudius Steinhardt","doi":"arxiv-2409.05061","DOIUrl":null,"url":null,"abstract":"In pursuit of a more sustainable and cost-efficient last mile, parcel lockers\nhave gained a firm foothold in the parcel delivery landscape. To fully exploit\ntheir potential and simultaneously ensure customer satisfaction, successful\nmanagement of the locker's limited capacity is crucial. This is challenging as\nfuture delivery requests and pickup times are stochastic from the provider's\nperspective. In response, we propose to dynamically control whether the locker\nis presented as an available delivery option to each incoming customer with the\ngoal of maximizing the number of served requests weighted by their priority.\nAdditionally, we take different compartment sizes into account, which entails a\nsecond type of decision as parcels scheduled for delivery must be allocated. We\nformalize the problem as an infinite-horizon sequential decision problem and\nfind that exact methods are intractable due to the curses of dimensionality. In\nlight of this, we develop a solution framework that orchestrates multiple\nalgorithmic techniques rooted in Sequential Decision Analytics and\nReinforcement Learning, namely cost function approximation and an offline\ntrained parametric value function approximation together with a truncated\nonline rollout. Our innovative approach to combine these techniques enables us\nto address the strong interrelations between the two decision types. As a\ngeneral methodological contribution, we enhance the training of our value\nfunction approximation with a modified version of experience replay that\nenforces structure in the value function. Our computational study shows that\nour method outperforms a myopic benchmark by 13.7% and an industry-inspired\npolicy by 12.6%.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In pursuit of a more sustainable and cost-efficient last mile, parcel lockers have gained a firm foothold in the parcel delivery landscape. To fully exploit their potential and simultaneously ensure customer satisfaction, successful management of the locker's limited capacity is crucial. This is challenging as future delivery requests and pickup times are stochastic from the provider's perspective. In response, we propose to dynamically control whether the locker is presented as an available delivery option to each incoming customer with the goal of maximizing the number of served requests weighted by their priority. Additionally, we take different compartment sizes into account, which entails a second type of decision as parcels scheduled for delivery must be allocated. We formalize the problem as an infinite-horizon sequential decision problem and find that exact methods are intractable due to the curses of dimensionality. In light of this, we develop a solution framework that orchestrates multiple algorithmic techniques rooted in Sequential Decision Analytics and Reinforcement Learning, namely cost function approximation and an offline trained parametric value function approximation together with a truncated online rollout. Our innovative approach to combine these techniques enables us to address the strong interrelations between the two decision types. As a general methodological contribution, we enhance the training of our value function approximation with a modified version of experience replay that enforces structure in the value function. Our computational study shows that our method outperforms a myopic benchmark by 13.7% and an industry-inspired policy by 12.6%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
包裹柜的动态需求管理
为了实现更具可持续性和成本效益的 "最后一公里",包裹柜在包裹递送领域站稳了脚跟。为了充分挖掘其潜力,同时确保客户满意度,成功管理储物柜的有限容量至关重要。从提供商的角度来看,未来的投递请求和取件时间都是随机的,因此这具有挑战性。为此,我们建议动态控制储物柜是否向每位顾客提供可用的递送选项,目标是根据优先级加权最大化已服务请求的数量。此外,我们还考虑到了不同的储物箱大小,这就需要做出第二类决策,因为必须对预定递送的包裹进行分配。我们将该问题形式化为一个无限视距的顺序决策问题,并发现由于维度的限制,精确方法难以解决。有鉴于此,我们开发了一个解决方案框架,该框架协调了源于序列决策分析和强化学习的多种算法技术,即成本函数近似、离线训练的参数值函数近似和截断在线推出。我们结合这些技术的创新方法使我们能够解决这两种决策类型之间的密切联系。作为方法论上的一般贡献,我们通过改进版的经验重放加强了价值函数近似的训练,从而强化了价值函数的结构。我们的计算研究表明,我们的方法比近视基准优胜 13.7%,比行业启发政策优胜 12.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Abductive explanations of classifiers under constraints: Complexity and properties Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach A Metric Hybrid Planning Approach to Solving Pandemic Planning Problems with Simple SIR Models Neural Networks for Vehicle Routing Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1