包裹柜的动态需求管理

arXiv - CS - Artificial Intelligence Pub Date : 2024-09-08 DOI:arxiv-2409.05061

Daniela Sailer, Robert Klein, Claudius Steinhardt

{"title":"包裹柜的动态需求管理","authors":"Daniela Sailer, Robert Klein, Claudius Steinhardt","doi":"arxiv-2409.05061","DOIUrl":null,"url":null,"abstract":"In pursuit of a more sustainable and cost-efficient last mile, parcel lockers\nhave gained a firm foothold in the parcel delivery landscape. To fully exploit\ntheir potential and simultaneously ensure customer satisfaction, successful\nmanagement of the locker's limited capacity is crucial. This is challenging as\nfuture delivery requests and pickup times are stochastic from the provider's\nperspective. In response, we propose to dynamically control whether the locker\nis presented as an available delivery option to each incoming customer with the\ngoal of maximizing the number of served requests weighted by their priority.\nAdditionally, we take different compartment sizes into account, which entails a\nsecond type of decision as parcels scheduled for delivery must be allocated. We\nformalize the problem as an infinite-horizon sequential decision problem and\nfind that exact methods are intractable due to the curses of dimensionality. In\nlight of this, we develop a solution framework that orchestrates multiple\nalgorithmic techniques rooted in Sequential Decision Analytics and\nReinforcement Learning, namely cost function approximation and an offline\ntrained parametric value function approximation together with a truncated\nonline rollout. Our innovative approach to combine these techniques enables us\nto address the strong interrelations between the two decision types. As a\ngeneral methodological contribution, we enhance the training of our value\nfunction approximation with a modified version of experience replay that\nenforces structure in the value function. Our computational study shows that\nour method outperforms a myopic benchmark by 13.7% and an industry-inspired\npolicy by 12.6%.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic Demand Management for Parcel Lockers\",\"authors\":\"Daniela Sailer, Robert Klein, Claudius Steinhardt\",\"doi\":\"arxiv-2409.05061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In pursuit of a more sustainable and cost-efficient last mile, parcel lockers\\nhave gained a firm foothold in the parcel delivery landscape. To fully exploit\\ntheir potential and simultaneously ensure customer satisfaction, successful\\nmanagement of the locker's limited capacity is crucial. This is challenging as\\nfuture delivery requests and pickup times are stochastic from the provider's\\nperspective. In response, we propose to dynamically control whether the locker\\nis presented as an available delivery option to each incoming customer with the\\ngoal of maximizing the number of served requests weighted by their priority.\\nAdditionally, we take different compartment sizes into account, which entails a\\nsecond type of decision as parcels scheduled for delivery must be allocated. We\\nformalize the problem as an infinite-horizon sequential decision problem and\\nfind that exact methods are intractable due to the curses of dimensionality. In\\nlight of this, we develop a solution framework that orchestrates multiple\\nalgorithmic techniques rooted in Sequential Decision Analytics and\\nReinforcement Learning, namely cost function approximation and an offline\\ntrained parametric value function approximation together with a truncated\\nonline rollout. Our innovative approach to combine these techniques enables us\\nto address the strong interrelations between the two decision types. As a\\ngeneral methodological contribution, we enhance the training of our value\\nfunction approximation with a modified version of experience replay that\\nenforces structure in the value function. Our computational study shows that\\nour method outperforms a myopic benchmark by 13.7% and an industry-inspired\\npolicy by 12.6%.\",\"PeriodicalId\":501479,\"journal\":{\"name\":\"arXiv - CS - Artificial Intelligence\",\"volume\":\"36 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.05061\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了实现更具可持续性和成本效益的 "最后一公里"，包裹柜在包裹递送领域站稳了脚跟。为了充分挖掘其潜力，同时确保客户满意度，成功管理储物柜的有限容量至关重要。从提供商的角度来看，未来的投递请求和取件时间都是随机的，因此这具有挑战性。为此，我们建议动态控制储物柜是否向每位顾客提供可用的递送选项，目标是根据优先级加权最大化已服务请求的数量。此外，我们还考虑到了不同的储物箱大小，这就需要做出第二类决策，因为必须对预定递送的包裹进行分配。我们将该问题形式化为一个无限视距的顺序决策问题，并发现由于维度的限制，精确方法难以解决。有鉴于此，我们开发了一个解决方案框架，该框架协调了源于序列决策分析和强化学习的多种算法技术，即成本函数近似、离线训练的参数值函数近似和截断在线推出。我们结合这些技术的创新方法使我们能够解决这两种决策类型之间的密切联系。作为方法论上的一般贡献，我们通过改进版的经验重放加强了价值函数近似的训练，从而强化了价值函数的结构。我们的计算研究表明，我们的方法比近视基准优胜 13.7%，比行业启发政策优胜 12.6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Dynamic Demand Management for Parcel Lockers

In pursuit of a more sustainable and cost-efficient last mile, parcel lockers have gained a firm foothold in the parcel delivery landscape. To fully exploit their potential and simultaneously ensure customer satisfaction, successful management of the locker's limited capacity is crucial. This is challenging as future delivery requests and pickup times are stochastic from the provider's perspective. In response, we propose to dynamically control whether the locker is presented as an available delivery option to each incoming customer with the goal of maximizing the number of served requests weighted by their priority. Additionally, we take different compartment sizes into account, which entails a second type of decision as parcels scheduled for delivery must be allocated. We formalize the problem as an infinite-horizon sequential decision problem and find that exact methods are intractable due to the curses of dimensionality. In light of this, we develop a solution framework that orchestrates multiple algorithmic techniques rooted in Sequential Decision Analytics and Reinforcement Learning, namely cost function approximation and an offline trained parametric value function approximation together with a truncated online rollout. Our innovative approach to combine these techniques enables us to address the strong interrelations between the two decision types. As a general methodological contribution, we enhance the training of our value function approximation with a modified version of experience replay that enforces structure in the value function. Our computational study shows that our method outperforms a myopic benchmark by 13.7% and an industry-inspired policy by 12.6%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Artificial Intelligence

自引率

0.00%

发文量