Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2024-11-25 DOI:10.1016/j.eswa.2024.125792
Yao Jing , Bin Guo , Nuo Li , Yasan Ding , Yan Liu , Zhiwen Yu
{"title":"Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning","authors":"Yao Jing ,&nbsp;Bin Guo ,&nbsp;Nuo Li ,&nbsp;Yasan Ding ,&nbsp;Yan Liu ,&nbsp;Zhiwen Yu","doi":"10.1016/j.eswa.2024.125792","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"264 ","pages":"Article 125792"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026599","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过联合多代理深度强化学习实现可扩展订单调度
高效的订单调度对在线叫车系统至关重要,直接影响用户体验和平台收入。传统方法通常侧重于通过对单个车辆的局部观察来实现即时收益最大化,而忽视了长期潜在收益、调度系统的动态性质以及分布式车辆间协作的重要性。这通常会导致性能不达标。为了解决这些问题,我们提出了 FedMARL4OD,这是一个新颖的联合多代理深度强化学习框架,旨在优化订单调度。该框架通过多代理强化学习(MARL)对单个车辆进行局部学习,并通过联邦多代理强化学习(FedMARL)对所有车辆进行全局学习。具体来说,我们在局部学习中引入了一种创新奖励机制,既考虑每个订单的当前收入,又考虑与潜在未来收入相关的系统供需动态,从而提高调度性能。此外,我们还在全局学习中引入了一种可扩展的模型聚合方法,该方法对分布式车辆之间的互动进行了明确建模,以促进协作学习。通过平均参数聚合逐步整合本地和全局洞察力,该方法不仅降低了通信开销,提高了代理的学习效率,还确保了系统的可扩展性并维护了数据隐私。大量实际模拟证明,FedMARL4OD 的性能优于基线方法,累计驱动收入(ADI)提高了 9.17%,订单响应率(ORR)提高了 7.75%。ADI 的提高证明了该框架在提高收入方面的有效性,而 ORR 的提高则表明用户的请求得到了更快的满足,从而改善了用户体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
Maximizing failure occurrence in water distribution Systems: A Multi-Objective approach considering Reliability, Economic, and environmental aspects Multi-objective grey wolf optimizer based on reinforcement learning for distributed hybrid flowshop scheduling towards mass personalized manufacturing Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning Deep multi-negative supervised hashing for large-scale image retrieval Machine-agnostic automated lumbar MRI segmentation using a cascaded model based on generative neurons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1