Yao Jing , Bin Guo , Nuo Li , Yasan Ding , Yan Liu , Zhiwen Yu
{"title":"Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning","authors":"Yao Jing , Bin Guo , Nuo Li , Yasan Ding , Yan Liu , Zhiwen Yu","doi":"10.1016/j.eswa.2024.125792","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"264 ","pages":"Article 125792"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026599","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.