Multi-Agent Reinforcement Learning for Urban Crowd Sensing with For-Hire Vehicles

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Pub Date : 2021-05-10 DOI:10.1109/INFOCOM42981.2021.9488713

Rong Ding, Zhaoxing Yang, Yifei Wei, Haiming Jin, Xinbing Wang

{"title":"Multi-Agent Reinforcement Learning for Urban Crowd Sensing with For-Hire Vehicles","authors":"Rong Ding, Zhaoxing Yang, Yifei Wei, Haiming Jin, Xinbing Wang","doi":"10.1109/INFOCOM42981.2021.9488713","DOIUrl":null,"url":null,"abstract":"Recently, vehicular crowd sensing (VCS) that leverages sensor-equipped urban vehicles to collect city-scale sensory data has emerged as a promising paradigm for urban sensing. Nowadays, a wide spectrum of VCS tasks are carried out by for-hire vehicles (FHVs) due to various hardware and software constraints that are difficult for private vehicles to satisfy. However, such FHV-enabled VCS systems face a fundamental yet unsolved problem of striking a balance between the order-serving and sensing outcomes. To address this problem, we propose a novel graph convolutional cooperative multi-agent reinforcement learning (GCC-MARL) framework, which helps FHVs make distributed routing decisions that cooperatively optimize the system-wide global objective. Specifically, GCC-MARL meticulously assigns credits to agents in the training process to effectively stimulate cooperation, represents agents’ actions by a carefully chosen statistics to cope with the variable agent scales, and integrates graph convolution to capture useful spatial features from complex large-scale urban road networks. We conduct extensive experiments with a real-world dataset collected in Shenzhen, China, containing around 1 million trajectories and 50 thousand orders of 553 taxis per-day from June 1st to 30th, 2017. Our experiment results show that GCC-MARL outperforms state-of-the-art baseline methods in order-serving revenue, as well as sensing coverage and quality.","PeriodicalId":293079,"journal":{"name":"IEEE INFOCOM 2021 - IEEE Conference on Computer Communications","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2021 - IEEE Conference on Computer Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOM42981.2021.9488713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Recently, vehicular crowd sensing (VCS) that leverages sensor-equipped urban vehicles to collect city-scale sensory data has emerged as a promising paradigm for urban sensing. Nowadays, a wide spectrum of VCS tasks are carried out by for-hire vehicles (FHVs) due to various hardware and software constraints that are difficult for private vehicles to satisfy. However, such FHV-enabled VCS systems face a fundamental yet unsolved problem of striking a balance between the order-serving and sensing outcomes. To address this problem, we propose a novel graph convolutional cooperative multi-agent reinforcement learning (GCC-MARL) framework, which helps FHVs make distributed routing decisions that cooperatively optimize the system-wide global objective. Specifically, GCC-MARL meticulously assigns credits to agents in the training process to effectively stimulate cooperation, represents agents’ actions by a carefully chosen statistics to cope with the variable agent scales, and integrates graph convolution to capture useful spatial features from complex large-scale urban road networks. We conduct extensive experiments with a real-world dataset collected in Shenzhen, China, containing around 1 million trajectories and 50 thousand orders of 553 taxis per-day from June 1st to 30th, 2017. Our experiment results show that GCC-MARL outperforms state-of-the-art baseline methods in order-serving revenue, as well as sensing coverage and quality.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于出租车辆的城市人群感知多智能体强化学习

最近，利用配备传感器的城市车辆收集城市尺度感官数据的车辆人群传感(VCS)已成为城市传感的一个有前途的范例。如今，由于各种硬件和软件的限制，许多VCS任务都是由出租车辆(fhv)执行的，而私家车很难满足这些限制。然而，这种fhv驱动的VCS系统面临着一个基本的尚未解决的问题，即如何在订单服务和感知结果之间取得平衡。为了解决这个问题，我们提出了一个新的图卷积协作多智能体强化学习(GCC-MARL)框架，该框架帮助fhv做出分布式路由决策，协同优化系统范围的全局目标。具体而言，GCC-MARL在训练过程中细致地为智能体分配积分，以有效地激发合作;通过精心选择的统计数据来表示智能体的行为，以应对智能体规模的变化;通过集成图卷积来捕获复杂的大规模城市道路网络中有用的空间特征。我们使用在中国深圳收集的真实数据集进行了广泛的实验，该数据集包含2017年6月1日至30日期间每天约100万条轨迹和553辆出租车的5万份订单。我们的实验结果表明，GCC-MARL在订单服务收入、传感覆盖范围和质量方面优于最先进的基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications

自引率

0.00%

发文量

期刊最新文献

Message from the TPC Chairs Enabling Edge-Cloud Video Analytics for Robotics Applications Practical Analysis of Replication-Based Systems Towards Minimum Fleet for Ridesharing-Aware Mobility-on-Demand Systems Beyond Value Perturbation: Local Differential Privacy in the Temporal Setting