{"title":"Constrained Multi-Agent Reinforcement Learning for Managing Electric Self-Driving Taxis","authors":"Zhaoxing Yang, Guiyun Fan, Haiming Jin","doi":"10.1109/ICPADS53394.2021.00079","DOIUrl":null,"url":null,"abstract":"Electric self-driving taxis (es-taxis) draw great attention nowadays and hold the promise for future transportation due to their convenient and environment-friendly nature. However efficiently managing large-scale es-taxis remains an open problem. In this paper, we focus on scheduling es-taxis under charging budget constraint. Specifically, we design safe-controller to guarantee the satisfaction of budget constraint, and propose HAT framework to enlarge the sight for decision-making on deactivating es-taxis. As for the non-stationary induced by HAT, we analyze and limit its influence with theoretical guarantees. The overall framework Safe-HAT achieves superior performance in real-world data against other strong baselines.","PeriodicalId":309508,"journal":{"name":"2021 IEEE 27th International Conference on Parallel and Distributed Systems (ICPADS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 27th International Conference on Parallel and Distributed Systems (ICPADS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPADS53394.2021.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Electric self-driving taxis (es-taxis) draw great attention nowadays and hold the promise for future transportation due to their convenient and environment-friendly nature. However efficiently managing large-scale es-taxis remains an open problem. In this paper, we focus on scheduling es-taxis under charging budget constraint. Specifically, we design safe-controller to guarantee the satisfaction of budget constraint, and propose HAT framework to enlarge the sight for decision-making on deactivating es-taxis. As for the non-stationary induced by HAT, we analyze and limit its influence with theoretical guarantees. The overall framework Safe-HAT achieves superior performance in real-world data against other strong baselines.