{"title":"考虑出行者偏好的路线选择强化学习","authors":"","doi":"10.1080/19427867.2023.2231689","DOIUrl":null,"url":null,"abstract":"<div><p>Travelers always perform some preference during the decision-making process. The preference will affect the decision results and can be improved by continuously learning. In order to understand the influence of individual preference on travel behavior choice , two individual preferences, including indifference preference and compulsive preference are considered in the paper. Two updating mechanisms of compulsive preference are proposed to obtain the choosing probability of all alternatives. Reinforcement learning models are established integrating the gain stimulating and loss stimulating considering expected utility. Nguyen Dupuis network is adopted for numerical simulation to study the updating process. Simulation results denote that the equilibrium state is much more efficient when preference learning mechanism is considered comparing with the traditional stochastic user equilibrium model, and can decrease the total travel time greatly, which can be applied for urban traffic management. Personalized traffic guidance is the effective solution to traffic congestion in the future</p></div>","PeriodicalId":48974,"journal":{"name":"Transportation Letters-The International Journal of Transportation Research","volume":"16 7","pages":"Pages 658-671"},"PeriodicalIF":3.3000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning of route choice considering traveler’s preference\",\"authors\":\"\",\"doi\":\"10.1080/19427867.2023.2231689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Travelers always perform some preference during the decision-making process. The preference will affect the decision results and can be improved by continuously learning. In order to understand the influence of individual preference on travel behavior choice , two individual preferences, including indifference preference and compulsive preference are considered in the paper. Two updating mechanisms of compulsive preference are proposed to obtain the choosing probability of all alternatives. Reinforcement learning models are established integrating the gain stimulating and loss stimulating considering expected utility. Nguyen Dupuis network is adopted for numerical simulation to study the updating process. Simulation results denote that the equilibrium state is much more efficient when preference learning mechanism is considered comparing with the traditional stochastic user equilibrium model, and can decrease the total travel time greatly, which can be applied for urban traffic management. Personalized traffic guidance is the effective solution to traffic congestion in the future</p></div>\",\"PeriodicalId\":48974,\"journal\":{\"name\":\"Transportation Letters-The International Journal of Transportation Research\",\"volume\":\"16 7\",\"pages\":\"Pages 658-671\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Letters-The International Journal of Transportation Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/org/science/article/pii/S1942786723002242\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Letters-The International Journal of Transportation Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S1942786723002242","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Reinforcement learning of route choice considering traveler’s preference
Travelers always perform some preference during the decision-making process. The preference will affect the decision results and can be improved by continuously learning. In order to understand the influence of individual preference on travel behavior choice , two individual preferences, including indifference preference and compulsive preference are considered in the paper. Two updating mechanisms of compulsive preference are proposed to obtain the choosing probability of all alternatives. Reinforcement learning models are established integrating the gain stimulating and loss stimulating considering expected utility. Nguyen Dupuis network is adopted for numerical simulation to study the updating process. Simulation results denote that the equilibrium state is much more efficient when preference learning mechanism is considered comparing with the traditional stochastic user equilibrium model, and can decrease the total travel time greatly, which can be applied for urban traffic management. Personalized traffic guidance is the effective solution to traffic congestion in the future
期刊介绍:
Transportation Letters: The International Journal of Transportation Research is a quarterly journal that publishes high-quality peer-reviewed and mini-review papers as well as technical notes and book reviews on the state-of-the-art in transportation research.
The focus of Transportation Letters is on analytical and empirical findings, methodological papers, and theoretical and conceptual insights across all areas of research. Review resource papers that merge descriptions of the state-of-the-art with innovative and new methodological, theoretical, and conceptual insights spanning all areas of transportation research are invited and of particular interest.