Investigating relationships between ridesourcing and public transit using big data analysis and nonlinear machine learning: A case study of Shanghai, China
Xinghua Liu , Qian Ye , Ye Li , Kaidi Yang , Xuan Shao
{"title":"Investigating relationships between ridesourcing and public transit using big data analysis and nonlinear machine learning: A case study of Shanghai, China","authors":"Xinghua Liu , Qian Ye , Ye Li , Kaidi Yang , Xuan Shao","doi":"10.1016/j.tra.2024.104339","DOIUrl":null,"url":null,"abstract":"<div><div>Ridesourcing has transformed the landscape of passenger transportation systems in many cities worldwide, but whether it competes with or complements public transport (PT) is still debated, and the literature is limited. Therefore, this study aims to address this knowledge gap by measuring the relationships between the two systems and examining their determinants using a multisource big data analysis and nonlinear machine learning approach, with Shanghai, China, as the study case. First, we used the observed ridesourcing data in Shanghai to compute the fastest PT alternative for each ridesourcing trip based on the Amap open platform and subsequently compared the travel patterns (i.e., distance, duration, and generalized cost) of the two systems. Second, we propose a technical framework that considers the spatiotemporal availability and generalized cost acceptability of PT services, as well as the inclusivity of ridesourcing services, to accurately classify and identify the relationship between ridesourcing and PT systems. Finally, we explored the importance of four types of determinants, namely, ridesourcing characteristics, PT service, built environment, and weather, and their nonlinear effects on different relationships based on extreme gradient boosting and Shapley additive explanations. Our results show that the fastest PT alternative involves an average travel distance, generalized travel time, and generalized cost that are 1.16, 2.13, and 1.15 times greater, respectively, than those of ridesourcing. Competitive trips account for 36% of urban areas but only 16% in the suburbs. Furthermore, more than 70% and 10% of the ridesourcing trips in suburban areas are used to complement and integrate PT, respectively. The nonlinear machine learning framework identified the top three determinants of integration as travel cost, distance to the CBD, and travel time. Notably, determinants such as the distance to the CBD and temperature have nonlinear effects on these relationships. These findings offer valuable insights for designing multimodal transportation options that integrate the benefits of ridesourcing and PT.</div></div>","PeriodicalId":49421,"journal":{"name":"Transportation Research Part A-Policy and Practice","volume":"192 ","pages":"Article 104339"},"PeriodicalIF":6.3000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part A-Policy and Practice","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965856424003872","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Ridesourcing has transformed the landscape of passenger transportation systems in many cities worldwide, but whether it competes with or complements public transport (PT) is still debated, and the literature is limited. Therefore, this study aims to address this knowledge gap by measuring the relationships between the two systems and examining their determinants using a multisource big data analysis and nonlinear machine learning approach, with Shanghai, China, as the study case. First, we used the observed ridesourcing data in Shanghai to compute the fastest PT alternative for each ridesourcing trip based on the Amap open platform and subsequently compared the travel patterns (i.e., distance, duration, and generalized cost) of the two systems. Second, we propose a technical framework that considers the spatiotemporal availability and generalized cost acceptability of PT services, as well as the inclusivity of ridesourcing services, to accurately classify and identify the relationship between ridesourcing and PT systems. Finally, we explored the importance of four types of determinants, namely, ridesourcing characteristics, PT service, built environment, and weather, and their nonlinear effects on different relationships based on extreme gradient boosting and Shapley additive explanations. Our results show that the fastest PT alternative involves an average travel distance, generalized travel time, and generalized cost that are 1.16, 2.13, and 1.15 times greater, respectively, than those of ridesourcing. Competitive trips account for 36% of urban areas but only 16% in the suburbs. Furthermore, more than 70% and 10% of the ridesourcing trips in suburban areas are used to complement and integrate PT, respectively. The nonlinear machine learning framework identified the top three determinants of integration as travel cost, distance to the CBD, and travel time. Notably, determinants such as the distance to the CBD and temperature have nonlinear effects on these relationships. These findings offer valuable insights for designing multimodal transportation options that integrate the benefits of ridesourcing and PT.
期刊介绍:
Transportation Research: Part A contains papers of general interest in all passenger and freight transportation modes: policy analysis, formulation and evaluation; planning; interaction with the political, socioeconomic and physical environment; design, management and evaluation of transportation systems. Topics are approached from any discipline or perspective: economics, engineering, sociology, psychology, etc. Case studies, survey and expository papers are included, as are articles which contribute to unification of the field, or to an understanding of the comparative aspects of different systems. Papers which assess the scope for technological innovation within a social or political framework are also published. The journal is international, and places equal emphasis on the problems of industrialized and non-industrialized regions.
Part A''s aims and scope are complementary to Transportation Research Part B: Methodological, Part C: Emerging Technologies and Part D: Transport and Environment. Part E: Logistics and Transportation Review. Part F: Traffic Psychology and Behaviour. The complete set forms the most cohesive and comprehensive reference of current research in transportation science.