利用大数据分析和非线性机器学习研究拼车与公共交通之间的关系:以中国上海为例

IF 6.3 1区 工程技术 Q1 ECONOMICS Transportation Research Part A-Policy and Practice Pub Date : 2024-11-29 DOI:10.1016/j.tra.2024.104339
Xinghua Liu , Qian Ye , Ye Li , Kaidi Yang , Xuan Shao
{"title":"利用大数据分析和非线性机器学习研究拼车与公共交通之间的关系:以中国上海为例","authors":"Xinghua Liu ,&nbsp;Qian Ye ,&nbsp;Ye Li ,&nbsp;Kaidi Yang ,&nbsp;Xuan Shao","doi":"10.1016/j.tra.2024.104339","DOIUrl":null,"url":null,"abstract":"<div><div>Ridesourcing has transformed the landscape of passenger transportation systems in many cities worldwide, but whether it competes with or complements public transport (PT) is still debated, and the literature is limited. Therefore, this study aims to address this knowledge gap by measuring the relationships between the two systems and examining their determinants using a multisource big data analysis and nonlinear machine learning approach, with Shanghai, China, as the study case. First, we used the observed ridesourcing data in Shanghai to compute the fastest PT alternative for each ridesourcing trip based on the Amap open platform and subsequently compared the travel patterns (i.e., distance, duration, and generalized cost) of the two systems. Second, we propose a technical framework that considers the spatiotemporal availability and generalized cost acceptability of PT services, as well as the inclusivity of ridesourcing services, to accurately classify and identify the relationship between ridesourcing and PT systems. Finally, we explored the importance of four types of determinants, namely, ridesourcing characteristics, PT service, built environment, and weather, and their nonlinear effects on different relationships based on extreme gradient boosting and Shapley additive explanations. Our results show that the fastest PT alternative involves an average travel distance, generalized travel time, and generalized cost that are 1.16, 2.13, and 1.15 times greater, respectively, than those of ridesourcing. Competitive trips account for 36% of urban areas but only 16% in the suburbs. Furthermore, more than 70% and 10% of the ridesourcing trips in suburban areas are used to complement and integrate PT, respectively. The nonlinear machine learning framework identified the top three determinants of integration as travel cost, distance to the CBD, and travel time. Notably, determinants such as the distance to the CBD and temperature have nonlinear effects on these relationships. These findings offer valuable insights for designing multimodal transportation options that integrate the benefits of ridesourcing and PT.</div></div>","PeriodicalId":49421,"journal":{"name":"Transportation Research Part A-Policy and Practice","volume":"192 ","pages":"Article 104339"},"PeriodicalIF":6.3000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Investigating relationships between ridesourcing and public transit using big data analysis and nonlinear machine learning: A case study of Shanghai, China\",\"authors\":\"Xinghua Liu ,&nbsp;Qian Ye ,&nbsp;Ye Li ,&nbsp;Kaidi Yang ,&nbsp;Xuan Shao\",\"doi\":\"10.1016/j.tra.2024.104339\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Ridesourcing has transformed the landscape of passenger transportation systems in many cities worldwide, but whether it competes with or complements public transport (PT) is still debated, and the literature is limited. Therefore, this study aims to address this knowledge gap by measuring the relationships between the two systems and examining their determinants using a multisource big data analysis and nonlinear machine learning approach, with Shanghai, China, as the study case. First, we used the observed ridesourcing data in Shanghai to compute the fastest PT alternative for each ridesourcing trip based on the Amap open platform and subsequently compared the travel patterns (i.e., distance, duration, and generalized cost) of the two systems. Second, we propose a technical framework that considers the spatiotemporal availability and generalized cost acceptability of PT services, as well as the inclusivity of ridesourcing services, to accurately classify and identify the relationship between ridesourcing and PT systems. Finally, we explored the importance of four types of determinants, namely, ridesourcing characteristics, PT service, built environment, and weather, and their nonlinear effects on different relationships based on extreme gradient boosting and Shapley additive explanations. Our results show that the fastest PT alternative involves an average travel distance, generalized travel time, and generalized cost that are 1.16, 2.13, and 1.15 times greater, respectively, than those of ridesourcing. Competitive trips account for 36% of urban areas but only 16% in the suburbs. Furthermore, more than 70% and 10% of the ridesourcing trips in suburban areas are used to complement and integrate PT, respectively. The nonlinear machine learning framework identified the top three determinants of integration as travel cost, distance to the CBD, and travel time. Notably, determinants such as the distance to the CBD and temperature have nonlinear effects on these relationships. These findings offer valuable insights for designing multimodal transportation options that integrate the benefits of ridesourcing and PT.</div></div>\",\"PeriodicalId\":49421,\"journal\":{\"name\":\"Transportation Research Part A-Policy and Practice\",\"volume\":\"192 \",\"pages\":\"Article 104339\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part A-Policy and Practice\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0965856424003872\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part A-Policy and Practice","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965856424003872","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

拼车已经改变了全球许多城市客运系统的格局,但它是否与公共交通(PT)竞争或互补仍存在争议,而且文献有限。因此,本研究旨在通过测量两个系统之间的关系并使用多源大数据分析和非线性机器学习方法检查其决定因素来解决这一知识差距,并以中国上海为研究案例。首先,我们利用上海观测到的拼车数据,在高德地图开放平台上计算每趟拼车行程的最快PT替代方案,然后比较两种系统的出行模式(即距离、持续时间和广义成本)。其次,我们提出了一个技术框架,考虑了PT服务的时空可用性和广义成本可接受性,以及拼车服务的包容性,以准确分类和识别拼车与PT系统之间的关系。最后,基于极端梯度增强和Shapley加性解释,探讨了拼车特征、公交服务、建筑环境和天气四类决定因素的重要性,以及它们对不同关系的非线性影响。研究结果表明,最快速公交方案的平均出行距离、广义出行时间和广义成本分别是拼车方案的1.16倍、2.13倍和1.15倍。竞争性出行占城市地区的36%,但在郊区仅占16%。此外,在郊区,超过70%和10%的拼车出行分别用于补充和整合PT。非线性机器学习框架确定了出行成本、到CBD的距离和出行时间这三个决定因素。值得注意的是,诸如到CBD的距离和温度等决定因素对这些关系具有非线性影响。这些发现为设计多式联运方案提供了有价值的见解,这些方案将拼车和PT的好处结合起来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Investigating relationships between ridesourcing and public transit using big data analysis and nonlinear machine learning: A case study of Shanghai, China
Ridesourcing has transformed the landscape of passenger transportation systems in many cities worldwide, but whether it competes with or complements public transport (PT) is still debated, and the literature is limited. Therefore, this study aims to address this knowledge gap by measuring the relationships between the two systems and examining their determinants using a multisource big data analysis and nonlinear machine learning approach, with Shanghai, China, as the study case. First, we used the observed ridesourcing data in Shanghai to compute the fastest PT alternative for each ridesourcing trip based on the Amap open platform and subsequently compared the travel patterns (i.e., distance, duration, and generalized cost) of the two systems. Second, we propose a technical framework that considers the spatiotemporal availability and generalized cost acceptability of PT services, as well as the inclusivity of ridesourcing services, to accurately classify and identify the relationship between ridesourcing and PT systems. Finally, we explored the importance of four types of determinants, namely, ridesourcing characteristics, PT service, built environment, and weather, and their nonlinear effects on different relationships based on extreme gradient boosting and Shapley additive explanations. Our results show that the fastest PT alternative involves an average travel distance, generalized travel time, and generalized cost that are 1.16, 2.13, and 1.15 times greater, respectively, than those of ridesourcing. Competitive trips account for 36% of urban areas but only 16% in the suburbs. Furthermore, more than 70% and 10% of the ridesourcing trips in suburban areas are used to complement and integrate PT, respectively. The nonlinear machine learning framework identified the top three determinants of integration as travel cost, distance to the CBD, and travel time. Notably, determinants such as the distance to the CBD and temperature have nonlinear effects on these relationships. These findings offer valuable insights for designing multimodal transportation options that integrate the benefits of ridesourcing and PT.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
13.20
自引率
7.80%
发文量
257
审稿时长
9.8 months
期刊介绍: Transportation Research: Part A contains papers of general interest in all passenger and freight transportation modes: policy analysis, formulation and evaluation; planning; interaction with the political, socioeconomic and physical environment; design, management and evaluation of transportation systems. Topics are approached from any discipline or perspective: economics, engineering, sociology, psychology, etc. Case studies, survey and expository papers are included, as are articles which contribute to unification of the field, or to an understanding of the comparative aspects of different systems. Papers which assess the scope for technological innovation within a social or political framework are also published. The journal is international, and places equal emphasis on the problems of industrialized and non-industrialized regions. Part A''s aims and scope are complementary to Transportation Research Part B: Methodological, Part C: Emerging Technologies and Part D: Transport and Environment. Part E: Logistics and Transportation Review. Part F: Traffic Psychology and Behaviour. The complete set forms the most cohesive and comprehensive reference of current research in transportation science.
期刊最新文献
A multi-objective model for fair location of stations and geofence parking area to address disorderly parking in free-floating micromobility systems Home delivery vs. out-of-home delivery: Syncretic value-based strategies for urban last-mile e-commerce logistics Pricing parking for fairness — A simulation study based on an empirically calibrated model of parking behavior The relationship between individual employment probabilities and accessibility to matching jobs: A study of the Netherlands Exploring the relationship between perceived bikeability and gender-inclusive micromobility usage: A study across 53 French cities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1