Pengxiang Zhao, Aoyong Li, P. Pilesjö, A. Mansourian
{"title":"A machine learning based approach for predicting usage efficiency of shared e-scooters using vehicle availability data","authors":"Pengxiang Zhao, Aoyong Li, P. Pilesjö, A. Mansourian","doi":"10.5194/agile-giss-3-20-2022","DOIUrl":null,"url":null,"abstract":"Abstract. Shared electric scooters (e-scooters) have been rapidly growing in popularity across Europe over the past three years, which can bring various environmental and socioeconomic benefits. However, how to further improve the usage efficiency of shared e-scooters is still a major concern for micro-mobility operators and city planners. This paper proposes a machine learning based approach to predict the usage efficiency of shared e-scooters using GPS-based vehicle availability data. First, the usage efficiency of shared e-scooters is measured with the indicator Time to Booking at the trip level. Second, ten exploratory variables in time and space are calculated as features for the prediction based on the e-scooter trips and other related data. Last, three typical machine learning methods, including logistical regression, artificial neural network and random forest are applied to predict the usage efficiency by inputting the features. Besides, the variable importance is evaluated by taking the random forest model as an example. The results show that the random forest model yields the best prediction performance (accuracy = 71.2%, F1 = 78.0%), and the variables like the hour of day and POI density present high variable importance. The findings of this study will be beneficial for micro-mobility operators and city planners to design policies and strategies for further improving the usage efficiency of e-scooter sharing services.\n","PeriodicalId":116168,"journal":{"name":"AGILE: GIScience Series","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AGILE: GIScience Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/agile-giss-3-20-2022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Abstract. Shared electric scooters (e-scooters) have been rapidly growing in popularity across Europe over the past three years, which can bring various environmental and socioeconomic benefits. However, how to further improve the usage efficiency of shared e-scooters is still a major concern for micro-mobility operators and city planners. This paper proposes a machine learning based approach to predict the usage efficiency of shared e-scooters using GPS-based vehicle availability data. First, the usage efficiency of shared e-scooters is measured with the indicator Time to Booking at the trip level. Second, ten exploratory variables in time and space are calculated as features for the prediction based on the e-scooter trips and other related data. Last, three typical machine learning methods, including logistical regression, artificial neural network and random forest are applied to predict the usage efficiency by inputting the features. Besides, the variable importance is evaluated by taking the random forest model as an example. The results show that the random forest model yields the best prediction performance (accuracy = 71.2%, F1 = 78.0%), and the variables like the hour of day and POI density present high variable importance. The findings of this study will be beneficial for micro-mobility operators and city planners to design policies and strategies for further improving the usage efficiency of e-scooter sharing services.
摘要共享电动滑板车(e-scooters)在过去三年中在欧洲迅速普及,它可以带来各种环境和社会经济效益。然而,如何进一步提高共享电动滑板车的使用效率仍然是微出行运营商和城市规划者关注的主要问题。本文提出了一种基于机器学习的方法,利用基于gps的车辆可用性数据预测共享电动滑板车的使用效率。首先,以出行层面的Time to Booking指标衡量共享电动滑板车的使用效率。其次,基于电动滑板车出行等相关数据,计算10个时间和空间上的探索性变量作为特征进行预测。最后,应用逻辑回归、人工神经网络和随机森林三种典型的机器学习方法,通过输入特征来预测使用效率。并以随机森林模型为例,对变量重要性进行了评价。结果表明,随机森林模型的预测效果最好(准确率为71.2%,F1 = 78.0%),且小时数、POI密度等变量具有较高的变量重要性。本研究结果将有助于微出行运营商和城市规划者制定政策和策略,进一步提高电动滑板车共享服务的使用效率。