{"title":"Passengers' requests clustering with k-prototype algorithm for the first-mile and last-mile (FMLM) shared-ride taxi service","authors":"Azimah Mohd , Lay Eng Teoh , Hooi Ling Khoo","doi":"10.1016/j.multra.2024.100132","DOIUrl":null,"url":null,"abstract":"<div><p>Shared mobility services are evolving globally. However, the first-mile and last-mile (FMLM) shared-ride taxi service poses a complex problem due to its large-scale nature and mixed-type variables (numeric and categorical features). As the input size of the problem increases exponentially, the absence of a known polynomial-time algorithm further complicates the finding of an optimal solution. Consequently, exploring potential solutions becomes computationally infeasible for more significant instances. Thus, this paper proposes using the k-prototype algorithm, an unsupervised learning approach, to cluster passengers' requests for FMLM shared-ride taxi service, which can reduce the problem's complexity via feasible clustering. Notably, the k-prototype algorithm is suitable for data sets with both numeric and categorical variables. It demonstrates a promising ability to handle large data sets effectively. As presented in this paper, the FMLM shared-ride taxi service prototypes and their unique characteristics could be optimally identified using the k-prototype algorithm with the Silhouette coefficient (as a performance index). By examining an illustrative case study with ten mixed-type variables of 946 passengers' requests, the results demonstrate the effective clustering of passengers' requests into three distinct prototypes, which can be characterized uniquely based on the temporal factors (pickup time of individual requests) and trip characteristics (including traveled distance, taxi type, as well as pickup and drop-off locations) that are significant in operating a competitive shared-ride taxi service. This paper is anticipated to reveal useful practical implications for the relevant stakeholders, especially the taxi service providers, in managing the FMLM shared-ride taxi services optimally to ensure an efficient and effective operating system.</p></div>","PeriodicalId":100933,"journal":{"name":"Multimodal Transportation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772586324000133/pdfft?md5=128170411b4f4b7fcb38a6a108487011&pid=1-s2.0-S2772586324000133-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimodal Transportation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772586324000133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Shared mobility services are evolving globally. However, the first-mile and last-mile (FMLM) shared-ride taxi service poses a complex problem due to its large-scale nature and mixed-type variables (numeric and categorical features). As the input size of the problem increases exponentially, the absence of a known polynomial-time algorithm further complicates the finding of an optimal solution. Consequently, exploring potential solutions becomes computationally infeasible for more significant instances. Thus, this paper proposes using the k-prototype algorithm, an unsupervised learning approach, to cluster passengers' requests for FMLM shared-ride taxi service, which can reduce the problem's complexity via feasible clustering. Notably, the k-prototype algorithm is suitable for data sets with both numeric and categorical variables. It demonstrates a promising ability to handle large data sets effectively. As presented in this paper, the FMLM shared-ride taxi service prototypes and their unique characteristics could be optimally identified using the k-prototype algorithm with the Silhouette coefficient (as a performance index). By examining an illustrative case study with ten mixed-type variables of 946 passengers' requests, the results demonstrate the effective clustering of passengers' requests into three distinct prototypes, which can be characterized uniquely based on the temporal factors (pickup time of individual requests) and trip characteristics (including traveled distance, taxi type, as well as pickup and drop-off locations) that are significant in operating a competitive shared-ride taxi service. This paper is anticipated to reveal useful practical implications for the relevant stakeholders, especially the taxi service providers, in managing the FMLM shared-ride taxi services optimally to ensure an efficient and effective operating system.