{"title":"Enhancing transport mode classification benchmark by integrating spatial independence with multimodal dataset","authors":"","doi":"10.1016/j.tbs.2024.100929","DOIUrl":null,"url":null,"abstract":"<div><div>The transport network is a complex system that benefits from detailed data on user mobility. Analyzing user trajectories through clustering or classification methods can provide valuable insights into mobility patterns. Extracting transport modes from these trajectories using classification methods enhances the understanding of user mobility. The complexity of classification methods varies, with some classifying a few transport modes, such as walking, running, bicycling, and driving. In contrast, others classify up to seven modes or use private, unpublished datasets. A key challenge in transport mode classification is ensuring the comparability of different methods across various contexts. Additionally, comparing results is further complicated by the insufficient use of existing standardized benchmark, which in the case of transport mode classification, must contain a structured testing framework and a dataset on which the testing will be conducted. This research introduces a process for collecting data to develop a new transport mode classification dataset. The goal is to enhance the benchmark by evaluating classification methods across diverse traffic patterns and geographic areas, thereby assessing their spatial independence. Spatial independence is crucial because it ensures that classification methods remain accurate regardless of geographic variations. This improves comparability by enabling consistent evaluation of methods across regions, as the improved benchmark addresses spatial independence and ensures robustness for real-world deployment. The current benchmark in literature examines three types of independence: user, position, and time independence. Our tests employ a multilevel method based on Transition State Matrices (TSMs) and the Random Forest (RF) algorithm for transport mode classification. The results demonstrate that the multilevel method maintains spatial independence and achieves higher accuracy compared to the original benchmark problem.</div></div>","PeriodicalId":51534,"journal":{"name":"Travel Behaviour and Society","volume":null,"pages":null},"PeriodicalIF":5.1000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Travel Behaviour and Society","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214367X24001923","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
The transport network is a complex system that benefits from detailed data on user mobility. Analyzing user trajectories through clustering or classification methods can provide valuable insights into mobility patterns. Extracting transport modes from these trajectories using classification methods enhances the understanding of user mobility. The complexity of classification methods varies, with some classifying a few transport modes, such as walking, running, bicycling, and driving. In contrast, others classify up to seven modes or use private, unpublished datasets. A key challenge in transport mode classification is ensuring the comparability of different methods across various contexts. Additionally, comparing results is further complicated by the insufficient use of existing standardized benchmark, which in the case of transport mode classification, must contain a structured testing framework and a dataset on which the testing will be conducted. This research introduces a process for collecting data to develop a new transport mode classification dataset. The goal is to enhance the benchmark by evaluating classification methods across diverse traffic patterns and geographic areas, thereby assessing their spatial independence. Spatial independence is crucial because it ensures that classification methods remain accurate regardless of geographic variations. This improves comparability by enabling consistent evaluation of methods across regions, as the improved benchmark addresses spatial independence and ensures robustness for real-world deployment. The current benchmark in literature examines three types of independence: user, position, and time independence. Our tests employ a multilevel method based on Transition State Matrices (TSMs) and the Random Forest (RF) algorithm for transport mode classification. The results demonstrate that the multilevel method maintains spatial independence and achieves higher accuracy compared to the original benchmark problem.
期刊介绍:
Travel Behaviour and Society is an interdisciplinary journal publishing high-quality original papers which report leading edge research in theories, methodologies and applications concerning transportation issues and challenges which involve the social and spatial dimensions. In particular, it provides a discussion forum for major research in travel behaviour, transportation infrastructure, transportation and environmental issues, mobility and social sustainability, transportation geographic information systems (TGIS), transportation and quality of life, transportation data collection and analysis, etc.