Yang Zhou, Quan Yuan, Chao Yang, Tangyi Guo, Xiaoyi Ma, Wenyong Sun, Tianren Yang
{"title":"Residence-Workplace Identification and Validation Based on Mobile Phone Data: A Case Study in a Large-Scale Urban Agglomeration in China","authors":"Yang Zhou, Quan Yuan, Chao Yang, Tangyi Guo, Xiaoyi Ma, Wenyong Sun, Tianren Yang","doi":"10.1177/03611981241270163","DOIUrl":null,"url":null,"abstract":"Residence-workplace identification is a fundamental task in mobile phone data analysis, but it faces certain challenges in sparse data processing and results validation because of the lack of ground-truth labels. Previous studies have generally relied on frequency-based methods for inference and trip-based metrics for validation, posing limitations in reliability and efficiency. This paper aims to fill this gap by developing a systematic approach that ranges from data error categorization and processing, feature relevance examination and parameter optimization, and the development of performance metrics considering both residence and workplace validation. For residence-workplace identification, we use a spatiotemporal closeness criterion to deal with the sparsity of data and develop effective dwelling time to enhance frequency-based methods, using one-month cellular signaling records from nine cities in the Yangtze River Delta urban agglomeration in China. For validation, we propose a residence-workplace pair metric based on the population-adjusted number of users, enabling more efficient evaluation of home and work locations than trip-based metrics. Results show that the mean absolute percentage errors (MAPEs) for the Nanjing and Shanghai cases are 5.04% and 8.46%, respectively. Adopted and verified in the large-scale urban agglomeration, the proposed method would be reliable for extracting residence and workplace from low-resolution mobile phone data and contributing to a more accurate identification of urban commuting patterns.","PeriodicalId":517391,"journal":{"name":"Transportation Research Record: Journal of the Transportation Research Board","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Record: Journal of the Transportation Research Board","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/03611981241270163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Residence-workplace identification is a fundamental task in mobile phone data analysis, but it faces certain challenges in sparse data processing and results validation because of the lack of ground-truth labels. Previous studies have generally relied on frequency-based methods for inference and trip-based metrics for validation, posing limitations in reliability and efficiency. This paper aims to fill this gap by developing a systematic approach that ranges from data error categorization and processing, feature relevance examination and parameter optimization, and the development of performance metrics considering both residence and workplace validation. For residence-workplace identification, we use a spatiotemporal closeness criterion to deal with the sparsity of data and develop effective dwelling time to enhance frequency-based methods, using one-month cellular signaling records from nine cities in the Yangtze River Delta urban agglomeration in China. For validation, we propose a residence-workplace pair metric based on the population-adjusted number of users, enabling more efficient evaluation of home and work locations than trip-based metrics. Results show that the mean absolute percentage errors (MAPEs) for the Nanjing and Shanghai cases are 5.04% and 8.46%, respectively. Adopted and verified in the large-scale urban agglomeration, the proposed method would be reliable for extracting residence and workplace from low-resolution mobile phone data and contributing to a more accurate identification of urban commuting patterns.