W. C. Ku, G. Jagadeesh, Alok Prakash, T. Srikanthan
{"title":"A clustering-based approach for data-driven imputation of missing traffic data","authors":"W. C. Ku, G. Jagadeesh, Alok Prakash, T. Srikanthan","doi":"10.1109/FISTS.2016.7552320","DOIUrl":null,"url":null,"abstract":"The problem of missing samples in road traffic data undermines the performance of intelligent transportation applications. This paper proposes a data-driven imputation method that exploits the spatial and temporal relationships existing between the traffic flows of multiple road segments that are correlated with each other. The K-means clustering technique is used to group together road segments with similar traffic flow patterns. Next, a deep-learning model based on stacked denoising autoencoders is constructed for each group of road segments to extract their spatial-temporal relationships and use them for imputing the missing data points. Experiments conducted with real traffic data demonstrate that the imputation accuracy of the proposed method is robust under different missing data rates.","PeriodicalId":179987,"journal":{"name":"2016 IEEE Forum on Integrated and Sustainable Transportation Systems (FISTS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"43","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Forum on Integrated and Sustainable Transportation Systems (FISTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FISTS.2016.7552320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 43
Abstract
The problem of missing samples in road traffic data undermines the performance of intelligent transportation applications. This paper proposes a data-driven imputation method that exploits the spatial and temporal relationships existing between the traffic flows of multiple road segments that are correlated with each other. The K-means clustering technique is used to group together road segments with similar traffic flow patterns. Next, a deep-learning model based on stacked denoising autoencoders is constructed for each group of road segments to extract their spatial-temporal relationships and use them for imputing the missing data points. Experiments conducted with real traffic data demonstrate that the imputation accuracy of the proposed method is robust under different missing data rates.