A clustering-based approach for data-driven imputation of missing traffic data

2016 IEEE Forum on Integrated and Sustainable Transportation Systems (FISTS) Pub Date : 2016-07-10 DOI:10.1109/FISTS.2016.7552320

W. C. Ku, G. Jagadeesh, Alok Prakash, T. Srikanthan

引用次数: 43

Abstract

The problem of missing samples in road traffic data undermines the performance of intelligent transportation applications. This paper proposes a data-driven imputation method that exploits the spatial and temporal relationships existing between the traffic flows of multiple road segments that are correlated with each other. The K-means clustering technique is used to group together road segments with similar traffic flow patterns. Next, a deep-learning model based on stacked denoising autoencoders is constructed for each group of road segments to extract their spatial-temporal relationships and use them for imputing the missing data points. Experiments conducted with real traffic data demonstrate that the imputation accuracy of the proposed method is robust under different missing data rates.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种基于聚类的数据驱动的缺失交通数据补全方法

道路交通数据的样本缺失问题严重影响了智能交通应用的性能。本文提出了一种数据驱动的插值方法，该方法利用了多个相互关联的路段交通流之间存在的时空关系。使用K-means聚类技术将具有相似交通流模式的路段分组在一起。接下来，对每组路段构建基于堆叠去噪自编码器的深度学习模型，提取其时空关系，并利用它们来输入缺失的数据点。用实际交通数据进行的实验表明，在不同缺失率下，该方法具有较好的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE Forum on Integrated and Sustainable Transportation Systems (FISTS)

自引率

0.00%

发文量