Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset

Dongchang Liu, Shih-Fen Cheng, Yiping Yang
{"title":"Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset","authors":"Dongchang Liu, Shih-Fen Cheng, Yiping Yang","doi":"10.1109/ITSC.2015.297","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands.","PeriodicalId":124818,"journal":{"name":"2015 IEEE 18th International Conference on Intelligent Transportation Systems","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 18th International Conference on Intelligent Transportation Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC.2015.297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

In this paper, we introduce a variant of the density peaks clustering (DPC) approach for discovering demand hot spots from a low-frequency, low-quality taxi fleet operational dataset. From the literature, the DPC approach mainly uses density peaks as features to discover potential cluster centers, and this requires distances between all pairs of data points to be calculated. This implies that the DPC approach can only be applied to cases with relatively small numbers of data points. For the domain of urban taxi operations that we are interested in, we could have millions of demand points per day, and calculating all-pair distances between all demand points would be practically impossible, thus making DPC approach not applicable. To address this issue, we project all points to a density image and execute our variant of the DPC algorithm on the processed image. Experiment results show that our proposed DPC variant could get similar results as original DPC, yet with much shorter execution time and lower memory consumption. By running our DPC variant on a real-world dataset collected in Singapore, we show that there are indeed recurrent demand hot spots within the central business district that are not covered by the current taxi stand design. Our approach could be of use to both taxi fleet operator and traffic planners in guiding drivers and setting up taxi stands.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
城市规模出租车车队数据集中需求热点发现的密度峰聚类方法
在本文中,我们引入了密度峰值聚类(DPC)方法的一种变体,用于从低频、低质量出租车车队运营数据集中发现需求热点。从文献来看,DPC方法主要使用密度峰作为特征来发现潜在的聚类中心,这需要计算所有数据点对之间的距离。这意味着DPC方法只能应用于数据点相对较少的情况。对于我们感兴趣的城市出租车运营领域,我们每天可能有数百万个需求点,并且计算所有需求点之间的全对距离实际上是不可能的,因此使得DPC方法不适用。为了解决这个问题,我们将所有点投影到密度图像上,并在处理后的图像上执行我们的DPC算法变体。实验结果表明,我们提出的DPC变体可以获得与原始DPC相似的结果,但执行时间更短,内存消耗更低。通过在新加坡收集的真实数据集上运行我们的DPC变体,我们表明,在中央商务区确实存在当前出租车车站设计未涵盖的经常性需求热点。我们的方法可为的士车队营办商及交通规划人员指引司机及设立的士候站提供参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Blind Area Traffic Prediction Using High Definition Maps and LiDAR for Safe Driving Assist ZEM 2 All Project (Zero Emission Mobility to All) Economic Analysis Based on the Interrelationships of the OLEV System Components Intelligent Driver Monitoring Based on Physiological Sensor Signals: Application Using Camera On Identifying Dynamic Intersections in Large Cities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1