公交行程模拟,开发公共交通预测算法

Thilo Reich , Marcin Budka , David Hulbert
{"title":"公交行程模拟,开发公共交通预测算法","authors":"Thilo Reich ,&nbsp;Marcin Budka ,&nbsp;David Hulbert","doi":"10.1016/j.socl.2021.100029","DOIUrl":null,"url":null,"abstract":"<div><p>Encouraging the use of public transport is essential to combat congestion and pollution in an urban environment. To achieve this, the reliability of arrival time prediction should be improved as this is one area of improvement frequently requested by passengers. The development of accurate predictive algorithms requires good quality data, which is often not available. Here we demonstrate a method to synthesise data using a reference curve approach derived from very limited real world data without reliable ground truth. This approach allows the controlled introduction of artefacts and noise to simulate their impact on prediction accuracy. To illustrate these impacts, a recurrent neural network next-step prediction is used to compare different scenarios in two different UK cities. The results show that a realistic data synthesis is possible, allowing for controlled testing of predictive algorithms. It also highlights the importance of reliable data transmission to gain such data from real world sources. Our main contribution is the demonstration of a synthetic data generator for public transport data, which can be used to compensate for low data quality. We further show that this data generator can be used to develop and enhance predictive algorithms in the context of urban bus networks if high-quality data is limited, by mixing synthetic and real data.</p></div>","PeriodicalId":101169,"journal":{"name":"Soft Computing Letters","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666222121000174/pdfft?md5=ac9f2e119a87da7041d08bee1f99991e&pid=1-s2.0-S2666222121000174-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Bus journey simulation to develop public transport predictive algorithms\",\"authors\":\"Thilo Reich ,&nbsp;Marcin Budka ,&nbsp;David Hulbert\",\"doi\":\"10.1016/j.socl.2021.100029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Encouraging the use of public transport is essential to combat congestion and pollution in an urban environment. To achieve this, the reliability of arrival time prediction should be improved as this is one area of improvement frequently requested by passengers. The development of accurate predictive algorithms requires good quality data, which is often not available. Here we demonstrate a method to synthesise data using a reference curve approach derived from very limited real world data without reliable ground truth. This approach allows the controlled introduction of artefacts and noise to simulate their impact on prediction accuracy. To illustrate these impacts, a recurrent neural network next-step prediction is used to compare different scenarios in two different UK cities. The results show that a realistic data synthesis is possible, allowing for controlled testing of predictive algorithms. It also highlights the importance of reliable data transmission to gain such data from real world sources. Our main contribution is the demonstration of a synthetic data generator for public transport data, which can be used to compensate for low data quality. We further show that this data generator can be used to develop and enhance predictive algorithms in the context of urban bus networks if high-quality data is limited, by mixing synthetic and real data.</p></div>\",\"PeriodicalId\":101169,\"journal\":{\"name\":\"Soft Computing Letters\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666222121000174/pdfft?md5=ac9f2e119a87da7041d08bee1f99991e&pid=1-s2.0-S2666222121000174-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Soft Computing Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666222121000174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing Letters","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666222121000174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

鼓励使用公共交通工具是解决城市环境拥挤和污染问题的关键。为了实现这一目标,应提高到达时间预测的可靠性,因为这是乘客经常要求改进的一个领域。开发准确的预测算法需要高质量的数据,而这些数据通常是无法获得的。在这里,我们展示了一种使用参考曲线方法合成数据的方法,该方法来源于非常有限的真实世界数据,没有可靠的地面真值。这种方法允许可控地引入伪影和噪声来模拟它们对预测精度的影响。为了说明这些影响,使用递归神经网络下一步预测来比较两个不同英国城市的不同情景。结果表明,现实的数据合成是可能的,允许对预测算法进行控制测试。它还强调了可靠的数据传输对于从真实世界来源获得此类数据的重要性。我们的主要贡献是演示了公共交通数据的合成数据生成器,它可以用来弥补数据质量低的问题。我们进一步表明,在高质量数据有限的情况下,通过混合合成数据和真实数据,该数据生成器可用于开发和增强城市公交网络背景下的预测算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Bus journey simulation to develop public transport predictive algorithms

Encouraging the use of public transport is essential to combat congestion and pollution in an urban environment. To achieve this, the reliability of arrival time prediction should be improved as this is one area of improvement frequently requested by passengers. The development of accurate predictive algorithms requires good quality data, which is often not available. Here we demonstrate a method to synthesise data using a reference curve approach derived from very limited real world data without reliable ground truth. This approach allows the controlled introduction of artefacts and noise to simulate their impact on prediction accuracy. To illustrate these impacts, a recurrent neural network next-step prediction is used to compare different scenarios in two different UK cities. The results show that a realistic data synthesis is possible, allowing for controlled testing of predictive algorithms. It also highlights the importance of reliable data transmission to gain such data from real world sources. Our main contribution is the demonstration of a synthetic data generator for public transport data, which can be used to compensate for low data quality. We further show that this data generator can be used to develop and enhance predictive algorithms in the context of urban bus networks if high-quality data is limited, by mixing synthetic and real data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Editorial: Socio-cultural inspired Metaheuristics A fuzzy optimization model for methane gas production from municipal solid waste A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method Analysis of French phonetic idiosyncrasies for accent recognition An ensemble machine learning model for the prediction of danger zones: Towards a global counter-terrorism
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1