Deep Probabilistic Forecasting of Multivariate Count Data With “Sums and Shares” Distributions: A Case Study on Pedestrian Counts in a Multimodal Transport Hub
Paul de Nailly;Etienne Côme;Latifa Oukhellou;Allou Samé;Jacques Ferrière;Yasmine Merad-Boudia
{"title":"Deep Probabilistic Forecasting of Multivariate Count Data With “Sums and Shares” Distributions: A Case Study on Pedestrian Counts in a Multimodal Transport Hub","authors":"Paul de Nailly;Etienne Côme;Latifa Oukhellou;Allou Samé;Jacques Ferrière;Yasmine Merad-Boudia","doi":"10.1109/TITS.2024.3447282","DOIUrl":null,"url":null,"abstract":"Forecasting counts data in transportation areas can enrich passenger information for public transport passengers, who may thus better plan their trips. Moreover, forecasting with uncertainty is particularly important in the transportation domain, where the risk of poorly managed high demand is to be avoided. In this paper, we propose a new probabilistic prediction model well-suited for multivariate, overdispersed, and possibly correlated count data. This model combines the strength of the deep learning framework with the modeling of counts data allowed by “sums and shares” distributions. Indeed, deep learning models can handle uncertainty by relying on an abstraction of contextual data and by assuming output distributions. Our model learns a latent representation of the input data with the help of a recurrent neural network and then translates it into multivariate count predictions with a “sums and shares” distribution, well suited to tackle multivariate overdispersed and correlated count data. An extensive benchmark of the proposed model is carried out. We compare this model with seven others from the state-of-the-art probabilistic forecasting models using five open-source data (bikes, taxis, railways, traffic, wikipedia) and a specific use case on pedestrian counts within a multimodal transport hub in the Paris Region. Our model outperforms other models in situations where the data present temporal regularities. The results also highlight the potential of our model in the specific use case. Moreover, this forecasting represents an interesting way to predict short-term pedestrian counts in response to different events, such as concerts or transport disruptions.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"25 11","pages":"15687-15701"},"PeriodicalIF":7.9000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10666917/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Forecasting counts data in transportation areas can enrich passenger information for public transport passengers, who may thus better plan their trips. Moreover, forecasting with uncertainty is particularly important in the transportation domain, where the risk of poorly managed high demand is to be avoided. In this paper, we propose a new probabilistic prediction model well-suited for multivariate, overdispersed, and possibly correlated count data. This model combines the strength of the deep learning framework with the modeling of counts data allowed by “sums and shares” distributions. Indeed, deep learning models can handle uncertainty by relying on an abstraction of contextual data and by assuming output distributions. Our model learns a latent representation of the input data with the help of a recurrent neural network and then translates it into multivariate count predictions with a “sums and shares” distribution, well suited to tackle multivariate overdispersed and correlated count data. An extensive benchmark of the proposed model is carried out. We compare this model with seven others from the state-of-the-art probabilistic forecasting models using five open-source data (bikes, taxis, railways, traffic, wikipedia) and a specific use case on pedestrian counts within a multimodal transport hub in the Paris Region. Our model outperforms other models in situations where the data present temporal regularities. The results also highlight the potential of our model in the specific use case. Moreover, this forecasting represents an interesting way to predict short-term pedestrian counts in response to different events, such as concerts or transport disruptions.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.