Y. E. Tan, Kai Sheng Teong, Mehlam Shabbir, Lee Kien Foo, Sook-Ling Chua
{"title":"Modelling Flight Delays in the Presence of Class Imbalance","authors":"Y. E. Tan, Kai Sheng Teong, Mehlam Shabbir, Lee Kien Foo, Sook-Ling Chua","doi":"10.1145/3299819.3299847","DOIUrl":null,"url":null,"abstract":"Flight delay is one of the common problems faced by many air passengers. Delays in flights not only bring about inconvenience to passengers, but also cost the airlines. To streamline travel experience, airlines have been leveraging on data analytics to predict flight delays. Although many prediction models have been proposed, they perform poorly especially on data that have imbalanced class distributions. Often, these models pay less attention to the minority 'delay' class, which are usually more relevant and important. In this paper, we address the issue of imbalanced class distributions to improve the overall classification performance in predicting flight delays. We validated our approach on a public airline on-time performance dataset.","PeriodicalId":119217,"journal":{"name":"Artificial Intelligence and Cloud Computing Conference","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3299819.3299847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Flight delay is one of the common problems faced by many air passengers. Delays in flights not only bring about inconvenience to passengers, but also cost the airlines. To streamline travel experience, airlines have been leveraging on data analytics to predict flight delays. Although many prediction models have been proposed, they perform poorly especially on data that have imbalanced class distributions. Often, these models pay less attention to the minority 'delay' class, which are usually more relevant and important. In this paper, we address the issue of imbalanced class distributions to improve the overall classification performance in predicting flight delays. We validated our approach on a public airline on-time performance dataset.