{"title":"基于CatBoost的销售预测","authors":"Jingyi Ding, Ziqing Chen, Li Xiaolong, Baoxin Lai","doi":"10.1109/ITCA52113.2020.00138","DOIUrl":null,"url":null,"abstract":"Sales forecasting is a vital technology nowadays in the retail industry. With the help of advanced machine learning and deep learning algorithms, business owners can accurately predict the sales of thousands of products and make optimum decisions based on them. In this paper, we proposed a sales forecasting system based on CatBoosting. The algorithm is trained on the Walmart sales dataset, by far the largest dataset in this field. We performed effective feature engineering to boost prediction accuracy and speed. In the experiments, our model outperforms traditional machine learning methods like Linear Regression and SVM, reaching an RMSE of 0. 605. Our method doesn't need as much finetuning as other methods thus improving its generalization ability on other custom datasets, expanding its potential use.","PeriodicalId":103309,"journal":{"name":"2020 2nd International Conference on Information Technology and Computer Application (ITCA)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Sales Forecasting Based on CatBoost\",\"authors\":\"Jingyi Ding, Ziqing Chen, Li Xiaolong, Baoxin Lai\",\"doi\":\"10.1109/ITCA52113.2020.00138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sales forecasting is a vital technology nowadays in the retail industry. With the help of advanced machine learning and deep learning algorithms, business owners can accurately predict the sales of thousands of products and make optimum decisions based on them. In this paper, we proposed a sales forecasting system based on CatBoosting. The algorithm is trained on the Walmart sales dataset, by far the largest dataset in this field. We performed effective feature engineering to boost prediction accuracy and speed. In the experiments, our model outperforms traditional machine learning methods like Linear Regression and SVM, reaching an RMSE of 0. 605. Our method doesn't need as much finetuning as other methods thus improving its generalization ability on other custom datasets, expanding its potential use.\",\"PeriodicalId\":103309,\"journal\":{\"name\":\"2020 2nd International Conference on Information Technology and Computer Application (ITCA)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Conference on Information Technology and Computer Application (ITCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITCA52113.2020.00138\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Information Technology and Computer Application (ITCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITCA52113.2020.00138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sales forecasting is a vital technology nowadays in the retail industry. With the help of advanced machine learning and deep learning algorithms, business owners can accurately predict the sales of thousands of products and make optimum decisions based on them. In this paper, we proposed a sales forecasting system based on CatBoosting. The algorithm is trained on the Walmart sales dataset, by far the largest dataset in this field. We performed effective feature engineering to boost prediction accuracy and speed. In the experiments, our model outperforms traditional machine learning methods like Linear Regression and SVM, reaching an RMSE of 0. 605. Our method doesn't need as much finetuning as other methods thus improving its generalization ability on other custom datasets, expanding its potential use.