{"title":"信用卡客户端违约预测分析","authors":"Alžbeta Bačová, F. Babič","doi":"10.1109/SAMI50585.2021.9378671","DOIUrl":null,"url":null,"abstract":"Predictive analytics has a significant potential to support different decision processes. We aimed to compare various machine learning algorithms for the selected task, which predicts credit card clients' default based on the free available data. We chose Random Forest, AdaBoost, XGBoost, and Gradient Boosting algorithm and applied them to a prepared data sample. We experimentally evaluated the classification models within metrics like accuracy, precision, recall, ROC, and AUC. The results show a very similar performance of the selected algorithms on this dataset. The Gradient boosting (0.7828) achieved the best performance within AUC, but the best precision for target class 1 reached the Bagging algorithm (0.72). The simple data processing brought only minimal improvements in individual metrics. Our results are comparable to the mentioned studies instead of MCC metrics that resulted in better value (0.4111) achieved by the Gradient Boosting model.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Predictive Analytics for Default of Credit Card Clients\",\"authors\":\"Alžbeta Bačová, F. Babič\",\"doi\":\"10.1109/SAMI50585.2021.9378671\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predictive analytics has a significant potential to support different decision processes. We aimed to compare various machine learning algorithms for the selected task, which predicts credit card clients' default based on the free available data. We chose Random Forest, AdaBoost, XGBoost, and Gradient Boosting algorithm and applied them to a prepared data sample. We experimentally evaluated the classification models within metrics like accuracy, precision, recall, ROC, and AUC. The results show a very similar performance of the selected algorithms on this dataset. The Gradient boosting (0.7828) achieved the best performance within AUC, but the best precision for target class 1 reached the Bagging algorithm (0.72). The simple data processing brought only minimal improvements in individual metrics. Our results are comparable to the mentioned studies instead of MCC metrics that resulted in better value (0.4111) achieved by the Gradient Boosting model.\",\"PeriodicalId\":402414,\"journal\":{\"name\":\"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAMI50585.2021.9378671\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMI50585.2021.9378671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predictive Analytics for Default of Credit Card Clients
Predictive analytics has a significant potential to support different decision processes. We aimed to compare various machine learning algorithms for the selected task, which predicts credit card clients' default based on the free available data. We chose Random Forest, AdaBoost, XGBoost, and Gradient Boosting algorithm and applied them to a prepared data sample. We experimentally evaluated the classification models within metrics like accuracy, precision, recall, ROC, and AUC. The results show a very similar performance of the selected algorithms on this dataset. The Gradient boosting (0.7828) achieved the best performance within AUC, but the best precision for target class 1 reached the Bagging algorithm (0.72). The simple data processing brought only minimal improvements in individual metrics. Our results are comparable to the mentioned studies instead of MCC metrics that resulted in better value (0.4111) achieved by the Gradient Boosting model.