{"title":"Decision Analysis and Prediction Based on Credit Card Fraud Data","authors":"Deshan Huang, Yu Lin, Zhaoxing Weng, Jiajie Xiong","doi":"10.1145/3478301.3478305","DOIUrl":null,"url":null,"abstract":"With the common use of credit cards in today's transactions, the related fraudulent behavior inevitably occurs and causes considerable loss of money. To solve this problem, our work used a dataset that contains legal credit card transactions as well as fraud transactions to find an effective solution. In this paper, through processing and analyzing the transaction data, the data was discovered to be unbalanced, so stratified sampling and oversampling were performed to achieve a more reliable analysis of the unbalanced dataset. Meanwhile, due to the randomness of sampling, the cross-validated were used for the final model selection. Further, we utilized five algorithms to build models which contains statistical machine learning model and deep learning model. To obtain optimal performance of the models, hyperparameter tuning was performed for the five classifiers. Finally, the results indicate that the optimal model was XGBoost, and its performance can be verified in a real-life scenario in the future.","PeriodicalId":338866,"journal":{"name":"The 2nd European Symposium on Computer and Communications","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2nd European Symposium on Computer and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3478301.3478305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
With the common use of credit cards in today's transactions, the related fraudulent behavior inevitably occurs and causes considerable loss of money. To solve this problem, our work used a dataset that contains legal credit card transactions as well as fraud transactions to find an effective solution. In this paper, through processing and analyzing the transaction data, the data was discovered to be unbalanced, so stratified sampling and oversampling were performed to achieve a more reliable analysis of the unbalanced dataset. Meanwhile, due to the randomness of sampling, the cross-validated were used for the final model selection. Further, we utilized five algorithms to build models which contains statistical machine learning model and deep learning model. To obtain optimal performance of the models, hyperparameter tuning was performed for the five classifiers. Finally, the results indicate that the optimal model was XGBoost, and its performance can be verified in a real-life scenario in the future.