{"title":"Applying and comparing multiple machine learning techniques to detect fraudulent credit card transactions","authors":"D. Berezkin, Shi Runfang, Liu Tengjiao","doi":"10.18127/j19997493-202102-01","DOIUrl":null,"url":null,"abstract":"This experiment compared the performance of four machine learning algorithms in detecting bank card fraud. At the same time, the strong imbalance of the classes in the training sample was taken into account, as well as the difference in transaction amounts, and the ability of different machine learning methods to recognize fraudulent behavior was assessed taking into account these features. It has been found that a method that works well with indicators for assessing a classification is not necessarily the best in terms of assessing the magnitude of economic losses. Logistic regression is a good proof of this. The results of this work show that the problem of detecting fraud with bank cards cannot be regarded as a simple classification problem. AUC data is not the most appropriate metric for fraud detection tasks. The final choice of the model depends on the needs of the bank, that is, it is necessary to take into account which of the two types of errors (FN, FP) will lead to large economic losses for the bank. If the bank believes that the loss caused by identifying fraudulent transactions as regular transactions is the main one, it should choose the algorithm with the lowest FN value, which in this experiment is Adaboost. If the bank believes that the negative impact of identifying regular transactions as fraudulent is also very important, it should choose an algorithm with relatively small FN and FP data. In this experiment, the overall performance of the random forest is better. Further, by evaluating the economic losses caused by false positives (identifying an ordinary transaction as fraudulent), a quantitative analysis of the economic losses caused by each algorithm can be used to select the optimal algorithm model.","PeriodicalId":370962,"journal":{"name":"Dynamics of Complex Systems - XXI century","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dynamics of Complex Systems - XXI century","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18127/j19997493-202102-01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This experiment compared the performance of four machine learning algorithms in detecting bank card fraud. At the same time, the strong imbalance of the classes in the training sample was taken into account, as well as the difference in transaction amounts, and the ability of different machine learning methods to recognize fraudulent behavior was assessed taking into account these features. It has been found that a method that works well with indicators for assessing a classification is not necessarily the best in terms of assessing the magnitude of economic losses. Logistic regression is a good proof of this. The results of this work show that the problem of detecting fraud with bank cards cannot be regarded as a simple classification problem. AUC data is not the most appropriate metric for fraud detection tasks. The final choice of the model depends on the needs of the bank, that is, it is necessary to take into account which of the two types of errors (FN, FP) will lead to large economic losses for the bank. If the bank believes that the loss caused by identifying fraudulent transactions as regular transactions is the main one, it should choose the algorithm with the lowest FN value, which in this experiment is Adaboost. If the bank believes that the negative impact of identifying regular transactions as fraudulent is also very important, it should choose an algorithm with relatively small FN and FP data. In this experiment, the overall performance of the random forest is better. Further, by evaluating the economic losses caused by false positives (identifying an ordinary transaction as fraudulent), a quantitative analysis of the economic losses caused by each algorithm can be used to select the optimal algorithm model.