{"title":"基于cu - rf的数据不平衡信用卡欺诈检测","authors":"Wei Li, Cheng-shu Wu, Sumei Ruan","doi":"10.54560/jracr.v12i3.332","DOIUrl":null,"url":null,"abstract":"With the continuous expansion of the banks' credit card businesses, credit card fraud has become a serious threat to banking financial institutions. So, the automatic and real-time credit card fraud detection is the meaningful research work. Because machine learning has the characteristics of non-linearity, automation, and intelligence, so that credit card fraud detection can improve the detection efficiency and accuracy. In view of this, this paper proposes a credit card fraud detection model based on heterogeneous ensemble, namely CUS-RF (cluster-based under-sampling boosting and random forest), based on clustering under-sampling and random forest algorithm. CUS-RF-based credit card fraud detection model has the following advantages. Firstly, the CUS-RF model can better overcome the issue of data imbalance. Secondly, based on the idea of heterogeneous ensemble learning, the clustering under-sampling method and random forest model are fused to achieve a better performance for credit card fraud detection. Finally, through the verification of real credit card fraud dataset, the CUS-RF model proposed in this paper has achieved better performance in credit card fraud detection compared with the benchmark model.","PeriodicalId":31887,"journal":{"name":"Journal of Risk Analysis and Crisis Response JRACR","volume":"39 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CUS-RF-Based Credit Card Fraud Detection with Imbalanced Data\",\"authors\":\"Wei Li, Cheng-shu Wu, Sumei Ruan\",\"doi\":\"10.54560/jracr.v12i3.332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the continuous expansion of the banks' credit card businesses, credit card fraud has become a serious threat to banking financial institutions. So, the automatic and real-time credit card fraud detection is the meaningful research work. Because machine learning has the characteristics of non-linearity, automation, and intelligence, so that credit card fraud detection can improve the detection efficiency and accuracy. In view of this, this paper proposes a credit card fraud detection model based on heterogeneous ensemble, namely CUS-RF (cluster-based under-sampling boosting and random forest), based on clustering under-sampling and random forest algorithm. CUS-RF-based credit card fraud detection model has the following advantages. Firstly, the CUS-RF model can better overcome the issue of data imbalance. Secondly, based on the idea of heterogeneous ensemble learning, the clustering under-sampling method and random forest model are fused to achieve a better performance for credit card fraud detection. Finally, through the verification of real credit card fraud dataset, the CUS-RF model proposed in this paper has achieved better performance in credit card fraud detection compared with the benchmark model.\",\"PeriodicalId\":31887,\"journal\":{\"name\":\"Journal of Risk Analysis and Crisis Response JRACR\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Risk Analysis and Crisis Response JRACR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54560/jracr.v12i3.332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Risk Analysis and Crisis Response JRACR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54560/jracr.v12i3.332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
随着银行信用卡业务的不断扩大,信用卡诈骗已成为银行业金融机构面临的严重威胁。因此,信用卡欺诈的自动实时检测是一项有意义的研究工作。由于机器学习具有非线性、自动化、智能化的特点,使得信用卡欺诈检测可以提高检测效率和准确性。鉴于此,本文提出了一种基于异构集成的信用卡欺诈检测模型,即基于聚类欠采样和随机森林算法的us - rf (cluster-based undersampling boosting and random forest)。基于cu - rf的信用卡欺诈检测模型具有以下优点:首先,CUS-RF模型可以较好地克服数据不平衡的问题。其次,基于异构集成学习的思想,将聚类欠采样方法与随机森林模型相融合,提高信用卡欺诈检测的性能;最后,通过对真实信用卡欺诈数据集的验证,与基准模型相比,本文提出的cu - rf模型在信用卡欺诈检测方面取得了更好的性能。
CUS-RF-Based Credit Card Fraud Detection with Imbalanced Data
With the continuous expansion of the banks' credit card businesses, credit card fraud has become a serious threat to banking financial institutions. So, the automatic and real-time credit card fraud detection is the meaningful research work. Because machine learning has the characteristics of non-linearity, automation, and intelligence, so that credit card fraud detection can improve the detection efficiency and accuracy. In view of this, this paper proposes a credit card fraud detection model based on heterogeneous ensemble, namely CUS-RF (cluster-based under-sampling boosting and random forest), based on clustering under-sampling and random forest algorithm. CUS-RF-based credit card fraud detection model has the following advantages. Firstly, the CUS-RF model can better overcome the issue of data imbalance. Secondly, based on the idea of heterogeneous ensemble learning, the clustering under-sampling method and random forest model are fused to achieve a better performance for credit card fraud detection. Finally, through the verification of real credit card fraud dataset, the CUS-RF model proposed in this paper has achieved better performance in credit card fraud detection compared with the benchmark model.