Ensemble Techniques for Credit Card Fraud Detection

Satya Dileep Penmetsa, Sabah Mohammed
{"title":"Ensemble Techniques for Credit Card Fraud Detection","authors":"Satya Dileep Penmetsa, Sabah Mohammed","doi":"10.21742/ijsbt.2021.9.2.03","DOIUrl":null,"url":null,"abstract":"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.","PeriodicalId":448069,"journal":{"name":"International Journal of Smart Business and Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Smart Business and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijsbt.2021.9.2.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
信用卡欺诈检测的集成技术
信用卡诈骗是一个日益严重的问题,对金融行业产生了巨大的影响。信用卡诈骗面临的挑战是公共数据的可用性、数据的高度不平衡以及诈骗性质的波动性。多年来,集成学习越来越受到重视,并被证明具有更好的性能。在这里,我们尝试对使用信用卡欺诈数据的各种学习算法的各种集成方法进行比较研究,并使用SMOTE平衡技术来理解基于各种评估和性能指标的多个模型。机器学习算法提出了几种标准模型,包括NB、SVM和DL。他们使用了一个公开可用的信用卡数据集,使用个人(标准)模型和使用AdaBoost和多数投票组合方法的混合模型进行评估。MCC指标被用作绩效衡量标准,因为它考虑了预测结果的真、假阳性和阴性。最佳MCC得分为0.823,采用多数决法。使用AdaBoost和多数投票方法获得了1分的完美MCC得分。为了进一步评价混合模型,在数据样本中加入了10% ~ 30%的噪声。当数据集中加入30%的噪声时,多数投票法的MCC得分为0.942。这表明多数投票方法在存在噪声的情况下具有鲁棒性。集成技术的使用对于从正常的信用卡交易中预测错误的信用卡交易是非常重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Implementation of the QRQC Method as a Quick Response to Reduce the Number of Non-conforming Pieces in an Industrial Enterprise Application of Selected Digital Enterprise Tools in Small and Medium-Sized Industrial Enterprises Impact of Size and Market Competition on Risk-taking and Profitability of GCC Bank. - An Empirical Study through GMM Estimator The Credit Crunch: More of a Needed Adjustment than an Overreaction Forecasting Canadian Dollar against the US Dollar via Combined Approaches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1