Fang Zhao, Gang Li, Yanxia Lyu, Hong-Dong Ma, Xiaoqian Zhu
{"title":"A cost-sensitive ensemble deep forest approach for extremely imbalanced credit fraud detection","authors":"Fang Zhao, Gang Li, Yanxia Lyu, Hong-Dong Ma, Xiaoqian Zhu","doi":"10.1080/14697688.2023.2230264","DOIUrl":null,"url":null,"abstract":"Credit fraud detection modeling helps prevent default risks and reduce economic losses, and increasingly sophisticated methods have been designed for predicting the default probability of clients. In such problems, the fact that the class of fraud clients is much smaller than the class of good clients makes it a challenge to detect the fraud class. To minimize the financial losses in extremely imbalanced datasets, this paper delivers a novel cost-sensitive ensemble model under the framework of deep forest. The model first introduces a cost-sensitive strategy to assign a higher cost to the fraud class, thereby improving the attention of the model to the fraud samples. As everyone knows, for the basic classifiers of ensemble learning, the greater their differences, the better the performance after ensemble. So the model adds superior cost-sensitive base classifiers into the cascade structure to improve the overall performance. The model also introduces Type II error as the convergence index to automatically adjust the depth of the cascade structure. The experiments conducted on the European credit dataset and a private electronic transaction dataset are presented to demonstrate the performance of the proposed method. The results indicate that the proposed model outperforms most benchmarks in detecting fraud samples.","PeriodicalId":20747,"journal":{"name":"Quantitative Finance","volume":"15 1","pages":"1397 - 1409"},"PeriodicalIF":1.5000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Finance","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1080/14697688.2023.2230264","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0
Abstract
Credit fraud detection modeling helps prevent default risks and reduce economic losses, and increasingly sophisticated methods have been designed for predicting the default probability of clients. In such problems, the fact that the class of fraud clients is much smaller than the class of good clients makes it a challenge to detect the fraud class. To minimize the financial losses in extremely imbalanced datasets, this paper delivers a novel cost-sensitive ensemble model under the framework of deep forest. The model first introduces a cost-sensitive strategy to assign a higher cost to the fraud class, thereby improving the attention of the model to the fraud samples. As everyone knows, for the basic classifiers of ensemble learning, the greater their differences, the better the performance after ensemble. So the model adds superior cost-sensitive base classifiers into the cascade structure to improve the overall performance. The model also introduces Type II error as the convergence index to automatically adjust the depth of the cascade structure. The experiments conducted on the European credit dataset and a private electronic transaction dataset are presented to demonstrate the performance of the proposed method. The results indicate that the proposed model outperforms most benchmarks in detecting fraud samples.
期刊介绍:
The frontiers of finance are shifting rapidly, driven in part by the increasing use of quantitative methods in the field. Quantitative Finance welcomes original research articles that reflect the dynamism of this area. The journal provides an interdisciplinary forum for presenting both theoretical and empirical approaches and offers rapid publication of original new work with high standards of quality. The readership is broad, embracing researchers and practitioners across a range of specialisms and within a variety of organizations. All articles should aim to be of interest to this broad readership.