The effects of data preprocessing on probability of default model fairness

arXiv - ECON - Econometrics Pub Date : 2024-08-28 DOI:arxiv-2408.15452

Di Wu

引用次数: 0

Abstract

In the context of financial credit risk evaluation, the fairness of machine learning models has become a critical concern, especially given the potential for biased predictions that disproportionately affect certain demographic groups. This study investigates the impact of data preprocessing, with a specific focus on Truncated Singular Value Decomposition (SVD), on the fairness and performance of probability of default models. Using a comprehensive dataset sourced from Kaggle, various preprocessing techniques, including SVD, were applied to assess their effect on model accuracy, discriminatory power, and fairness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

数据预处理对违约概率模型公平性的影响

在金融信用风险评估方面，机器学习模型的公平性已成为一个重要问题，特别是考虑到有可能出现偏差的预测，对某些人口群体造成不成比例的影响。本研究探讨了数据预处理对违约概率模型的公平性和性能的影响，重点是截断奇异值分解（SVD）。利用从 Kaggle 获取的综合数据集，应用了包括 SVD 在内的各种预处理技术，以评估它们对模型准确性、判别力和公平性的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - ECON - Econometrics

自引率

0.00%

发文量