Research and application of random forest model in mining automobile insurance fraud

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) Pub Date : 2016-08-01 DOI:10.1109/FSKD.2016.7603443

Yaqi Li, Chun Yan, W. Liu, Maozhen Li

{"title":"Research and application of random forest model in mining automobile insurance fraud","authors":"Yaqi Li, Chun Yan, W. Liu, Maozhen Li","doi":"10.1109/FSKD.2016.7603443","DOIUrl":null,"url":null,"abstract":"Automobile insurance fraud is gradually spreading in the global scope, and mining automobile insurance fraud is more and more concerned by the society. Concerning that the number of samples in the actual automobile insurance claims data is not balance and the amount of data is large, the real data of a automobile insurance company were selected to establish the random forest fraud mining model based on the theory of automobile insurance fraud mining. The data were processed to screen the index and the importance analysis of each input variable to the output variable was obtained. The error of the model was analyzed. Finally the method has been verified by empirical analysis. The empirical results show that: compared with the traditional model, the automobile insurance fraud mining model introducing Random Forest is suitable for large data sets and unbalanced data. It can be better used for the classification and prediction of the automobile insurance claims data and mining fraud rules. And it has the better accuracy and robustness.","PeriodicalId":373155,"journal":{"name":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2016.7603443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

Abstract

Automobile insurance fraud is gradually spreading in the global scope, and mining automobile insurance fraud is more and more concerned by the society. Concerning that the number of samples in the actual automobile insurance claims data is not balance and the amount of data is large, the real data of a automobile insurance company were selected to establish the random forest fraud mining model based on the theory of automobile insurance fraud mining. The data were processed to screen the index and the importance analysis of each input variable to the output variable was obtained. The error of the model was analyzed. Finally the method has been verified by empirical analysis. The empirical results show that: compared with the traditional model, the automobile insurance fraud mining model introducing Random Forest is suitable for large data sets and unbalanced data. It can be better used for the classification and prediction of the automobile insurance claims data and mining fraud rules. And it has the better accuracy and robustness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

随机森林模型在汽车保险欺诈中的研究与应用

汽车保险诈骗在全球范围内逐渐蔓延，挖掘汽车保险诈骗越来越受到社会的关注。针对实际车险理赔数据样本数量不均衡且数据量较大的问题，选取某车险公司的真实数据，基于车险欺诈挖掘理论，建立随机森林欺诈挖掘模型。对数据进行处理筛选指标，得到各输入变量对输出变量的重要性分析。对模型的误差进行了分析。最后通过实证分析对该方法进行了验证。实证结果表明:与传统模型相比，引入随机森林的车险欺诈挖掘模型适用于大数据集和不平衡数据。它可以更好地用于汽车保险理赔数据的分类和预测以及欺诈规则的挖掘。具有较好的精度和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

自引率

0.00%

发文量