使用机器学习算法研究和分析客户流失预测

Dr. Sonali Nemade, Dr. Sujata Patil, Mrs. Deepashree Mehendale, Mrs. Vidya Shinde, Mrs. Reshma Masurekar
{"title":"使用机器学习算法研究和分析客户流失预测","authors":"Dr. Sonali Nemade, Dr. Sujata Patil, Mrs. Deepashree Mehendale, Mrs. Vidya Shinde, Mrs. Reshma Masurekar","doi":"10.32628/ijsrset241143","DOIUrl":null,"url":null,"abstract":"The customer churn prediction (CCP) is one of the challenging problems in the E-Commerce industry. With the advancement in the field of machine learning and artificial intelligence, the possibilities to predict customer churn has increased significantly. Our proposed methodology, consists of six phases. In the first two phases, data pre-processing and feature analysis is performed. In the third phase, feature selection is taken into consideration. Next, the data has been split into two parts train and test set in the ratio of 80% and 20% respectively. In the prediction process, most popular predictive models have been applied, namely, logistic regression, random forest classifier etc. on train set are applied to see the effect on accuracy of models. In addition, K-fold cross validation has been used over train set for hyper parameter tuning and to prevent overfitting of models. Finally, the obtained results on test set have been evaluated using confusion matrix and AUC curve.","PeriodicalId":14228,"journal":{"name":"International Journal of Scientific Research in Science, Engineering and Technology","volume":" 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"To Study and Analyse the Customer Churn Prediction using Machine Learning Algorithm\",\"authors\":\"Dr. Sonali Nemade, Dr. Sujata Patil, Mrs. Deepashree Mehendale, Mrs. Vidya Shinde, Mrs. Reshma Masurekar\",\"doi\":\"10.32628/ijsrset241143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The customer churn prediction (CCP) is one of the challenging problems in the E-Commerce industry. With the advancement in the field of machine learning and artificial intelligence, the possibilities to predict customer churn has increased significantly. Our proposed methodology, consists of six phases. In the first two phases, data pre-processing and feature analysis is performed. In the third phase, feature selection is taken into consideration. Next, the data has been split into two parts train and test set in the ratio of 80% and 20% respectively. In the prediction process, most popular predictive models have been applied, namely, logistic regression, random forest classifier etc. on train set are applied to see the effect on accuracy of models. In addition, K-fold cross validation has been used over train set for hyper parameter tuning and to prevent overfitting of models. Finally, the obtained results on test set have been evaluated using confusion matrix and AUC curve.\",\"PeriodicalId\":14228,\"journal\":{\"name\":\"International Journal of Scientific Research in Science, Engineering and Technology\",\"volume\":\" 9\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Scientific Research in Science, Engineering and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32628/ijsrset241143\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Scientific Research in Science, Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32628/ijsrset241143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

客户流失预测(CCP)是电子商务行业中极具挑战性的问题之一。随着机器学习和人工智能领域的进步,预测客户流失的可能性大大增加。我们提出的方法包括六个阶段。在前两个阶段,进行数据预处理和特征分析。第三阶段是特征选择。接下来,数据被分成训练集和测试集两部分,比例分别为 80% 和 20%。在预测过程中,在训练集上应用了最流行的预测模型,即逻辑回归、随机森林分类器等,以了解模型对准确率的影响。此外,还在训练集上使用了 K 折交叉验证来进行超参数调整,防止模型过度拟合。最后,使用混淆矩阵和 AUC 曲线对测试集上获得的结果进行评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
To Study and Analyse the Customer Churn Prediction using Machine Learning Algorithm
The customer churn prediction (CCP) is one of the challenging problems in the E-Commerce industry. With the advancement in the field of machine learning and artificial intelligence, the possibilities to predict customer churn has increased significantly. Our proposed methodology, consists of six phases. In the first two phases, data pre-processing and feature analysis is performed. In the third phase, feature selection is taken into consideration. Next, the data has been split into two parts train and test set in the ratio of 80% and 20% respectively. In the prediction process, most popular predictive models have been applied, namely, logistic regression, random forest classifier etc. on train set are applied to see the effect on accuracy of models. In addition, K-fold cross validation has been used over train set for hyper parameter tuning and to prevent overfitting of models. Finally, the obtained results on test set have been evaluated using confusion matrix and AUC curve.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
UGC Guidelines on Sustainable and Vibrant University- Industry Linkage System for Indian Universities, 2024 Leachate as a Fertilizer Artificial Intelligence in Healthcare : A Review Advancements in Quadcopter Development through Additive Manufacturing: A Comprehensive Review Sensing Human Emotion using Emerging Machine Learning Techniques
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1