Michael Peter , Hawa Mofi , Said Likoko , Julius Sabas , Ramadhani Mbura , Neema Mduma
{"title":"Predicting customer subscription in bank telemarketing campaigns using ensemble learning models","authors":"Michael Peter , Hawa Mofi , Said Likoko , Julius Sabas , Ramadhani Mbura , Neema Mduma","doi":"10.1016/j.mlwa.2025.100618","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the use of ensemble learning models bagging, boosting, and stacking to enhance the accuracy and reliability of predicting customer subscriptions in bank telemarketing campaigns. Recognizing the challenges posed by class imbalance and complex customer behaviors, we employ multiple ensemble techniques to build a robust predictive framework. Our analysis demonstrates that stacking models achieve the best overall performance, with an accuracy of 91.88% and an Receiver Operating Characteristic Area Under the Curve (ROC-AUC) score of 0.9491, indicating a strong capability to differentiate between subscribers and non-subscribers. Additionally, feature importance analysis reveals that contact duration, economic indicators like the Euro interbank offered (Euribor) rate, and customer age are the most influential factors in predicting subscription likelihood. These findings suggest that by focusing on customer engagement and economic trends, banks can improve telemarketing campaign effectiveness. We recommend the integration of advanced balancing techniques and real-time prediction systems to further enhance model performance and adaptability. Future work could explore deep learning models and interpretability techniques to gain deeper insights into customer behavior patterns. Overall, this study highlights the potential of ensemble models in predictive modeling for telemarketing, providing a data-driven foundation for more targeted and efficient customer acquisition strategies.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"19 ","pages":"Article 100618"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/7 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates the use of ensemble learning models bagging, boosting, and stacking to enhance the accuracy and reliability of predicting customer subscriptions in bank telemarketing campaigns. Recognizing the challenges posed by class imbalance and complex customer behaviors, we employ multiple ensemble techniques to build a robust predictive framework. Our analysis demonstrates that stacking models achieve the best overall performance, with an accuracy of 91.88% and an Receiver Operating Characteristic Area Under the Curve (ROC-AUC) score of 0.9491, indicating a strong capability to differentiate between subscribers and non-subscribers. Additionally, feature importance analysis reveals that contact duration, economic indicators like the Euro interbank offered (Euribor) rate, and customer age are the most influential factors in predicting subscription likelihood. These findings suggest that by focusing on customer engagement and economic trends, banks can improve telemarketing campaign effectiveness. We recommend the integration of advanced balancing techniques and real-time prediction systems to further enhance model performance and adaptability. Future work could explore deep learning models and interpretability techniques to gain deeper insights into customer behavior patterns. Overall, this study highlights the potential of ensemble models in predictive modeling for telemarketing, providing a data-driven foundation for more targeted and efficient customer acquisition strategies.
本研究探讨了在银行电话营销活动中使用集成学习模型装袋、提升和堆叠来提高预测客户订阅的准确性和可靠性。认识到类不平衡和复杂的客户行为所带来的挑战,我们采用多种集成技术来构建一个健壮的预测框架。我们的分析表明,叠加模型获得了最佳的整体性能,准确率为91.88%,Receiver Operating Characteristic Area Under the Curve (ROC-AUC)得分为0.9491,表明有很强的区分用户和非用户的能力。此外,特征重要性分析显示,联系时间、欧元银行间同业拆借利率等经济指标和客户年龄是预测认购可能性的最重要因素。这些发现表明,通过关注客户参与度和经济趋势,银行可以提高电话营销活动的有效性。我们建议将先进的平衡技术与实时预测系统相结合,以进一步提高模型的性能和适应性。未来的工作可以探索深度学习模型和可解释性技术,以更深入地了解客户行为模式。总体而言,本研究强调了集成模型在电话营销预测建模中的潜力,为更有针对性和更有效的客户获取策略提供了数据驱动的基础。