An Ensemble Learning Approach to Enhance Customer Churn Prediction in Telecom Industry

Authors

  • Revati M. Wahul Department of Computer Engineering, Modern Education Society's College of Engineering, Pune, Maharashtra, India
  • Archana P. Kale Department of Computer Engineering, Modern Education Society's College of Engineering, Pune, Maharashtra, India
  • Prabhakar N. Kota Department of Electronics and Telecommunication Engineering, Modern Education Society's College of Engineering, Pune, Maharashtra, India

Keywords:

Ensemble learning, customer churn prediction, telecom industry, classification algorithms, feature engineering, performance evaluation

Abstract

The phenomenon of customer churn, which occurs when a customer leaves a service provider for another, is a major challenge for businesses in the telecommunications industry. Accurate predictions of this issue can help them improve their profitability and reduce their customer attrition. This study aims to use the data collected by Orange to improve the prediction accuracy of this issue. The study begins by reviewing the Telecom Churn dataset, which contains details about the customers such as their demographics and usage patterns. It also has a label that indicates whether or not the customer has churned. Through exploratory data gathering, we can identify correlations, patterns, and possible predictive elements that can be utilized in the prediction of churn. The goal of this study is to develop an ensemble learning framework that combines multiple classifiers. The framework is composed of various classification algorithms- Stochastic Gradient Boost(SGD), Random Forests (RF), Gradient Boosting (GB) and AdaBoost. We also test the performance of these through cross-validation techniques. The goal of this research is to improve the accuracy of its predictions by capturing more accurate information about the behavior of customers. The evaluation of the ensembles involves using different performance metrics, such as accuracy, recall, and F1 score. According to our experiments, the ensemble learning GB outperforms the other classifiers when it comes to predicting the likelihood of a customer leaving a service provider. By incorporating the base classifiers' predictions, the ensembles were able to achieve a robust and accurate prediction. This method can help businesses identify potential churners and implement effective retention strategies. The findings of this study demonstrate the utility of ensembles in improving the accuracy of churn prediction models for telecommunications companies. These findings can be utilized to develop more reliable and accurate churn prediction models, which can help improve the customer retention rate and enhance the business performance of service providers.

Downloads

Download data is not yet available.

References

A. K. Ahmad, A. Jafar, and K. Aljoumaa, “Customer churn prediction in telecom using machine learning in big data platform,” J. Big Data, vol. 6, no. 1, 2019, doi: 10.1186/s40537-019-0191-6.

A. A. Q. Ahmed and D. Maheswari, “Churn prediction on huge telecom data using hybrid firefly based classification Churn prediction on huge telecom data,” Egypt. Informatics J., vol. 18, no. 3, pp. 215–220, 2017, doi: 10.1016/j.eij.2017.02.002.

J. Vijaya and E. Sivasankar, “An efficient system for customer churn prediction through particle swarm optimization based feature selection model with simulated annealing,” Cluster Comput., vol. 22, no. s5, pp. 10757–10768, 2019, doi: 10.1007/s10586-017-1172-1.

H. Jain, A. Khunteta, and S. Srivastava, “Churn Prediction in Telecommunication using Logistic Regression and Logit Boost,” Procedia Comput. Sci., vol. 167, no. 2019, pp. 101–112, 2020, doi: 10.1016/j.procs.2020.03.187.

D. Das Adhikary and D. Gupta, “Applying over 100 classifiers for churn prediction in telecom companies,” Multimed. Tools Appl., vol. 80, no. 28–29, pp. 35123–35144, 2021, doi: 10.1007/s11042-020-09658-z.

A. Idris, M. Rizwan, and A. Khan, “Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies,” Comput. Electr. Eng., vol. 38, no. 6, pp. 1808–1819, 2012, doi: 10.1016/j.compeleceng.2012.09.001.

L. Zhao, Q. Gao, X. J. Dong, A. Dong, and X. Dong, “K- local maximum margin feature extraction algorithm for churn prediction in telecom,” Cluster Comput., vol. 20, no. 2, pp. 1401–1409, 2017, doi: 10.1007/s10586-017-0843-2.

D. L. García, À. Nebot, and A. Vellido, “Intelligent data analysis approaches to churn as a business problem: a survey,” Knowl. Inf. Syst., vol. 51, no. 3, pp. 719–774, 2017, doi: 10.1007/s10115-016-0995-z.

A. Idris, A. Iftikhar, and Z. ur Rehman, “Intelligent churn prediction for telecom using GP-AdaBoost learning and PSO undersampling,” Cluster Comput., vol. 22, no. s3, pp. 7241–7255, 2019, doi: 10.1007/s10586-017-1154-3.

K. Lu, X. Zhao, and B. Wang, “A Study on Mobile Customer Churn Based on Learning from Soft Label Proportions,” Procedia Comput. Sci., vol. 162, no. Itqm 2019, pp. 413–420, 2019, doi: 10.1016/j.procs.2019.12.005.

W. N. Wassouf, R. Alkhatib, K. Salloum, and S. Balloul, “Predictive analytics using big data for increased customer loyalty: Syriatel Telecom Company case study,” J. Big Data, vol. 7, no. 1, 2020, doi: 10.1186/s40537-020-00290-0.

P. Sulikowski and T. Zdziebko, “Churn factors identification from real-world data in the telecommunications industry: Case study,” Procedia Comput. Sci., vol. 192, pp. 4800–4809, 2021, doi: 10.1016/j.procs.2021.09.258.

T. W. Cenggoro, R. A. Wirastari, E. Rudianto, M. I. Mohadi, D. Ratj, and B. Pardamean, “Deep Learning as a Vector Embedding Model for Customer Churn,” Procedia Comput. Sci., vol. 179, no. 2019, pp. 624–631, 2021, doi: 10.1016/j.procs.2021.01.048.

Y. Liu, J. Fan, J. Zhang, X. Yin, and Z. Song, “Research on telecom customer churn prediction based on ensemble learning,” J. Intell. Inf. Syst., no. 0123456789, 2022, doi: 10.1007/s10844-022-00739-z.

M. T. Quasim, A. Sulaiman, A. Shaikh, and M. Younus, “Blockchain in churn prediction based telecommunication system on climatic weather application,” Sustain. Comput. Informatics Syst., vol. 35, no. December 2021, p. 100705, 2022, doi: 10.1016/j.suscom.2022.100705.

S. M. Shrestha and A. Shakya, “A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal,” Procedia Comput. Sci., vol. 215, pp. 652–661, 2022, doi: 10.1016/j.procs.2022.12.067.

A. Amin, A. Adnan, and S. Anwar, “An adaptive learning approach for customer churn prediction in the telecommunication industry using evolutionary computation and Naïve Bayes,” Appl. Soft Comput., vol. 137, p. 110103, 2023, doi: 10.1016/j.asoc.2023.110103.

Orange, “Telecom Churn Dataset | Kaggle.” [Online]. Available: https://www.kaggle.com/mnassrib/telecom-churn-datasets.

Purnima, T., & Rao, C. K. . (2023). CROD: Context Aware Role based Offensive Detection using NLP/ DL Approaches. International Journal on Recent and Innovation Trends in Computing and Communication, 11(1), 01–11. https://doi.org/10.17762/ijritcc.v11i1.5981

Brown, R., Brown, J., Rodriguez, C., Garcia, J., & Herrera, J. Predictive Analytics for Effective Resource Allocation in Engineering Education. Kuwait Journal of Machine Learning, 1(1). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/91

Downloads

Published

11.07.2023

How to Cite

Wahul, R. M. ., Kale, A. P. ., & Kota, P. N. . (2023). An Ensemble Learning Approach to Enhance Customer Churn Prediction in Telecom Industry. International Journal of Intelligent Systems and Applications in Engineering, 11(9s), 258–266. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3116

Issue

Section

Research Article