(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Technology and Engineering Exploration (IJATEE)

ISSN (Print):2394-5443    ISSN (Online):2394-7454
Volume-11 Issue-110 January-2024
Full-Text PDF
Paper Title : Customer churn prediction model in enterprises using machine learning
Author Name : Yamini B, K. Venkata Ramana, M. Nalini, D. Chitra Devi, B. Maheswari and Siva Subramanian. R
Abstract :

The customer is a vital and integral component of all organizations worldwide. The success of any business hinges on its ability to engage customers effectively in terms of the products or services offered. Research into customer churn is a critical element in the operations of any enterprise. Customer churn prediction (CCP) helps to understand customer interactions with a business and often to identify when customers are likely to cease doing business with the company. In this study, Machine Learning (ML) algorithms are utilized for effective CCP. This study considers various supervised learning models, as the dataset used is labeled. The ML models employed include logistic regression (LR), k-nearest neighbors (k-NN), decision tree (DT), random forest (RF), XGBoost (XGB), light gradient boosting machine (LightGBM), and CatBoost. The ML model is applied to a dataset of customer churn sourced from the Kaggle repository. The results of the experiments are evaluated using several validity metrics, such as accuracy, recall, precision, area under the curve (AUC), and F1-score. The experimental data reveal that LR excels in terms of recall (0.5275) and accuracy (0.581), while the CatBoost model leads in AUC (0.8415), precision (0.6564), and F1-score (0.581). Moreover, LightGBM achieves results close to those of LR and CatBoost. The research findings indicate that the use of ML contributes to the prediction of customer churn. Additionally, the experimental results suggest that LR, CatBoost, and LightGBM outperform other ML models. Utilizing this knowledge, enterprises can develop more effective strategies for customer retention and enhance their business's financial performance.

Keywords : Customer churn, Enterprises, Prediction, ML, CatBoost, LightGBM, Logistic regression.
Cite this article : Yamini B, Ramana KV, Nalini M, Devi DC, Maheswari B, Subramanian. SR. Customer churn prediction model in enterprises using machine learning. International Journal of Advanced Technology and Engineering Exploration. 2024; 11(110):94-107. DOI:10.19101/IJATEE.2023.10101581.
References :
[1]Mishra A, Reddy US. A comparative study of customer churn prediction in telecom industry using ensemble based classifiers. In international conference on inventive computing and informatics 2017 (pp. 721-5). IEEE.
[Crossref] [Google Scholar]
[2]Qi J, Zhang Y, Zhang Y, Shi S. TreeLogit model for customer churn prediction. In Asia-Pacific conference on services computing 2006 (pp. 70-5). IEEE.
[Crossref] [Google Scholar]
[3]Prabha D, Subramanian RS. A survey on customer relationship management. In 4th international conference on advanced computing and communication systems 2017 (pp. 1-5). IEEE.
[Crossref] [Google Scholar]
[4]Hopkins E. Machine learning tools, algorithms, and techniques. Journal of Self-Governance and Management Economics. 2022; 10(1):43-55.
[Google Scholar]
[5]Muneer A, Ali RF, Alghamdi A, Taib SM, Almaghthawi A, Ghaleb EA. Predicting customers churning in banking industry: a machine learning approach. Indonesian Journal of Electrical Engineering and Computer Science. 2022; 26(1):539-49.
[Crossref] [Google Scholar]
[6]Sudharsan R, Ganesh EN. A swish RNN based customer churn prediction for the telecom industry with a novel feature selection strategy. Connection Science. 2022; 34(1):1855-76.
[Crossref] [Google Scholar]
[7]Lalwani P, Mishra MK, Chadha JS, Sethi P. Customer churn prediction system: a machine learning approach. Computing. 2022:1-24.
[Crossref] [Google Scholar]
[8]Ahmad AK, Jafar A, Aljoumaa K. Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data. 2019; 6(1):1-24.
[Crossref] [Google Scholar]
[9]Vo NN, Liu S, Li X, Xu G. Leveraging unstructured call log data for customer churn prediction. Knowledge-Based Systems. 2021; 212:106586.
[Crossref] [Google Scholar]
[10]Xiahou X, Harada Y. B2C E-commerce customer churn prediction based on K-means and SVM. Journal of Theoretical and Applied Electronic Commerce Research. 2022; 17(2):458-75.
[Crossref] [Google Scholar]
[11]Zhang T, Moro S, Ramos RF. A data-driven approach to improve customer churn prediction based on telecom customer segmentation. Future Internet. 2022; 14(3):1-19.
[Crossref] [Google Scholar]
[12]AL-najjar D, Al-rousan N, AL-najjar H. Machine learning to develop credit card customer churn prediction. Journal of Theoretical and Applied Electronic Commerce Research. 2022; 17(4):1529-42.
[Crossref] [Google Scholar]
[13]Fujo SW, Subramanian S, Khder MA. Customer churn prediction in telecommunication industry using deep learning. Information Sciences Letters. 2022; 11(1):185-98.
[Google Scholar]
[14]Rahman M, Kumar V. Machine learning based customer churn prediction in banking. In 4th international conference on electronics, communication and aerospace technology 2020 (pp. 1196-201). IEEE.
[Crossref] [Google Scholar]
[15]Saheed YK, Hambali MA. Customer churn prediction in telecom sector with machine learning and information gain filter feature selection algorithms. In international conference on data analytics for business and industry 2021 (pp. 208-13). IEEE.
[Crossref] [Google Scholar]
[16]Thorat MA, Sonawane VR. Customer churn prediction in the telecom industry using machine learning algorithms. Computer Integrated Manufacturing Systems. 2023; 29(4):1-11.
[Google Scholar]
[17]Kumar S, Logofatu D. Comparative study on customer churn prediction by using machine learning techniques. In Asian conference on intelligent information and database systems 2023 (pp. 339-51). Cham: Springer Nature Switzerland.
[Crossref] [Google Scholar]
[18]Awasthi S. Customer churn prediction on E-commerce data using stacking classifier. Authorea Preprints. 2023:1-10.
[Google Scholar]
[19]Liu Y, Shengdong M, Jijian G, Nedjah N. Intelligent prediction of customer churn with a fused attentional deep learning model. Mathematics. 2022; 10(24):1-16.
[Crossref] [Google Scholar]
[20]Jahan I, Sanam TF. An improved machine learning based customer churn prediction for insight and recommendation in E-commerce. In 25th international conference on computer and information technology 2022 (pp. 1-6). IEEE.
[Crossref] [Google Scholar]
[21]Swetha P, Dayananda RB. A customer churn prediction model in telecom industry using improved_XGBoost. International Journal of Cloud Computing. 2023; 12(2-4):277-94.
[Crossref] [Google Scholar]
[22]Pandithurai O, Ahmed HH, Sriman B, Seetha R. Telecom customer churn prediction using supervised machine learning techniques. In international conference on advances in computing, communication and applied informatics 2023 (pp. 1-7). IEEE.
[Crossref] [Google Scholar]
[23]Angelina JJ, Subhashini SJ, Baba SH, Reddy PD, Reddy PS, Khan KS. A machine learning model for customer churn prediction using CatBoost classifier. In 7th international conference on intelligent computing and control systems 2023 (pp. 166-72). IEEE.
[Crossref] [Google Scholar]
[24]Galal M, Rady S, Aref M. Enhancing customer churn prediction in digital banking using ensemble modeling. In 4th novel intelligent and leading emerging sciences conference 2022 (pp. 21-5). IEEE.
[Crossref] [Google Scholar]
[25]Agarwal V, Taware S, Yadav SA, Gangodkar D, Rao AL, Srivastav VK. Customer-churn prediction using machine learning. In 2nd international conference on technological advancements in computational sciences 2022 (pp. 893-9). IEEE.
[Crossref] [Google Scholar]
[26]Tsai TY, Lin CT, Prasad M. An intelligent customer churn prediction and response framework. In 14th international conference on intelligent systems and knowledge engineering 2019 (pp. 928-35). IEEE.
[Crossref] [Google Scholar]
[27]Günesen SN, Şen N, Yıldırım N, Kaya T. Customer churn prediction in FMCG sector using machine learning applications. In IFIP international workshop on artificial intelligence for knowledge management 2021 (pp. 82-103). Cham: Springer International Publishing.
[Crossref] [Google Scholar]
[28]Siva SR, Prabha D. Optimizıng naive bayes probability estimation in customer analysis using hybrid variable selection. In computer networks and inventive communication technologies: proceedings of third ICCNCT 2021 (pp. 595-612). Springer Singapore.
[Crossref] [Google Scholar]
[29]Subramanian RS, Prabha D, Aswini J, Maheswari B, Anita M. Alleviating NB conditional independence using multi-stage variable selection (MSVS): banking customer dataset application. In journal of physics: conference series 2021 (pp. 1-10). IOP Publishing.
[Crossref] [Google Scholar]
[30]Subramanian RS, Prabha D, Maheswari B, Aswini J. Customer analysis using machine learning algorithms: a case study using banking consumer dataset. Recent Trends in Intensive Computing. 2021; 689-94.
[Google Scholar]
[31]Subramanian RS, Prabha D. Ensemble variable selection for naive bayes to improve customer behaviour analysis. Computer Systems Science & Engineering. 2022; 41(1):339-55.
[Crossref] [Google Scholar]
[32]Al DE. Comparison between XGBoost, LightGBM and CatBoost using a home credit dataset. International Journal of Computer and Information Engineering. 2019; 13(1):6-10.
[Google Scholar]
[33]Maheswari B, Bushra SN, Nirmala G, Anita M, Smys S, Kamel KA, et al. Enhancing customer prediction using machine learning with feature selection approaches. Inventive Computation and Information Technologies. Lecture Notes in Networks and Systems. 2023; 563.
[Google Scholar]
[34]Prabha D, Subramanian RS, Balakrishnan S, Karpagam M. Performance evaluation of naive bayes classifier with and without filter based feature selection. International Journal of Innovative Technology and Exploring Engineering. 2019; 8(10):2154-8.
[Google Scholar]