📅 27 June 2024
DOI: 10.62411/jcta.10562

Effects of Data Resampling on Predicting Customer Churn via a Comparative Tree-based Random Forest and XGBoost

Journal of Computing Theories and Applications
Universitas Dian Nuswantoro

📄 Abstract

Customer attrition has become the focus of many businesses today – since the online market space has continued to proffer customers, various choices and alternatives to goods, services, and products for their monies. Businesses must seek to improve value, meet customers' teething demands/needs, enhance their strategies toward customer retention, and better monetize. The study compares the effects of data resampling schemes on predicting customer churn for both Random Forest (RF) and XGBoost ensembles. Data resampling schemes used include: (a) default mode, (b) random-under-sampling RUS, (c) synthetic minority oversampling technique (SMOTE), and (d) SMOTE-edited nearest neighbor (SMOTEEN). Both tree-based ensembles were constructed and trained to assess how well they performed with the chi-square feature selection mode. The result shows that RF achieved F1 0.9898, Accuracy 0.9973, Precision 0.9457, and Recall 0.9698 for the default, RUS, SMOTE, and SMOTEEN resampling, respectively. Xgboost outperformed Random Forest with F1 0.9945, Accuracy 0.9984, Precision 0.9616, and Recall 0.9890 for the default, RUS, SMOTE, and SMOTEEN, respectively. Studies support that the use of SMOTEEN resampling outperforms other schemes; while, it attributed XGBoost enhanced performance to hyper-parameter tuning of its decision trees. Retention strategies of recency-frequency-monetization were used and have been found to curb churn and improve monetization policies that will place business managers ahead of the curve of churning by customers.

🔖 Keywords

#Customer attrition; Churn; Imbalanced dataset; Random Forest; XGBoost; SMOTE; Random-Under-Sampling; SMOTEEN

ℹ️ Informasi Publikasi

Tanggal Publikasi
27 June 2024
Volume / Nomor / Tahun
Volume 2, Nomor 1, Tahun 2024

📝 HOW TO CITE

Ako, Rita Erhovwo; Aghware, Fidelis Obukohwo; Okpor, Margaret Dumebi; Akazue, Maureen Ifeanyi; Yoro, Rume Elizabeth; Ojugo, Arnold Adimabua; Setiadi, De Rosal Ignatius Moses; Odiakaose, Chris Chukwufunaya; Abere, Reuben Akporube; Emordi, Frances Uche; Geteloma, Victor Ochuko; Ejeh, Patrick Ogholuwarami, "Effects of Data Resampling on Predicting Customer Churn via a Comparative Tree-based Random Forest and XGBoost," Journal of Computing Theories and Applications, vol. 2, no. 1, Jun. 2024.

ACM
ACS
APA
ABNT
Chicago
Harvard
IEEE
MLA
Turabian
Vancouver

🔗 Artikel Terkait dari Jurnal yang Sama

📊 Statistik Sitasi Jurnal