Implementasi Algoritma Random Forest untuk Prediksi Customer Churn pada Perusahaan Telekomunikasi

Authors

  • Muhammad Junaidi Institut Teknologi dan Bisnis Indonesia
  • Roberto Kaban Institut Teknologi dan Bisnis Indonesia

DOI:

https://doi.org/10.61124/sinta.v3i3.320

Keywords:

Customer Churn, Random Forest, Machine Learning, SMOTE, Telekomunikasi

Abstract

Customer churn merupakan permasalahan krusial dalam industri telekomunikasi yang berdampak langsung pada penurunan pendapatan perusahaan. Tantangan utama dalam prediksinya adalah ketidakseimbangan kelas, di mana jumlah pelanggan non-churn jauh lebih besar dibandingkan pelanggan churn. Penelitian ini mengimplementasikan algoritma Random Forest dikombinasikan dengan Synthetic Minority Oversampling Technique (SMOTE) untuk mengatasi permasalahan tersebut. Dataset yang digunakan adalah Telco Customer Churn dari Kaggle, terdiri dari 7.043 data pelanggan dengan 21 atribut. Pra-pemrosesan meliputi pembersihan data dan label encoding, kemudian dataset dibagi dengan rasio 80:20, dan SMOTE diterapkan eksklusif pada data pelatihan. Hasil pengujian menunjukkan model mencapai accuracy 77,15%, precision 55,65%, recall 68,45%, dan F1-score 61,39%. Analisis feature importance mengungkapkan bahwa Contract, OnlineSecurity, dan TechSupport merupakan faktor paling dominan yang memengaruhi churn. Integrasi Random Forest dan SMOTE terbukti menghasilkan model prediksi yang andal sekaligus memberikan wawasan strategis bagi perusahaan dalam merancang program retensi pelanggan yang lebih efektif.

References

Achmad Mauludi Asror and I Kadek Dwi Nuryana, “Prediction and Analysis of Customer Churn at Telkomsel Using Machine Learning Approach,” J. Emerg. Inf. Syst. Bus. Intell. JEISBI, vol. 6, no. 2, Jul. 2025, doi: 10.26740/jeisbi.v6i2.65825.

Y. Zhou, W. Chen, X. Sun, and D. Yang, “Early warning of telecom enterprise customer churn based on ensemble learning,” PLOS ONE, vol. 18, no. 10, p. e0292466, Oct. 2023, doi: 10.1371/journal.pone.0292466.

L. Breiman, “Random Forests,” Mach. Learn., vol. 45, no. 1, pp. 5–32, Oct. 2001, doi: 10.1023/A:1010933404324.

I. Ullah, B. Raza, A. K. Malik, M. Imran, S. U. Islam, and S. W. Kim, “A Churn Prediction Model Using Random Forest: Analysis of Machine Learning Techniques for Churn Prediction and Factor Identification in Telecom Sector,” IEEE Access, vol. 7, pp. 60134–60149, 2019, doi: 10.1109/ACCESS.2019.2914999.

R. T. Prasetio, Y. Ramdhani, V. H. C. Putra, and P. Pratiwi, “Algoritma Random Forest Sebagai Uplift Modeling pada Prediksi Retensi Pelanggan Layanan Telekomunikasi,” J. Infortech, vol. 7, no. 2, pp. 84–88, Dec. 2025, doi: 10.31294/infortech.v7i2.11577.

M. M. S. Nurhidayat and Dyah Anggraini, “Analysis and Classification of Customer Churn Using Machine Learning Models,” J. RESTI Rekayasa Sist. Dan Teknol. Inf., vol. 7, no. 6, pp. 1253–1259, Nov. 2023, doi: 10.29207/resti.v7i6.4933.

L. N. Wakhidah, A. K. Zyen, and B. B. Wahono, “Evaluation of Telecommunication Customer Churn Classification with SMOTE Using Random Forest and XGBoost Algorithms,” J. Appl. Inform. Comput., vol. 9, no. 1, pp. 89–95, Jan. 2025, doi: 10.30871/jaic.v9i1.8740.

V. Chang, K. Hall, Q. Xu, F. Amao, M. Ganatra, and V. Benson, “Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models,” Algorithms, vol. 17, no. 6, p. 231, May 2024, doi: 10.3390/a17060231.

M. Z. Alotaibi and M. A. Haq, “Customer Churn Prediction for Telecommunication Companies using Machine Learning and Ensemble Methods,” Eng. Technol. Appl. Sci. Res., vol. 14, no. 3, pp. 14572–14578, Jun. 2024, doi: 10.48084/etasr.7480.

F. Bolikulov, R. Nasimov, A. Rashidov, F. Akhmedov, and Y.-I. Cho, “Effective Methods of Categorical Data Encoding for Artificial Intelligence Algorithms,” Mathematics, vol. 12, no. 16, p. 2553, Aug. 2024, doi: 10.3390/math12162553.

M. Mujahid et al., “Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering,” J. Big Data, vol. 11, no. 1, p. 87, Jun. 2024, doi: 10.1186/s40537-024-00943-4.

S. K. Wagh, A. A. Andhale, K. S. Wagh, J. R. Pansare, S. P. Ambadekar, and S. H. Gawande, “Customer churn prediction in telecom sector using machine learning techniques,” Results Control Optim., vol. 14, p. 100342, Mar. 2024, doi: 10.1016/j.rico.2023.100342.

A. Tharwat, “Classification assessment methods,” Appl. Comput. Inform., vol. 17, no. 1, pp. 168–192, Jan. 2021, doi: 10.1016/j.aci.2018.08.003.

L. Oriana, A. D. S. Wati, A. P. Ramahdani, N. N. Safira, and E. Ismanto, “Peningkatan Kinerja Model Random Forest untuk Deteksi Kecurangan Kartu Kredit Menggunakan RandomizedSearchCV,” vol. 15, no. 2.

S. Ouf, K. T. Mahmoud, and M. A. Abdel-Fattah, “A proposed hybrid framework to improve the accuracy of customer churn prediction in telecom industry,” J. Big Data, vol. 11, no. 1, p. 70, May 2024, doi: 10.1186/s40537-024-00922-9.

A. S. Thorat, “A Random Forest Churn Prediction Model: An Investigation of Machine Learning Techniques for Churn Prediction and Factor Identification in the Telecommunications Industry,” vol. 71, no. 4, 2022.

A. K. Ahmad, A. Jafar, and K. Aljoumaa, “Customer churn prediction in telecom using machine learning in big data platform,” J. Big Data, vol. 6, no. 1, p. 28, Dec. 2019, doi: 10.1186/s40537-019-0191-6.

Published

06/28/2026

How to Cite

Junaidi, M., & Roberto Kaban. (2026). Implementasi Algoritma Random Forest untuk Prediksi Customer Churn pada Perusahaan Telekomunikasi. Jurnal SINTA: Sistem Informasi Dan Teknologi Komputasi, 3(3). https://doi.org/10.61124/sinta.v3i3.320