Main Article Content

Abstract

Bad credit card is a problem of inability of credit card users to pay credit card bills that can cause losses to both parties concerned. In order to avoid losses caused by bad credit cards, the provider must conduct a careful analysis of prospective or old customers using credit cards. This study aims to classify bad credit card customers using machine learning techniques, namely classification techniques. One of the classification techniques used is the XGBoost method which is useful for regression analysis and classification based on the Gradient Boosting Decision Tree (GBDT), the XGBoost method has several hyperparameters that can be configured to improve the performance of the model. Hyperparameter tuning method used is grid search cross validation which is then validated using 10-Fold Cross Validation. XGBoost hyperparameters configured include n_estimators, max_depth, subsample, gamma, colsample_bylevel, min_child_weight and learning_rate. Based on the results of this study proves that the use of algorithms with hyperparameter tuning can improve the performance of eXtreme Gradient Boosting algorithm in the process of classification of credit card customers with an accuracy of 80.039%, precision of 81.338% and a recall value of 96.854%.


 


Keywords: XGBoost, classification, Accuracy, Precision, Recall

Keywords

Keywords: XGBoost, classification, Accuracy, Precision, Recall

Article Details

References

  1. [1] Chen, T., & Guestrin, C. (2016). Xgboost: A Scalable Tree Boosting System. In Proceedings Of The 22nd Acm Sigkdd International Conference On Knowledge Discovery And Data Mining (Pp. 785-794).
  2. [2] Cherif, Iyad Lahsen, And Abdesselem Kortebi. 2019. “On Using Extreme Gradient Boosting (Xgboost) Machine Learning Algorithm For Home Network Traffic Classification.” IFIP Wireless Days 2019-April: 1–6.
  3. [3] Gorunescu, F. (2011) Data Mining Concepts, Models And Techniques.
  4. [4] Hanif, I. (2020) ‘Implementing Extreme Gradient Boosting (Xgboost) Classifier To Improve Customer Churn Prediction’.
  5. [5] Kristianti, D. S. (2014) ‘Kartu Kredit Syariah dan Perilaku Konsumtif Masyarakat’, AHKAM : Jurnal Ilmu Syariah, 14(2), Pp. 287–296.
  6. [6] Li, S. And Zhang, X. (2020) ‘Research On Orthopedic Auxiliary Classification and Prediction Model Based On Xgboost Algorithm’, Neural Computing And Applications, 32(7), Pp. 1971–1979.
  7. [7] Mujilahwati, S. Dkk (2021) ‘Optimasi Hyperparameter Tensorflow Dengan Menggunakan Optuna Di Python : Study Kasus Klasifikasi Dokumen Abstrak Skripsi’, 5, Pp. 1084–1089. Doi: 10.30865/Mib.V5i3.3090.
  8. Nokeri, T. C. (2021) Data Science Revealed, Data Science Revealed. Doi: 10.1007/978-1-4842-6870-4.
  9. [8] Muslim, Ichwanul, Karo Karo, Fakultas Informatika, and Universitas Telkom. 2020. “Implementasi Metode Xgboost dan Feature Importance Untuk Klasifikasi Pada Kebakaran Hutan Dan Lahan.” 1(1): 10–16.
  10. [9] Nokeri, Tshepo Chris. 2021. Data Science Revealed Data Science Revealed.
  11. [10] Nyoman, Ngakan Et Al. 2020. “Prediksi Kecelakaan Lalu Lintas Di Bali Dengan Xgboost Pada Python.” 8(3): 188–96.
  12. [11] Pavan, R. Et Al. (2021) ‘Bayesian Optimization and Gradient Boosting To Detect Phishing Websites’, 2021 55th Annual Conference On Information Sciences And Systems, CISS 2021, Pp. 2–6.
  13. [12] Pritasari, Mardhiyah Putri Ayu Siregar Riki Ruli A Palupiningsih. 2020. “Klasifikasi Untuk Memprediksi Pembayaran Kartu Kredit Macet.” Jurnal Teknologia 3(1): 91–101.
  14. [13] Ramadhan, M. M. Et Al. (2017) ‘Parameter Tuning In Random Forest Based On Grid Search Method For Gender Classification Based On Voice Frequency’, Destech Transactions On Computer Science And Engineering, (Cece), Pp. 625–629.
  15. [14] Siringoringo, Rimbun, Resianta Perangin-Angin, and Mufria J Purba. 2021. “Segmentasi dan Peramalan Pasar Retail Menggunakan Xgboost dan Principal Component Analysis.” 5(1): 42–47.
  16. [15] Sunata, Haliem Azrullah, Firman Jodi Rianto, Y. (2020) ‘Komparasi Tujuh Algoritma Identifikasi Fraud ATM Pada PT. Bank Central Asia Tbk 1,2,3’, 7(3), Pp. 441–450.
  17. [16] Tama, B. A. Et Al. (2020) ‘An Enhanced Anomaly Detection In Web Traffic Using A Stack Of Classifier Ensemble’, IEEE Access, 8, Pp. 24120–24134.
  18. [17] Wijayanti, Reny. 2018. “Analisa Klasifikasi Kartu Kredit Menggunakan Algoritma Naive Bayes 1,2.” Jurnal Mantik Penusa 10(2): 80–86.
  19. [18] Zhang, D. And Gong, Y. (2020) ‘The Comparison Of Lightgbm And Xgboost Coupling Factor Analysis And Prediagnosis Of Acute Liver Failure’, IEEE Access, 8.
  20. [19] Zhou, Z.-H. (2012) Ensemble Methods Foundations And Algorithms, SEAISI Quarterly (South East Asia Iron And Steel Institute).