Model Prediksi Pelunasan Haji Berbasis XGBoost Dengan Interpretasi Shap

Studi Prediksi Pelunasan Haji dengan XGBoost dan SHAP di Provinsi Jambi

Authors

  • Yan Apriadi Universitas Dinamika Bangsa
  • Dodo Zaenal Abidin Universitas Dinamika Bangsa
  • Jasmir Jasmir Universitas Dinamika Bangsa

DOI:

https://doi.org/10.61132/prosemnasproit.v2i2.178

Keywords:

Haji, interpretabilitas model, ketidakseimbangan kelas, pembelajaran mesin, prediksi pelunasan, SHAP, XGBoost

Abstract

This study develops an interpretable machine learning model to predict the settlement status of Hajj fees in Jambi Province, Indonesia. Utilizing the XGBoost algorithm on a dataset of 4,332 prospective pilgrims from 2025, the research addresses the critical challenge of class imbalance where only 28.5% of samples are labeled "Unsettled". The baseline XGBoost model achieved a ROC-AUC of 0.7778, with a recall of 0.3482 for the minority class. SHAP (SHapley Additive exPlanations) analysis was employed to interpret model predictions, revealing that financial features specifically NILAI_VA (Virtual Account Value), JML_SETORAN (Deposit Amount), and JML_PELUNASAN (Settlement Amount) are the most significant factors influencing repayment risk, with negative SHAP values indicating increased default probability. The findings demonstrate that an interpretable XGBoost framework can provide both predictive accuracy and actionable insights for policymakers, enabling targeted interventions such as flexible payment schemes and enhanced financial monitoring for high-risk pilgrims..

References

Amjad, M., Ahmad, I., Ahmad, M., Wróblewski, P., Kamiński, P., & Amjad, U. (2022). Prediction of Pile Bearing Capacity Using XGBoost Algorithm: Modeling and Performance Evaluation. Applied Sciences, 12(4), 2126. https://doi.org/10.3390/app12042126

Dwivedi, R., Dave, D., Naik, H., Singhal, S., Omer, R., Patel, P., Qian, B., Wen, Z., Shah, T., Morgan, G., & Ranjan, R. (2023). Explainable AI (XAI): Core Ideas, Techniques, and Solutions. ACM Computing Surveys, 55(9), 1–33. https://doi.org/10.1145/3561048

Kumar, A., Sharma, S., & Mahdavi, M. (2021). Machine Learning (ML) Technologies for Digital Credit Scoring in Rural Finance: A Literature Review. Risks, 9(11), 192. https://doi.org/10.3390/risks9110192

Li, W., Yin, Y., Quan, X., & Zhang, H. (2019). Gene Expression Value Prediction Based on XGBoost Algorithm. Frontiers in Genetics, 10, 1077. https://doi.org/10.3389/fgene.2019.01077

Ponce‐Bobadilla, A. V., Schmitt, V., Maier, C. S., Mensing, S., & Stodtmann, S. (2024). Practical guide to SHAP analysis: Explaining supervised machine learning model predictions in drug development. Clinical and Translational Science, 17(11), e70056. https://doi.org/10.1111/cts.70056

Zhang, X., Zhang, T., Hou, L., Liu, X., Guo, Z., Tian, Y., & Liu, Y. (2025). Data-Driven Loan Default Prediction: A Machine Learning Approach for Enhancing Business Process Management. Systems, 13(7), 581. https://doi.org/10.3390/systems13070581

Zhu, X., Chu, Q., Song, X., Hu, P., & Peng, L. (2023). Explainable prediction of loan default based on machine learning models. Data Science and Management, 6(3), 123–133. https://doi.org/10.1016/j.dsm.2023.04.003

Downloads

Published

2025-12-30

How to Cite

Yan Apriadi, Dodo Zaenal Abidin, & Jasmir Jasmir. (2025). Model Prediksi Pelunasan Haji Berbasis XGBoost Dengan Interpretasi Shap: Studi Prediksi Pelunasan Haji dengan XGBoost dan SHAP di Provinsi Jambi. Prosiding Seminar Nasional Ilmu Teknik, 2(2), 1289–1294. https://doi.org/10.61132/prosemnasproit.v2i2.178