SciRepID - Machine Learning Implementation for E-commerce Delivery Delay Prediction Using XGBoost Algorithm


Machine Learning Implementation for E-commerce Delivery Delay Prediction Using XGBoost Algorithm

International Journal of Engineering and Applied Science
International Forum of Researchers and Lecturers (IFREL)

📄 Abstract

Delivery delays pose a major challenge in the e-commerce industry, often leading to decreased customer satisfaction and negatively impacting business operations. In this study, the XGBoost (Extreme Gradient Boosting) algorithm is applied to predict delivery delays based on a dataset containing 96,476 records. These records include various features relevant to the delivery process, such as shipping distance, carrier performance, and order characteristics. The model achieves a high overall accuracy of 93.24%, indicating strong general performance. In particular, XGBoost demonstrates excellent results in predicting on-time deliveries, achieving a precision of 93% and a recall of 100%. However, the model struggles to correctly identify delayed deliveries. The recall for delayed deliveries is 0%, and the F1-score is extremely low at 0.01. This significant discrepancy reveals a critical limitation in the model's performance — the inability to detect minority class cases (delayed deliveries) due to class imbalance within the dataset. The results highlight the importance of addressing data imbalance in predictive modeling for delivery outcomes. When the dataset is dominated by on-time delivery records, the model tends to be biased toward that class, failing to learn the patterns associated with delays. To improve performance, the study recommends integrating class balancing techniques such as SMOTE (Synthetic Minority Oversampling Technique) to generate synthetic samples of the minority class. Additionally, the use of alternative evaluation metrics beyond accuracy — such as precision, recall, and F1-score for each class — is suggested to provide a more comprehensive understanding of model effectiveness. Overall, the study provides valuable insights into the complexities of predicting delivery delays and outlines practical strategies for enhancing future models in e-commerce logistics analytics.

🔖 Keywords

#Delay Prediction; Delivery; E-commerce; Machine Learning; XGBoost

ℹ️ Informasi Publikasi

Tanggal Publikasi
31 July 2025
Volume / Nomor / Tahun
Volume 2, Nomor 3, Tahun 2025

📝 HOW TO CITE

Stevanus Putra Lesmana; Dina Hermawati; Maulina Mukaromah; Iqbal Ahmad Bukhari; Norma Puspitasari, "Machine Learning Implementation for E-commerce Delivery Delay Prediction Using XGBoost Algorithm," International Journal of Engineering and Applied Science, vol. 2, no. 3, Jul. 2025.

ACM
ACS
APA
ABNT
Chicago
Harvard
IEEE
MLA
Turabian
Vancouver

🔗 Artikel Terkait dari Jurnal yang Sama

📊 Statistik Sitasi Jurnal

Tren Sitasi per Tahun