6285641688335, 628551515511 info@scirepid.com

 
J. Comput. Theor. Appl. - Journal of Computing Theories and Applications - Vol. 1 Issue. 3 (2024)

Dataset Analysis and Feature Characteristics to Predict Rice Production based on eXtreme Gradient Boosting

Ella Budi Wijayanti, De Rosal Ignatius Moses Setiadi, Bimo Haryo Setyoko,



Abstract

Rice plays a vital role as the main food source for almost half of the global population, contributing more than 21% of the total calories humans need. Production predictions are important for determining import-export policies. This research proposes the XGBoost method to predict rice harvests globally using FAO and World Bank datasets. Feature analysis, removal of duplicate data, and parameter tuning were carried out to support the performance of the XGBoost method. The results showed excellent performance based on which reached 0.99. Evaluation of model performance using metrics such as MSE, and MAE measured by k-fold validation show that XGBoost has a high ability to predict crop yields accurately compared to other regression methods such as Random Forest (RF), Gradient Boost (GB), Bagging Regressor (BR) and K-Nearest Neighbor (KNN). Apart from that, an ablation study was also carried out by comparing the performance of each model with various features and state-of-the-art. The results prove the superiority of the proposed XGBoost method. Where results are consistent, and performance is better, this model can effectively support agricultural sustainability, especially rice production.







DOI :


Sitasi :

0

PISSN :

EISSN :

3024-9104

Date.Create Crossref:

21-Feb-2024

Date.Issue :

21-Feb-2024

Date.Publish :

21-Feb-2024

Date.PublishOnline :

21-Feb-2024



PDF File :

Resource :

Open

License :

https://creativecommons.org/licenses/by/4.0