6285641688335, 628551515511 info@scirepid.com

 
sji - Scientific Journal of Informatics - Vol. 11 Issue. 2 (2024)

Identifying Relevant Messages from Citizens in a Social Media Platform for Natural Disasters in Indonesia Using Histogram Gradient Boosting and Self-Training Classifier

Ariana Yunita, Zhafran Ramadhan, Angelia Regina Dwi Kartika, Ade Irawan,



Abstract

Purpose: This research aims to develop a classification model using histogram-based gradient boosting to identify relevant contextual tweets about disasters. This model can then be used for subsequent data cleaning stages.
Methods: This study uses a semi-supervised approach to develop a classification model using histogram-based gradient boosting. The model is trained to identify and remove irrelevant tweets that are related to disasters and gathered from Twitter. Optimization techniques, such as the AdaBoost classifier, calibrated classifier, and self-training classifier, are used to enhance the model's performance. The goal is to accurately recognize and categorize relevant tweets for additional data analysis and decision-making.
Result: The classification model that has been developed has achieved a high F1-score of 93.07%, which indicates its effectiveness in filtering disaster-related tweets that are relevant. This highlights the potential of the model to enable more precise aid distribution and faster decision-making in disaster response efforts. The successful implementation of the model also demonstrates its usefulness in utilizing social media data to enhance disaster management practices.
Novelty: This research contributes to the analysis of social media through machine learning algorithms. By utilizing social media, specifically Twitter, as a valuable resource for disaster response efforts, this study tackles challenges related to data collection and analysis in disaster management. The classification of relevant tweets into different types of natural disasters offers opportunities to enhance stakeholder decision-making processes in disaster scenarios.







DOI :


Sitasi :

0

PISSN :

2460-0040

EISSN :

2407-7658

Date.Create Crossref:

13-Feb-2025

Date.Issue :

16-May-2024

Date.Publish :

16-May-2024

Date.PublishOnline :

16-May-2024



PDF File :

Resource :

Open

License :