(Muhammad Fernanda Naufal Fathoni, Eva Yulia Puspaningrum, Andreas Nugroho Sihananto)
- Volume: 2,
Issue: 3,
Sitasi : 0
Abstrak:
Rohingya in Indonesia has become trending conversation on social media. Sentiment analysis can get public responds. Big data makes the problem time efficiency labeling process, therefore the lexicon dictionary is needed for the labeling process. Data is growing and circulating very rapidly so it takes a fast and efficient time. Although it is fast and makes it easier to solve problems, it is still necessary to question the accuracy produced when using the lexicon labeling. A comparison of the labeling process between the InSet lexicon and the VADER lexicon was conducted to determine the accuracy of the labeling. It was done by combining lexicon with machine learning method of support vector machine and TF-IDF weighting and accuracy result calculated using confusion marix. Data from social media X as many as 9117 lines and labeled with InSet lexicon result 5241 negative sentiments, 1369 positive, and 521 neutral. Then the labeling results with VADER produced 2749 positive, 2523 negative, and 1881 neutral. After labeled, processed SVM and calculated accuracy with results of InSet lexicon accuracy having an average of 85.8% while the VADER SVM lexicon has an average of 82.65%.