SciRepID - Scientific Publication Search

Analisis sentimen pengguna tiktok terhadap program makan bergizi gratis menggunakan metode naive bayes

Rifna, Iza; Nurdin, Nurdin

IT-Explore: Jurnal Penerapan Teknologi Informasi dan Komunikasi• 2026 •Fakultas Teknologi Informasi, Universitas Kristen Satya Wacana

The Free Nutritional Meal Program (MBG) is a government policy that is widely discussed by the public through social media, especially TikTok. Various comments that have emerged indicate differences in public opinion towards the program, so an analysis is needed to determine the tendency of public sentiment. This study aims to analyze TikTok user sentiment towards the Free Nutritional Meal Program using the Naive Bayes method. The research method is carried out through several steps, namely collecting TikTok comment data, preprocessing text, labeling sentiment data into positive, negative, and neutral, feature transformation using TF-IDF, and classification using the Naive Bayes algorithm. Based on the analysis of 500 comment data, the results show that positive sentiment dominates public opinion by 42% (210 data), followed by negative sentiment by 36% (180 data), and neutral sentiment by 22% (110 data). Testing the classification model using Naive Bayes produces excellent performance with an accuracy rate of 86%, precision of 84%, recall of 85%, and F1-score of 84%. The conclusion of this study shows that the Naive Bayes method is effective as an approach in social media sentiment analysis to map public responses to government policies.

https://doi.org/10.24246/itexplore.v5i2.2026.pp241-251

Website Google Scholar

Analisis Sentimen Komentar Netizen Terhadap 17+8 Tuntutan Rakyat Pada X Menggunakan Naive Bayes Classifier

Halawa, Fransisco Lucky; Heriansyah, Rudi; Permatasari, Indah

Teknik: Jurnal Ilmu Teknik dan Informatika• 2026 •LPPM Sekolah Tinggi Ilmu Ekonomi - Studi Ekonomi Modern

This study analyzes netizen sentiment concerning the 17+8 public aspirations circulating the digital platform X spanning the period from August 18 through October 31, 2025. 1,837 comments obtained through scraping method. Classification Research stages include data preprocessing, sentiment weighting based on lexicon, and feature extraction using TF-IDF. Data 80% used for learning purposes and the remaining 20% utilized for validation. The findings reveal that the majority of comments, amounting to 81.14%, contained negative sentiment, while the remaining 18.86% were positive. The outcomes demonstrate that community reactions toward the 17+8 People's Demands were dominated by unsupportive views. From a theoretical standpoint this scholarly work offers to enriching knowledge concerning public opinion classification on political issues through a computational approach, while also serving as a reference for future research focused on improving the accuracy of sentiment analysis related to political dynamics and the behavior of state institutions.

https://doi.org/10.51903/teknik.v6i1.1228

Open Access Website Google Scholar

Analisis Tingkat Kepuasan Pengguna QRIS Berdasarkan Pengalaman Dan Persepsi Pengguna twitter/ X Menggunakan Naive Baiyes

Veri Arinal; Satria Wira Yudha; Muhammad Joko Umbaran Kharis Bahrudin; Dessyanti Ryantina

International Journal of Information Engineering and Science• 2026 •Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

QRIS (Quick Response Code Indonesian Standard) has become a widely used national digital payment standard. User satisfaction with this service needs to be monitored continuously to ensure its sustainability. This study aims to predict the level of QRIS user satisfaction based on their experiences and perceptions expressed organically on the Twitter social media platform. The method used is sentiment analysis with the Naive Bayes classification algorithm implemented using RapidMiner software. The research data was obtained from Twitter user comments collected through web scraping techniques. The text data then went through a preprocessing stage that included cleansing, stopword filtering, stemming, and tokenizing to be prepared as features ready to be processed by the model. The data was divided into training (80%) and testing (20%) subsets for model training and validation. The results showed that the Naive Bayes model was able to predict user satisfaction sentiment with an accuracy of 80.99%. These findings indicate that the model is highly accurate in identifying satisfied comments and sufficiently sensitive in detecting dissatisfaction. This study concludes that sentiment analysis of Twitter UGC data using Naive Bayes is an effective and efficient approach for predicting QRIS user satisfaction in real time. The practical implication of this study is to provide an automatic feedback system for service providers to monitor public sentiment and take targeted corrective actions.

https://doi.org/10.62951/ijies.v2i4.53

Open Access Website Google Scholar

Analisis Sentimen Tren 'Kabur Aja Dulu' pada Sosial Media X sebagai Dasar Perancangan Sistem Pemantauan Sentimen Publik Menggunakan Naive Bayes dan SVM

Sutisna Sutisna; Tri Wahyudi; Dwi Swasono Rachmad; Fachrur Rozi

International Journal of Information Engineering and Science• 2026 •Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

Social media X (Twitter) has become the main platform for the Indonesian public to express opinions, including on the trend of 'kabur aja dulu' (let's just run away for a bit). This research aims to classify the sentiments of the public using the Naïve Bayes and Support Vector Machine (SVM) methods, and to compare the accuracy of both in sentiment analysis. Data was collected via the Twitter API with the hashtag #kaburajadulu, resulting in 2,067 tweets, which, after the cleansing process and manual labeling, left 385 data points. The analysis process followed the CRISP-DM stages, which include business understanding, data understanding, data preparation, modeling, evaluation, and deployment. Model evaluation was conducted using a confusion matrix with accuracy, precision, and recall metrics. The classification results show that 82% of tweets have a positive sentiment and 18% negative. The Naïve Bayes algorithm achieved an accuracy of 86.49%, slightly lower than SVM, which reached 88.05%. In conclusion, Support Vector Machine is more effective in sentiment classification on public opinion data. This research contributes to the digital mapping of public opinion and recommends the development of automatic labeling methods as well as the exploration of advanced algorithms in the future.

https://doi.org/10.62951/ijies.v2i3.79

Open Access Website Google Scholar

Implementasi model Naive Bayes multikategori untuk analisis sentimen produk Wardah di E-commerce Shoppe

Mesra Betty Yel; Sopan Adrianto; Rasiban Rasiban; Eva Widiyanti

International Journal of Information Engineering and Science• 2026 •Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The growth of information technology has driven changes in consumer behavior, one of which is through e-commerce platforms such as Shopee. This phenomenon has generated a large number of customer reviews, including those for local cosmetic products such as Wardah. These reviews serve as an important source of information for understanding customer perceptions and satisfaction levels. However, manual analysis of large and linguistically diverse datasets is inefficient and potentially subjective. This study aims to implement the multi-category Naive Bayes algorithm to classify the sentiment of Wardah product reviews on Shopee into three categories: positive, negative, and neutral. The data were collected using a web scraping technique and processed through a series of preprocessing stages including case folding, tokenization, stopword removal, stemming, and text cleaning. Subsequently, term weighting was performed using the TF-IDF method prior to classification. Model performance was evaluated using a confusion matrix as well as accuracy, precision, and recall metrics. The results indicate that the multi-category Naive Bayes algorithm achieved an accuracy of 86.00%, a precision of 86.63%, and a recall of 98.24%. This approach can assist business practitioners in objectively understanding customer opinions and support decision-making in business strategy and product development.

https://doi.org/10.62951/ijies.v2i2.6

Open Access Website Google Scholar

Analisis Sentimen Terhadap Kinerja Hukum Di Indonesia Berdasarkan Tanggapan Komentar Twitter Menggunakan Algoritma Naive Bayes

Rasiban Rasiban; Dadang Iskandar Mulyana; Muhammad Joko Umbaran Kharis Bahrudin; Nicola Marthy

International Journal of Information Engineering and Science• 2026 •Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The development of social media, especially TWITTER, has become one of the main means for people to express opinions and criticism on various issues, including the performance of law in Indonesia. This study aims to analyze public sentiment towards the performance of law based on TWITTER user comments using the Naïve Bayes algorithm. The research data consists of 1004 comments collected from several videos related to legal topics. The analysis process includes the stages of data crawling, pre- processing (text cleaning, normalization, and tokenization), labeling sentiment into positive, negative, and neutral, and testing the Naïve Bayes model. The results show that the Naïve Bayes algorithm is able to classify sentiment with an accuracy level of 93.73%. The distribution of sentiment from 1004 comments shows that the majority of public opinion is (negative/positive/neutral), which indicates that public perception of the performance of law is still (critical/positive). These findings are expected to be input for related parties to understand public opinion and improve the quality of legal performance in

https://doi.org/10.62951/ijies.v2i2.84

Open Access Website Google Scholar

ANALISIS SENTIMEN TRENDING TOPIK #INDONESIAGELAP DI MEDIA SOSIAL X MENGGUNAKAN ALGORITMA NAIVE BAYES BERBASIS PARTICLE SWARM OPTIMIZATION

Untung Surapati; Veri Arinal; Tri Wahyudi; Ahmad Fauzan

International Journal of Applied Mathematics and Computing• 2026 •Asosiasi Riset Ilmu Matematika dan Sains Indonesia

The rise of social media has created a digital public sphere that enables users to express their opinions on social and political issues openly and in real-time. One of the most discussed topics on social media platform X is the trending hashtag #IndonesiaGelap, which reflects public concern and criticism regarding various governmental and societal conditions. This study aims to conduct sentiment analysis on tweets containing the hashtag to determine the overall sentiment trend among users. The method employed in this research is the Naive Bayes classification algorithm, known for its simplicity and effectiveness in text classification. To enhance the model’s performance, Particle Swarm Optimization (PSO) is applied to optimize feature selection and parameter tuning. The dataset consists of public tweets collected via the Twitter API, followed by preprocessing, feature extraction using TF-IDF, and sentiment classification into three categories: positive, negative, and neutral. The results indicate that the integration of PSO significantly improves the classification accuracy of the Naive Bayes model compared to the baseline. The majority of tweets related to #IndonesiaGelap exhibit a negative sentiment, indicating widespread public dissatisfaction and criticism. This research is expected to contribute to a better understanding of public perception and serve as valuable input for stakeholders in addressing social issues in the digital age.

https://doi.org/10.62951/ijamc.v2i2.127

Open Access Website Google Scholar

Prediksi Nilai Penjualan Pulsa dengan algoritma naive bayes pada sujase cell jakarta

Veri Arinal; Untung Surapati; Sugiyono Sugiyono; Dita Safira

International Journal of Applied Mathematics and Computing• 2026 •Asosiasi Riset Ilmu Matematika dan Sains Indonesia

-

https://doi.org/10.62951/ijamc.v1i3.110

Open Access Website Google Scholar

Analisis Sentimen Pengguna YouTube terhadap Game Mobile menggunakan Metode Naïve Bayes

Aura Rahayu Aksa Radiana; Fathoni Mahardika; Dani Indra Junaedi

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika• 2026 •Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This study aims to develop a sentiment classification method for YouTube user comments related to the game Love and Deepspace using the Naïve Bayes algorithm, focusing on improving the text data processing and understanding user perceptions. Comment data were collected through scraping from YouTube videos, followed by preprocessing including text cleaning, normalization, stopword removal, stemming, and translation into English. Initial labeling was conducted using TextBlob, then the data were randomly sampled for training the Naïve Bayes model. Evaluation involved comparing sentiment distributions and visualization using Word Cloud and bar charts. The Naïve Bayes model achieved an accuracy of 77.36% in sentiment classification. The sentiment distribution shows differences between TextBlob (positive: 1,011, neutral: 1,312, negative: 575) and Naïve Bayes (positive: 901, neutral: 1,627, negative: 370), with Naïve Bayes being more conservative. The Word Cloud visualization identifies dominant words such as "bang," "game," and "main," while the bar chart shows the largest proportion of neutral sentiment. Naïve Bayes is effective for sentiment classification on informal comment data, with significant differences from rule-based methods like TextBlob. This research contributes to the development of text data processing techniques and user perception analysis, as well as opening up optimization opportunities with other algorithms like SVM for better accuracy.

https://doi.org/10.61132/merkurius.v4i3.1602

Open Access Website Google Scholar

Analisis Sentimen Pelanggan UMKM di Media Sosial Menggunakan Naive Bayes untuk Menilai Persepsi Kualitas Produk

Ayu Astuti Siregar; Al-Khowarizmi

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika• 2026 •Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Social media has evolved into a significant platform where consumers freely express their opinions, experiences, and levels of satisfaction regarding various products, including those offered by Micro, Small, and Medium Enterprises (MSMEs). The comments and reviews shared by customers on these platforms contain diverse sentiments that can serve as valuable indicators of how consumers perceive product quality. Understanding these sentiments is crucial for MSME owners, as it allows them to evaluate their products and adapt to market expectations more effectively. This study aims to analyze customer sentiment toward MSME products on social media by utilizing the Naïve Bayes algorithm, a widely used classification method in text mining. The data used in this research consist of customer comments collected from various social media platforms. The research process involves several stages, including data collection, manual labeling of sentiments, text preprocessing (such as tokenization, case folding, and stopword removal), and splitting the dataset into training and testing subsets. Subsequently, the classification process is carried out using the Naïve Bayes algorithm to categorize sentiments into positive, negative, and neutral classes. The results of this study demonstrate that the Naïve Bayes method is effective in classifying customer sentiments with a satisfactory level of accuracy. These findings provide a comprehensive overview of consumer perceptions regarding the quality of MSME products. Furthermore, this research is expected to assist MSME business owners in understanding customer feedback more systematically and using it as a basis for improving product quality and enhancing customer satisfaction in a competitive digital marketplace.

https://doi.org/10.61132/merkurius.v4i3.1601

Open Access Website Google Scholar

Quantifying the Impact of Text Preprocessing on IndoBERT Fine-Tuning for Indonesian Informal Culinary Sentiment Analysis

Budianoor, Rahmat; Saputro, Setyo Wahyu; Abadi, Friska; Nugroho, Radityo Adi; Farmadi, Andi

Journal of Computing Theories and Applications• 2026 •Universitas Dian Nuswantoro

Indonesian culinary comments on social media platforms such as Instagram are characterized by informal spelling, regional language mixing, slang expressions, and emojis, posing substantial challenges for automated sentiment classification. While IndoBERT has demonstrated strong performance across Indonesian natural language processing tasks, the contribution of individual preprocessing components to fine-tuning performance on informal text remains underexplored, particularly in the culinary domain. This study addresses this gap by conducting a systematic preprocessing ablation study on IndoBERT-Base fine-tuning for Indonesian culinary sentiment classification, accompanied by a comparative evaluation against Naive Bayes with TF-IDF, SVM with TF-IDF, and BiLSTM as representative baselines. A dataset of 3,500 manually labeled Instagram culinary comments across three sentiment classes was used, with a stratified 80/10/10 split. Six preprocessing variants were evaluated under identical experimental conditions to isolate the contribution of each component. The results show that slang normalization is the most impactful single preprocessing step, yielding a macro F1-score gain of +0.0609 over the no-preprocessing baseline, while the full pipeline achieves an accuracy of 0.8800 and a macro F1-score of 0.8465. IndoBERT-Base with the full pipeline outperforms all baselines across all evaluation metrics. Per-class analysis reveals that the negative class achieves the lowest F1-score of 0.7600, with sarcastic expressions and Banjar regional vocabulary identified as primary sources of misclassification. These findings indicate that preprocessing decisions have a measurable and non-uniform effect on IndoBERT fine-tuning performance. In this study, slang normalization provides the most substantial individual contribution in bridging the vocabulary gap between informal user-generated text and the model’s pre-training distribution.

https://doi.org/10.62411/jcta.15980

Open Access Website Google Scholar

Enhancing Software Defect Prediction through Hybrid Multi-Filter Feature Selection and Imbalance Handling

Maulana, Muhammad Khalid; Saputro, Setyo Wahyu; Faisal, Mohammad Reza; Nugroho, Radityo Adi; Ramadhan, As’ary

Journal of Computing Theories and Applications• 2026 •Universitas Dian Nuswantoro

Software Defect Prediction (SDP) aims to identify defective modules early in the software development lifecycle to improve software quality and reduce maintenance costs. However, SDP datasets commonly suffer from high dimensionality, feature redundancy, and class imbalance, which can degrade model performance and stability. This study proposes a hybrid feature selection framework to address these challenges and enhance prediction performance. The proposed approach integrates Combined Correlation and Mutual Information (CONMI), which combines the Pearson Correlation Coefficient (PCC) and Mutual Information (MI) to capture both linear and nonlinear feature relevance. The selected features are further refined through Top-K selection, correlation-based filtering to reduce multicollinearity, and Backward Elimination (BE) to obtain an optimal feature subset. To address class imbalance, SMOTE-Tomek is applied by combining over-sampling and data cleaning techniques. Experiments are conducted on twelve NASA MDP datasets using Logistic Regression (LR) and Naïve Bayes (NB) classifiers. The results show that the proposed framework consistently achieves the best performance, with Logistic Regression combined with SMOTE-Tomek obtaining the highest average AUC of 0.7923 ± 0.0714, while NB achieves 0.7554 ± 0.0580. Statistical analysis using a paired t-test indicates that the proposed method significantly outperforms MI+SMOTE-Tomek and BE+SMOTE-Tomek for Logistic Regression, whereas no significant differences are observed for NB. In addition to improving overall classification performance (AUC), the proposed approach also enhances minority class detection, as reflected in improved Recall and F1-score. Overall, the proposed hybrid framework provides an effective and reliable solution for software defect prediction, particularly for high-dimensional and imbalanced datasets.

https://doi.org/10.62411/jcta.15943

Open Access Website Google Scholar

Analisis sentimen masyarakat terhadap putusan Mahkamah Konstitusi tentang batasan usia calon Presiden dan Wakil Presiden di media sosial Twitter

Noviolen Jehovan Dieksa; Pakereng, Ineke

IT-Explore: Jurnal Penerapan Teknologi Informasi dan Komunikasi• 2026 •Fakultas Teknologi Informasi, Universitas Kristen Satya Wacana

This study evaluates public sentiment toward Constitutional Court Decision No. 90/PUU-XXI/2023 regarding the age limit for presidential and vice-presidential candidates, a controversial issue closely related to Indonesia’s democratic dynamics. Understanding public opinion on Twitter, as a major platform for political expression, is essential for informing electoral policy formulation. Data were collected using Tweet Harvest through Google Colab and analyzed using the Naïve Bayes algorithm as the primary sentiment classification method, with RapidMiner employed to support and streamline the analytical process. The analysis process included data cleaning, text normalization, stopword removal, manual labeling of 80 tweets as training data, and automatic sentiment classification to identify positive and negative sentiments. From a total of 151 analyzed tweets, 84 (55.63%) were classified as negative and 67 (44.37%) as positive, with the model achieving an accuracy of 66.67%. These findings suggest a tendency toward public opposition to the decision, reflecting dissatisfaction among Twitter users. The study demonstrates that Naïve Bayes is reasonably effective for sentiment classification with limited datasets and provides insights for policymakers in understanding public responses to election-related regulations.

https://doi.org/10.24246/itexplore.v5i1.2026.pp1-10

Open Access Website Google Scholar

Analisis Sentimen Publik pada TikTok terhadap Rencana Penerapan Sistem Balik Nama Ponsel Bekas menggunakan Naive Bayes dan Support Vector Machine

Afif Lustyo Muji; Aziz Musthofa; Dihin Muriyatmoko

Prosiding Seminar Nasional Ilmu Teknik• 2026 •Asosiasi Riset Ilmu Teknik Indonesia

Since the announcement of the policy plan for a name transfer system in the sale of used mobile phones, the issue has attracted widespread public attention and discussion. People have expressed their opinions on social media platforms, particularly TikTok. This study aims to classify the sentiment of TikTok users using Naive Bayes and Support Vector Machine (SVM) algorithms. The data were collected through a comment scraping technique on related content.The research stages include text preprocessing, sentiment labeling into positive, negative, and neutral categories, and feature extraction using TF-IDF. The classification process employs Naive Bayes and Support Vector Machine algorithms, which are then evaluated based on accuracy, precision, recall, and F1-score. The results of this study indicate that both methods are capable of classifying sentiment effectively. However, the Support Vector Machine method is superior to the Naive Bayes method with an accuracy rate of 99.57% compared to 94.30%. This study is expected to help the government understand public responses to the planned policy of the used mobile phone name transfer system.

https://doi.org/10.61132/prosemnasproit.v2i2.198

Open Access Website Google Scholar

Model Machine Learning untuk Klasifikasi Loyalitas Pelanggan Menggunakan Random Forest

Tengku Syahvina Rival Dini; Rani Chantika; Pebi Mina Husania; Puji Sri Alhirani

Prosiding Seminar Nasional Ilmu Teknik• 2026 •Asosiasi Riset Ilmu Teknik Indonesia

This research develops a machine learning model to classify customer loyalty using the Random Forest algorithm. Customer churn is a critical issue that reduces revenue and increases acquisition costs. A dataset of 50,000 customers from global e-commerce and subscription platforms was processed through data cleaning, imputation, outlier handling, and class balancing with SMOTE. The Random Forest model was built as a baseline and optimized with hyperparameter tuning. Evaluation using accuracy, precision, recall, and F1-score shows that the optimized model achieved 90.81% accuracy and 83.87% F1-score, outperforming previous Naïve Bayes approaches. Feature importance analysis highlights customer service interactions, lifetime value, and demographic factors as key predictors of churn. These findings demonstrate Random Forest’s effectiveness in churn prediction and provide practical insights for customer retention strategies

https://doi.org/10.61132/prosemnasproit.v2i2.202

Open Access Website Google Scholar

Sentiment Analysis of YouTube Comments on Indonesia’s Performance in 2026 World Cup Qualifiers Using Naive Bayes

Dihin Muriyatmoko; Aziz Musthafa; Yusuf Al Banna

Prosiding Seminar Nasional Ilmu Teknik• 2026 •Asosiasi Riset Ilmu Teknik Indonesia

Sentiment analysis on social media is widely used to represent public perceptions of sports performance, particularly in international competitions. This study aims to analyze the sentiment of YouTube user comments regarding the performance of the Indonesian National Football Team during the FIFA World Cup 2026 Asian Qualifiers. The data were collected from user comments on videos related to the matches and analyzed using a machine learning–based sentiment analysis approach. Sentiment classification was performed using the Naive Bayes algorithm. The results indicate that the proposed approach is able to effectively identify public sentiment toward the national team’s performance during the qualification matches. The findings of this study are expected to provide insights into public perceptions and contribute to sentiment analysis research in the field of sports.

https://doi.org/10.61132/prosemnasproit.v2i2.196

Open Access Website Google Scholar

Analisis Performansi Pendekatan Machine Learning Pada Deteksi Penyakit Daun Tanaman Kopi

Purnomo, Rosyana Fitria; Purnomo, Rosyana Fitria; Yodhi Yuniarthe; Hilda Dwi Yunita; Fatimah Fahurian +1 more

Jurnal Elektronika dan Komputer• 2026 •STEKOM PRESS

Detection and identification of plant diseases is critical to the success and efficiency of agricultural production. Plant disease outbreaks are becoming more frequent throughout the world, and the presence of these diseases in cultivated plants has a significant impact on productivity. Therefore, researchers are focusing on developing effective and reliable plant disease detection methods. Thus, farmers can take advantage of early detection of this disease to minimize future losses. This article discusses machine learning approaches as well as decision trees, K-nearest neighbors, naive Bayes, support vector machines (SVM), and random forests for detecting coffee leaf diseases using leaf images. The above-mentioned classifications were researched and compared to determine the most suitable plant disease prediction model with the highest accuracy. Compared with other classification algorithms, the SVM algorithm achieves the highest accuracy of 99.75%. All the models trained above will be used by farmers to quickly identify and classify new diseases in images as a prevention strategy. As a preventive measure, farmers can detect and classify new diseases in images early.

https://doi.org/10.51903/elkom.v18i2.3302

Open Access Website Google Scholar

Analisis Sentimen X: Kegagalan Timnas ke Piala Dunia 2026 dengan Naive Bayes

Aditya Abdulloh Masykur; Aditya Abdulloh Masykur; Rino Raihan Gumilang; Harun Al Rosyid

Jurnal Elektronika dan Komputer• 2026 •STEKOM PRESS

The performance of the Indonesian National Team (Timnas) in the 2026 World Cup qualifications has triggered massive and diverse responses on social media, particularly on platform X. This study aims to identify and classify public sentiment regarding Timnas Indonesia's performance into positive, negative, and neutral categories using a data mining approach. Text data was processed through pre-processing stages, term weighting using TF-IDF, and the application of the Synthetic Minority Over-sampling Technique (SMOTE) to address significant class distribution imbalance. The classification algorithm employed was Multinomial Naïve Bayes. Model performance evaluation was conducted by comparing two training-testing data split scenarios: 90:10 and 80:20 ratios. The results indicate that public opinion is dominated by negative sentiment at 73.2%, reflecting public disappointment. In terms of model performance, the 90:10 ratio scenario yielded the best accuracy of 80%, outperforming the 80:20 ratio which recorded an accuracy of 75%. These findings demonstrate that combining Multinomial Naïve Bayes with the SMOTE technique is effective in handling imbalanced text data and is capable of accurately mapping public perception.

https://doi.org/10.51903/elkom.v18i2.3371

Open Access Website Google Scholar

Penerapan Metode Decision Tree dan Naïve Bayes pada Kasus Kriminalitas di Lampung

Khadafi, Muhammad; Yudhistira, Aditia

Dinamik• 2026 •Universitas Stikubank

Crime, an unlawful act that contradicts ethics and norms, has now become a primary factor for the police in Lampung province. This presents a challenge for the police institution in predicting high crime rates. However, there are still many crimes that have not become the main focus of problem-solving at the Lampung Regional Police.This research aims to identify the types and criminal acts of crime with the highest recorded incidence in a crime dataset by performing classification using the Naïve Bayes algorithm. The data was obtained from investigators at the Directorate of General Criminal Investigation of the Lampung Regional Police, with a total of 12,034 JTP (Total Criminal Acts) and 7,518 PTP (Crime Resolution) data points for each type of crime, distributed across the Regional Police, City Police, and District Police throughout Lampung province. The classification process using the Naïve Bayes algorithm reveals the relationship between the work unit (Satker) and the type of crime handled, thereby identifying crime patterns based on the location where they are handled. The results of the research, which involved converting numerical data into binomial (binary) form using the "Numerical to Binominal" feature in Rapid miner, show that the analysis and modeling process, especially in algorithms like Naïve Bayes or decision trees, is more effective when using data in a binary format. Thus, the initial dataset can be visualized in the form of a , with the size of the text varying according to the level of each high-incidence crime; the larger the text, the more frequently or significantly the crime occurred or was reported. The application of this method can help in identifying patterns, dominant trends, and areas of focus for more targeted law enforcement efforts or crime prevention policies.

https://doi.org/10.35315/dinamik.v31i1.10411

Open Access Website Google Scholar

Penerapan Algoritma Naïve Bayes untuk Prediksi Minat Siswa Melanjutkan Studi ke Perguruan Tinggi

Al-Kasidmi, Afif; Megawaty, Dyah Ayu

Dinamik• 2026 •Universitas Stikubank

This study aims to analyze the factors that influence students' interest in continuing their education to college using a machine learning approach. Data was collected through an online questionnaire completed by 727 students between July 27 and August 22, 2025, covering 23 variables consisting of respondent identity (gender, grade level, major) as well as internal and external factors such as parental support, learning motivation, and preferred type of college. The data preparation stage was carried out through column cleaning, deletion of empty data, encoding of categorical variables, and division of the dataset into 80% training data and 20% test data. The Naive Bayes algorithm of the CategoricalNB type was used because it was suitable for the categorical nature of the data. The evaluation results showed that the model was able to predict student interest with 96% accuracy. For the class of students interested in continuing their studies, the precision, recall, and F1-score values were above 0.95, while the performance in the class of students who were not interested was slightly lower due to the smaller amount of data. These findings show that Naive Bayes is proven to be effective and reliable in classifying students' interest in continuing their studies and can be the basis for decision-making in designing more targeted educational strategies.

https://doi.org/10.35315/dinamik.v31i1.10416

Open Access Website Google Scholar