SciRepID - Scientific Publication Search

Publication Search

41,336 articles from 397 journals · 1,447 citations tracked

Showing 1-20 of 25

Analytics

Rasiban Rasiban; Dadang Iskandar Mulyana; Muhammad Joko Umbaran Kharis Bahrudin; Nicola Marthy

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The development of social media, especially TWITTER, has become one of the main means for people to express opinions and criticism on various issues, including the performance of law in Indonesia. This study aims to analyze public sentiment towards the performance of law based on TWITTER user comments using the Naïve Bayes algorithm. The research data consists of 1004 comments collected from several videos related to legal topics. The analysis process includes the stages of data crawling, pre- processing (text cleaning, normalization, and tokenization), labeling sentiment into positive, negative, and neutral, and testing the Naïve Bayes model. The results show that the Naïve Bayes algorithm is able to classify sentiment with an accuracy level of 93.73%. The distribution of sentiment from 1004 comments shows that the majority of public opinion is (negative/positive/neutral), which indicates that public perception of the performance of law is still (critical/positive). These findings are expected to be input for related parties to understand public opinion and improve the quality of legal performance in

Mesra Betty Yel; Sopan Adrianto; Rasiban Rasiban; Eva Widiyanti

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

The growth of information technology has driven changes in consumer behavior, one of which is through e-commerce platforms such as Shopee. This phenomenon has generated a large number of customer reviews, including those for local cosmetic products such as Wardah. These reviews serve as an important source of information for understanding customer perceptions and satisfaction levels. However, manual analysis of large and linguistically diverse datasets is inefficient and potentially subjective. This study aims to implement the multi-category Naive Bayes algorithm to classify the sentiment of Wardah product reviews on Shopee into three categories: positive, negative, and neutral. The data were collected using a web scraping technique and processed through a series of preprocessing stages including case folding, tokenization, stopword removal, stemming, and text cleaning. Subsequently, term weighting was performed using the TF-IDF method prior to classification. Model performance was evaluated using a confusion matrix as well as accuracy, precision, and recall metrics. The results indicate that the multi-category Naive Bayes algorithm achieved an accuracy of 86.00%, a precision of 86.63%, and a recall of 98.24%. This approach can assist business practitioners in objectively understanding customer opinions and support decision-making in business strategy and product development.

Untung Surapati; Veri Arinal; Tri Wahyudi; Ahmad Fauzan

International Journal of Applied Mathematics and Computing 2026 Asosiasi Riset Ilmu Matematika dan Sains Indonesia

The rise of social media has created a digital public sphere that enables users to express their opinions on social and political issues openly and in real-time. One of the most discussed topics on social media platform X is the trending hashtag #IndonesiaGelap, which reflects public concern and criticism regarding various governmental and societal conditions. This study aims to conduct sentiment analysis on tweets containing the hashtag to determine the overall sentiment trend among users. The method employed in this research is the Naive Bayes classification algorithm, known for its simplicity and effectiveness in text classification. To enhance the model’s performance, Particle Swarm Optimization (PSO) is applied to optimize feature selection and parameter tuning. The dataset consists of public tweets collected via the Twitter API, followed by preprocessing, feature extraction using TF-IDF, and sentiment classification into three categories: positive, negative, and neutral. The results indicate that the integration of PSO significantly improves the classification accuracy of the Naive Bayes model compared to the baseline. The majority of tweets related to #IndonesiaGelap exhibit a negative sentiment, indicating widespread public dissatisfaction and criticism. This research is expected to contribute to a better understanding of public perception and serve as valuable input for stakeholders in addressing social issues in the digital age.

M Daffa Adrian; Pareza Alam Jusia; Rudolf Sinaga; Azzahra Raihana Adriansyah; Mutammimah Mutammimah

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Diabetes Mellitus is a group of metabolic diseases characterized by hyperglycemia resulting from defects in insulin secretion, insulin action or both. Hyperglycemia is a medical condition in the form of an increase in glucose levels beyond normal limits which is a characteristic of several diseases, especially Diabetes Mellitus, in addition to various other conditions. Diabetes Mellitus is currently a global health threat. Classification is one of the techniques of data mining that can be used to help predict the results of the classification of types of diabetes using the naïve Bayes algorithm. Testing was carried out using 5 evaluation models including rapid miner with 3 options, namely use training set, 5 Fold Cross-Validation, 10 Fold Cross-Validation, and 2 other evaluation models, namely Microsoft Excel and Python. Testing data regarding Diabetes Mellitus has high accuracy in the excel evaluation model, which is 89.00% compared to other evaluation models. Meanwhile, the lowest accuracy is the Python evaluation model which obtains an accuracy of 86.36%. The Naïve Bayes algorithm can be said to be one of the most effective algorithms, both in terms of calculations and the final results, where the test can be used as a basis for diabetes mellitus considering the accuracy results are above 85%.

Anggi Saputra; Setiawan Assegaff; Benni Purnama

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study analyzes creditworthiness assessment and predicts non-performing loan (NPL) risk using the Naïve Bayes algorithm at BPR Ukabima Lestari, Jambi Branch. A quantitative data mining approach with probabilistic classification is applied. The dataset includes borrower attributes such as age, occupation, income, loan amount, tenor, collateral, and repayment history. Research stages comprise data preprocessing, model development, and performance evaluation using accuracy, precision, recall, and F1-score implemented in RapidMiner. The results indicate that the Naïve Bayes model achieves 99.58% accuracy, demonstrating strong capability to predict potential problem loans accurately and efficiently, supporting data-driven credit decisions and strengthening credit risk management in microbanking institutions.

Nur Aufa, Lia; Nurhadi Nurhadi; Yulia Arvita

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study aims to classify customer payment methods at 17 Coffee & Eatery using machine learning algorithms, namely Naïve Bayes and Support Vector Machine (SVM). The increasing use of digital and non-cash payments has generated large volumes of transaction data that are rarely analyzed optimally, even though such data contain valuable information for business decision making. This research used secondary transaction data collected from January to March 2025, consisting of 10,147 transaction records. The dataset included several attributes such as order time, payment time, transaction type, total sales, number of items, and payment method. Data preprocessing was performed through data cleaning, feature engineering, normalization, and label encoding before being divided into training and testing sets with an 80:20 ratio. The Naïve Bayes and SVM models were then trained and evaluated using accuracy, precision, recall, F1-score, and ROC–AUC metrics. The results show that both algorithms were able to classify payment methods effectively, but SVM achieved higher accuracy and more stable performance than Naïve Bayes. These findings indicate that SVM is more suitable for handling complex and heterogeneous transaction patterns. The implementation of machine learning for transaction classification can support more efficient financial management and data-driven decision making for small and medium enterprises in the culinary sector.

Caterina Paras Dewi; Jasmir Jasmir; Willy Riyadi; Alya Rafina

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Chronic Kidney Disease (CKD) is a heterogeneous disorder that gradually affects the structure and function of the kidneys, is difficult to recover, and causes the body to be unable to maintain metabolism and fail to maintain fluid and electrolyte balance, leading to increased urea levels. Chronic kidney disease data was obtained from Kaggle, in this study a comparison was made between two classification algorithms, namely Naïve Bayes Classifier (NBC) and Random Forest because it is not yet known what algorithm is best in classifying chronic kidney disease (CKD). Both algorithms are evaluated based on performance metrics such as accuracy, precision, recall, and confusion matrix. The results of the evaluation showed that in a dataset of 400 samples, the performance  of the Naïve Bayes Classifier (NBC) algorithm obtained an accuracy of 94%, while Random Forest had an accuracy of 93%. Then in the small dataset (158 data), Random Forest got a better accuracy score with 87% compared to the Naïve Bayes Classifier (NBC) of 78%. Based on the results of the evaluation, Random Forest has a more stable performance on small datasets, while Naïve Bayes Classifier (NBC) provides higher performance on larger datasets in the context of chronic kidney disease classification.

Srikandi Alifya; Jasmir Jasmir; Elvi yanti

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The growth of e-commerce in Indonesia has led to an increase in product reviews, including for beauty products on Tokopedia and Shopee. These reviews serve as important sources of information to assess consumer satisfaction; however, manually analyzing thousands of reviews daily is impractical. This study applies Natural Language Processing (NLP) with Naive Bayes, C4.5, XGBoost algorithms to classify sentiment in Indonesian-language reviews. The dataset used consists of 76,256 reviews labeled as positive, negative, and neutral. The research stages include text preprocessing, feature representation using BoW and TF-IDF, data balancing through SMOTE, and model performance evaluation based on accuracy, precision, and recall. Differences in results among the algorithms were analyzed using ANOVA. The results show that Naive Bayes achieved the highest accuracy at 67.71%, followed by XGBoost at 65.91%, and C4.5 at 58.39%, with Naive Bayes performing best in identifying positive and negative sentiments, while XGBoost and C4.5 handled more complex data patterns effectively. These findings provide guidance for sentiment analysis in Indonesian and support businesses in obtaining automated insights from customer reviews to improve product quality and services.

Nanda Mediya Sari; Jasmir Jasmir; Elvi Yanti

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Sentiment analysis is a technique in Natural Language Processing (NLP) used to identify user opinion tendencies based on textual reviews. This study analyzer user reviews of the Maxim application on the Google Play Store and compares three Machine Learning algoritmhs-Naïve Bayes, Support Vector Machine (SVM), and CatBoost-in classifying sentiment. The research stages include data collection, text preprocessing, feature extraction using TF-IDF and Chi-Square, class balancing using SMOTE, and performance evaluation through Accuracy, Precision, Recall, and F1-Score. ANOVA is used to examine the influence of feature selection on model performance. The results show that each model exhibits different performance level across the tested feature combinations. The CatBoost achieved the highest accuracy of 99,26% and demonstrating the most stable performance. Meanwhile, the Naïve Bayes and SVM models experienced performance decreases experiments, especially after applying SMOTE. These findings indicate that the choise of algorithm, feature extraction method, and class balancing technique significantly affects classification outcomes. Overall, CatBoost is identified as the best-performing model, providing more consistenst classification result in accordance with the characteristics of the user reviews.

Bintang Dwi Atmaja; Yani Maulita; Novriyenni Novriyenni

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Traffic violations are one of the serious problems frequently occurring in various regions, including Binjai City. Various types of violations, such as disobeying road signs and markings, incomplete vehicle documents, and violations that threaten the safety of drivers and other road users, continue to increase despite preventive and repressive efforts carried out by the authorities. This condition indicates that handling traffic violations cannot rely solely on field enforcement but also requires the support of technology capable of analyzing data more comprehensively. This study aims to predict the level of traffic violations by applying the Naïve Bayes method through data mining techniques. The dataset used consists of traffic violation records in 2023 from the Binjai City Police Department, with the main variables including violations of traffic signs and markings, document completeness, and safety-related violations. The Naïve Bayes method was selected because of its ability to perform classification with good accuracy, simplicity, and efficiency in processing large amounts of data. The implementation of this research is realized by developing a web-based application using Visual Studio Code as the development environment and MySQL as the database system. The results of this study are expected to provide structured information regarding traffic violation patterns, support authorities in making more effective decisions, and serve as an alternative solution in the prevention and handling of traffic violations in Binjai City.

Muhammad Azlan; Elvi Rahmi

Neptunus: Jurnal Ilmu Komputer Dan Teknologi Informasi 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This study aims to analyze the sentiment of customer reviews of the Grand Jatra Hotel Pekanbaru on the Google Review platform using the Naïve Bayes algorithm. Social media and online review platforms are increasingly becoming the primary source of information for potential customers in making purchasing decisions, particularly in the hospitality sector. Therefore, sentiment analysis of customer reviews is crucial for understanding consumer perceptions and providing strategic input for hotels in improving service quality. The research data was collected using web scraping techniques to obtain publicly available customer reviews. The obtained data was then processed through text preprocessing stages including case folding, tokenizing, normalization, stopword removal, and stemming. The Term Frequency-Inverse Document Frequency (TF-IDF) method was then used to weight each word, so that more relevant words have a greater influence in the classification process. The sentiment classification process was carried out into two main categories, namely positive and negative. The Naïve Bayes model was trained using training data and then tested with test data to measure the algorithm's performance in classifying sentiment. The evaluation results show that the model built is able to achieve an accuracy level of 98%, with a precision value of 97% and a recall of 100% in the positive class, and 92% in the negative class. These findings confirm that the Naïve Bayes algorithm can be effectively used in analyzing customer sentiment towards hotel services and facilities. Practically, the results of this study are expected to provide insight for the management of Grand Jatra Hotel Pekanbaru in understanding customer perceptions, identifying service strengths and weaknesses, and formulating more targeted marketing strategies. In addition, this study can also be a reference for the development of similar studies in the hotel industry and other service sectors.

Eka Wulansari Fidayanthie; Asep Sayfulloh; Mardiana Rafa Alzena; Nilam Kurnia Sari

Saturnus: Jurnal Teknologi dan Sistem Informasi 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Lungs are vital organs in the human respiratory system, responsible for fulfilling the body's oxygen needs. If the lungs experience health problems, it can have adverse effects on the human respiratory system. Common causes of lung diseases are usually due to inhaling air contaminated by dust, smoke, viruses, and bacteria. This study aims to compare the performance of two classification algorithms, namely Random Forest and Naive Bayes, in predicting lung diseases. The data used was obtained from the Kaggle website and processed using RapidMiner software. The attributes involved include smoking habits, pre-existing conditions, staying up late, exercise activities, age, and outcomes. Based on the test results, the Random Forest algorithm demonstrated the best performance with an accuracy of 93%, while the Naive Bayes algorithm achieved an accuracy of 87%. These findings indicate that the Random Forest algorithm outperforms the Naive Bayes algorithm in terms of lung disease prediction accuracy.

Arya Erlangga; Yani Parti Astuti; Etika Kartikadarma; Sindhu Rakasiwi; Egia Rosi Subhiyakto

Switch : Jurnal Sains dan Teknologi Informasi 2025 Asosiasi Profesi Telekomunikasi Dan Informatika Indonesia

Football is a popular sport in the world and is enjoyed by people of all ages. The Indonesia U-16 national team played in the ASEAN CUP 2024 event in this field. Twitter users gave their support through #timnasday during the event. This provided many forms of support for the Indonesian national team which made it difficult to identify positive, neutral, and negative sentiments. This requires the use of lexicon-based textblob to perform automatic labeling. In the labeling results using textblob from a total of 1138 user tweet data resulted in positive sentiment values of 50.9% or 579 positive data, neutral 33.7% or 384 neutral data, and negative 15.4% or 175 negative data. In the test results using one of the machine learning from the naïve bayes classifier, namely gaussian naïve bayes with the division of test data and training data of 0.3 and 0.7, the accuracy value is 98.53%

Elsa Damayanti; Barry Ceasar Octariadi; Rachmat Wahid Saleh Insani

Jurnal Riset Rumpun Ilmu Teknik 2025 Pusat riset dan Inovasi Nasional

Oil palm is a key commodity supporting Indonesia’s economy through exports and employment. The industry’s success depends heavily on the selection of superior seedlings, which determine productivity, crop quality, and resistance to pests and diseases. Manual selection, however, often leads to subjectivity and inconsistency due to limited human resources and genetic variation. To address this, the study applies the Naïve Bayes algorithm for classifying oil palm seedlings based on seven variables: height, stem diameter, number of leaves, leaf color, disease resistance, root growth, and fruit yield. Using an explanatory quantitative method, the study follows seven stages: identifying problems, literature review, collecting 1,000 data entries from PT Intitama Berlian Perkebunan, data pre-processing, system modeling (UML), algorithm implementation, and evaluation using a confusion matrix and black box testing. Data was split into 80% training and 20% testing. The Naïve Bayes-based classification achieved 95% accuracy and perfect recall (1.00) for the superior seedling class. However, its performance on the minority class (non-superior seedlings) was weaker due to dataset imbalance. Black box testing verified all system functions worked correctly, enabling effective and efficient use by administrators. The study concludes that Naïve Bayes improves objectivity, efficiency, and accuracy in seedling selection. Nonetheless, attention is needed on data balancing and optimization to maintain consistent performance across classes. This system shows strong potential as a decision-support tool in plantations and promotes digital transformation in agricultural processes.

Dini Oktaviani; Syarifah Putri Agustini Alkadri; Sucipto Sucipto

Jurnal Riset Rumpun Ilmu Teknik 2025 Pusat riset dan Inovasi Nasional

This research is motivated by the importance of improving the quality of passport making services at the Pontianak City Immigration Office which still faces obstacles such as complicated procedures, limited quotas, lack of officer direction, and mismatches in passport collection schedules that cause public dissatisfaction. This research aims to classify the level of satisfaction of passport making services using the Naïve Bayes algorithm, measure classification accuracy, and develop a website-based system that helps evaluate and improve service quality effectively and efficiently. The method used is a quantitative approach with data collection through questionnaires, interviews, and direct observation of 205 respondents, then the data is processed using the Naïve Bayes algorithm which assumes independence between variables to classify satisfaction levels based on variables such as officer friendliness, officer ability, ease of procedure, and timeliness of service. The main findings show that the Naïve Bayes algorithm is able to classify satisfaction levels with 73% accuracy, 76% precision, 70% recall, and 73% F1-score, signaling the effectiveness of this method in identifying community satisfaction patterns. However, the results also indicate the need for improvement in user interface aspects and system responsiveness so that the system can be widely accepted and provide optimal benefits. The implication of this research is that the application of Naïve Bayes-based data mining methods can be an effective tool in evaluation and decision-making to improve the quality of public services, especially in the field of passport making, and encourage the development of interactive and empirical data-based public service information systems.

Theresia Clarita Neba; Anastasia Mude; Krisantus Thomas Rada

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This research aims to address the challenges in sales data management and limited market reach faced by the Inegena Village-Owned Enterprise (BUMDes) in North Bajawa District, Ngada Regency, East Nusa Tenggara. The BUMDes produces and sells candlenut oil, a superior local product, but currently uses a manual sales and recording system (B2B and B2C), which leads to fluctuating demand, difficulties in sales data analysis, and decision-making that lacks valid data. To address these issues, a web-based e-commerce system was implemented. This system was designed using Agile methods, involving planning, implementation, software testing (black box testing), documentation, deployment, and maintenance. Furthermore, the Naïve Bayes algorithm was applied to visualize sales data and support better decision-making by classifying best-selling products, popular payment methods, and sales levels. The results of this research are expected to assist Inegena BUMDes in improving sales efficiency, expanding the market reach of candlenut oil products nationally. This system uses supporting software such as Xampp, PHP, and MySQL.

Lifa Sholiah; Ito Setiawan; Abdillah Teguh Permana; Iqbal Yusuf Azhari; Wakhid Sayudha Rendra Graha Alrashid

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

KPRI KOKARNABA Baturraden faces challenges in managing increasingly complex sales data, particularly in identifying the most in-demand products to maximize profit. This study aims to analyze sales patterns using the Naïve Bayes algorithm as a probability-based classification method. The collected sales data were analyzed to identify categories of best-selling and less popular products within the cooperative. The results indicate that the Naïve Bayes algorithm has an accuracy rate of 77.56% in predicting product categories. This research is expected to assist the cooperative in optimizing stock management and improving member satisfaction.

Nurfalah Nurfalah; Rouli Doharma Ms

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Social assistance is an important aspect of government and non-government programs that can help on a large scale for the community so that the impact is to lighten life in the short term, but social assistance has several criteria such as income, social conditions, family status and the impact of the economic situation. . Knowing the criteria for social assistance is done by applying data mining to social assistance using the Naive Bayes algorithm procedure which produces accuracy calculations from 100 testing data, obtained good values, namely accuracy of 95.00%, precision of 92.31%, and recall of 97.95%.

Hafidz Syauqie; Augie Sugiarto Nunka; Mu. Aldi Rahmad Fahrozi

Merkurius : Jurnal Riset Sistem Informasi dan Teknik Informatika 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This research use the Naive Bayes algorithm to classification of user reviews of the Sky Childern Of The Light application from the Google Play Store. The Sky Childern Of The Light application is a popular online game, because it offers a unique and immersive playing experience. This method was chosen because of its simplicity, speed, ease of interpretation, and suitability for high-dimensional data. The advantages of Naive Bayes are the accuracy and efficiency of calculations, fast results and presentation. The data collected was 1500 data with a classification ratio of 8:2 with an accuracy value of 87% using the Naïve Bayes algorithm. This method is very good at analyzing the sentiment of the Sky Children Of The Light application.