SciRepID - Scientific Publication Search

Publication Search

41,520 articles from 397 journals · 1,447 citations tracked

Showing 1-20 of 25

Analytics

Untung Surapati; Dadang Iskandar Mulyana; Dedi Gunawan; Anggit Purnama

International Journal of Applied Mathematics and Computing 2026 Asosiasi Riset Ilmu Matematika dan Sains Indonesia

Early detection of a potential heart attack is a crucial step in preventing sudden death from heart disease. This research aims to develop an Internet of Things (IoT)-based health monitoring system capable of measuring vital body data in real time and predicting the likelihood of a heart attack from CSV data obtained from sensors, integrated through RapidMiner as learning data using a machine learning algorithm, the Support Vector Machine (SVM). The system was built using an ESP32 microcontroller connected to a MAX30102 sensor to measure heart rate and finger oxygen levels (SpO₂), as well as a DHT22 sensor to measure temperature and humidity. The resulting data is sent to the Blynk application to display real-time data according to its parameters. The initial prediction logic was developed using a rule-based method based on medical thresholds for four vital parameters. The data was then used to train an SVM model as a classification system to detect potential heart attacks. Test results showed that the system can identify abnormal conditions with a good level of accuracy and provide early warnings based on changes in vital parameters in real time. This system is expected to be an initial solution for personal health monitoring, especially for individuals at risk of heart disease. It can be further developed with cloud integration and automatic notifications to users' devices.

Veri Arinal; Satria Wira Yudha; Muhammad Joko Umbaran Kharis Bahrudin; Dessyanti Ryantina

International Journal of Information Engineering and Science 2026 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

QRIS (Quick Response Code Indonesian Standard) has become a widely used national digital payment standard. User satisfaction with this service needs to be monitored continuously to ensure its sustainability. This study aims to predict the level of QRIS user satisfaction based on their experiences and perceptions expressed organically on the Twitter social media platform. The method used is sentiment analysis with the Naive Bayes classification algorithm implemented using RapidMiner software. The research data was obtained from Twitter user comments collected through web scraping techniques. The text data then went through a preprocessing stage that included cleansing, stopword filtering, stemming, and tokenizing to be prepared as features ready to be processed by the model. The data was divided into training (80%) and testing (20%) subsets for model training and validation. The results showed that the Naive Bayes model was able to predict user satisfaction sentiment with an accuracy of 80.99%. These findings indicate that the model is highly accurate in identifying satisfied comments and sufficiently sensitive in detecting dissatisfaction. This study concludes that sentiment analysis of Twitter UGC data using Naive Bayes is an effective and efficient approach for predicting QRIS user satisfaction in real time. The practical implication of this study is to provide an automatic feedback system for service providers to monitor public sentiment and take targeted corrective actions.

Elsa Syahriza Putri; Andri Triyono; Kartika Imam Santoso

Router : Jurnal Teknik Informatika dan Terapan 2026 Asosiasi Profesi Telekomunikasi dan Informatika Indonesia

Dengue fever is a disease commonly found in tropical and subtropical regions. This disease can cause severe symptoms, such as very high fever, accompanied by nausea, vomiting, headache, abdominal pain, and leukopenia (decrease in white blood cells). This infectious disease, known as dengue hemorrhagic fever (DHF), is a viral infection transmitted by the Aedes Aegyppti mosquito. This study aims to classify dengue-prone areas using the K-Means Algorithm, and to classify the factors that cause dengue in Purwodadi District, Grobogan Regency. The clustering results using the K-Means algorithm with Rapidminer tool from 266 data produced 3 clusters: cluster 0 (blue) with 138 patients dominated by Kuripan, Purwodadi, Ngambak villages, cluster 1 (green) with 31 patients in Ngraji, Nambuhan, Cingkrong villages, and cluster 2 (orange) with 97 patients in Danyang, Kalongan, Pulorejo villages. This study is expected to provide additional information for stakeholders in controlling dengue cases and increase awareness of the importance of environmental cleanliness as a preventive measure.

Anggi Saputra; Setiawan Assegaff; Benni Purnama

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study analyzes creditworthiness assessment and predicts non-performing loan (NPL) risk using the Naïve Bayes algorithm at BPR Ukabima Lestari, Jambi Branch. A quantitative data mining approach with probabilistic classification is applied. The dataset includes borrower attributes such as age, occupation, income, loan amount, tenor, collateral, and repayment history. Research stages comprise data preprocessing, model development, and performance evaluation using accuracy, precision, recall, and F1-score implemented in RapidMiner. The results indicate that the Naïve Bayes model achieves 99.58% accuracy, demonstrating strong capability to predict potential problem loans accurately and efficiently, supporting data-driven credit decisions and strengthening credit risk management in microbanking institutions.

Melda Septriani; Pareza Alam Jusia; Rudolf Sinaga; Shinta Renova Putri; Firyal Najla 'Afifah

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Diabetes Mellitus is a disease caused by the failure of the pancreas organ in producing the hormone insulin in excess causing increased blood sugar levels and resulting in a lack of insulin. This study discusses the application of the k-means clustering method to determine risk factors for diabetes mellitus. By using the clustering method, data will be grouped into several clusters or groups which in this study compare by applying several data mining tools such as RapidMiner, SPSS, WEKA, and Python. From the results of the comparison carried out resulted in 5 calculations, namely the manual calculation of cluster 1 with a ratio value of 73% being the first priority, calculations using RapidMiner resulting in cluster 3 with a ratio value of 58% being the first priority, calculations using SPSS cluster 2 with a ratio value of 34% being the first priority, and calculations using Python produce cluster 1 with a ratio value of 55% being the first priority.

Dina Amalia Putri; Naza Sefti Prianita; Elkin Rilvani

Jupiter: Publikasi Ilmu Keteknikan Industri, Teknik Elektro dan Informatika 2025 Asosiasi Riset Ilmu Teknik Indonesia

The issue of determining the number of students' graduation times is one of the important indicators in transmitting the quality and effectiveness of the higher education process in universities. The rate of on-time graduation not only impacts accredited institutions, but also becomes a concern for campus management in designing learning strategies and academic guidance. This study aims to apply and compare two classification algorithms in data mining, namely C4.5 and K-Nearest Neighbor KNN, in predicting the accuracy of students' graduation times. Predictions are made based on academic attributes such as Grade Point Average GPA, number of credits that have been achieved, and Semester Grade Point Average IPS as input variables. The method used in this study is Knowledge Discovery in Database KDD which includes data selection, preprocessing, transformation, data mining, and evaluation of results. The study was conducted using the RapidMiner tool, with a dataset of 279 Informatics Study Program students from the 2015 to 2019 intake. The data was classified into two categories: "graduated on time" and "not graduated on time". The test results showed that the KNN algorithm provided better performance compared to C4.5. KNN produced an accuracy of 76.08%, with a precision of 73.11% and a recall of 41.92%. Meanwhile, the C4.5 algorithm produced an accuracy of 73.49%, with a precision of 64.62% and a recall of 41.89%. This difference in accuracy indicates that KNN is more effective in capturing patterns in the data and providing more accurate predictions in this context. Thus, the KNN algorithm can be considered a more optimal method to assist universities in predicting potential student admissions in a timely manner, thus enabling early intervention for students at risk of late graduation. This research also contributes to the development of data mining-based academic decision support systems in higher education.

Muhamad Arief Firdaus; Fadli Rahman Latarissa; Yanuar Dzaky; Hidayanti Murtina; Fadli Rahman Latarissa +2 more

Jurnal Elektronika dan Komputer 2025 STEKOM PRESS

Peningkatan transaksi dalam platform e-commerce seperti Shopee menuntut adanya sistem prediksi status pesanan yang akurat, guna mengoptimalkan pelayanan dan mengurangi pembatalan maupun keterlambatan pengiriman. Penelitian ini bertujuan membangun model klasifikasi status pesanan (selesai atau batal) pada toko Stuftech.Id menggunakan algoritma C4.5. Data yang digunakan merupakan transaksi pesanan mencakup metode pembayaran, kategori wilayah pengiriman, dan ongkos kirim. Proses klasifikasi dilakukan menggunakan RapidMiner dengan tahapan preprocessing, pembangunan decision tree, dan evaluasi model. Hasil analisis menunjukkan bahwa atribut “Kategori Pulau” memiliki nilai gain tertinggi sehingga dipilih sebagai node akar. Model yang dibentuk menghasilkan akurasi sebesar 86%, dengan recall 100% untuk pesanan selesai namun hanya 6,67% untuk pesanan batal. Temuan ini mengindikasikan bahwa algoritma C4.5 efektif dalam memprediksi pesanan yang berhasil, namun perlu peningkatan dalam mendeteksi potensi pembatalan. Implementasi model ini dapat membantu pelaku usaha dalam mengambil keputusan operasional secara proaktif.

Eka Wulansari Fidayanthie; Asep Sayfulloh; Mardiana Rafa Alzena; Nilam Kurnia Sari

Saturnus: Jurnal Teknologi dan Sistem Informasi 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Lungs are vital organs in the human respiratory system, responsible for fulfilling the body's oxygen needs. If the lungs experience health problems, it can have adverse effects on the human respiratory system. Common causes of lung diseases are usually due to inhaling air contaminated by dust, smoke, viruses, and bacteria. This study aims to compare the performance of two classification algorithms, namely Random Forest and Naive Bayes, in predicting lung diseases. The data used was obtained from the Kaggle website and processed using RapidMiner software. The attributes involved include smoking habits, pre-existing conditions, staying up late, exercise activities, age, and outcomes. Based on the test results, the Random Forest algorithm demonstrated the best performance with an accuracy of 93%, while the Naive Bayes algorithm achieved an accuracy of 87%. These findings indicate that the Random Forest algorithm outperforms the Naive Bayes algorithm in terms of lung disease prediction accuracy.

Yayang Tika Robiatush Sholiha; Lubna Asjad Muhda Nabilah; Imron Imron

Saturnus: Jurnal Teknologi dan Sistem Informasi 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

This study aims to evaluate user sentiment toward the Liputan6.com application available on the Google Play Store. In the digital era, user reviews serve as a significant indicator in assessing the quality of an application. However, the inconsistency between rating scores and review content renders manual analysis less objective. To address this issue, a machine learning approach was adopted by comparing two algorithms, namely Support Vector Machine (SVM) and Naïve Bayes (NB). A total of 2,500 reviews were collected through a web scraping process and automatically labeled based on the rating (positive if ≥ 3, negative if < 3). The data preprocessing stages included cleaning, case folding, tokenizing, stopword removal, and token filtering. Subsequently, word weighting was carried out using the TF-IDF method, followed by classification using 10-Fold Cross Validation in RapidMiner. The evaluation results indicate that, in the positive class, NB demonstrated superior precision (89.47%), whereas SVM achieved higher recall (98.94%) and F1-score (90.96%). In the negative class, SVM performed better in terms of precision (66.15%), while NB attained higher recall (65.65%) and F1-score (36.34%). Further evaluation based on AUC and accuracy positioned SVM in the good category (AUC 0.842; accuracy 83.82%), while NB was categorized as fail (AUC 0.505; accuracy 60.87%). Overall, SVM is considered to be more effective than NB.

Yuma Akbar; Kiki Setiawan; Muhammad Joko Umbaran Kharis Bahrudin; Intan Purwasih

International Journal of Electrical Engineering, Mathematics and Computer Science 2024 Asosiasi Riset Teknik Elektro dan Infomatika Indonesia

In today's world of retail and technology, competition is fiercely competitive. With the development of retail businesses increasing in number and mushrooming in a region, consumer needs are increasing, and retail business players are competing to develop their businesses by utilizing existing technology. Daily sales transaction data continues to increase, causing a lot of storage. Toko Ira has more than 228 sales transaction data records from 2023 to 2024 that have not been used. Data requires a lot of storage space. Additionally, the data has not been used in an effective way. Based on this problem, this research aims to use data mining to classify sales transaction data to determine which items are selling best. This research is a case study with a qualitative approach. This research was conducted with the Naive Bayes method and Rapidminer was used. The results of the sales transaction data classification research are the division of products into best-selling and non-selling categories. The results of this research show that the K-Nearest Neighbors (KNN) algorithm with a 50:50 data division is more effective in predicting and classifying sales of best-selling and non-selling products in IRA stores. The results show that the Naive Bayes algorithm has an accuracy of 89.91%, while the K-Nearest Neighbors (KNN) algorithm has an accuracy of 60.09%.

Muksan Junaidi

Jupiter: Publikasi Ilmu Keteknikan Industri, Teknik Elektro dan Informatika 2024 Asosiasi Riset Ilmu Teknik Indonesia

The determination of outstanding students is based on different criteria, depending on the type of achievement that is to be measured. At SMK Migas Cepu, this assessment is typically based on the highest academic score from the class promotion exam. However, this method is considered less accurate and problematic in terms of grouping students. To address this issue, a clustering method using the K-Means algorithm can be applied. The purpose of this research is to build a K-Means model to determine outstanding students. The data used in this study comes from the report card ledger of class XI Machine A and B for the year 2022, which includes 71 students at SMK Migas Cepu. The RapidMiner tool was used to build the K-Means model and cluster the data based on student characteristics. The first test conducted using Excel resulted in two clusters: 35 outstanding students and 36 non-outstanding students. Meanwhile, the second test using the RapidMiner model produced two clusters with a distribution of 26 outstanding students and 45 non-outstanding students.

Theodorus Ikhtiar Hulu

Router : Jurnal Teknik Informatika dan Terapan 2024 Asosiasi Profesi Telekomunikasi dan Informatika Indonesia

Human resources are a company's most dominant asset, because they can play an important role in the company's business development. An outsourcing company is a legal entity and is obliged to comply with business licenses issued by the Central Government. The eligibility process for new employees is supported based on the level of ability and competency determined by the company. The existence of difficulties for companies in determining the eligibility of new employees which makes the reason for the ineffectiveness of the processes carried out by the company at this time, is used as a goal for the authors for the purposes of a study. By using the classification method in a data mining with the C4.5 algorithm (Decision Tree) and a RapidMiner application as a tool in the analysis process carried out to find a factor supporting the process of a new employee eligibility. With the data of 960 applicants used as a sample, this data was taken from 2021-2022. From the data divided into several attributes used, the highest Gain value obtained from these attributes through the results of Test 2 of 0.417152421 which will be used as the root in the process of determining employee eligibility and has the highest accuracy value of 98.44%.

Viktor Loja; Gergorius Kopong Pati; Agustin Purnami Setiawi

Neptunus: Jurnal Ilmu Komputer Dan Teknologi Informasi 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

It has been demonstrated that using computers greatly improves our ability to perform our duties. Information services are vital because, while employee performance may still be predicted manually, the process takes a long time. Data mining technologies, on the other hand, make it easier to anticipate employee success for loyal employees. Employee performance evaluation criteria are necessary in order to increase the accuracy of the assessment results, as Toko Merpati Simpang's employee performance assessments cannot be conducted carelessly. Employee performance has to be analyzed and categorized because up until now, manual employee performance evaluations have only used subjective criteria. The C4.5 Algorithm data mining approach is used in this evaluation of employee performance. The degree of accuracy will be ascertained by comparing these two approaches. Positive and negative emotions are the two categories of sentiment. The aim of this study is to ascertain the degree of accuracy of the comparison between the two tested techniques and to offer information on the quality of one of Toko Merpati Simpang's employee performance assessments using visitor sentiment. The test results will be evaluated using the Rapidminer tool to demonstrate the degree of accuracy for both testing approaches.   Keywords: , 

Farida Hanum; Yani Maulita; I Gusti Prahmana

Bridge : Jurnal Publikasi Sistem Informasi dan Telekomunikasi 2024 Asosiasi Profesi Telekomunikasi Dan Informatika Indonesia

The Merdeka Belajar Kampus Merdeka (MBKM) program provides students the opportunity to study for one semester outside of their major, aiming to develop the soft and hard skills required in the workforce. One key component of this program is internships or practical work, which gives students hands-on experience in the professional world and the chance to build professional networks. This research uses the K-Nearest Neighbor (K-NN) method to predict the impact of MBKM activities on undergraduate students at STMIK Kaputama. Using the RapidMiner application, student data was tested to obtain the accuracy of predicting students' engagement in the MBKM program in the future. The test results show that the K-NN model has an accuracy of 75.34%, indicating that the model is fairly good at predicting the impact of the MBKM program on students.    

Dhea Alfiya Ningsih; Relita Buaton; Anton Sihombing

Saturnus: Jurnal Teknologi dan Sistem Informasi 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Stunting is a growth and development disorder in children caused by chronic malnutrition over a long period of time, especially in the first 1,000 days of life, namely from pregnancy to the first 2 years of life. There are more than 149 million (22%) toddlers worldwide who are stunted, of which 6.3 million are Indonesian toddlers. Based on data from the Ministry of Health, the stunting rate in Indonesia in 2023 was recorded at 21.5 percent, only down 0.1 percent from the previous year which amounted to 21.6 percent. Predicting the number of stunted toddlers is very important and necessary to know the stunting rate in Langkat Regency in 2024, and the prediction results can help health workers in handling and preventing the spread of stunting. The method applied to this prediction system is Multiple Linear Regression where this analysis determines whether each independent variable is positively or negatively related, the direction of the relationship between variables, and estimates the value of the dependent variable will increase or decrease.  The prediction system is carried out using the RapidMiner application because this application is very appropriate to produce information output in the form of prediction results for the coming year. The prediction results obtained are an increase and decrease in 2024 in each sub-district and there are sub-districts that do not experience an increase and decrease. The sub-district with the highest number was Secanggang with approximately 177 people, and the sub-district with the lowest number of stunted children was West Berandan with approximately 55 people. Then Stabat sub-district became the sub-district that experienced the most increase in the number of stunting, which was around 15 people, and the sub-district that experienced the most decrease was Kuala sub-district with a total of approximately 23 people. From the overall results it can be calculated that the number of stunting in all districts in Langkat Regency amounted to approximately 2453 people in 2024. And testing the error rate of prediction results using RMSE in the RapidMiner application of 7.63%, where the level of accuracy in the prediction of child stunting in Langkat Regency is 92.46%.

Dini Anjani; Novriyenni Novriyenni; Zira Fatmaira

Saturnus: Jurnal Teknologi dan Sistem Informasi 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Soft skills are non-technical abilities that make a person able to interact and work effectively with others. This study aims to analyze the relationship between student activities of Internships and Certified Independent Study (MSIB) on improving student soft skills using the Apriori method in data mining analysis. this research uses RapidMiner analysis tools to analyze data collected from a total of 539 student data from all over Indonesia, the best association rule has been formed (best rule) which provides information about improving the soft skills of MSIB students. Tests were conducted by determining the minimum support value of 3% (0.03) and the minimum confidence of 30% (0.3). and resulted in 106 association rules. Based on the results of the analysis, it was found that the best rule of 2 itemsets has a support of 39% and a confidence of 67%, the best rule of 3 itemsets has a support of 13% and a confidence of 81%, the best rule of 4 itemsets has a support of 6% and a confidence of 82%, and the best rule of 5 itemsets has a support of 3% and a confidence of 100%.  After analyzing data using the Apriori method and RapidMiner application on 539 MSIB student soft skills data, it was found that there was a significant relationship between MBKM activities followed by students and the improvement of their soft skills and these findings also show that the less frequent value is set, the more data can be processed, as well as the minimum support value and confidence value, where the smaller the value determined, the more association results will be issued.

Andi Diah Kuswanto; Said Imam Puro; Jodi Hariyan; Ridho Rafliansyah; Muhammad Rival Aziz +1 more

Repeater : Publikasi Teknik Informatika dan Jaringan 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

In the era of rapid digitalization, understanding consumer behavior through data is becoming increasingly important for retail businesses. Shopping trends, such as those contained in this study, provide in-depth insights into various aspects of consumer behavior, from demographics to purchasing preferences and patterns of discount usage. This data is invaluable in formulating effective marketing strategies, improving customer experience, and optimizing business operations. The data used in this study included a variety of relevant variables, such as age, gender, location, product categories purchased, number of purchases, payment methods, and frequency of purchases. This information allows for a comprehensive analysis of how these factors affect consumer spending decisions. For example, analytics can reveal seasonal trends in purchases, product color and size preferences, and the impact of discounts and promo codes on sales volume. In addition, this dataset also reflects the changes in consumer behavior that have occurred over the past few years. Quantitative methodology is a research approach used to collect and analyze numerical data to understand patterns, relationships, and events in a given population. Data is collected from various sources such as online sales transactions, consumer surveys, Naive Bayesian algorithms are applied to the dataset that has been processed. The data was divided into two sets: training (80%) and testing (20%).    

Andi Diah Kuswanto; Azumardi Nabil Fadhila; Paulus Tri Setiawan; Muhammad Kevin Setiawan; Dody Renal Syahputra

Repeater : Publikasi Teknik Informatika dan Jaringan 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

Unemployment is a persistent problem in the labor market, thus hampering economic development and national prosperity. Indonesia, including its capital Jakarta, continues to face significant levels of unemployment compared to neighboring countries. This research focuses on analyzing the structure of unemployment in Jakarta using K-Means Clustering to categorize unemployment data based on age groups (2020-2022) sourced from the Central Statistics Agency. Analysis carried out via RapidMiner revealed three clusters:-Cluster 0: Age 30-60 years and above, Cluster 1: Age 20-24 years, Cluster 2: Age 15-19 and 25-29 years. The findings show that the 20-24 year age group has the highest unemployment rate (399,167 people), while the 30-60 year and above age group shows the lowest unemployment rate (75,560 people). This clustering approach provides insight into the distribution of unemployment by different age demographics in Jakarta, highlighting areas where targeted interventions may be needed to effectively address this socio-economic challenge

Zena Lusi; Ayu Eka Saputri; Tri Basuki Kurniawan

Neptunus: Jurnal Ilmu Komputer Dan Teknologi Informasi 2024 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

The use of social media is already powerful and difficult to avoid. Social media users are not only limited to the general public, but also public figures and even economic actors who use social media as a means of marketing. In every post from the account owner, there will always be followers who can give likes and comments. Unfortunately, not all comments are related to the uploaded post. One of the most annoying comments is spam comments. Spam comments are comments that are not clear and contain about business (promos / selling), links or various other things that are promoting something. Using the Naive Bayes algorithm, this study wants to identify spam comments, especially on Instagram social media. Where the data is retrieved using the tools provided by Google. Which is then processed with the Rapidminer application to get the Naive Bayes calculation results.

Fajar Amalia Putri; Relita Buaton; Selfira Selfira

Switch : Jurnal Sains dan Teknologi Informasi 2024 Asosiasi Profesi Telekomunikasi Dan Informatika Indonesia

A blood donor is someone who wants to donate their own blood to people in need without any element of coercion from anyone. Predicting the number of blood donors is very important and necessary to find out the number of blood donors in Langkat Regency in 2023-2024, and the prediction results can help PMI Langkat Regency in increasing the number of blood donors. The method applied in this prediction system is Linear Regression, where this analysis determines whether or not each variable is in accordance with the prediction results being tested and estimates that the value of the variable will increase or decrease each month. The prediction system is carried out using the RapidMiner application because this application is very appropriate for producing information output in the form of prediction results for the coming year. The prediction results obtained by testing using the Linear Regression method show increases and decreases every month. There are 11 months where there has been an increase and decrease in the predicted results and are in accordance with the data in 2023, then there is 1 month which has decreased in the predicted results and does not match the data in 2023. From the overall data results, it can be calculated the number of blood donors in Langkat Regency in 2023 and every month. Measuring the error level of prediction results using RMSE, the resulting accuracy level was 83.574%.