SciRepID - Scientific Publication Search

Publication Search

41,520 articles from 397 journals · 1,447 citations tracked

Showing 1-20 of 110

Analytics

Kaslin Yulianty; Abidin, Dodo Zaenal; Devitra, Joni

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Private vehicles are a frequently used mode of transportation because they are considered more practical. However, using private vehicles carries several risks, such as traffic accidents due to drivers losing focus on the road due to other activities, such as making calls on smartphones, drinking, or operating the radio. Approximately 90% of accidents are caused by human error. Convolutional Neural Network (CNN) is a type of neural network commonly used on image data. CNN is often used for image classification due to its high performance and accuracy. Therefore, this study aims to analyze the performance of CNN for the classification of distracted driving activities. The results show that the CNN model is able to effectively classify images of distracted driving activities, with an accuracy of approximately 99% across all datasets and across all input image size variations. Furthermore, the results of this study also show that differences in right-hand and left-hand drive datasets do not significantly affect model accuracy. Variations in input image size also do not significantly affect model accuracy, but do affect the training duration.

Simon Simarmata; Panser Karo-Karo; Budi Artono; Muhammad Akbar Hariyono; Ardy Wicaksono +1 more

Background: The increasing complexity of industrial production systems requires machine condition monitoring solutions that are capable of operating in real time with high accuracy and responsiveness to support predictive maintenance strategies. Conventional cloud based monitoring systems often experience limitations such as high latency and dependence on stable network connectivity, which can delay decision making processes in critical industrial operations. Objective: This study aims to design and evaluate an Industrial Internet of Things (IIoT) architecture based on edge computing to improve the efficiency of industrial sensor data processing and accelerate anomaly detection in industrial machines. Method: The research adopts an experimental approach by designing a system architecture consisting of a sensor layer, edge computing layer, and cloud layer. Industrial sensors, including vibration, temperature, and current sensors, continuously collect machine operational data, which are then processed locally at the edge node using a machine learning based anomaly detection algorithm. System testing is conducted in a simulated manufacturing environment to evaluate performance based on latency, reliability, and detection accuracy. Results: The results indicate that edge based data processing significantly reduces latency compared with cloud-based processing and enables faster responses to machine condition changes. Additionally, the implemented anomaly detection algorithm achieves high accuracy in identifying abnormal sensor data patterns.

Siska Nar; Ahmad Nugroho; Ahmad Subhan Yazid; Helmi Wibowo; Alyauma Hajjah

Background: The development of industrial technology in the Industry 4.0 era has encouraged the implementation of intelligent monitoring systems to improve machine reliability and operational efficiency. However, machine fault diagnosis systems based on artificial intelligence often face limitations in terms of interpretability because the models used are complex and difficult to explain. Objective: This study aims to develop a deep learning-based industrial machine fault diagnosis system integrated with an Explainable Artificial Intelligence (XAI) approach to improve diagnostic accuracy while providing interpretable insights for users. Method: The research method involves collecting data from industrial machine sensors consisting of vibration signals, temperature measurements, and acoustic signals, followed by data preprocessing and feature extraction processes. The processed data are then used to train a deep learning-based diagnostic model, after which explainability methods such as SHAP or LIME are applied to analyze the contribution of each feature to the model’s prediction results. Model performance is evaluated using accuracy, precision, recall, and F1-score metrics. Results: The results indicate that the proposed deep learning model achieves better performance compared to conventional machine learning methods such as Support Vector Machine and Random Forest. Furthermore, the explainability analysis reveals that vibration amplitude, increases in machine component temperature, and anomalies in acoustic signals are the main factors influencing machine fault detection. Therefore, the proposed system not only improves the accuracy of machine fault diagnosis but also provides transparency in the decision-making process, thereby supporting the implementation of predictive maintenance in smart manufacturing environments.

Muh Fadli Faisal Rasyid

Proceeding of the International Conference on Law and Human Rights 2025 Asosiasi Peneliti dan Pengajar Ilmu Hukum Indonesia

The integration of artificial intelligence (AI) in forensic investigation has significantly transformed the analysis and authentication of digital evidence. This paper explores the role of AI technologies, specifically machine learning and deep learning algorithms, in examining digital evidence from various sources, including computers, mobile devices, and network systems. We provide an in-depth analysis of current AI-based forensic tools, their efficiency in evidence authentication, and the challenges they face regarding legal admissibility. Our findings indicate that AI-powered forensic systems can detect digital evidence tampering with 94.7% accuracy, drastically reducing analysis time from weeks to hours. However, challenges remain, particularly in areas such as algorithmic transparency, bias prevention, and ensuring the integrity of the chain of custody. This research offers a framework for incorporating AI in forensic laboratories, while also addressing crucial legal and ethical concerns to ensure the admissibility of AI-analyzed evidence in court. These considerations are essential for the widespread acceptance and use of AI in forensic investigations.

Yan Apriadi; Dodo Zaenal Abidin; Jasmir Jasmir

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study develops an interpretable machine learning model to predict the settlement status of Hajj fees in Jambi Province, Indonesia. Utilizing the XGBoost algorithm on a dataset of 4,332 prospective pilgrims from 2025, the research addresses the critical challenge of class imbalance where only 28.5% of samples are labeled "Unsettled". The baseline XGBoost model achieved a ROC-AUC of 0.7778, with a recall of 0.3482 for the minority class. SHAP (SHapley Additive exPlanations) analysis was employed to interpret model predictions, revealing that financial features specifically NILAI_VA (Virtual Account Value), JML_SETORAN (Deposit Amount), and JML_PELUNASAN (Settlement Amount) are the most significant factors influencing repayment risk, with negative SHAP values indicating increased default probability. The findings demonstrate that an interpretable XGBoost framework can provide both predictive accuracy and actionable insights for policymakers, enabling targeted interventions such as flexible payment schemes and enhanced financial monitoring for high-risk pilgrims..

Denia Igesti Nur Mellyati; Kurniabudi Kurniabudi; Jasmir Jasmir

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Student dropout remains a significant challenge for higher education institutions as it impacts academic quality, educational management efficiency, and students' success in completing their studies. Therefore, an approach that can identify students at risk of dropping out is necessary so that timely academic interventions can be made. This study aims to develop a dropout detection model using an Artificial Neural Network (ANN). The data used come from a publicly available higher education dataset, ensuring research reproducibility. Data preprocessing steps were carried out to improve data quality before modeling, and the Synthetic Minority Over-Sampling Technique combined with Edited Nearest Neighbors (SMOTE-ENN) was applied to address class imbalance issues. The ANN model's performance was evaluated using accuracy, precision, recall, F1-score, and area under the ROC curve (ROC-AUC). The test results show that the ANN model can provide excellent predictive performance in detecting at-risk students. The application of SMOTE-ENN also proved to enhance the model’s sensitivity toward the minority class, as indicated by improvements in recall and F1-score. These findings indicate that the developed ANN model has the potential to be used as a student dropout detection system to support data-driven decision-making and strategy development within higher education institutions.

Fransiskus Dapot Sihaloho; Jasmir Jasmir; Gunardi Gunardi

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The rapid growth of e-commerce platforms in Indonesia, particularly Tokopedia, has resulted in a large volume of consumer reviews containing valuable information regarding customer perceptions and satisfaction. However, manual analysis of such reviews is inefficient and prone to subjectivity, necessitating an automated approach based on machine learning. This study aims to classify the sentiment of sports product reviews on Tokopedia into positive, negative, and neutral categories by applying Logistic Regression, Support Vector Machine (SVM), and Random Forest using the Term Frequency–Inverse Document Frequency (TF-IDF) approach. The data were collected through web scraping of Indonesian-language sports product reviews and processed through several preprocessing stages, including data cleaning, case folding, tokenization, stopword removal, and stemming. Feature representation was performed using TF-IDF to transform textual data into numerical vectors, after which the dataset was divided into training and testing sets with an 80:20 ratio. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. The results indicate that the application of TF-IDF significantly improves the performance of all models, with SVM consistently achieving the most optimal performance compared to Logistic Regression and Random Forest. These findings demonstrate that classical machine learning algorithms combined with TF-IDF remain highly effective for sentiment analysis of Indonesian-language text. The implications of this study are expected to assist sellers in understanding customer opinions, support consumers in making informed purchasing decisions, and serve as a foundation for the development of sentiment analysis and recommendation systems on e-commerce platforms.

Riza Pahlevi; Wilujeng Niar Raharjanto; Lies Aryani; Roby Setiawan

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Jambi Province is one of the largest natural rubber producing regions in Indonesia; however, rubber factories under GAPKINDO Jambi still face productivity issues, particularly the gap between production capacity and actual output, and productivity assessment that is still conducted manually by GAPKINDO Jambi. This study employs Decision Tree, Random Forest, KNN, and SVM algorithms within a structured pipeline involving preprocessing, feature selection, standardization, data balancing using SMOTE, and hyperparameter tuning. The proposed solution applies productivity level classification both individually and through paired combinations (ensemble voting). The results show that the Decision Tree + Random Forest model achieves the best performance with an accuracy of 0.84 and an F1-score of 0.83, confirming the effectiveness of ensemble methods in supporting productivity improvement decisions.

Ary Ardiansyah; Pareza Alam Jusia; Rudolf Sinaga; Clarisa Putri Valentina; Pardede, Nadia

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

The Ministry of Social Affairs has made a new breakthrough in facilitating the public in checking social assistance recipients, namely the social assistance check application. User reviews can be used to find out whether the application provides benefits to the community or not. However, these reviews need to be processed using sentiment analysis. Then to do sentiment analysis requires machine learning. One method that includes machine learning is Naïve Bayes. The purpose of this research is to implement the Naïve Bayes method in conducting sentiment analysis and find out whether the social assistance check application is beneficial to society based on the results of sentiment analysis. In this study, two categories of sentiment are used, namely positive and negative. The author collects by crawling using the Google Play Scrapper library. The results of crawling data obtained as many as 4000 data. The results showed that the actual data that had been labeled using Textblob resulted in 987 negative label reviews and 628 positive label reviews. Meanwhile, the Naïve Bayes method is able to analyze the review sentiment of the social assistance check application with the results of 1181 negative sentiments and 434 positive sentiments. The Naïve Bayes model has a good accuracy rate of 0.77 or 77% in analyzing sentiment for social assistance check application reviews.

Nur Aufa, Lia; Nurhadi Nurhadi; Yulia Arvita

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

This study aims to classify customer payment methods at 17 Coffee & Eatery using machine learning algorithms, namely Naïve Bayes and Support Vector Machine (SVM). The increasing use of digital and non-cash payments has generated large volumes of transaction data that are rarely analyzed optimally, even though such data contain valuable information for business decision making. This research used secondary transaction data collected from January to March 2025, consisting of 10,147 transaction records. The dataset included several attributes such as order time, payment time, transaction type, total sales, number of items, and payment method. Data preprocessing was performed through data cleaning, feature engineering, normalization, and label encoding before being divided into training and testing sets with an 80:20 ratio. The Naïve Bayes and SVM models were then trained and evaluated using accuracy, precision, recall, F1-score, and ROC–AUC metrics. The results show that both algorithms were able to classify payment methods effectively, but SVM achieved higher accuracy and more stable performance than Naïve Bayes. These findings indicate that SVM is more suitable for handling complex and heterogeneous transaction patterns. The implementation of machine learning for transaction classification can support more efficient financial management and data-driven decision making for small and medium enterprises in the culinary sector.

Caterina Paras Dewi; Jasmir Jasmir; Willy Riyadi; Alya Rafina

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Chronic Kidney Disease (CKD) is a heterogeneous disorder that gradually affects the structure and function of the kidneys, is difficult to recover, and causes the body to be unable to maintain metabolism and fail to maintain fluid and electrolyte balance, leading to increased urea levels. Chronic kidney disease data was obtained from Kaggle, in this study a comparison was made between two classification algorithms, namely Naïve Bayes Classifier (NBC) and Random Forest because it is not yet known what algorithm is best in classifying chronic kidney disease (CKD). Both algorithms are evaluated based on performance metrics such as accuracy, precision, recall, and confusion matrix. The results of the evaluation showed that in a dataset of 400 samples, the performance  of the Naïve Bayes Classifier (NBC) algorithm obtained an accuracy of 94%, while Random Forest had an accuracy of 93%. Then in the small dataset (158 data), Random Forest got a better accuracy score with 87% compared to the Naïve Bayes Classifier (NBC) of 78%. Based on the results of the evaluation, Random Forest has a more stable performance on small datasets, while Naïve Bayes Classifier (NBC) provides higher performance on larger datasets in the context of chronic kidney disease classification.

Nanda Mediya Sari; Jasmir Jasmir; Elvi Yanti

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

Sentiment analysis is a technique in Natural Language Processing (NLP) used to identify user opinion tendencies based on textual reviews. This study analyzer user reviews of the Maxim application on the Google Play Store and compares three Machine Learning algoritmhs-Naïve Bayes, Support Vector Machine (SVM), and CatBoost-in classifying sentiment. The research stages include data collection, text preprocessing, feature extraction using TF-IDF and Chi-Square, class balancing using SMOTE, and performance evaluation through Accuracy, Precision, Recall, and F1-Score. ANOVA is used to examine the influence of feature selection on model performance. The results show that each model exhibits different performance level across the tested feature combinations. The CatBoost achieved the highest accuracy of 99,26% and demonstrating the most stable performance. Meanwhile, the Naïve Bayes and SVM models experienced performance decreases experiments, especially after applying SMOTE. These findings indicate that the choise of algorithm, feature extraction method, and class balancing technique significantly affects classification outcomes. Overall, CatBoost is identified as the best-performing model, providing more consistenst classification result in accordance with the characteristics of the user reviews.

Eni Rohaini; Gunardi, Gunardi; Nurhayati Nurhayati; Jasmir Jasmir; Zahra Prisdian Tiararosa

Prosiding Seminar Nasional Ilmu Teknik 2025 Asosiasi Riset Ilmu Teknik Indonesia

AImbalanced data remains a significant issue in heart disease classification using machine learning, as it tends to cause models to overestimate the majority class while ignoring minority classes with high clinical value. This can lead to a decrease in accuracy and the model's ability to accurately detect disease cases. Therefore, this study aims to assess the effectiveness of oversampling techniques, namely Random Oversampling and Synthetic Minority Oversampling Technique (SMOTE), in improving the performance of the K-Nearest Neighbors (KNN), Naive Bayes (NB), and Random Forest (RF) algorithms. The dataset used comes from Kaggle and consists of 918 data sets with 12 attributes representing patient information related to heart disease prediction. The research stages include data preprocessing, baseline model testing, and re-evaluation using the two oversampling methods. Experimental results show that oversampling can improve the performance of all algorithms. KNN achieved the best results with SMOTE, with an accuracy of 72.98% and an F1-score of 75.39%. In the Naive Bayes algorithm, both oversampling techniques produced relatively stable performance, with the highest F1-score of 73.56% using SMOTE. Meanwhile, Random Forest showed the most optimal performance when combined with Random Oversampling, with an accuracy of 79.19% and an F1-score of 81.51%. These findings confirm that the success of data balancing techniques is strongly influenced by the characteristics of the classification algorithm used, and provide a practical contribution in determining strategies for handling imbalanced data in health research.

Kamelia Indah Sari; Fredericho Mego Sundoro

Proceeding of the International Conference on Management, Entrepreneurship, and Business 2025 Asosiasi Riset Ilmu Manajemen Kewirausahaan dan Bisnis Indonesia

Economic forecasting is becoming increasingly important year after year, especially during crises such as the pandemic of COVID-19 and the Russia-Ukraine war. Its development can be seen from the use of basic statistical models to the increasingly widespread use of machine learning technology. Economic forecasting plays an important role in helping to formulate policies and is also a reliable tool for researchers in dealing with uncertainty. Global crises, such as inflationary pressures due to the pandemic and supply chain disruptions from the Russia-Ukraine conflict, have prompted increased research in this field in an effort to anticipate economic shocks and emphasize the urgency of forecasting to prepare strategies for dealing with future uncertainty. This literature review uses the Scopus database with 2561 publications from 2020 to 2025, analyzed using R Studio with a bibliometrix approach (specifically biblioshiny) and VOSviewer to map relevant thematic connections. This analysis shows that economic forecasting is greatly influenced by market uncertainty and geopolitical factors, and at the same time influences public policy formulation and financial stability. Research contributions from Indonesia are still limited, with only 40 documents, thus emphasizing the need to strengthen economic forecasting studies in Indonesia to support monetary policy and national financial stability.

Hilmi Satria Himawan; Verra Rizki Amelia; Anggun Permata Husda; Rahayu Alkam

Jurnal Publikasi Ekonomi dan Akuntansi 2025 Asosiasi Riset Ekonomi dan Akuntansi Indonesia

The interval between 2018 and 2025 represents a defining epoch in financial assurance, characterized by a systemic collision between traditional audit methodologies and the exponential sophistication of fraudulent actors. This research employs a comprehensive library research methodology, utilizing Systematic Literature Review (SLR) to evaluate the evolving landscape of audit and fraud. The study traces the theoretical migration from Cressey’s Fraud Triangle to multidimensional frameworks like the Fraud Pentagon, which emphasizes the roles of arrogance and competence. Through a forensic examination of catastrophic audit failures including Wirecard, FTX, and the emerging risks of crypto-assets, the research identifies recurring patterns of auditor failure in assessing operational risks and internal controls. Furthermore, the report analyzes the dual-edged impact of Artificial Intelligence (AI); while machine learning algorithms offer enhanced detection capabilities, the rise of Generative AI (GenAI) and deepfake technology has empowered perpetrators to execute sophisticated "synthetic reality" frauds. The study critically evaluates regulatory responses, particularly the revision of International Standard on Auditing (ISA) 240, which mandates a more proactive "fraud lens." The findings suggest that the auditing profession faces an existential crisis of relevance, necessitating a fundamental shift toward a forensic mindset supported by advanced technological integration.

Claudia K. Hamsi; I Wayan Sudiarsa; Vinsensia P.K Abu; Sarling C. Dhai; Maria A. Serero

Mars: Jurnal Teknik Mesin, Industri, Elektro Dan Ilmu Komputer 2025 Asosiasi Riset Teknik Elektro dan Informatika Indonesia

The rapid development of digital streaming platforms such as Netflix has generated a large volume of content data with diverse characteristics, thereby requiring effective analytical methods to understand emerging patterns and trends. This study aims to classify Netflix content into two main categories, namely movies and television shows, and to analyze genre trends and content characteristics using a data mining approach with the Naive Bayes algorithm. The dataset used in this study is the Netflix Shows dataset, consisting of 8,809 content entries, with the primary features analyzed including genre, rating, and country of production. The research process begins with data exploration and preprocessing stages, including data cleaning, handling missing values, and transforming categorical features to enable effective model construction. Subsequently, the dataset is divided into training and testing sets to objectively and systematically build and evaluate the Naive Bayes classification model. Model performance is evaluated using accuracy, precision, recall, and F1-score metrics to assess the model’s ability to accurately distinguish between Netflix content types. The experimental results demonstrate that the Naive Bayes algorithm is able to classify Netflix content into Movie and TV Show categories with accuracy, precision, recall, and F1-score values of 100%, respectively. The confusion matrix indicates that no misclassification occurred, suggesting that genre, rating, and country of production features provide a very clear separation between content classes. These findings indicate that the Naive Bayes algorithm can achieve exceptionally high classification performance with optimal evaluation results. The results further reveal distinct differences in characteristics between movies and television shows based on genre and production attributes. Therefore, this study is expected to contribute to the development of content recommendation systems and strategic content management within the streaming industry.

Rachmatika, Rinna; Desyani, Teti; Khoirudin

Journal of Information Technology and Computer Science 2025 International Forum of Researchers and Lecturers

Diseases in primary health services exhibit complex spatial-temporal dynamics due to urbanization and population mobility. Conventional surveillance approaches are difficult to capture these patterns adaptively. Machine learning (ML) based on spatio-temporal modeling offers a solution with the ability to detect disease clusters automatically and with high precision. Research Objectives: This research aims to develop a machine learning model to detect disease hotspots from primary service data in Indonesia, with a focus on improving prediction accuracy, interpretability, and relevance of health policies. Methodology: The primary service dataset for 2024 (5,343 entries) was analyzed using three ML models Gradient Boosting Machine (GBM), Temporal Random Forest (TRF), and Multi-EigenSpot with spatial (village) and temporal (week, month) features. Performance evaluation includes predictive (AUC, F1-score) and spatial (Moran's I, Spatio-Temporal Correlation Index) metrics. Results: The results showed that Multi-EigenSpot achieved the best performance (AUC=0.91; F1=0.86), with the detection of dominant hotspots in Sungai Asam and Beringin Villages. Moran's I value of 0.63 indicates a strong spatial autocorrelation, while STCI=0.57 indicates moderate temporal stability. Conclusions: ML-based spatio-temporal models are effective in identifying hidden disease patterns and have the potential to be integrated into national digital surveillance systems. This approach supports precision public health by providing a scientific basis for real-time location- and time-based intervention policies.

Dwi Hastuti

Jurnal Elektronika dan Komputer 2025 STEKOM PRESS

This paper explores the epistemological dimensions of the digital transformation occurring in traditional game development through the integration of machine learning systems. By examining how knowledge creation, validation, and application have evolved in this domain, we identify fundamental shifts in the epistemological frameworks governing game development practices. The research investigates how machine learning has redefined creative processes, technical implementation, and experiential design while challenging traditional notions of authorship, expertise, and knowledge transmission. Through analysis of industry case studies, technological capabilities, and theoretical frameworks, this paper contributes to understanding how machine learning systems are not merely tools but epistemological agents that fundamentally transform how knowledge is generated, validated, and utilized in game development ecosystems.

Parhusip, Jadiaman; Julian, Ary Sigit; Hidayat, Febrian Nur; Souk, Jeremy Timothy; Fakhri, Naufal +5 more

JURNAL ILMIAH KOMPUTER GRAFIS 2025 UNIVERSITAS STEKOM

Penelitian ini menggunakan data sekunder yang telah melalui beberapa proses pra-pengolahan, mencakup penanganan data yang hilang, standarisasi data numerik, serta konversi data kategorikal menggunakan teknik One-Hot Encoding. Sebagian besar data (80%) digunakan dalam tahap pelatihan, sedangkan 20% sisanya digunakan untuk tahap pengujian, sedangkan model diimplementasikan dengan metode LinearRegression() pada library scikit-learn. Hasil evaluasi menunjukkan bahwa model berhasil menangkap hubungan linier di antara variabel independen dan dependen, memperoleh nilai MAE = 0,509; MSE = 0,464; RMSE = 0,681; dan R² = 0,627. Hal ini menandakan bahwa sekitar 62,7 persen variasi harga rumah di wilayah Jabodetabek dapat dijelaskan oleh model tersebut.

Tiara Ayu Triarta Tambak

Imajinasi : Jurnal Ilmu Pengetahuan, Seni, dan Teknologi 2025 Asosiasi Seni Desain dan Komunikasi Visual Indonesia

This study aims to analyze user sentiment toward the integration of Artificial Intelligence (AI) in online learning platforms, which are increasingly expanding in the digital era. With the growing use of AI technologies in education—such as learning chatbots, material recommendation systems, and automated assessments—it is essential to understand users’ perceptions and reactions to these implementations. The research employs sentiment analysis based on text mining using user review data collected from various online learning platforms. The analysis process includes data preprocessing, sentiment classification using machine learning algorithms, and interpretation of results based on the proportion of positive, negative, and neutral sentiments. The findings indicate that most users express positive sentiments toward AI integration, as it enhances learning efficiency and personalization. However, some users raise concerns regarding data privacy and the lack of human interaction. This study is expected to serve as a reference for educational platform developers to design AI systems that are more adaptive, transparent, and user-centered