Exploring Machine Learning and Deep Learning Techniques for Occluded Face Recognition: A Comprehensive Survey and Comparative Analysis
(Keny Muhamada, De Rosal Ignatius Moses Setiadi, Usman Sudibyo, Budi Widjajanto, Arnold Adimabua Ojugo)
DOI : 10.62411/faith.2024-30
- Volume: 1,
Issue: 2,
Sitasi : 0 26-Sep-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
Face recognition occluded by occlusions, such as glasses or shadows, remains a challenge in many security and surveillance applications. This study aims to analyze the performance of various machine learning and deep learning techniques in face recognition scenarios with occlusions. We evaluate KNN (standard and FisherFace), CNN, DenseNet, Inception, and FaceNet methods combined with a pre-trained DeepFace model using three public datasets: YALE, Essex Grimace, and Georgia Tech. The results show that KNN maintains the highest accuracy, reaching 100% on two datasets (Essex Grimace and YALE), even in the presence of occlusions. Meanwhile, CNN shows strong performance, with accuracy remaining 100% on YALE, both with and without occlusions, although its performance drops slightly on Essex Grimace (94% with occlusion). DenseNet and Inception show a more significant drop in accuracy when faced with occlusion, with DenseNet dropping from 81% to 72% on Essex Grimace and Inception dropping from 100% to 92% on the same dataset. FaceNet + DeepFace excels on more large dataset (Georgia Tech) with 98% accuracy, but its performance drops dramatically to 53% and 70% on Essex Grimace and YALE with occlusion. These findings indicate that while deep learning methods show high accuracy under ideal conditions, machine learning methods such as KNN are more flexible and robust to occlusion in face recognition.
|
0 |
2024 |
Phishing Website Detection Using Bidirectional Gated Recurrent Unit Model and Feature Selection
(De Rosal Ignatius Moses Setiadi, Suyud Widiono, Achmad Nuruddin Safriandono, Setyo Budi)
DOI : 10.62411/faith.2024-15
- Volume: 1,
Issue: 2,
Sitasi : 0 12-Jul-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
Phishing attacks continue to be a significant threat to internet users, necessitating the development of advanced detection systems. This study explores the efficacy of a Bidirectional Gated Recurrent Unit (BiGRU) model combined with feature selection techniques for detecting phishing websites. The dataset used for this research is sourced from the UCI Machine Learning Repository, specifically the Phishing Websites dataset. This approach involves cleaning and preprocessing the data, then normalizing features and employing feature selection to identify the most relevant attributes for classification. The BiGRU model, known for its ability to capture temporal dependencies in data, is then applied. To ensure robust evaluation, we utilized cross-validation, dividing the data into five folds. The experimental results are highly promising, demonstrating a Mean Accuracy, Mean Precision, Mean Recall, Mean F1 Score, and Mean AUC of 1.0. These results indicate the model's exceptional performance distinguishing between phishing and legitimate websites. This study highlights the potential of combining BiGRU models with feature selection and cross-validation to create highly accurate phishing detection systems, providing a reliable solution to enhance cybersecurity measures.
|
0 |
2024 |
Effects of Data Resampling on Predicting Customer Churn via a Comparative Tree-based Random Forest and XGBoost
(Rita Erhovwo Ako, Fidelis Obukohwo Aghware, Margaret Dumebi Okpor, Maureen Ifeanyi Akazue, Rume Elizabeth Yoro, Arnold Adimabua Ojugo, De Rosal Ignatius Moses Setiadi, Chris Chukwufunaya Odiakaose, Reuben Akporube Abere, Frances Uche Emordi, Victor Ochuko Geteloma, Patrick Ogholuwarami Ejeh)
DOI : 10.62411/jcta.10562
- Volume: 2,
Issue: 1,
Sitasi : 0 27-Jun-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
Customer attrition has become the focus of many businesses today – since the online market space has continued to proffer customers, various choices and alternatives to goods, services, and products for their monies. Businesses must seek to improve value, meet customers' teething demands/needs, enhance their strategies toward customer retention, and better monetize. The study compares the effects of data resampling schemes on predicting customer churn for both Random Forest (RF) and XGBoost ensembles. Data resampling schemes used include: (a) default mode, (b) random-under-sampling RUS, (c) synthetic minority oversampling technique (SMOTE), and (d) SMOTE-edited nearest neighbor (SMOTEEN). Both tree-based ensembles were constructed and trained to assess how well they performed with the chi-square feature selection mode. The result shows that RF achieved F1 0.9898, Accuracy 0.9973, Precision 0.9457, and Recall 0.9698 for the default, RUS, SMOTE, and SMOTEEN resampling, respectively. Xgboost outperformed Random Forest with F1 0.9945, Accuracy 0.9984, Precision 0.9616, and Recall 0.9890 for the default, RUS, SMOTE, and SMOTEEN, respectively. Studies support that the use of SMOTEEN resampling outperforms other schemes; while, it attributed XGBoost enhanced performance to hyper-parameter tuning of its decision trees. Retention strategies of recency-frequency-monetization were used and have been found to curb churn and improve monetization policies that will place business managers ahead of the curve of churning by customers.
|
0 |
2024 |
Analyzing Preprocessing Impact on Machine Learning Classifiers for Cryotherapy and Immunotherapy Dataset
(De Rosal Ignatius Moses Setiadi, Hussain Md Mehedul Islam, Gustina Alfa Trisnapradika, Wise Herowati)
DOI : 10.62411/faith.2024-2
- Volume: 1,
Issue: 1,
Sitasi : 0 01-Jun-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
In the clinical treatment of skin diseases and cancer, cryotherapy and immunotherapy offer effective and minimally invasive alternatives. However, the complexity of patient response demands more sophisticated analytical strategies for accurate outcome prediction. This research focuses on analyzing the effect of preprocessing in various machine learning models on the prediction performance of cryotherapy and immunotherapy. The preprocessing techniques analyzed are advanced feature engineering and Synthetic Minority Over-sampling Technique (SMOTE) and Tomek links as resampling techniques and their combination. Various classifiers, including support vector machine (SVM), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF), XGBoost, and Bidirectional Gated Recurrent Unit (BiGRU), were tested. The findings of this study show that preprocessing methods can significantly improve model performance, especially in the XGBoost model. Random Forest also gets the same results as XGBoost, but it can also work better without significant preprocessing. The best results were 0.8889, 0.8889, 0.6000, 0.9037, and 0.8790, respectively, for accuracy, recall, specificity, precision, and f1 on the Immunotherapy dataset, while on the Cryotherapy dataset, respectively, they were 0.8889, 0.8889, 0.6000, 0.9037, and 0.8790. This study confirms the potential of customized preprocessing and machine learning models to provide deep insights into treatment dynamics, ultimately improving the quality of diagnosis.
|
0 |
2024 |
Analyzing Quantum Feature Engineering and Balancing Strategies Effect on Liver Disease Classification
(Achmad Nuruddin Safriandono, De Rosal Ignatius Moses Setiadi, Akhmad Dahlan, Farah Zakiyah Rahmanti, Iwan Setiawan Wibisono, Arnold Adimabua Ojugo)
DOI : 10.62411/faith.2024-12
- Volume: 1,
Issue: 1,
Sitasi : 0 01-Jun-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
This research aims to improve the accuracy of liver disease classification using Quantum Feature Engineering (QFE) and the Synthetic Minority Over-sampling Tech-nique and Tomek Links (SMOTE-Tomek) data balancing technique. Four machine learning models were compared in this research, namely eXtreme Gradient Boosting (XGB), Random Forest (RF), Support Vector Machine (SVM), and Logistic Regression (LR) on the Indian Liver Patient Dataset (ILPD) dataset. QFE is applied to capture correlations and complex patterns in the data, while SMOTE-Tomek is used to address data imbalances. The results showed that QFE significantly improved LR performance in terms of recall and specificity up to 99%, which is very important in medical diagnosis. The combination of QFE and SMOTE-Tomek gives the best results for the XGB method with an accuracy of 81%, recall of 90%, and f1-score of 83%. This study concludes that the use of QFE and data balancing techniques can improve liver disease classification performance in general.
|
0 |
2024 |
Integrating SMOTE-Tomek and Fusion Learning with XGBoost Meta-Learner for Robust Diabetes Recognition
(De Rosal Ignatius Moses Setiadi, Kristiawan Nugroho, Ahmad Rofiqul Muslikh, Syahroni Wahyu Iriananda, Arnold Adimabua Ojugo)
DOI : 10.62411/faith.2024-11
- Volume: 1,
Issue: 1,
Sitasi : 0 23-May-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
This research aims to develop a robust diabetes classification method by integrating the Synthetic Minority Over-sampling Technique (SMOTE)-Tomek technique for data balancing and using a machine learning ensemble led by eXtreme Gradient Boosting (XGB) as a meta-learner. We propose an ensemble model that combines deep learning techniques such as Bidirectional Long Short-Term Memory (BiLSTM) and Bidirectional Gated Recurrent Units (BiGRU) with XGB classifier as the base learner. The data used included the Pima Indians Diabetes and Iraqi Society Diabetes datasets, which were processed by missing value handling, duplication, normalization, and the application of SMOTE-Tomek to resolve data imbalances. XGB, as a meta-learner, successfully improves the model's predictive ability by reducing bias and variance, resulting in more accurate and robust classification. The proposed ensemble model achieves perfect accuracy, precision, recall, specificity, and F1 score of 100% on all tested datasets. This method shows that combining ensemble learning techniques with a rigorous preprocessing approach can significantly improve diabetes classification performance.
|
0 |
2024 |
Comprehensive Exploration of Machine and Deep Learning Classification Methods for Aspect-Based Sentiment Analysis with Latent Dirichlet Allocation Topic Modeling
(De Rosal Ignatius Moses Setiadi, Dhendra Marutho, Noor Ageng Setiyanto)
DOI : 10.62411/faith.2024-3
- Volume: 1,
Issue: 1,
Sitasi : 0 22-May-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
This research explores the effectiveness of machine learning (ML) and deep learning (DL) classification methods in Aspect-Based Sentiment Analysis (ABSA) on product reviews, incorporating Latent Dirichlet Allocation (LDA) for topic modeling. Using the Amazon reviews dataset, this research tests models such as Naive Bayes (NB), Support Vector Machines (SVM), Random Forest (RF), Long Short-Term Memory (LSTM), and Gated Recurrent Units(GRU). Important aspects such as the product's quality, practicality, and reliability are discussed. The results show that the RF and DL models provide competitive performance, with the RF achieving an accuracy of up to 94.50% and an F1 score of 95.45% for the reliability aspect. The study's conclusions emphasize the importance of selecting an appropriate model based on specifications and data requirements for ABSA, as well as recognizing the need to strike a balance between accuracy and computational efficiency.
|
0 |
2024 |
Analyzing InceptionV3 and InceptionResNetV2 with Data Augmentation for Rice Leaf Disease Classification
(Fadel Muhamad Firnando, De Rosal Ignatius Moses Setiadi, Ahmad Rofiqul Muslikh, Syahroni Wahyu Iriananda)
DOI : 10.62411/faith.2024-4
- Volume: 1,
Issue: 1,
Sitasi : 0 21-May-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
This research aims to evaluate and compare the performance of several deep learning architectures, especially InceptionV3 and InceptionResNetV2, with other models, such as EfficientNetB3, ResNet50, and VGG19, in classifying rice leaf diseases. In addition, this research also evaluates the impact of using data augmentation on model performance. Three different datasets were used in this experiment, varying the number of images and class distribution. The results show that InceptionV3 and InceptionResNetV2 consistently perform excellently and accurately on most datasets. Data augmentation has varying effects, providing slight advantages on datasets with lower variation. The findings from this research are that the InceptionV3 model is the best model for classifying rice diseases based on leaf images. The InceptionV3 model produces accuracies of 99.53, 58.94, and 90.00 for datasets 1, 2, and 3, respectively. It is also necessary to be wise in carrying out data augmentation by considering the dataset's characteristics to ensure the resulting model can generalize well.
|
0 |
2024 |
Enhanced Vision Transformer and Transfer Learning Approach to Improve Rice Disease Recognition
(Rahadian Kristiyanto Rachman, De Rosal Ignatius Moses Setiadi, Ajib Susanto, Kristiawan Nugroho, Hussain Md Mehedul Islam)
DOI : 10.62411/jcta.10459
- Volume: 1,
Issue: 4,
Sitasi : 0 26-Apr-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
In the evolving landscape of agricultural technology, recognizing rice diseases through computational models is a critical challenge, predominantly addressed through Convolutional Neural Networks (CNN). However, the localized feature extraction of CNNs often falls short in complex scenarios, necessitating a shift towards models capable of global contextual understanding. Enter the Vision Transformer (ViT), a paradigm-shifting deep learning model that leverages a self-attention mechanism to transcend the limitations of CNNs by capturing image features in a comprehensive global context. This research embarks on an ambitious journey to refine and adapt the ViT Base(B) transfer learning model for the nuanced task of rice disease recognition. Through meticulous reconfiguration, layer augmentation, and hyperparameter tuning, the study tests the model's prowess across both balanced and imbalanced datasets, revealing its remarkable ability to outperform traditional CNN models, including VGG, MobileNet, and EfficientNet. The proposed ViT model not only achieved superior recall (0.9792), precision (0.9815), specificity (0.9938), f1-score (0.9791), and accuracy (0.9792) on challenging datasets but also established a new benchmark in rice disease recognition, underscoring its potential as a transformative tool in the agricultural domain. This work not only showcases the ViT model's superior performance and stability across diverse tasks and datasets but also illuminates its potential to revolutionize rice disease recognition, setting the stage for future explorations in agricultural AI applications.
|
0 |
2024 |
Enhancing Lung Cancer Classification Effectiveness Through Hyperparameter-Tuned Support Vector Machine
(Fita Sheila Gomiasti, Warto Warto, Etika Kartikadarma, Jutono Gondohanindijo, De Rosal Ignatius Moses Setiadi)
DOI : 10.62411/jcta.10106
- Volume: 1,
Issue: 4,
Sitasi : 0 25-Mar-2024
| Abstrak
| PDF File
| Resource
| Last.31-Jul-2025
Abstrak:
This research aims to improve the effectiveness of lung cancer classification performance using Support Vector Machines (SVM) with hyperparameter tuning. Using Radial Basis Function (RBF) kernels in SVM helps deal with non-linear problems. At the same time, hyperparameter tuning is done through Random Grid Search to find the best combination of parameters. Where the best parameter settings are C = 10, Gamma = 10, Probability = True. Test results show that the tuned SVM improves accuracy, precision, specificity, and F1 score significantly. However, there was a slight decrease in recall, namely 0.02. Even though recall is one of the most important measuring tools in disease classification, especially in imbalanced datasets, specificity also plays a vital role in avoiding misidentifying negative cases. Without hyperparameter tuning, the specificity results are so poor that considering both becomes very important. Overall, the best performance obtained by the proposed method is 0.99 for accuracy, 1.00 for precision, 0.98 for recall, 0.99 for f1-score, and 1.00 for specificity. This research confirms the potential of tuned SVMs in addressing complex data classification challenges and offers important insights for medical diagnostic applications.
|
0 |
2024 |