SCIRAPID - Android Malware Detection Using Machine Learning with SMOTE-Tomek Data Balancing

Android Malware Detection Using Machine Learning with SMOTE-Tomek Data Balancing

Journal of Computing Theories and Applications

Universitas Dian Nuswantoro

📄 Abstract

This study presents a comprehensive comparative analysis of four traditional machine learning algorithms Decision Tree, Random Forest, K-Nearest Neighbors, and Support Vector Machine for Android malware detection using the preprocessed TUANDROMD dataset comprising 4,465 instances and 241 features representing both static and dynamic application characteristics. Motivated by the limitations of conventional signature-based and hybrid detection methods, especially in managing imbalanced datasets and detecting emerging malware variants, the study employed SMOTE to ensure balanced training data and fair model evaluation. The dataset was divided into 80% training and 20% testing subsets, and models were assessed using key performance metrics including accuracy, precision, recall, F1-score, and ROC AUC. The findings revealed that the proposed Random Forest model outperformed the other classifiers, achieving an accuracy of 0.993, precision of 0.992, recall of 1.000, F1-score of 0.996, and a near-perfect ROC AUC of 0.9998 surpassing state-of-the-art approaches. These results affirm the superior predictive capability, consistency, and robustness of the Random Forest algorithm in Android malware detection. The study concludes that base models, when integrated with class-balancing techniques, provide reliable and efficient malware detection across imbalanced datasets. For future research, the study recommends exploring advanced hybrid or ensemble frameworks that integrate Random Forest with deep learning architectures or other meta-heuristic optimization techniques to further enhance detection accuracy, adaptability, and resilience against rapidly evolving Android malware threats.

🔖 Keywords

#Android malware detection; Cybersecurity; Imbalanced dataset; Intrusion detection; Machine learning; Malicious detection; Malware classification; Random Forest

ℹ️ Informasi Publikasi

Tanggal Publikasi

18 January 2026

Volume / Nomor / Tahun

Volume 3, Nomor 3, Tahun 2026

📝 HOW TO CITE

Masari, Maryam Sufiyanu; Danladi, Maiauduga Abdullahi; Onyinye, Ilori Loretta; Tohomdet, Loreta Katok, "Android Malware Detection Using Machine Learning with SMOTE-Tomek Data Balancing," Journal of Computing Theories and Applications, vol. 3, no. 3, Jan. 2026.

ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Android Malware Detection Using Machine Learning with SMOTE-Tomek Data Balancing

📄 Abstract

🔖 Keywords

ℹ️ Informasi Publikasi

📝 HOW TO CITE

Download Citation

🔗 Artikel Terkait dari Jurnal yang Sama

An Attention-Enhanced CNN–RBF Framework for Network Intrusion Detection in Imbalanced Traffic

A Lightweight Maize Leaf Disease Recognition Using PCA-Compressed MobileNetV2 Features and RBF-SVM

Multimodal Deep Learning for Pneumonia Detection Using Wearable Sensors: Toward an Edge-Cloud Framework

Hybrid Real-time Framework for Detecting Adaptive Prompt Injection Attacks in Large Language Models

Integrating Fully Homomorphic Encryption and Zero-Knowledge Proofs for Efficient Verifiable Computation

The Llama–ARCS Adaptive Learning framework: AI–VR Integration System for Real-Time Motivational Feedback in Higher Education

📊 Statistik Sitasi Jurnal

Tren Sitasi per Tahun

Artikel Tersitasi Lainnya

BEHeDaS: A Blockchain Electronic Health Data System for Secu...

A Comparative Analysis of Generative Artificial Intelligence...

Strategic Feature Selection for Enhanced Scorch Prediction i...

UNMASKING FRAUDSTERS: Ensemble Features Selection to Enhance...

Dataset and Feature Analysis for Diabetes Mellitus Classific...