Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data

Abstract
AImbalanced data remains a significant issue in heart disease classification using machine learning, as it tends to cause models to overestimate the majority class while ignoring minority classes with high clinical value. This can lead to a decrease in accuracy and the model's ability to accurately detect disease cases. Therefore, this study aims to assess the effectiveness of oversampling techniques, namely Random Oversampling and Synthetic Minority Oversampling Technique (SMOTE), in improving the performance of the K-Nearest Neighbors (KNN), Naive Bayes (NB), and Random Forest (RF) algorithms. The dataset used comes from Kaggle and consists of 918 data sets with 12 attributes representing patient information related to heart disease prediction. The research stages include data preprocessing, baseline model testing, and re-evaluation using the two oversampling methods. Experimental results show that oversampling can improve the performance of all algorithms. KNN achieved the best results with SMOTE, with an accuracy of 72.98% and an F1-score of 75.39%. In the Naive Bayes algorithm, both oversampling techniques produced relatively stable performance, with the highest F1-score of 73.56% using SMOTE. Meanwhile, Random Forest showed the most optimal performance when combined with Random Oversampling, with an accuracy of 79.19% and an F1-score of 81.51%. These findings confirm that the success of data balancing techniques is strongly influenced by the characteristics of the classification algorithm used, and provide a practical contribution in determining strategies for handling imbalanced data in health research.
Keywords
How to Cite

Eni Rohaini, et al. (2025). Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data. Prosiding Seminar Nasional Ilmu Teknik, 2(2). https://doi.org/10.61132/prosemnasproit.v2i2.59

Eni Rohaini; Gunardi, Gunardi; Nurhayati Nurhayati; Jasmir Jasmir; Zahra Prisdian Tiararosa, "Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data," Prosiding Seminar Nasional Ilmu Teknik, vol. 2, no. 2, 2025.

Eni Rohaini; Gunardi, Gunardi; Nurhayati Nurhayati; Jasmir Jasmir; Zahra Prisdian Tiararosa. "Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data." Prosiding Seminar Nasional Ilmu Teknik, vol. 2, no. 2, 2025.

Eni Rohaini; Gunardi, Gunardi; Nurhayati Nurhayati; Jasmir Jasmir; Zahra Prisdian Tiararosa. "Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data." Prosiding Seminar Nasional Ilmu Teknik 2, no. 2 (2025).

Eni Rohaini, et al. (2025) 'Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data', Prosiding Seminar Nasional Ilmu Teknik, 2(2). doi: 10.61132/prosemnasproit.v2i2.59.

Eni Rohaini; Gunardi, Gunardi; Nurhayati Nurhayati; Jasmir Jasmir; Zahra Prisdian Tiararosa. Evaluasi Kinerja Machine Learning pada Klasifikasi Penyakit Jantung Menggunakan Teknik Penyeimbangan Data. Prosiding Seminar Nasional Ilmu Teknik. 2025;2(2).

Artikel Terkait
Tren Sitasi Jurnal