Caterina Paras Dewi; Jasmir Jasmir; Willy Riyadi; Alya Rafina
Chronic Kidney Disease (CKD) is a heterogeneous disorder that gradually affects the structure and function of the kidneys, is difficult to recover, and causes the body to be unable to maintain metabolism and fail to maintain fluid and electrolyte balance, leading to increased urea levels. Chronic kidney disease data was obtained from Kaggle, in this study a comparison was made between two classification algorithms, namely Naïve Bayes Classifier (NBC) and Random Forest because it is not yet known what algorithm is best in classifying chronic kidney disease (CKD). Both algorithms are evaluated based on performance metrics such as accuracy, precision, recall, and confusion matrix. The results of the evaluation showed that in a dataset of 400 samples, the performance of the Naïve Bayes Classifier (NBC) algorithm obtained an accuracy of 94%, while Random Forest had an accuracy of 93%. Then in the small dataset (158 data), Random Forest got a better accuracy score with 87% compared to the Naïve Bayes Classifier (NBC) of 78%. Based on the results of the evaluation, Random Forest has a more stable performance on small datasets, while Naïve Bayes Classifier (NBC) provides higher performance on larger datasets in the context of chronic kidney disease classification.