Yuma Akbar; Sopan Adrianto; Rasiban Rasiban; Nadya Khairunnisa
This study discusses a student concentration detection system using Convolutional Neural Network (CNN) with the MobileNetV2 architecture. The dataset was adapted from Classroom Student Behaviors and mapped into four concentration categories: highly focused, focused, less focused, and unfocused. The system was tested with a 720p webcam and produced real-time detection data. The evaluation results show an overall accuracy of 75.85%, with the highest precision achieved in the focused class (0.9859) and the highest recall in the highly focused (0.9739) and unfocused (0.9811) classes. The confusion matrix indicates that the focused class was detected most consistently, while highly focused and unfocused classes were often misclassified as focused, resulting in lower precision. In real-time testing, the system operated at an average of 7 FPS and worked optimally when students faced the camera directly with sufficient lighting, but its performance decreased significantly at face angles greater than 45°. User evaluation shows that 75% of students rated the detection results as accurate/very accurate with an average satisfaction score of 3.6 out of 5, and 75% felt assisted in recognizing their concentration level. From the teachers’ perspective, most stated that the results were consistent with classroom observations, and all expressed willingness to reuse the system.