SciRepID

Kontak Publisher

ASOSIASI RISET TEKNIK ELEKTRO DAN INFORMATIKA INDONESIA (ARTEII)

Jl. Watunganten I No.1, Karangrawa, Batursari, Kec. Mranggen, Kabupaten Demak, Jawa Tengah 59567 Demak
info@arteii.or.id
082136969859
https://arteii.or.id/

HomePage
OAI Link
Editorial Team
Contact
Reviewer
Google Schoolar

Neptunus - Neptunus Jurnal Ilmu Komputer Dan Teknologi Informasi - Vol. 3 Issue. 3 (2025)

Amir Hamzah, Jamilatul Badriyah,

This study compares the performance of two deep learning models, namely Convolutional Long Short-Term Memory (ConvLSTM) and Long-term Recurrent Convolutional Network (LRCN), in the task of recognizing human activity from videos. Human activity recognition is an important field in computer vision with many applications, such as security monitoring, human-computer interaction, and social media-based video analysis. ConvLSTM is a model that combines convolution operations with long-term memory LSTM, thus capable of capturing spatial and temporal information simultaneously. This approach is ideal for processing video data sequences that have spatial and temporal dimensions. On the other hand, LRCN combines the power of spatial feature extraction from Convolutional Neural Network (CNN) and temporal sequence modeling through Recurrent Neural Network (RNN), specifically LSTM, to understand movement patterns in videos. The study used the UCF50 dataset consisting of 50 activity classes, but was limited to five classes for the focus of the experiment. The dataset was divided into 80% for training and 20% for testing, and the model was drilled for 50 epochs using early stopping to prevent overfitting. The results show that both models have high training performance. ConvLSTM achieved a training accuracy of around 98% and a validation accuracy of 90%, while LRCN achieved a training accuracy of 99.5% and a validation accuracy of 88%. Although ConvLSTM demonstrated good stability on the validation data, further testing using TikTok videos as real-world data showed that LRCN had a higher confidence level in recognizing activities, with most predictions achieving confidence scores above 80%. This difference in performance indicates that while ConvLSTM excels in generalizing on training data, LRCN is more robust to real-world data variations.

DOI :

DOI : 10.61132/neptunus.v3i3.991

Sitasi :

PISSN :

3031-8998

EISSN :

3031-898X

Date.Create Crossref:

04-Aug-2025

Date.Issue :

04-Aug-2025

Date.Publish :

04-Aug-2025

Date.PublishOnline :

04-Aug-2025

PDF File :

Download

Resource :

Open

License :

https://creativecommons.org/licenses/by-sa/4.0

Analisis Perbandingan Performa Model ConvLSTM dan LRCN dalam Pengenalan Aktivitas Gerak Manusia

Abstract

DOI :

Sitasi :

PISSN :

EISSN :

Date.Create Crossref:

Date.Issue :

Date.Publish :

Date.PublishOnline :

PDF File :

Resource :

License :

links

Tentang