Yogiek Indra Kurniawan; Krisna Widi Nugraha; Rosyid Ridlo Al-Hakim; Erick Fernando; Rian Ardianto +2 more
Background: The development of modern manufacturing systems requires production scheduling strategies that not only improve productivity but also optimize energy utilization. Multi-machine production systems with job-shop configurations exhibit high complexity due to dynamic interactions between machines, job queues, and varying processing times, making conventional scheduling methods less effective in handling changing operational conditions. Objective: This study aims to develop and evaluate a reinforcement learning based production scheduling approach to improve production efficiency while reducing energy consumption in multi-machine manufacturing systems. Methods: This research employs a job-shop based multi-machine production simulation model as the experimental environment. The scheduling problem is formulated as a Markov Decision Process, enabling the implementation of reinforcement learning algorithms, namely Q-learning and Deep Q-Network, to learn optimal scheduling policies through interaction with the simulation environment. Energy consumption parameters are incorporated into the reward function so that the learning agent can consider energy efficiency in the scheduling decision-making process. System performance is evaluated using three main metrics, namely energy consumption, throughput, and makespan. Results: The experimental results show that the reinforcement learning based scheduling approach achieves better performance compared to conventional scheduling methods, resulting in lower energy consumption, higher job completion rates, and shorter production completion times within the multi-machine manufacturing system.