Abstract: Human-in-the-loop reinforcement learning (HIRL) has emerged as a promising approach to address the challenges of sample efficiency and exploration in complex environments. This paper ...