#reinforcement learning debugging