#rl training validation