Train and test a reinforcement learning model
Compare standard and hybrid test-time adaptation in Meta-RL