ppo-LunarLander-v2 / ppo-LunarLander-v2
abragin's picture
Baseline agent trained with 3M steps
91d9a91 verified