ppo-LunarLander-v2 / robot_1 /_stable_baselines3_version
jgerbscheid's picture
basic PPO model trained in colab, deep-rl course unit
7f32894
1.5.0