DeepRLCourse2022 / bguan_ppo_lunarlander3 /_stable_baselines3_version
bguan's picture
bguan's lunar lander model #3 using PPO trained for 1M timesteps
ee17131
1.5.0