Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bguan
/
lunar_lander_v2_ppo_5
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
lunar_lander_v2_ppo_5
1 contributor
History:
3 commits
bguan
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
57e96c5
over 2 years ago
bguan_ppo_lunarlander5
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
.gitattributes
Safe
1.22 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
README.md
Safe
677 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
bguan_ppo_lunarlander5.zip
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
145 kB
LFS
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
config.json
Safe
15 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
replay.mp4
Safe
247 kB
LFS
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
results.json
Safe
163 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago