chirbard
/

ppo-BipedalWalker-v3

Reinforcement Learning

stable-baselines3

BipedalWalker-v3

deep-reinforcement-learning

Model card Files Files and versions Community

chirbard commited on Apr 18, 2024

Commit

9aa1aef

·

verified ·

1 Parent(s): 99b8bbe

Update README.md

Files changed (1) hide show

README.md +14 -8

README.md CHANGED Viewed

@@ -25,13 +25,19 @@ model-index:
 This is a trained model of a **PPO** agent playing **BipedalWalker-v3**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
-## Usage (with Stable-baselines3)
-TODO: Add your code
 ```python
-from stable_baselines3 import ...
-from huggingface_sb3 import load_from_hub
-...
 ```

 This is a trained model of a **PPO** agent playing **BipedalWalker-v3**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Hyperparameters
 ```python
+model = PPO(
+    policy = 'MlpPolicy',
+    env = env,
+    n_steps = 1024,
+    batch_size = 64,
+    n_epochs = 4,
+    gamma = 0.99,
+    gae_lambda = 0.98,
+    ent_coef = 0.01,
+    verbose=1)
 ```
+## Train Time
+Trained for 3 000 000 timesteps. Training took 1 hour and 8 minutes on Nvidia RTX A2000 Laptop.