PPO-LunarLander-v2 / results.json
linker81's picture
Update of hyperparameters PPO
cbecc85
raw
history blame
164 Bytes
{"mean_reward": -8.599161120748613, "std_reward": 83.01900113693753, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-19T13:46:40.338471"}