DQN-LunarLander-v2 / results.json
ThoDum's picture
Using DQN achitecture with 1e6 total_timesteps to improve results
4e92e58
raw
history blame contribute delete
166 Bytes
{"mean_reward": -123.02195335792821, "std_reward": 62.233437684796236, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-16T16:26:52.909848"}