MLP agent that lands a rocket to the moon, trained by deep reinforcement learning 945f43c verified ChihoonLee3 commited on Jan 28