OpenAI Lunar Lander – Solving with Vanilla DQN (aka Reinforcement Learning with Experience Replay)
In order to understand what is all the buzz about DeepMind’s reinforcement learning papers, I decided to implement Deep Reinforcement Learning with: Double Q-learning Experience Replay The neural network was then trained on the OpenAI Lunar Lander environment. I did my best to implement the above in TensorFlow with just the paper published by DeepMind …