Trying to Make Deep RL Work on a Small Gridworld
This project started as a practical follow-up to an introductory deep reinforcement learning course. The goal was simple to state and harder to make work: train deep RL agents to solve a small stochastic pickup-and-delivery task better than a hand-written greedy baseline. The code is available on GitHub: github.com/jclotten/deep-rl-gridworld-benchmark. Why This Project Small gridworlds are often used to explain reinforcement learning because the rules are easy to understand. That does not mean they are automatically easy for deep RL. ...