In this project, you will implement value iteration and Q-learning. You will test your agents first on Gridworld (from class), then apply them to a simulated robot controller (Crawler) and Pacman.
You can run the code with: python2 gridworld.py -k 20 -a q -l 1