For some coursework I am developing a maze generator that uses Q-Learning to solve it. I have used Sentdex\'s tutorial: https://youtu.be/G92TF4xYQcU, to implement the actual