Been solving a minimal likelihood problem in order to learn a reward function for an inverse reinforcement learning model. I\'ve built it in NumPy and managed to minimise th