I\'m confused about why the training output is different from evaluating. I printed out the parameters after training and during evaluating to make sure they were the same.