I would like to include the gradient information of the output of the neural network with respect to the input, in the loss function of the model, so I can use it to guide t