I am trying to plot first layer kernel\'s change as we increase the number of batches trained. However I get the same plot after training with 10 batches and 30000 batches.