I am playing some demos about recurrent neural network.
I noticed that the scale of my data in each column differs a lot. So I am considering to do some preprocess work
I found this https://arxiv.org/abs/1510.01378 If you normalize it may improve convergence so you will get lower training times.