I used word2vec in deeplearning4j to train word vectors, but those vectors are unstable

一世执手 提交于 2019-12-06 22:57:32

You have set the following line:

.minLearningRate(1e-2) // learning rate decays wrt # words. floor learning

But that is an extremely high learning rate. The high learning rate causes the model to not 'settle' in any state, but instead a few updates significantly changes the learned representation. That is not a problem during the first few updates, but bad for convergence.

Solution: Allow learning rate to decay. You can leave this line out completely, or if you must you can use a more appropriate value, such as 1e-15

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!