Deep-Learning Nan loss reasons

后端 未结 9 2178
执念已碎
执念已碎 2020-11-28 02:12

Perhaps too general a question, but can anyone explain what would cause a Convolutional Neural Network to diverge?

Specifics:

I am using Tensorflow\'s iris_tra

9条回答
  •  暗喜
    暗喜 (楼主)
    2020-11-28 02:34

    I'd like to plug in some (shallow) reasons I have experienced as follows:

    1. we may have updated our dictionary(for NLP tasks) but the model and the prepared data used a different one.
    2. we may have reprocessed our data(binary tf_record) but we loaded the old model. The reprocessed data may conflict with the previous one.
    3. we may should train the model from scratch but we forgot to delete the checkpoints and the model loaded the latest parameters automatically.

    Hope that helps.

提交回复
热议问题