发表新帖

发表新帖

float16 vs float32 for convolutional neural networks

后端未结

关注

 3  881

忘了有多久 2021-02-07 13:26

The standard is float32 but I\'m wondering under what conditions it\'s ok to use float16?

I\'ve compared running the same covnet with both datatypes and haven\'t noticed

3条回答

星月不相逢 (楼主)

2021-02-07 14:10

float16 training is tricky: your model might not converge when using standard float16, but float16 does save memory, and is also faster if you are using the latest Volta GPUs. Nvidia recommends "Mixed Precision Training" in the latest doc and paper.

To better use float16, you need to manually and carefully choose the loss_scale. If loss_scale is too large, you may get NANs and INFs; if loss_scale is too small, the model might not converge. Unfortunately, there is no common loss_scale for all models, so you have to choose it carefully for your specific model.

If you just want to reduce the memory usage, you could also try tf. to_bfloat16, which might converge better.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题