Why tf.GradientTape() has less GPU memory usage when watch model variables manually?

前端 未结 0 1956
轮回少年
轮回少年 2021-01-01 03:20

So when I use tf.GradientTape() to automatically monitor the trainable variables in a resnet model, the computer threw an out of memory error. Below is the code:



        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题