When I run sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})
I get InternalError: Blas SGEMM launch failed
. Here is the full error and st
I got this error when running Tensorflow Distributed. Did you check if any of the workers were reporting CUDA_OUT_OF_MEMORY errors? If this is the case it may have to do with where you place your weight and bias variables. E.g.
with tf.device("/job:paramserver/task:0/cpu:0"):
W = weight_variable([input_units, num_hidden_units])
b = bias_variable([num_hidden_units])