How to build a multi-class convolutional neural network with Keras

后端未结

关注

 3  777

借酒劲吻你 2021-01-25 13:58

I am trying to implement a U-Net with Keras with Tensorflow backend for an image segmentation task. I have images of size (128,96) as input to the network together with mask ima

3条回答

甜味超标 (楼主)

2021-01-25 14:39
In my experience, also with a U-net for segmentation. It tends to do this:
- Go to totally black or totally white
- After a lot of time in which the loss seems to be frozen, it finds it's way.
I also use the "train just one image" method to find that convergence, then adding the other images is ok.

But I had to try a lot of times, and the only time it worked pretty fast was when I used:
- final activation = 'sigmoid'
- loss = 'binary_crossentropy'
But I wasn't using "relu" anywhere...perhaps that influences a little the convergence speed...? Thinking about "relu", which has only 0 or positive results, there is a big region in this function that does not have a gradient. Maybe having lots of "relu" activations creates a lot of "flat" areas without gradients? (Must think better about it to confirm)

Try a few times (and have patience to wait for many many epochs) with different weight initializations.

There is a chance that your learning rate is too big too.

About to_categorical(): have you tried to plot/print your masks? Do they really seem like what you expect them to?
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...