Neural Network Always Produces Same/Similar Outputs for Any Input

前端未结

关注

 11  1935

I have a problem where I am trying to create a neural network for Tic-Tac-Toe. However, for some reason, training the neural network causes it to produce nearly the same out

相关标签:

11条回答

无人共我

2020-12-13 04:42

I know, that for the original post this is far, too late but maybe I can help someone with this, as I faced the same problem.

For me the problem was, that my input data had missing values in important columns, where the training/test data were not missing. I replaced these values with zero values and voilà, suddenly the results were plausible. So maybe check your data, maybe it si misrepresented

0 讨论(0)
发布评论:

提交评论
- 加载中...
日久生厌

2020-12-13 04:47

I was running into the same problem with my model when number of layers is large. I was using a learning rate of 0.0001. When I lower the learning rate to 0.0000001 the problem seems solved. I think algorithms stuck on local minumums when learning rate is too low

0 讨论(0)
发布评论:

提交评论
- 加载中...
醉话见心

2020-12-13 04:49
I've had similar problems, but was able to solve by changing these:
- Scale down the problem to manageable size. I first tried too many inputs, with too many hidden layer units. Once I scaled down the problem, I could see if the solution to the smaller problem was working. This also works because when it's scaled down, the times to compute the weights drop down significantly, so I can try many different things without waiting.
- Make sure you have enough hidden units. This was a major problem for me. I had about 900 inputs connecting to ~10 units in the hidden layer. This was way too small to quickly converge. But also became very slow if I added additional units. Scaling down the number of inputs helped a lot.
- Change the activation function and its parameters. I was using tanh at first. I tried other functions: sigmoid, normalized sigmoid, Gaussian, etc.. I also found that changing the function parameters to make the functions steeper or shallower affected how quickly the network converged.
- Change learning algorithm parameters. Try different learning rates (0.01 to 0.9). Also try different momentum parameters, if your algo supports it (0.1 to 0.9).
Hope this helps those who find this thread on Google!
0 讨论(0)
发布评论:

提交评论
- 加载中...
萌比男神i

2020-12-13 04:54

It's hard to tell without seeing a code sample but it is possible occure for a net because its number of hidden neron.with incresing in number of neron and number of hiden layer it is not possible to train a net with small set of training data.until it is possible to make a net with smaller layer and nerons it is amiss to use a larger net.therefore perhaps your problem solved with attention to this matters.

0 讨论(0)
发布评论:

提交评论
- 加载中...
眼角桃花

2020-12-13 04:56

I've had similar problems with machine learning algorithms and when I looked at the code I found random generators that were not really random. If you do not use a new random seed (such Unix time for example, see http://en.wikipedia.org/wiki/Unix_time) then it is possible to get the exact same results over and over again.

0 讨论(0)
发布评论:

提交评论
- 加载中...

上一页 1 2