Linear vs nonlinear neural network?

前端未结

关注

 7  1370

余生分开走 2021-01-31 17:26

I\'m new to machine learning and neural networks. I know how to build a nonlinear classification model, but my current problem has a continuous output. I\'ve been searching for

7条回答

渐次进展 (楼主)

2021-01-31 17:43

When it comes to nonlinear regression, this is referring to how the weights affect the output. If a function is not linear with respect to the weights, then your problem is a nonlinear regression problem. So for example, let's look at a Feedforward Neural Network with one hidden layer where the activation functions in the hidden layer are some function $g(z)$ and the output layer has linear activation functions. Given this, the mathematical representation can be:

$y = W_2 g(W_1 x + b_1) + b_2$

where we assume $g(z)$ can operator on scalars and vectors with this notation to make it easy. $W_1$ , $W_2$ , $b_1$ , and $b_2$ are the weight you are aiming to estimate with the regression. If this was linear regression, $g(z)$ would equal z, because that would make y linearly dependent on $W_1$ & $b_1$ . But if $g(z)$ is nonlinear, say like $g(z) = z^2$ , then now y is nonlinearly dependent on the weights $W_2$ .

Now provided you understand all that, I am surprised you haven't seen discussion of the nonlinear case because that's pretty much all people talk about in textbooks and research. The use of things like stochastic gradient descent, Nonlinear Conjugate Gradient, RProp, and other methods are to help find local minima (and hopefully good local minima) for these nonlinear regression problems, even though a global optimum is not typically guaranteed.

0 讨论(0)

查看其它7个回答
发布评论:

提交评论
- 加载中...