Why does the first LSTM in a Keras model have more params than the subsequent one?
问题 I was just looking at the Keras model details from a fairly straightforward sequential model where I have multiple LSTM layers, one after another. I was surprised to see that the first layer always has more params despite having the same definition as the subsequent LSTM layer. The model definition here shows it clearly: Layer (type) Output Shape Param # ================================================================= lstm_1 (LSTM) (None, 400, 5) 380 _________________________________________