Use multiple softmax in transformers output layer and calculate loss

前端 未结 0 1323
佛祖请我去吃肉
佛祖请我去吃肉 2020-12-23 01:34

Can I use multiple softmax in the last output layer in transformers? If so, how can I calculate loss from that. I am working in pytorch.

And I am asking because my da

相关标签:
回答
  • 消灭零回复
提交回复
热议问题