Pytorch transformer forward function masks implementation for decoder forward function

后端 未结 0 1167
甜味超标
甜味超标 2021-01-06 08:16

I am trying to use and learn PyTorch Transformer with DeepMind math dataset. I have tokenized (char not word) sequence that is fed into model. Models forward function is doi

相关标签:
回答
  • 消灭零回复
提交回复
热议问题