How do I mask output in transformer model?

前端 未结 0 817
你的背包
你的背包 2020-12-21 08:16

I am applying the transformer model and I apply padding_mask + look_a_head_mask to the attention layer. But the masks are not propagated to outputs. Is there any way to appl

相关标签:
回答
  • 消灭零回复
提交回复
热议问题