I\'m using distilBert in an nn.Module class and pytorch Dataset from P3 for dataloaders.
The input into Bert model: outputs = self.bert(**input), is a dictionary of i