I want to build an NER model with custom train data. But after tokenizing I got the data in the following format. Id Text
21 [lymph, s