I have 3k text data where I wanted to categorize them by different themes
Currently, my script separates the data by each word (data_words) however I