In a dfm how is it possible to detect in an ngram the same words i.e.
hello_hello, text_text
and remove them from the dfm?