Counter ngram with tm package in R
问题 I created a script for the frequency of words in a document using the object and a dictionary documentTermMatrix in R. The script works on individual words and not on the compound word es. "foo" "bar" "foo bar" This is the code require(tm) my.docs <- c("foo bar word1 word2") myCorpus <- Corpus(VectorSource(my.docs)) inspect(DocumentTermMatrix(myCorpus,list(dictionary = c("foo","bar","foo bar")))) But the result is Terms Docs bar foo foo bar 1 1 1 0 I would have to find one "foo bar" = 1 How