I\'m trying to use the tm package in R to perform some text analysis. I tied the following:
require(tm)
dataSet <- Corpus(DirSource(\'tmp/\'))
dataSet <
I have been running this on Mac and to my frustration,I had to identify the foul record (as these were tweets) to resolve. Since the next time, there is no guarantee of the record being the same, I used the following function
tm_map(yourCorpus, function(x) iconv(x, to='UTF-8-MAC', sub='byte'))
as suggested above.
It worked like a charm