I extracted tweets from twitter using the twitteR package and saved them into a text file.
I have carried out the following on the corpus
xx<-tm
To remove the URLs you could try the following:
removeURL <- function(x) gsub("http[[:alnum:]]*", "", x) xx <- tm_map(xx, removeURL)
Possibly you could define similar functions to further transform the text.