I\'m new to NLTK. I\'m getting this error and I\'ve searched around for encoding/decoding and specifically the UnicodeDecodeError but this error seems specific to the NLTK s
Try using the module "textclean"
>>> pip install textclean
Python code
from textclean.textclean import textclean text = textclean.clean("John's big idea isn't all that bad.") print pos_tag(word_tokenize(text))