I\'m trying to use python-docx module (pip install python-docx) but it seems to be very confusing as in github repo test sample they are using
python-docx
pip install python-docx
you can try this
import docx def getText(filename): doc = docx.Document(filename) fullText = [] for para in doc.paragraphs: fullText.append(para.text) return '\n'.join(fullText)