How can I read and process contents of every cell of a table in a DOCX file?
I am using Python 3.2 on Windows 7 and PyWin32 to access the MS-Word Document.
I
Jumping in rather late in life, but thought I'd put this out anyway: Now (2015), you can use the pretty neat doc python library: https://python-docx.readthedocs.org/en/latest/. And then:
from docx import Document
wordDoc = Document('')
for table in wordDoc.tables:
for row in table.rows:
for cell in row.cells:
print cell.text