python convert microsoft office docs to plain text on linux

前端 未结 7 643
庸人自扰
庸人自扰 2020-12-06 05:49

Any recomendations on a method to convert .doc, .ppt, and .xls to plain text on linux using python? Really any method of conversion would be useful. I have already looked at

7条回答
  •  囚心锁ツ
    2020-12-06 06:46

    The usual tool for converting Microsoft Office documents to HTML or other formats was mswordview, which has since been renamed to vwWare.

    If you're looking for a command-line tool, they actually recommend using AbiWord to perform the conversion:

    AbiWord --to=txt
    

    If you're looking for a library, start on the wvWare overview page. They also maintain a list of libraries and tools which read MS Office documents.

提交回复
热议问题