Number of pages of a word document with Python

后端 未结 2 1645

Is there a way to get efficiently the number of pages of a word document (.doc, .docx) with Python ?

And for an .odt file ?

I want to use this for a web appl

相关标签:
2条回答
  • 2020-12-19 23:58

    Only for those who search for this blog entry....

    from win32com.client import Dispatch
    #open Word
    word = Dispatch('Word.Application')
    word.Visible = False
    word = word.Documents.Open(doc_path)
    
    #get number of sheets
    word.Repaginate()
    num_of_sheets = word.ComputeStatistics(2)
    
    0 讨论(0)
  • 2020-12-20 00:05

    You can read the value

    <Properties>
    <Pages>CountValue</Pages>
    

    from docProps/app.xml in the docx package or

    <office:document-meta>
        <office:meta>
            <meta:document-statistic meta:page-count="CountValue">
    

    form meta.xml in odt package.

    If these values ​​do not exist (they are optional), you have to make a calculation of the entire document, in fact perform rendering, that much more difficult

    0 讨论(0)
提交回复
热议问题