pyPdf unable to extract text from some pages in my PDF

后端 未结 6 1102
伪装坚强ぢ
伪装坚强ぢ 2021-01-05 13:07

I\'m trying to use pyPdf to extract and print pages from a multipage PDF. Problem is, text is not extracted from some pages. I\'ve put an example file here:

http://w

6条回答
  •  慢半拍i
    慢半拍i (楼主)
    2021-01-05 13:43

    You could also try the pdfminer library (also in python), and see if it's better at extracting the text. For splitting however, you will have to stick with pyPdf as pdfminer doesn't support that.

提交回复
热议问题