How to extract text from a PDF file?

前端 未结 24 2254
孤城傲影
孤城傲影 2020-11-22 14:05

I\'m trying to extract the text included in this PDF file using Python.

I\'m using the PyPDF2 module, and have the following script:

imp         


        
24条回答
  •  失恋的感觉
    2020-11-22 14:15

    You can use PDFtoText https://github.com/jalan/pdftotext

    PDF to text keeps text format indentation, doesn't matter if you have tables.

提交回复
热议问题