PDFminer: extract text with its font information

后端 未结 6 1255
伪装坚强ぢ
伪装坚强ぢ 2021-02-08 03:26

I find this question, but it uses command line, and I do not want to call a Python script in command line using subprocess and parse HTML files to get the font information.

6条回答
  •  执笔经年
    2021-02-08 03:47

    Have a look at PDFlib, it can extract font info as you require and has a Python library you can import in your scripts and work with it.

提交回复
热议问题