How to get text extraction from PDF to work?

前端 未结 2 2008
长发绾君心
长发绾君心 2020-12-12 00:53

I need to extract the text from a PDFs in Romanian language. The symbols: ȚțȘșĂăÎîÂâ are not extracted correctly with pdfBox or Snowtide.

Here is a sample file that

2条回答
  •  一生所求
    2020-12-12 01:02

    How about iText: http://itextpdf.com/

    "iText® is an open source library that allows you to create and manipulate PDF documents. It enables developers looking to enhance web- and other applications with dynamic PDF document generation and/or manipulation."

提交回复
热议问题