How to preserve document structure in tesseract

后端 未结 5 1507
南方客
南方客 2020-12-11 00:40

I am using tesseract ocr to extract text from an image. Preserving the structure of the document is very important to me. Currently tesseract does not preserve the structure

5条回答
  •  孤街浪徒
    2020-12-11 01:15

    Tesseract code compresses spaces in output. You will need to change the code to preserve them. See Tesseract - ambiguity in space and tab post.

提交回复
热议问题