How do I segment a document using Tesseract then output the resulting bounding boxes and labels

后端 未结 6 2051
忘了有多久
忘了有多久 2020-12-07 10:25

I\'m trying to get Tesseract to output a file with labelled bounding boxes that result from page segmentation (pre OCR). I know it must be capable of doing this \'out of the

6条回答
  •  眼角桃花
    2020-12-07 11:18

    Shortcut

    It is also possible to open HOCR files directly with the PageViewer tool. The file extension has to be .xml, however.

提交回复
热议问题