I\'m trying to get Tesseract to output a file with labelled bounding boxes that result from page segmentation (pre OCR). I know it must be capable of doing this \'out of the
Shortcut
It is also possible to open HOCR files directly with the PageViewer tool. The file extension has to be .xml, however.