Getting the bounding box of the recognized words using python-tesseract

前端 未结 7 1136
不思量自难忘°
不思量自难忘° 2020-11-30 19:21

I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code.

I am using the following code for getting th

7条回答
  •  攒了一身酷
    2020-11-30 20:04

    tesseract.GetBoxText() method returns the exact position of each character in an array.

    Besides, there is a command line option tesseract test.jpg result hocr that will generate a result.html file with each recognized word's coordinates in it. But I'm not sure whether it can be called through python script.

提交回复
热议问题