I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code.
I am using the following code for getting th
tesseract.GetBoxText()
method returns the exact position of each character in an array.
Besides, there is a command line option tesseract test.jpg result hocr
that will generate a result.html
file with each recognized word's coordinates in it. But I'm not sure whether it can be called through python script.