Trouble recognizing digits in Tesseract - android

前端 未结 3 1384
面向向阳花
面向向阳花 2021-02-04 15:21

I was hoping someone could tell me why it is my Tesseract has trouble recognizing some images with digits, and if there is something i can do about it. Everything is working acc

3条回答
  •  没有蜡笔的小新
    2021-02-04 16:06

    I know of some options that might help you:

    1. Add extra space between image border and text. Tesseract would work awful if text in the image is positioned at the edge.
    2. Duplicate your image. For example, if you're performing OCR on a word 'foobar', clone the image and send 'foobar foobar foobar foobar foobar' to tesseract, results would be better.
    3. Google for font training and image binarization for tesseract.

    Keep in mind, that built-in camera in mobile devices mostly produce low quality images (blured, noised, skewed etc.) OCR itself is a resource comsuming process and if you add a worthy image preprocessing to that, low-end and mid mobile devices (which are likely to have android) could face unexpectedly slow performance or even lack of resources. That's OK for free/study projects, but if you're planning a commercial app - consider using a better SDK.

    Have a look at this question for details: OCR for android

提交回复
热议问题