Tesseract doesn't seem to work with digits

大兔子大兔子 提交于 2020-01-13 15:46:33

问题


I followed the FAQ to make Tesseract recognize digits, but all I get is a bunch of text in the output file, despite having only numbers in my image.

My command line looks like this:

tesseract --tessdata-dir ./ ./input.jpg ./output/output digits

Any ideas what could be happening?.


回答1:


As mentioned in tesseract github issue you can't black or whitelist characters with tesseract 4.0 LSTM, instead you should train LSTM with characters you expect on your image.

Thanks to Shreeshrii you can try his 'experimantal' digits traineddata from here

Please note that Tesseract 4.0 is still in alpha stage and if you want - you can still use 3.* versions of tesseract which support your needs from the box. Tesseract v 3.4 tessdata is located here, library for windows can be downloaded from here



来源:https://stackoverflow.com/questions/48210518/tesseract-doesnt-seem-to-work-with-digits

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!