Tesseract: How to run tesseract with multiple languages one time

问题

I have to analyzed a image which containing both English and Japanese texts. When I run tesseract by default (eng), some Japanese characters lost. Otherwise, if I run tesseract with japanese (-l jpn) some English characters lost (e.p. Email). How can I run one process which recognize both English and Japanese characters. Thanks.

回答1:

Since tesseract 3.02 it is possible to specify multiple languages for the -l parameter.

-l lang The language to use. If none is specified, English is assumed. Multiple languages may be specified, separated by plus characters. Tesseract uses 3-character ISO 639-2 language codes.

An example:

tesseract myscan.png out -l deu+eng

来源：https://stackoverflow.com/questions/24379781/tesseract-how-to-run-tesseract-with-multiple-languages-one-time

标签

image-processing

ocr

tesseract

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!