Python OCR Module in Linux?

后端 未结 5 1202
暗喜
暗喜 2020-12-24 00:04

I want to find a easy-to-use OCR python module in linux, I have found pytesser http://code.google.com/p/pytesser/, but it contains a .exe executable file.

I tried ch

5条回答
  •  無奈伤痛
    2020-12-24 00:50

    In addition to Blender's answer, that just executs Tesseract executable, I would like to add that there exist other alternatives for OCR that can also be called as external process.

    ABBYY comand line OCR utility: http://ocr4linux.com/en:start

    It is not free, so worth to consider only if Tesseract accuracy is not good enough for your task, or you need more sophisticated layout analisys or you need to export PDF, Word and other files.

    Update: here's comparison of ABBYY and tesseract accuracy: http://www.splitbrain.org/blog/2010-06/15-linux_ocr_software_comparison

    Disclaimer: I work for ABBYY

提交回复
热议问题