Tesseract does not recognize single characters

后端 未结 4 600
夕颜
夕颜 2020-12-06 10:26

How to represent:

  1. Create new image with paint (any size)
  2. Add letter A to this image
  3. Try to recognize -> tesseract will not find any letters
4条回答
  •  醉话见心
    2020-12-06 10:52

    python code to do that configuration is like this:

    import pytesseract
    import cv2
    img = cv2.imread("path to some image")
    pytesseract.image_to_string(
         img, config=("-c tessedit"
                      "_char_whitelist=abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789"
                      " --psm 10"
                      " -l osd"
                      " "))
    

    the --psm flag defines the page segmentation mode.

    according to documentaion of tesseract, 10 means :

    Treat the image as a single character.

    so to recognize a single character you just need to use : --psm 10 flag.

提交回复
热议问题