Image processing / super light OCR

前端 未结 3 888
孤街浪徒
孤街浪徒 2021-01-01 07:02

I have 55 000 image files (in both JPG and TIFF format) which are pictures from a book.

The structure of each page is this:

some text

3条回答
  •  一整个雨季
    2021-01-01 07:57

    Probably the easiest way to detect your lines is using the Hough transform in OpenCV (which has wrappers for many languages).

    The OpenCV Hough tranform will detect all lines in the image and return their angles and start/stop coordinates. You should only keep the ones whose angles are close to horizontal and of adequate length.

    O'Reilly's Learning OpenCV explains in detail the function's input and output (p.156).

提交回复
热议问题