Transform an image of handwritten notes to text [closed]

痴心易碎 提交于 2020-01-25 08:03:10

问题


I have hundreds of images of handwritten notes. They were written from different people but they are in sequence so you know that for example person1 wrote img1.jpg -> img100.jpg. The style of handwriting varies a lot from person to person but there are parts of the notes which are always fixed, I imagine that could help an algorithm (it helps me!).

I tried tesseract and it failed pretty bad at recognizing the text. I'm thinking since each person has like 100 images is there an algorithm I can train by feeding it a small number of examples, like 5 or less and it can learn from that? Or would it not be enough data? From searching around it seems looks like I need to implement a CNN (e.g. this paper).

My knowledge of ai is limited though, is this something that I could still do using a library and some studying? If so, what should I do going forward?


回答1:


This is called OCR and there has been a progress. Actually, here is an example of how simple it is to parse an image file to text using tesseract:

try:
    from PIL import Image
except ImportError:
    import Image
import pytesseract


def ocr_core(file):
    text = pytesseract.image_to_string(file)
    return text


print(ocr_core('sample.png'))

BUT

I am not very sure that it can recognize different types of handwriting. You can give it a try yourself to find out. If you want to try the python example you need to import tesseract but first things first to install tesseract on your OS and add it to your PATH.




回答2:


There are many OCRs out there and some perform better than others. However, this is a field that has improved a lot recently with the Deep Neural Networks. I would consider using a Cloud provider such as Azure, Google Cloud or Amazon. Your upload the image and they return the metadata.

For instance: https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/

If you don't want to use cloud services for any reason, I would consider using TensorFlow... but some knowledge is required:

Tensorflow model for OCR



来源:https://stackoverflow.com/questions/58395079/transform-an-image-of-handwritten-notes-to-text

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!