ocr

How can I highlight a chunk of text on an image taken by Camera view

北城以北 提交于 2019-12-21 21:55:16
问题 I am working on an OCR project. My first task is to take an image from iPhone camera and then locate a specific word on the image. Below is a sample image taken by my device, now I want to find the position of word "STATIONERY" and highlight it with a yellow rectangle on image. How can I do that? Do I need to first use an OCR SDK for that (like ABBYY) or can I do this otherwise? 回答1: If you need to get the coordinates of a specific word, of course you need to perform OCR first. Keep in mind

How can I detect boxes in an image and pull them out as individual files?

一世执手 提交于 2019-12-21 21:39:23
问题 I need a programmatic way of taking a scanned image (let's assume PNG or any other convenient image format) and breaking it up into many smaller images. The scanned image is a grid, and the boxes of the grid will always be the same size and in the same relative location. Because the image is scanned, they are not necessarily in the same absolute location. In each box is a character, ideally I'd like to save the character as its own image file, without any of the box border. I prefer PHP and

OCR code in android platform [duplicate]

旧巷老猫 提交于 2019-12-21 21:28:58
问题 This question already has answers here : Closed 6 years ago . Possible Duplicate: android OCR? Currently, I have a project in mobile application. in the application, I would like to developed OCR. Anyone know the sourcecode which can be execute for android platform? 回答1: The popular OCR libraries are Aspire and Tesseract. Example of it in practice: Mezzofanti (http://code.google.com/p/mezzofanti/), an open-source Android App using Tesjeract. OCR can be pretty CPU intensive, you might want to

What is a good OCR that can detect handwriting? [closed]

﹥>﹥吖頭↗ 提交于 2019-12-21 21:28:04
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I need a library that can extract text from handwritten paper once I scan it. Normal Latin text. It can be a free solution or even something I have to pay for, as long as it handles handwriting for block letters (not cursive). 回答1: There isn't an OCR program capable of recognising hand-written text well; it's

Using Tesseract OCR in VC++

随声附和 提交于 2019-12-21 20:58:35
问题 In my project I have to read the numbers from the image(.jpg or .tiff). After googling a lot, I came to know about the open OCR i.e., Tesseract OCR. Am begginer for Tesseract OCR, I read all the documentation of tesseract & how to use it in Visual studio. Bascically am facing some problem in using tesseract... I followed the steps like this: 1) Downloaded & Installed tesseract-ocr-setup-3.02.02.exe from http://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-setup-3.02.02

Can not use ChoiceIterator in tesseract

只愿长相守 提交于 2019-12-21 20:55:45
问题 First of all i want to confirm that i understand choice iterator right. For example if i have a word on an image "scope", choice iterator must give me something like "s" and maybe after Next(), "5". for 3. letter "o" it maybe gives me "0", after Next() "O" and after Next() "o". Do i understand right? Here is all my related code, api.SetImage((uchar*)img->imageData,img->width,img->height,img->depth/8,img->widthStep); api.SetRectangle(0,0,img->width, img->height); int left,top,right,bottom;

Drawing convexHull in openCV2 Python

坚强是说给别人听的谎言 提交于 2019-12-21 20:49:03
问题 So I am trying to draw the convexHull from a contour in python, however when i print the image it is not changing. roi=mask[y:y+h,x:x+w] roi = cv2.fastNlMeansDenoisingColored(roi,None,15,15,7,21) hull = cv2.convexHull(cnt) cv2.drawContours(roi,[hull],0,(147,0,255),2) cv2.imshow(str(i),roi) blank_image[y:y+h,x:x+w] = roi However, the images that show are the exact same if I did not include the code. I looked online, but cannot seem to find the answer. Here is a sample Image: 回答1: I used the

Recognizable numbers using PHP

眉间皱痕 提交于 2019-12-21 20:25:48
问题 I’m trying to extract some numbers ranging from 1-99 from a picture. I’ve tried several OCR methods using PHP, but eventually my script will fail, since the numbers occasionally is rotated 5% to the left or right. This making the picture not being recognizable. I’ve now installed Ocropus http://code.google.com/p/ocropus/ as a test. Unfortunately this is not giving me the correct numbers every time. This leads me to think that my pictures are not optimized enough. Does anyone have some tips

How to use trained data with pytesseract?

时间秒杀一切 提交于 2019-12-21 19:44:16
问题 Using this tool http://trainyourtesseract.com/ I would like to be able to use new fonts with pytesseract. the tool give me a file called *.traineddata Right now I'm using this simple script : try: import Image except ImportError: from PIL import Image import pytesseract as tes results = tes.image_to_string(Image.open('./test.jpg'),boxes=True) file = open('parsing.text','a') file.write(results) print(results) How to I use my traineddata file so I'm able to read new font with the python script

Programmatically divide scanned images into separate images

亡梦爱人 提交于 2019-12-21 19:38:10
问题 In order to improve OCR quality, I need to preprocess my scanned images. Sometimes I need to OCR the image with few pictures (components on the page and they are at different angles - for example, a few paper documents scanned at one time), for example: Is it possible to automatically programmatically divide such images into separate images that will contain every logical document? For example with a tool like ImageMagick or something else? Is there any solutions/technics exists for such