tesseract

OCR for Equations and Formulae on the iOS Platform (Xcode)

孤者浪人 提交于 2019-12-05 06:05:17
I'm currently developing an application which uses the iOS enabled device camera to recognise equations from the photo and then match these up to the correct equation in a library or database - basically an equation scanner. For example you could scan an Image of the Uncertainty Principle or Schrodinger Equation and the iOS device would be able to inform the user it's name and certain feedback. I was wondering how to implement this using Xcode, I was thinking of using an open-source framework such as Tesseract OCR or OpenCV but I'm not sure how to apply these to equations. Any help would be

Tesseract False Space Recognition

▼魔方 西西 提交于 2019-12-05 05:36:58
I'm using tesseract to recognize a serial number. This works acceptable, common problem like false recognition of zero and "O", 6 and 5, or M and H exists. Beside by this tesseract adds spaces to the recognized words, where no space is in the image. The following image is recognized as "HI 3H" . This image results in " FBKHJ 1R1" So tesseract added a space, although there isn't really a space in the image. Is there a possibility parametrize the spacing behavior of tesseract? Edit I'm sorry, have forgot to add, that I also have serial numbers which include spaces. So I cannot delete all spaces

Can tesseract be trained for non-font symbols?

喜你入骨 提交于 2019-12-05 05:25:57
I'm curious about how I may be able to more reliably recognise the value and the suit of playing card images. Here are two examples: There may be some noise in the images, but I have a large dataset of images that I could use for training (roughly 10k pngs, including all values & suits). I can reliably recognise images that I've manually classified, if I have a known exact-match using a hashing method. But since I'm hashing images based on their content, then the slightest noise changes the hash and results in an image being treated as unknown. This is what I'm looking to reliably address with

How to configure Tesseract in Eclipse for Android development?

ⅰ亾dé卋堺 提交于 2019-12-05 05:25:17
I have been working on android from 6 months. So I have basic idea about Android development. Now I want to develop an OCR Android app using Tesseract. For that I have downloaded android-ndk-r6b, tess-two from Tesseract, Cygwin for Windows and installed C/C++ developer in Eclipse. And I followed these two links: 1.http://www.itwizard.ro/interfacing-cc-libraries-via-jni-example-tesseract-163.html 2.http://gaut.am/making-an-ocr-android-app-using-tesseract/ but I got stuck in between. I mean I am not able to configure these things together. Can anyone suggest me how to configure tesseract in

Tesseract OCR: is it possible to force a specific pattern?

落花浮王杯 提交于 2019-12-05 03:16:46
问题 I'm using Tesseract and I want to develop an app that is able to recognize a sequence of characters. I had good results but not exellent. The characters sequence I want to read has always a specific pattern, let's say: number number number char char - (e.g.: 123AB) Is there a way to "tell" the ocr engine that the structure is always fixed, in order to improve the results of the recognition? Thank you in advance. 回答1: Try bazaar matching pattern in Tesseract: \d\d\d\c\c 回答2: You can use the

Tesseract OCR w/ iOS & Swift returns error or gibberish

僤鯓⒐⒋嵵緔 提交于 2019-12-05 02:19:15
问题 I used this tutorial to get Tesseract OCR working with Swift: http://www.piterwilson.com/blog/2014/10/18/minimal-tesseact-ocr-setup-in-swift/ It works fine if I upload the demo image and call tesseract.image = UIImage(named: "image_sample.jpg"); But if I use my camera code and take a picture of that same image and call tesseract.image = self.image.blackAndWhite(); the result is either gibberish like s I 5E251 :Ec ‘-. —7.//:E*é«§ a g :_{:7 IC‘ J 7 iii—1553‘ : fizzle —‘;-—: ; ~:~./: -:-‘- ‘- :~£

Errors in Tesseract integration in iOS app

不羁的心 提交于 2019-12-05 02:12:43
问题 I am getting some errors while integrating Tesseract SDK in my iOS app. The procedure I have followed - 1) Dragged "libtesseract_full.a" in xcode 2) Dragged "tessdata" folder in xcode 3) Dragged "baseapi.h" in xcode Now when I am using Tesseract - // init the tesseract engine. tess = new TessBaseAPI(); tess->SimpleInit([dataPath cStringUsingEncoding:NSUTF8StringEncoding], // Path to tessdata-no ending /. "eng", // ISO 639-3 string or NULL. false); I am getting these below errors (I think

How to import Tesseract into Angular2 (TypeScript)

自作多情 提交于 2019-12-05 00:45:47
问题 I'm trying to import Tesseract into Angular2 (TypeScript). I can see it saved into the node_modules folder but when using import { Tesseract } from '@types/tesseract.js'; it says: [ts] Module '"c:/Users/black/Projects/projectCLI/tess/node_modules/@types/tesseract.js/index"' has no exported member 'Tesseract'. In the index.d.ts file there is a namespace called Tesseract. Is there some other way to import this or are we looking at this the wrong way? I used npm install --save-dev @types

Can `tesseract-ocr` put the result to STDOUT?

烈酒焚心 提交于 2019-12-04 22:35:13
Using tesseract-ocr #3.02.02. The basic usage of tesseract is tesseract sourc.png result and result.txt is generated. To get the result text, I have to cat this file. Is there any options to dump the result in stdout? Simeon Visser You should upgrade to v3.03 where support for stdout was added . The solution is: tesseract input.jpg stdout But you need at least version 3.03 来源: https://stackoverflow.com/questions/24347819/can-tesseract-ocr-put-the-result-to-stdout

Tesseract user-pattern is not applied

独自空忆成欢 提交于 2019-12-04 22:24:45
问题 I want to do OCR on this image. This is pre-define format. ie first five will characters, then next four will be digits and last will be character. When I execute following command $ tesseract in.png stdout I get output as BDVPD474SQ So, I went for user-pattern. I created a file(in directory /usr/share/tesseract-ocr/tessdata/configs ) named as bazaar (its content is as follow) load_system_dawg F load_freq_dawg F user_patterns_suffix user-patterns I also created a file, named as eng.user