How to speed up tesseract OCR
问题 I'm trying to OCR a lot of documents(I mean in 300k + range a day). At the moment i'm using Tesseract wrapper for .NET and it's all good in quality but the speed is not good enough. The times i get for 20 tasks in parallel scanning of a half page from the same pdf in average are 2,546 second per scan. The code im using: using (var engine = new TesseractEngine(Tessdata, "eng", EngineMode.TesseractOnly)) { Page page; page = engine.Process(image, srcRect); var text = page.GetText(); return Task