Forcing Tesseract to match pattern (four digits in a row)
问题 I'm trying to get Tesseract (using the Tess4J wrapper) to match only a specific pattern. The pattern is four digits in a row, which I think would be \d\d\d\d. Here is a VERY small subset of the image I'm feeding tesseract (the floorplans are restricted, so I'm cautious to post much more of it): http://mike724.com/view/a06771 I'm using the following java code: File imageFile = new File("/<redacted>/file.pdf"); Tesseract instance = Tesseract.getInstance(); instance.setTessVariable("load_system