tesseract OCR have a command line interface, which allow us to recognize text from images with some parameters.
Input argumetns are imagename (path
Tesseract 3.0x supports a "hocr" command option, which produces a HTML-format output file consisting of recognized words and their coordinates. It does not have size/font info, though.