PDF Text Extraction with Coordinates
I would like to extract text from a portion (using coordinates) of PDF using Ghostscript. Can anyone help me out? Kurt Pfeifle Yes, with Ghostscript, you can extract text from PDFs. But no, it is not the best tool for the job. And no, you cannot do it in "portions" (parts of single pages). What you can do: extract the text of a certain range of pages only. First: Ghostscript's txtwrite output device (not so good) gs \ -dBATCH \ -dNOPAUSE \ -sDEVICE=txtwrite \ -dFirstPage=3 \ -dLastPage=5 \ -sOutputFile=- \ /path/to/your/pdf This will output all text contained on pages 3-5 to stdout. If you