Ruby: Reading PDF files

前端 未结 6 664
孤独总比滥情好
孤独总比滥情好 2020-12-02 06:05

I\'m looking for a fast and reliable way to read/parse large PDF files in Ruby (on Linux and OSX).

Until now I\'ve found the rather old and simple PDF-toolkit (a pd

6条回答
  •  南笙
    南笙 (楼主)
    2020-12-02 06:41

    You might find Docsplit useful:

    Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)

提交回复
热议问题