Read Pdf with C

雨燕双飞 提交于 2019-12-10 15:38:00

问题


I want to be able to read the content of pdf files. I need to do that with C on Linux.

The closer i can get to this was here but I think Haru can only create pdf and is not able to read them (not 100% sure).

PS: I only need the plain text from pdf


回答1:


Check out libpoppler. I've never used it work extracting text, just querying PDF attributes. It's pretty easy to use.




回答2:


How well do you need to parse them? Just extracting strings should be relatively easy, fully accurate rendering is harder. Take a look at the source for evince or ghostscript?

This is for C++ but might be a good starting point for understanding PDF structure http://www.codeproject.com/KB/cpp/ExtractPDFText.aspx (sorry wrong link before)




回答3:


Another possible, though I've never used it is VersyPDF. It claims to allow you to edit PDFs ... http://versypdf.sybrex-systems-ltd.qarchive.org/



来源:https://stackoverflow.com/questions/772919/read-pdf-with-c

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!