Convert PDF to anything which can be opened by Word

筅森魡賤 提交于 2019-12-01 01:00:51

Use a PDF library, such as iTextSharp to parse the PDF. You will be able to access all text and images from the PDF and convert to whatever representation you want.

There are other solutions (such as installing xpdf and shelling to it - it will convert to html if the right command line arguments are passed in).

I am not sure if Word could open a pdf unless you created the pdf in a word document.

I think the only quick solution to that would be to purchase or find a 3rd party library that does PDF handling, then use it's API to pull out the text you need. The text any any case would be extremely badly formatted at that point i am sure. Also be aware that some pdfs that show text actually have it saved as an image, so there would be no way to get the data out.

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!