问题
How to read contents of specific page from PDF using itextsharp APIs
Can anybody redirect me to the correct direction?
Thanks in advance!
回答1:
The following code only extracts text, if that is what you are looking for.
PdfReader pdfReader = new PdfReader(documentPath);
ITextExtractionStrategy its = new iTextSharp.text.pdf.parser.SimpleTextExtractionStrategy();
//Extract text from the page.
string txt = PdfTextExtractor.GetTextFromPage(pdfReader, page, its);
// Convert the extracted text into a readable string using the right encoding.
extractedText = Encoding.UTF8.GetString(ASCIIEncoding.Convert(Encoding.Default, Encoding.UTF8, Encoding.Default.GetBytes(txt)));
来源:https://stackoverflow.com/questions/3421766/how-to-read-contents-of-specific-page-from-pdf-using-itextsharp-apis