问题
I am using iTextSharp Library in order to extract all the text from a PDF. It works fine but in few PDF files that I receive from a SAP all bold text is returned twice in the extracted Text.
Has anyone faced something like this before?
I am using
PdfTextExtractor.GetTextFromPage(reader, p, new LocationTextExtractionStrategy())
and have implemented a custom TextExtractionStrategy
.
来源:https://stackoverflow.com/questions/21405718/text-extraction-duplicate-bold-text