Check if a PDF file is a scanned one

后端 未结 6 884
傲寒
傲寒 2020-12-09 20:08

What is the best way to programmatically check if a PDF file is a totally scanned one? I do have iText and PDFBox at my disposal. I can check if a pdf file contains text or

6条回答
  •  时光取名叫无心
    2020-12-09 20:37

    Your best bet might be to check to see if it has text and also see if it contains a large pagesized image or lots of tiled images which cover the page. If you also check the metadata this should cover most options.

提交回复
热议问题