How to get the number of pages of a .PDF uploaded by user?

前端 未结 5 381
温柔的废话
温柔的废话 2020-12-06 11:14

I have a file input, and before \"uploading\" i need to calculate the number of pages of that .pdf in JAVASCRIPT (eg. JQuery...)

相关标签:
5条回答
  • 2020-12-06 11:24

    and a pure javascript solution:

    var input = document.getElementById("files");
    var reader = new FileReader();
    reader.readAsBinaryString(input.files[0]);
    reader.onloadend = function(){
        var count = reader.result.match(/\/Type[\s]*\/Page[^s]/g).length;
        console.log('Number of Pages:',count );
    }
    
    0 讨论(0)
  • 2020-12-06 11:26

    As has been stated in the other answers, something like pdf.js is be what you are looking for. I've taken a look at the API and it does include a numPages() function to return the total number of pages. It also seems to count pages for me when viewing the demo page from Mozilla.

    It depends if you are able to use modern browsers and experimental technology for your solution. pdf.js is very impressive, but it is still experimental according to the github page .

    If you are able to count the pages on the server after uploading, then you should look at pdftools or similar.

    Something like pdftools --countpages is what you are looking for

    0 讨论(0)
  • 2020-12-06 11:30

    I think the API has changed a little since Tracker1 posted an answer. I tried Tracker1's code and saw this error:

    Uncaught TypeError: pdfjsLib.getDocument(...).then is not a function
    

    A small change fixes it:

    const pdfjsLib = require('pdfjs-dist');
    ...
    pdfjsLib.getDocument(pdfPath).promise.then(function (doc) {
        var numPages = doc.numPages;
        console.log('# Document Loaded');
        console.log('Number of Pages: ' + numPages);
    }
    
    0 讨论(0)
  • 2020-12-06 11:31

    In case you use pdf.js you may reference an example on github ('.../examples/node/getinfo.js') with following code that prints number of pages in a pdf file.

    const pdfjsLib = require('pdfjs-dist');
    ...
    pdfjsLib.getDocument(pdfPath).then(function (doc) {
        var numPages = doc.numPages;
        console.log('# Document Loaded');
        console.log('Number of Pages: ' + numPages);
    }
    
    0 讨论(0)
  • 2020-12-06 11:31

    You could also use pdf-lib.

    You will need to read the file from the input field and then make use of pdf-lib to get the number of pages. The code would be like this:

    import { PDFDocument } from 'pdf-lib';
    
    ...
    
    const readFile = (file) => {
    
      return new Promise((resolve, reject) => {
    
        const reader = new FileReader();
    
        reader.onload = () => resolve(reader.result);
        reader.onerror = error => reject(error);
    
        reader.readAsArrayBuffer(file);
      });
    }
    
    const async getNumPages = (file) => {
    
      const arrayBuffer = await readFile(file);
    
      const pdf = await PDFDocument.load(arrayBuffer);
    
      return pdf.getPages();
    }
    

    And then just get the number of pages of the attached file with:

    const numPages = await getNumPages(input.files[0]);
    

    being input the variable which stores the reference to the DOM element of the file input.

    0 讨论(0)
提交回复
热议问题