Extracting text data from PDF files

后端 未结 7 1916
[愿得一人]
[愿得一人] 2020-12-02 11:24

Is it possible to parse text data from PDF files in R? There does not appear to be a relevant package for such extraction, but has anyone attempted or seen this done in R?

7条回答
  •  醉话见心
    2020-12-02 11:55

    install.packages("pdftools")
    library(pdftools)
    
    
    download.file("http://www.nfl.com/liveupdate/gamecenter/56901/DEN_Gamebook.pdf", 
                  "56901.DEN.Gamebook", mode = "wb")
    
    txt <- pdf_text("56901.DEN.Gamebook")
    cat(txt[1])
    

提交回复
热议问题