The most recent version of ScraperWiki depends on Poppler (or so the GitHub says). Unfortunately, it only specifies how to get it on macOS and Linux, not Windows.
A
MSYS2 has the latest version available for install.
If you don't want to install the whole enviroment (or you wanted some kind of portable version) you could also just download Poppler straight from the repository, but then you'd also have to manually handle dependencies. Namely: libwinpthread
, nspr
, gcc-libs
, nss
, curl
, brotli
, openssl
, libidn2
, libiconv
, gettext
, libunistring
, nghttp2
, libpsl
, libjpeg-turbo
, lcms2
, openjpeg2
, libpng
, zlib
, libtiff
, xz
and zstd
.
To Simply install Poppler on Windows run through the below mentioned steps without touching the environmental varible.
Now go to your Python code where you want to call Poppler for image conversion and use the below mentioned code snippet:
from pdf2image import convert_from_path
pages = convert_from_path('MyPdf.pdf', 500, poppler_path = r'C:\User\Poppler\poppler-0.68.0_x86\poppler-0.68.0\bin')
for page in pages:
page.save('out.jpg', 'JPEG')
With anaconda installed on windows one can simply execute:
conda install -c conda-forge poppler
Other answers have linked to the correct download page for Windows users but do not specify how to install them for the uninitiated.
Go to this page and download the binary of your choice. In this example we will download and use poppler-0.68.0_x86
.
Extract the archive file poppler-0.68.0_x86.7z
into C:\Program Files
. Thus, the directory structure should look something like this:
C:
└ Program Files
└ poppler-0.68.0_x86
└ bin
└ include
└ lib
└ share
Add C:\Program Files\poppler-0.68.0_x86\bin
to your system PATH
by doing the following: Click on the Windows start button, search for Edit the system environment variables
, click on Environment Variables...
, under System variables
, look for and double-click on PATH
, click on New
, then add C:\Users\Program Files\poppler-0.68.0_x86\bin
, click OK
.
If you are using a terminal to execute poppler (e.g. running pdf2image
in command line), you may need to reopen your terminal for poppler to work.
Done!
Poppler is available as Chocolatey package:
choco install poppler
By default Poppler is installed in C:\ProgramData\chocolatey\lib\poppler
and shims are automatically created for the following tools: pdfdetach
, pdffonts
, pdfimages
, pdfinfo
, pdfseparate
, pdftocairo
, pdftohtml
, pdftoppm
, pdftops
, pdftotext
, pdfunite
.
To update Poppler, run:
cup poppler
Install from the main bucket:
scoop install poppler
By default Poppler is installed in ~\scoop\apps\poppler
and shims are automatically created for the following tools: pdfdetach
, pdffonts
, pdfimages
, pdfinfo
, pdfseparate
, pdftocairo
, pdftohtml
, pdftoppm
, pdftops
, pdftotext
, pdfunite
.
To update Poppler, run:
scoop update poppler
As mentioned in another answer, MiKTeX currently ships with Poppler tools, and so does another LaTeX distribution, TeX Live. From the guide:
Command-line tools.
A number of Windows ports of common Unix command-line programs are installed along with the usual TeX Live binaries. These include
gzip
,zip
,unzip
, and the utilities from the poppler suite (pdfinfo
,pdffonts
, …)
Poppler suite is located by default in C:\texlive\<year>\bin\win32
and, if you can compile your LaTeX documents, should work out of the box since this location is added to the PATH by the installer.
It looks like a version that is build-able with visual studio can be found here https://bitbucket.org/merarischroeder/poppler-for-windows/overview