How do I know my file is attached in my PDF using PyPDF2?

Deadly 提交于 2021-01-28 11:04:52

问题


I am trying to attach an .exe file into a PDF using PyPDF2.

I ran the code. It runs flawlessly, but my PDF file is still the same size.
I don't know if my file was attached or not.

This is what I am trying to do:

from PyPDF2 import PdfFileWriter, PdfFileReader
output = PdfFileWriter()
input1 = PdfFileReader(open ("doc1.pdf", "rb"))
#check it's whether work or not
print("doc1 has %d pages" % input1.getNumPages())
output.addAttachment("doc1.pdf","client.exe")

I don't know if I am doing this right or not. Please if anyone know about this help me out.


回答1:


First of all, you have to use the PdfFileWriter class properly.

You can use appendPagesFromReader to copy pages from the source PDF ("doc1.pdf") to the output PDF (ex. "out.pdf"). Then, for addAttachment, the 1st parameter is the filename of the file to attach and the 2nd parameter is the attachment data (it's not clear from the docs, but it has to be a bytes-like sequence). To get the attachment data, you can open the .exe file in binary mode, then read() it. Finally, you need to use write to actually save the PdfFileWriter object to an actual PDF file.

Here is a more working example:

from PyPDF2 import PdfFileWriter, PdfFileReader

output = PdfFileWriter()

input_pdf = PdfFileReader("doc1.pdf")
output.appendPagesFromReader(input_pdf)

with open("client.exe", "rb") as exe:
    output.addAttachment("client.exe", exe.read())

with open("out.pdf", "wb") as f:
    output.write(f)

Next, to check if attaching was successful, you can use os.stat.st_size to compare the file size (in bytes) before and after attaching the .exe file.

Here is the same example with checking for file sizes:
(I'm using Python 3.6+ for f-strings)

from PyPDF2 import PdfFileWriter, PdfFileReader
import os

output = PdfFileWriter()

print(f"size of SOURCE: {os.stat('doc1.pdf').st_size}")
input_pdf = PdfFileReader("doc1.pdf")
output.appendPagesFromReader(input_pdf)

print(f"size of EXE: {os.stat('client.exe').st_size}")
with open("client.exe", "rb") as exe:
    output.addAttachment("client.exe", exe.read())

with open("out.pdf", "wb") as f:
    output.write(f)
print(f"size of OUTPUT: {os.stat('out.pdf').st_size}")

The above code prints out

size of SOURCE: 42942
size of EXE: 989744
size of OUTPUT: 1031773

...which sort of shows that the .exe file was added to the PDF.

Of course, you can manually check it by opening the PDF in Adobe Reader:

As a side note, I am not sure what you want to do with attaching exe files to PDF, but it seems you can attach them but Adobe treats them as security risks and may not be possible to be opened. You can use the same code above to attach another PDF file (or other documents) instead of an executable file, and it should still work.



来源:https://stackoverflow.com/questions/59521753/how-do-i-know-my-file-is-attached-in-my-pdf-using-pypdf2

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!