Will have 50-100 single PDF's that we'll be generating with a php script. PDF's are generally grouped into groups of 10-20. Each group needs to have it's own Table of Contents or Index, and then there also needs to be a Master Table of Contents or Index at the beginning.
Or if that is too difficult we could get away with a single Table of Contents at the beginning.
What's the best way to go about this?
Will we need to create the Table of Contents and then export that to PDF and append it to the beginning and mash the rest of the files after that? Or is there a better solution?
And what's the best tool for us to merge the pdf's?
Will be running on a Linux server.
And what's the best tool for us to merge the pdf's?
On linux distributions you can install an useful little program, pdftk
. It works well to bind PDF's together. For example:
$ pdftk in1.pdf in2.pdf in3.pdf in4.pdf in5.pdf in6.pdf cat output out.pdf
where in*.pdf
are the input files and out.pdf
is the result.
With Version 1.45 - December 6, 2012 is pdftk
able to create bookmarks with update_info, which could be used as toc.
Its done in 3 Steps:
Precondition for my example
3 PDF files. Single page.
page1.pdf
page2.pdf
page3.pdf
1. Create the bookmark information
# build the bookmark out of an example file
pdftk page1.pdf dump_data output meta.txt
# Edit meta.txt as you need
Heres an example that worked for me, meta.txt:
InfoBegin
InfoKey: Creator
InfoValue: PDFTK
NumberOfPages: 3
PageMediaBegin
PageMediaNumber: 1
PageMediaRotation: 0
PageMediaRect: 0 0 595.32 841.92
PageMediaDimensions: 595.32 841.92
BookmarkBegin
BookmarkTitle: Page 1
BookmarkLevel: 1
BookmarkPageNumber: 1
BookmarkBegin
BookmarkTitle: Page 2
BookmarkLevel: 1
BookmarkPageNumber: 2
BookmarkBegin
BookmarkTitle: Page 3
BookmarkLevel: 1
BookmarkPageNumber: 3
2. Create an temporary merged pdf file
pdftk page* cat output temp.pdf
3. Add the bookmarks to the pdf
pdftk temp.pdf update_info meta.txt output final.pdf
When you open the final.pdf in acrobat reader you see the bookmarks on the left side.
To simplify the process of generating table of contents for PDF files, I've successfully generated LaTeX code from php and then used latex2pdf to generate the PDF file.
Maybe this could be an alternative, in your case.
To merge PDFs, I agree with Federico that pdftk is the best solution.
If you can use PDF bookmarks instead of a table of Contents page you can use pdfrecycle to merge the PDF files and create the bookmarks.
You can use gs
like this:
gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -sOutputFile=combined.pdf file1.pdf file2.pdf
来源:https://stackoverflow.com/questions/2418871/need-to-merge-multiple-pdfs-into-a-single-pdf-with-table-of-contents-sections