My overall goal is to transfer a set of PDFs (input files) into a Corpus. Some of the input files contain pages with multiple (let\'s say 2) columns. pdftools\' read_pdf fun