So, basically I have about 350 files, each containing multiple housing certificates. My first problem is uploading all of these files in to PySpark. What is the fastest wa