For one of my assignment, I am supposed to extract data from the pdf file then save then into a text file which will be used later to create an NLP model for document extrac