Is there a way to extract only text from the PDF using PDFBox? The reason I am asking is that the pdf can have malicious links, any action, button or form can also embed som