In case you are processing PDF files with the purpose of importing data into a database then I suggest to consider ByteScout PDF Extractor SDK. Some useful functions included are
- table detection;
- text extraction as CSV, XML or formatted text (with the optional layout restoration);
- text search with support for regular expressions;
- low-level API to access text objects
DISCLAIMER: I'm affiliated with ByteScout