I am currently building a .NET application and one of the requirement is that it has to convert a pdf file to XML file. Has anyone had success doing this? If so what have yo
Have a look at pdf2Data.
http://itextpdf.com/blog/pdf2data-extract-information-invoices-and-templates
It converts pdf files to XML files based on a template. Templates are defined using selectors that allow the end-user to specify things like "select the table on the 2nd page" or "select the text written in this particular font" and so on.
Keep in mind, I am affiliated with iText so even though my knowledge of PDF is extensive, I may be considered biased towards iText products (seeing as I help develop them).