I am developing a full text search engine for indexing popular binary formats. I know that there are hundereds of such questions (and solutions) already, but I found it toug
If at server side you can use OpenOffice then you can use unoconv: Convert between any document format supported by OpenOffice