I am loading a lot of xml documents and some of them return errors like \"hexadecimal value 0x12, is an invalid character\" and there are different character. How to remove
This is essentially a special case of this question. I suggest you use one of the answers from there.