I have an XML document that\'s being generated from some content that people are copy/pasting from all sorts of places (Word documents mostly though).
It looks like
U+001A is not a valid character in an XML document. The valid range of characters according to the specification is:
Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */