XML spec defines a subset of Unicode characters which are allowed in XML documents: http://www.w3.org/TR/REC-xml/#charsets.
How do I filter out these characters from
Using StringEscapeUtils.escapeXml(xml) from commons-lang will escape, not filter the characters.