Keep XML entities in output (jSoup)
问题 I'm using jsoup to do some xml processing. Problem is, it is replacing xml entities, ie.: » with html entities: » How could I keep original (xml) entities? Groovy script: import org.jsoup.Jsoup import org.jsoup.nodes.Document import org.jsoup.nodes.Entities import org.jsoup.parser.Parser String HTML_STRING = ''' <html> <div></div> <div>Some text »</div> </html> ''' Document doc = Jsoup.parse(new ByteArrayInputStream(HTML_STRING.getBytes("UTF-8")), "UTF-8", "", Parser.xmlParser()) doc