Encoding for an XML document containing U+001A

后端 未结 3 619
醉酒成梦
醉酒成梦 2021-01-16 03:53

I have an XML document that\'s being generated from some content that people are copy/pasting from all sorts of places (Word documents mostly though).

It looks like

3条回答
  •  Happy的楠姐
    2021-01-16 04:27

    U+001A is not a valid character in an XML document. The valid range of characters according to the specification is:

    Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */
    

提交回复
热议问题