DocumentBuilder parse produces invalid byte 2 of 4-byte UTF-8 sequence error

回眸只為那壹抹淺笑 提交于 2020-01-04 05:51:13

问题


I am trying to parse a bytearray which contains a string Impresión in XML

final DocumentBuilderFactory builderFactory = DocumentBuilderFactory.newInstance();
final DocumentBuilder builder = builderFactory.newDocumentBuilder();
final Document document;
try (final InputStream stream = new ByteArrayInputStream(bytearray)) {
     document = builder.parse(stream); // gives Invalid byte 2 of 4-byte UTF-8 sequence error
}

It produces Invalid byte 2 of 4-byte UTF-8 sequence error. But when i have Unicode Impresi\u00f3n, it works fine. But i want to parse actual character.

Anybody help me to solve it.

来源:https://stackoverflow.com/questions/49686444/documentbuilder-parse-produces-invalid-byte-2-of-4-byte-utf-8-sequence-error

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!