How to unescape HTML character entities in Java?

前端 未结 11 1909
耶瑟儿~
耶瑟儿~ 2020-11-21 22:38

Basically I would like to decode a given Html document, and replace all special chars, such as \" \" -> \" \", \">\" -

11条回答
  •  深忆病人
    2020-11-21 23:21

    This did the job for me,

    import org.apache.commons.lang.StringEscapeUtils;
    ...
    String decodedXML= StringEscapeUtils.unescapeHtml(encodedXML);
    

    or

    import org.apache.commons.lang3.StringEscapeUtils;
    ...
    String decodedXML= StringEscapeUtils.unescapeHtml4(encodedXML);
    

    I guess its always better to use the lang3 for obvious reasons. Hope this helps :)

提交回复
热议问题