XML Unicode Safe Encoding

╄→尐↘猪︶ㄣ 提交于 2020-01-02 06:01:34

问题


I'm looking for a way to encode an XML document using the #&233; encoding.

Using this basic code

var xmlDoc = new XmlDocument();
xmlDoc.Load(@"D:\Temp\XmlDocBase.xml");
xmlDoc.Save(@"D:\Temp\XmlDocBaseCopy.xml");

my Xml document pass from:

<?xml version="1.0"?>
<Tag1>
  <comment>entit&#233;</comment>
</Tag1>

to

<?xml version="1.0"?>
<Tag1>
  <comment>entité</comment>
</Tag1>

Regards


回答1:


You can force encoding that does not support all unicode characters (i.e. ASCII). As result writer will be forced to use entities.

    XmlDocument doc = new XmlDocument();
    doc.LoadXml("<Tag1><comment>entit&#233;</comment></Tag1>");

    var writer = XmlTextWriter.Create(
        @"c:\temp\o.xml",
        new XmlWriterSettings { Encoding = System.Text.ASCIIEncoding.ASCII });
    doc.Save(writer);

Results in:

<?xml version="1.0" encoding="us-ascii"?><Tag1><comment>entit&#xE9;</comment></Tag1>



回答2:


You can call HttpUtility.HtmlEncode on a string.

return HttpUtility.HtmlEncode("entité");

Returns entit&#233;

HttpUtility is part of System.Web.



来源:https://stackoverflow.com/questions/4279315/xml-unicode-safe-encoding

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!