问题
I'm trying to insert some HTML into a page using javascript, and the HTML I'm inserting contains CDATA blocks.
I'm finding, in Firefox and Chrome, that the CDATA is getting converted to a comment.
The HTML is not under my control, so it's difficult for me to avoid using CDATA.
The following test case, when there is a div on the page with id "test":
document.getElementById('test').innerHTML = '<![CDATA[foo]]> bar'
causes the following HTML to be appeded to the 'test' div:
<!--[CDATA[foo]]--> bar
Is there any way I can insert, verbatim, HTML containing CDATA into a document using javascript?
回答1:
document.createCDATASection should do it, but the real answer to your question is that although HTML 5 does have CDATA sections cross-browser support for them is pretty spotty.
EDIT
The CDATA sections just aren't in the HTML 4 definition, so most browsers won't recognize them.
But it doesn't require a full DOM parser. Here's a simple lexical solution that will fix the problem.
function htmlWithCDATASectionsToHtmlWithout(html) {
var ATTRS = "(?:[^>\"\']|\"[^\"]*\"|\'[^\']*\')*",
// names of tags with RCDATA or CDATA content.
SCRIPT = "[sS][cC][rR][iI][pP][tT]",
STYLE = "[sS][tT][yY][lL][eE]",
TEXTAREA = "[tT][eE][xX][tT][aA][rR][eE][aA]",
TITLE = "[tT][iI][tT][lL][eE]",
XMP = "[xX][mM][pP]",
SPECIAL_TAG_NAME = [SCRIPT, STYLE, TEXTAREA, TITLE, XMP].join("|"),
ANY = "[\\s\\S]*?",
AMP = /&/g,
LT = /</g,
GT = />/g;
return html.replace(new RegExp(
// Entities and text
"[^<]+" +
// Comment
"|<!--"+ANY+"-->" +
// Regular tag
"|<\/?(?!"+SPECIAL_TAG_NAME+")[a-zA-Z]"+ATTRS+">" +
// Special tags
"|<\/?"+SCRIPT +"\\b"+ATTRS+">"+ANY+"<\/"+SCRIPT +"\\s*>" +
"|<\/?"+STYLE +"\\b"+ATTRS+">"+ANY+"<\/"+STYLE +"\\s*>" +
"|<\/?"+TEXTAREA+"\\b"+ATTRS+">"+ANY+"<\/"+TEXTAREA+"\\s*>" +
"|<\/?"+TITLE +"\\b"+ATTRS+">"+ANY+"<\/"+TITLE +"\\s*>" +
"|<\/?"+XMP +"\\b"+ATTRS+">"+ANY+"<\/"+XMP +"\\s*>" +
// CDATA section. Content in capturing group 1.
"|<!\\[CDATA\\[("+ANY+")\\]\\]>" +
// A loose less-than
"|<", "g"),
function (token, cdataContent) {
return "string" === typeof cdataContent
? cdataContent.replace(AMP, "&").replace(LT, "<")
.replace(GT, ">")
: token === "<"
? "<" // Normalize loose less-thans.
: token;
});
}
Given
<b>foo</b><![CDATA[<i>bar</i>]]>
it produces
<b>foo</b><i>bar</i>
and given something that looks like a CDATA section inside a script
or other special tag or comment, it correctly does not muck with it:
<script>/*<![CDATA[*/foo=bar<baz&//]]></script><![CDATA[fish: <><]]>
becomes
<script>/*<![CDATA[*/foo=bar<baz&//]]></script>fish: <><
回答2:
You could try to use innerText
instead of innerHTML
.
回答3:
I would just strip the CDATA tags using a regular expression like so:
document.getElementById('test').innerHTML = '<![CDATA[foo]]> bar'.replace(/<!\[CDATA\[(.*)\]\]>/g, "$1")
Which results in 'test' having:
foo bar
That way the content of the CDATA sections is preserved without one having to worry about any of it becoming commented out. Unfortunately, this may break whatever required your documents to use CDATA sections to begin with.
回答4:
convert <, > and & signs like this:
document.getElementById('test').innerHTML = '<![CDATA[foo]]> bar'
回答5:
That is because CDATA
converts <
and >
(<
and >
) to their html entities. Try to convert the entities back to <
and >
.
You can read more about it here.
回答6:
If you make your page XHTML rather than HTML then the auto-comment "feature" of the CDATA might not happen. You do need to jump through the hoops that XHTML requires, such as a DOCTYPE, and whatever else.
Seems a bit arbitrary, any application that depends on CDATA is broken IMHO, but hopefully you get it working.
来源:https://stackoverflow.com/questions/7065615/innerhtml-converts-cdata-to-comments