I would need to remove anything between XML tags, especially whitespace and newlines.
For example removing whitespace and newslines from: \\n<
You shouldn't use regex to parse XML or HTML, it's just not reliable and there are way too many edge cases. You should use a XML/HTML parser for this kind of stuff instead.