We have a bunch of files that are html pages but which contain additional xml elements (all prefixed with our company name \'TLA\') to provide data and structure for an older pr
This should do it:
When run on your sample input (once the missing namespace declaration is added), the result is: