I would like to parse an HTML document and ignore span elements (but keep their contents) so that I can iterate the strings in the document the way a user would see them, as