I\'m working on a small Python script to clean up HTML documents. It works by accepting a list of tags to KEEP and then parsing through the HTML code trashing tags that are
Read:
Repent.
Use a real HTML parser, like BeautifulSoup.