I am converting some HTML cleanup code from JS to Python and BS4. The JS version uses nasty regex ways to clean up, and I\'ve converted a bunch of them to much nicer, idioma