I need to be able to modify every single link in an HTML document. I know that I need to use the SoupStrainer but I\'m not 100% positive on how to implement it.
SoupStrainer
I tried this and it worked, it's easier to avoid using regexp for matching each 'href':
'href'
from bs4 import BeautifulSoup as bs soup = bs(htmltext) for a in soup.findAll('a'): a['href'] = "mysite"
Check it out, on bs4 docs.