I\'d like one or more regexes that can:
1) Take the html of a large page.
2) Find the urls contained in all links, for example:
/]+href\s*=\s*["']([^"']+)["'][^>]*>(.*?)<\/a>/mis