import urllib2 website = \"WEBSITE\" openwebsite = urllib2.urlopen(website) html = getwebsite.read() print html
So far so good.
But I wa
Look at using the beautiful soup html parsing library.
http://www.crummy.com/software/BeautifulSoup/
You will do something like this:
import BeautifulSoup soup = BeautifulSoup.BeautifulSoup(html) for link in soup.findAll("a"): print link.get("href")