I want a regular expression to extract the title from a HTML page. Currently I have this:
title = re.search(\'.*\', html, re.IGNOR
May I recommend you to Beautiful Soup. Soup is a very good lib to parse all of your html document.
soup = BeatifulSoup(html_doc) titleName = soup.title.name