When trying to extract the title of a html-page I have always used the following regex:
(?<=)([\\s\\S]*)(?=)
What about something like:
r = re.compile("()([\s\S]*)()") title = r.search(page).group(2)