I\'d like to extract the text from an HTML file using Python. I want essentially the same output I would get if I copied the text from a browser and pasted it into notepad.
You can use html2text method in the stripogram library also.
from stripogram import html2text text = html2text(your_html_string)
To install stripogram run sudo easy_install stripogram