I am learning to use both the re
module and the urllib
module in python and attempting to write a simple web scraper. Here\'s the code I\'ve writte
It is recommended that you use Beautiful Soup or any other parser to parse HTML, but if you badly want regex the following piece of code would do the job.
The regex code:
(.+?)
How it works:
Produces:
['Google']
['Welcome to Facebook - Log In, Sign Up or Learn More']
['reddit: the front page of the internet']