问题
I found the HTML Agility Pack useful and easy to use for screen scraping web sites. What's the equivalent library for HTML screen scraping in Java, Ruby, Python?
回答1:
Found what I was looking for: Options for HTML scraping?
回答2:
BeautifulSoup is the standard Python screen scraping tool.
Recently, however, I used the (incomplete at the moment) pyQuery, which is more or less a rewrite of jQuery into python, and found it to be very useful.
来源:https://stackoverflow.com/questions/1060484/html-agility-pack-or-html-screen-scraping-libraries-for-java-ruby-python