Is there a method to follow a link using Nokogiri for scraping? I know I can extract the href and open it, but I thought I saw a method to do this using hpricot and was wondering if there was something like that in Nokogiri.
dbyrne
Here is an excellent screen scraping guide for using Ruby, Nokigiri, Hpricot, and Firebug.
Personally I am a big fan of using Mechanize, which is a headless browser, for screen scraping. You can use mechanize to navigate links and fill out forms and it will handle all the tricky stuff like cookies.
来源:https://stackoverflow.com/questions/2807500/following-a-link-using-nokogiri-for-scraping