How do you parse HTML with a variety of languages and parsing libraries?
When answering:
Individual comments will be linked to in answers to questions
language: Ruby library: Hpricot
#!/usr/bin/ruby require 'hpricot' html = '' ['foo', 'bar', 'baz'].each {|link| html += "#{link}" } html += '' doc = Hpricot(html) doc.search('//a').each {|elm| puts elm.attributes['href'] }