Nokogiri HTML parsing not working

为君一笑 提交于 2019-12-11 10:28:32

问题


I am trying to parse some HTML with Nokogiri, but I am not getting anything back from the css or xpath methods.

require 'rubygems'
require 'open-uri'
require 'nokogiri'

doc = Nokogiri::HTML(open("http://www.google.com"))
doc.css('div').each do |div|
   puts div.content
end
doc.xpath('//div').each do |div|
   puts div.content
end

Nothing gets printed to the screen, so css and xpath are returning empty arrays. There are at least 100 divs in Google's homepage.

doc.to_html returns:

<!DOCTYPE html>\n\n

doc.validate returns:

[#<Nokogiri::XML::SyntaxError: no root element>]

I uninstalled Nokogiri, and reinstalled libxml2 and libxslt as mentioned in "Installing Nokogiri". Everything's working now.

来源:https://stackoverflow.com/questions/7335877/nokogiri-html-parsing-not-working

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!