How to get redirect log in Mechanize?

与世无争的帅哥 提交于 2019-12-12 04:48:07

问题


In ruby, if you use mechanize following 301/302 redirects like this

require 'mechanize'

m = WWW::Mechanize.new
m.get('http://google.com')

how to get the list of the pages mechanize was redirected through? (Like http://google.com => http://www.google.com => http://google.com.ua)

OK, here is the code in mechanize responsible for redirection

 elsif res_klass <= Net::HTTPRedirection
        return page unless follow_redirect?
        log.info("follow redirect to: #{ response['Location'] }") if log
        from_uri  = page.uri
        raise RedirectLimitReachedError.new(page, redirects) if redirects + 1 > redirection_limit
        redirect_verb = options[:verb] == :head ? :head : :get
        page = fetch_page(  :uri => response['Location'].to_s,
                            :referer => page,
                            :params  => [],
                            :verb => redirect_verb,
                            :redirects => redirects + 1
                         )
        @history.push(page, from_uri)
        return page

but trying to m.history.map {|p| puts p.uri} shows 3 times the uri of last page..


回答1:


I'm not certain, but here are a couple of things to try:

  1. see what's in m.history[i].uri after the get()

  2. You might need something like:

    for m.redirection_limit in 0..99
      begin
        m.get(url)
        break
        rescue WWW::Mechanize::RedirectLimitReachedError
          # code here could get control at
          # intermediate redirection levels
      end
    end



回答2:


The key here is to take advantage of the built in logging in Mechanize. Here's a full code sample using the built in Rails logging facilities.

require 'mechanize'

require 'logger'

mechanize_logger = Logger.new('log/mechanize.log')

mechanize_logger.level = Logger::INFO

url = 'http://google.com'

agent = Mechanize.new

agent.log = mechanize_logger

agent.get(url)

And then check the output of log/mechanize.log in your log directory and you'll see the whole mechanize process including the intermediate urls.



来源:https://stackoverflow.com/questions/1352178/how-to-get-redirect-log-in-mechanize

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!