Scraping from a website that requires a login?

前端 未结 5 1971
旧时难觅i
旧时难觅i 2020-12-17 04:30

Can this be done if so, how? I want to scrape data from xbox.com but the pages I need to scrape only appear after a successful login.

5条回答
  •  情话喂你
    2020-12-17 05:17

    Most login forms will set a cookie. So you should use a HTTP class like Zend_Http that can store them for further requests. It's presumably as simple as:

    $client = new Zend_Http_Client();
    $client->setCookieJar();   // this is the crucial part for "logging in"
    
    // make login request
    $client->setUri("http://xbox.com/login");
    $client->setParameterPost("login", "hackz0r");
    $result = $client->request('POST');
    
    // go scraping
    ...
    

提交回复
热议问题