Scrapy Shell and Scrapy Splash

前端 未结 3 847
刺人心
刺人心 2020-12-07 14:53

We\'ve been using scrapy-splash middleware to pass the scraped HTML source through the Splash javascript engine running inside a docker container.

If we

3条回答
  •  隐瞒了意图╮
    2020-12-07 15:33

    For the windows users, who use Docker Toolbox:

    1. Change the single inverted comma with double inverted comma for preventing the invalid hostname:http error.

    2. change the localhost to the docker ip address which is below the whale logo. for me it was 192.168.99.100.

    Finally i got this:

    scrapy shell "http://192.168.99.100:8050/render.html?url="https://samplewebsite.com/category/banking-insurance-financial-services/""

提交回复
热议问题