scrapyjs

Splash do not render the whole page

社会主义新天地 提交于 2019-12-13 09:11:17
问题 I like to use scrapy and splash to grabb some data but poorly splash seems not to render the whole --> page <--. The page should look like this: But it looks like this: So some of the more important information is missing. I already tried to increase the waiting time but this had no positive effect. Does anyone have an idea what I could do, to make this work? 回答1: Try to look at Splash FAQ where common problems with page rendering are discussed. Especially, I've often seen problem with

Scrapyjs + Splash click controller button

[亡魂溺海] 提交于 2019-11-29 20:06:49
问题 Hello I have installed Scrapyjs + Splash and I use the following code import json import scrapy from scrapy.linkextractors import LinkExtractor from scrapy.spider import Spider from scrapy.selector import Selector import urlparse, random class DmozSpider(scrapy.Spider): name = "dmoz" allowed_domains = ["whoscored.com"] start_urls = ['http://www.whoscored.com/Regions/81/Tournaments/3/Seasons/4336/Stages/9192/Fixtures/Germany-Bundesliga-2014-2015'] def start_requests(self): for url in self