Following hyperlink and “Filtered offsite request”

前端 未结 2 1729
悲哀的现实
悲哀的现实 2020-12-16 13:26

I know that there are several related threads out there, and they have helped me a lot, but I still can\'t get all the way. I am at the point where running the code doesn\'t

2条回答
  •  死守一世寂寞
    2020-12-16 13:56

    You need to modify your yielded Request in parse to use parse2 as its callback.

    EDIT: allowed_domains shouldn't include the http prefix eg:

    allowed_domains = ["boliga.dk"]
    

    Try that and see if your spider still runs correctly instead of leaving allowed_domains blank

提交回复
热议问题