Scrapy start_urls

后端 未结 6 1562
有刺的猬
有刺的猬 2020-12-28 23:23

The script (below) from this tutorial contains two start_urls.

from scrapy.spider import Spider
from scrapy.selector import Selector

from dirb         


        
6条回答
  •  谎友^
    谎友^ (楼主)
    2020-12-29 00:04

    start_urls contain those links from which the spider start crawling. If you want crawl recursively you should use crawlspider and define rules for that. http://doc.scrapy.org/en/latest/topics/spiders.html look there for example.

提交回复
热议问题