Scrapy doesn't seem to be doing DFO

后端 未结 3 1114
臣服心动
臣服心动 2021-02-20 03:21

I have a website for which my crawler needs to follow a sequence. So for example, it needs to go a1, b1, c1 before it starts going a2 etc. each of a, b and c are handled by diff

3条回答
  •  广开言路
    2021-02-20 03:42

    Depth first searching is exactly what you are describing:

    search as deep into a's as possible before moving to b's
    

    To change Scrapy to do breadth-first searching (a1, b1, c1, a2, etc...), change these settings:

    DEPTH_PRIORITY = 1
    SCHEDULER_DISK_QUEUE = 'scrapy.squeue.PickleFifoDiskQueue'
    SCHEDULER_MEMORY_QUEUE = 'scrapy.squeue.FifoMemoryQueue'
    

    *Found in the doc.scrapy.org FAQ

提交回复
热议问题