发表新帖

发表新帖

How can I make scrapy crawl break and exit when encountering the first exception?

前端未结

关注

 3  923

伪装坚强ぢ 2020-12-14 02:20

For development purposes, I would like to stop all scrapy crawling activity as soon a first exception (in a spider or a pipeline) occurs.

Any advice?

3条回答

情书的邮戳 (楼主)

2020-12-14 02:52
In spider, you can just throw CloseSpider exception.
```
def parse_page(self, response):
    if 'Bandwidth exceeded' in response.body:
        raise CloseSpider('bandwidth_exceeded')
```
For others (middlewares, pipeline, etc), you can manually call close_spider as akhter mentioned.
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题