发表新帖

发表新帖

Screen scraping: getting around “HTTP Error 403: request disallowed by robots.txt”

前端未结

关注

 8  1122

借酒劲吻你 2020-12-12 17:15

Is there a way to get around the following?

httperror_seek_wrapper: HTTP Error 403: request disallowed by robots.txt

Is the only way around

8条回答

离开以前 (楼主)

2020-12-12 17:48
oh you need to ignore the robots.txt
```
br = mechanize.Browser()
br.set_handle_robots(False)
```
0 讨论(0)

查看其它8个回答
发布评论:

提交评论
- 加载中...

热议问题