Python爬虫从入门到放弃(十九)之 Scrapy爬取所有知乎用户信息(下)
在上一篇文章中主要写了关于爬虫过程的分析,下面是代码的实现,完整代码在: https://github.com/pythonsite/spider items中的代码主要是我们要爬取的字段的定义 class UserItem(scrapy.Item): id = Field() name = Field() account_status = Field() allow_message= Field() answer_count = Field() articles_count = Field() avatar_hue = Field() avatar_url = Field() avatar_url_template = Field() badge = Field() business = Field() employments = Field() columns_count = Field() commercial_question_count = Field() cover_url = Field() description = Field() educations = Field() favorite_count = Field() favorited_count = Field() follower_count = Field() following_columns