Writing a crawler to parse a site in scrapy using BaseSpider
问题 I am getting confused on how to design the architecure of crawler. I have the search where I have pagination: next page links to follow a list of products on one page individual links to be crawled to get the description I have the following code: def parse_page(self, response): hxs = HtmlXPathSelector(response) sites = hxs.select('//ol[@id=\'result-set\']/li') items = [] for site in sites[:2]: item = MyProduct() item['product'] = myfilter(site.select('h2/a').select("string()").extract())