如何通过命令生成url让scrapy

def parse(self, response): selector = Selector(response) sites = selector.xpath("//h3[@class='r']/a/@href") for index, site in enumerate(sites): url = result.group(1) print url yield Request(url = site.extract(),callback = self.parsedetail) def parsedetail(self,response): print response.url ... obj = Store.objects.filter(id=store_obj.id,add__isnull=True) if obj: obj.update(add=add)

1条回答

网友

1楼 · 发布于 2024-09-26 23:23:49

默认情况下，未定义计划和发送报废请求的顺序。但是，您可以使用^{} keyword argument来控制它：

priority (int) – the priority of this request (defaults to 0). The priority is used by the scheduler to define the order used to process requests. Requests with a higher priority value will execute earlier. Negative values are allowed in order to indicate relatively low-priority.

您还可以通过在meta字典中传递callstack使爬行同步，例如请参见this answer。在

编程相关推荐

java为TestNG中的并行测试初始化和清理pertest数据
java如何遍历ArrayList并从每个对象中仅提取1个字段
如何在Java中读取用户的下一个按键
圆内和圆上的java点
java如何在此上下文中使用interrupt（）
java如何在Google Guice中使用Jersey ExceptionMapper？
爪哇罐头http://localhost:8080/tunnelweb/jsonws是否在默认安装时打开？
硬件如何在Linux、Windows和Mac上使用Java+JNI检索硬盘的唯一ID
java如何使用Hibernate获取不完整的集合？
如何将SOAP消息从Tomcat Java应用捕获到外部服务器？

相关问题更多 >

编程相关推荐

热门问题

热门文章