擅长:python、mysql、java
<p><code>response</code>不包含<code>xpath</code>方法的原因是下载中间件的<code>process_response</code>方法中的<code>response</code>参数属于{a1}类型,参见<a href="https://doc.scrapy.org/en/latest/topics/downloader-middleware.html#scrapy.downloadermiddlewares.DownloaderMiddleware.process_response" rel="nofollow noreferrer">documentation</a>。只有<a href="https://doc.scrapy.org/en/latest/topics/request-response.html#scrapy.http.TextResponse" rel="nofollow noreferrer">^{<cd6>}</a>(和<a href="https://doc.scrapy.org/en/latest/topics/request-response.html#scrapy.http.HtmlResponse" rel="nofollow noreferrer">^{<cd7>}</a>)有<code>xpath</code>方法。所以在使用<code>xpath</code>之前,从<code>response</code>创建{<cd10>}对象。相应的部分将变成:</p>
<pre><code>...
new_response = scrapy.http.HtmlResponse(response.url, body=response.body)
if new_response.xpath(xpath).extract():
found = True
break
...
</code></pre>