如何使用“if”处理两个或更多xpath？问题的回答

如何使用“if”处理两个或更多xpath？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

通过下面的代码，我正在用python培训web抓取 但其中一个数据有两个xpath，我想知道是否有一种方法可以使用“if”条件捕获这两个xpath，但我不知道如何将其插入到代码中。有人能指引我吗 例如，如果其中一个xpath为null，那么肯定是另一个。我不知道它是否解释得很好，但是如果我有a和b，如果a为空，那么b “vlr_atual”可分别为： <pre><code>product.xpath(".//span[@id='priceblock_ourprice']/text()").get() product.xpath(".//span[@id='priceblock_saleprice']/text()").get() </code></pre> <a href="https://www.amazon.com.br/Monitor-LG-19-5-LED-Inclina%C3%A7%C3%A3o/dp/B084TKF88Q/ref=sr_1_1?dchild=1&qid=1615682905&s=computers&sr=1-1" rel="nofollow noreferrer">https://www.amazon.com.br/Monitor-LG-19-5-LED-Inclina%C3%A7%C3%A3o/dp/B084TKF88Q/ref=sr_1_1?dchild=1&qid=1615682905&s=computers&sr=1-1</a> <a href="https://www.amazon.com.br/Monitor-Gamer-Dell-S2421HGF-23-8/dp/B086M269P3/ref=sr_1_19?dchild=1&qid=1615682905&s=computers&sr=1-19" rel="nofollow noreferrer">https://www.amazon.com.br/Monitor-Gamer-Dell-S2421HGF-23-8/dp/B086M269P3/ref=sr_1_19?dchild=1&qid=1615682905&s=computers&sr=1-19</a> <pre><code>import scrapy import datetime class ProductsSpider(scrapy.Spider): name = 'products' allowed_domains = ['www.amazon.com.br'] start_urls = ['https://www.amazon.com.br/s?i=computers&bbn=16339926011&rh=n%3A16364756011&fs=true&qid=1615634908&ref=sr_pg_1'] def parse(self, response): for produto in response.xpath("//div[@class='a-section a-spacing-medium']"): selo = produto.xpath(".//span[@class='a-badge-text']/text()").get() link = response.urljoin(produto.xpath(".//h2/a/@href").get()) yield response.follow(url=link, callback=self.parse_details, meta={'selo' : selo}) next_page = response.urljoin(response.xpath("//li[@class='a-last']/a/@href").get()) if next_page: yield scrapy.Request(url=next_page, callback=self.parse) def parse_details(self, response): selo = response.request.meta['selo'] for produto in response.xpath("//div[@id='dp']"): vlr_atual = produto.xpath(".//span[@id='priceblock_ourprice']/text()").get() if vlr_atual is None: vlr_atual = produto.xpath(".//span[@id='priceblock_saleprice']/text()").get() yield{ 'data' : datetime.datetime.now().strftime("%Y%m%d"), 'selo': selo, 'nome': produto.xpath("normalize-space(.//span[@id='productTitle']/text())").get(), 'vlr_atual': vlr_atual, 'estoque': produto.xpath("normalize-space(.//select[@name='quantity']/option[last()]/text())").get(), 'ean': produto.xpath("normalize-space(.//table[@id='productDetails_techSpec_section_1']//tr[last()]/td/text())").get(), } </code></pre>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

如何使用“if”处理两个或更多xpath？

1 个回答

相关Python问题