<p>试试这个结果。您可以在scrapy项目中使用<code>xpaths</code>,如下所示:</p>
<pre><code>class AutotraderSpider(scrapy.Spider):
name = "autotrader"
allowed_domains = ["autotrader.co.uk"]
start_urls = ["https://www.autotrader.co.uk/car-dealers/search?advertising-location=at_cars&postcode=m43aq&radius=1500&forSale=on&toOrder=on&sort=with-retailer-reviews&page=822"]
def parse(self, response):
for items in response.xpath("//article[@class='dealerList__item']"):
name = items.xpath(".//span[@itemprop='legalName']/text()").extract_first()
address = ' '.join([' '.join(item.split()) for item in items.xpath(".//p[@class='dealerList__itemAddress']/text()").extract()])
cars = items.xpath(".//span[@class='dealerList__itemCountNumber']/text()").extract_first()
yield {"Name":name,"Address":address,"Cars":cars}
</code></pre>
<p>部分输出:</p>
<pre><code>Midland Motors Leicester Street, Burton-On-Trent, Staffordshire DE14 3BA 2
Ns Cars 69 Eldon Street, Burton-On-Trent, Staffordshire DE15 0LT 1
RS Sales Nottingham Ltd Unit 1 TRINITY PARK, RANDALL PARK WAY, Retford, Nottinghamshire DN22 7WF 1
Adc Ltd Unit 3 HUCKNALL LANE, Nottingham, Nottinghamshire NG6 8AJ 5
</code></pre>